Predicted Trend Value of a Time Series
A time series shows data on a variable’s outcome in different periods—for example, a time series showing EURUSD exchange rates between a given time interval. Time series models explain the past and predict the future of a time series using…
Evaluating the Fit of a Machine Learning Algorithm
Model Training Suppose that the target variable (y) for the ML training model has the sentiment class labels (positive and negative). To ease the calculation of the performance metrics, we relabel them as 1 (for positive) and 0 (for negative)….
Feature Extraction, Selection, and Engineering of Textual Data
Feature Extraction Feature extraction entails mapping the textual data to real-valued vectors. After the text has been normalized, the next step is to create a bag-of-words (BOW). It is a representation of analyzing text. It does not, however, represent the…
Preparing, Wrangling, and Exploring Textual Data for Financial Forecasting
Sentiment analysis refers to the analysis of opinions or emotions from text data. In other words, it refers to how positive, negative, or neutral a particular phrase or statement is regarding a “target.” Such sentiment can provide critical predictive power…
Model Training
Machine learning (ML) model training entails three tasks: method selection, performance evaluation, and tuning. While there are no standard rules for training an ML model, having a fundamental understanding of domain-specific training data and ML algorithm principles is key to…
Data Exploration
The main objective of data exploration is to investigate and comprehend data distributions and relationships. Data exploration involves three critical tasks: exploratory data analysis, feature selection, and feature engineering. Exploratory Data Analysis Exploratory Data Analysis (EDA) is the first step…
Preparing and Wrangling Data
Data preparation and wrangling is a crucial step that entails cleaning and organizing raw data in a consolidated format that allows for more convenient consumption of the data. Data collection precedes the data preparation and wrangling stage. Recall that before…
Steps in a Data Analysis Project
The term “big data” refers to structured or unstructured data that is significant, fast, or complex, thus strenuous or even impossible to process using traditional methods. The incorporation of big data has prompt implications for building a machine learning model…
Neural Networks (NNs), Deep Learning Nets (DLNs), and Reinforcement Learning (RL)
Neural networks, deep learning nets, and reinforcement learning are sophisticated algorithms that handle complex tasks with non-linearity and interactions between large numbers of feature inputs. Some of these complicated tasks include image classification, speech recognition, and face recognition. We describe…
Unsupervised Machine Learning Algorithms
Recall that unlike supervised learning, unsupervised learning does not use labeled data. The algorithm finds patterns within the data. The two main categories of unsupervised ML algorithms are dimension reduction, using principal components analysis, and clustering, which includes k-means and…