uni-machine-learning

This repository showcases a research project focused on leveraging Long Short-Term Memory (LSTM) networks and Gated Recurrent Units (GRU) for predicting cryptocurrency trends. The study bridges the gap between machine learning and finance, evaluating time series models on historical Bitcoin price data over the last four years, influenced by significant market events such as COVID-19 and the 2024 halving.

Group Project - Cryptocurrency

With Machine Learning (ML) and Finance intersecting in recent years, this study aims to bridge the gap between the subjects using ML models as the vehicle for accurately predicting Time Series data on Cryptocurrencies. The project evaluates Time Series Models such as Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) to investigate their performance on historical price data without bias. The dataset covers the last four years due to market conditions, making it ideal for model analysis. The research measures model performance through metrics like Mean Squared Error, utilizing data sourced from public interfaces such as Yahoo Finance.

Installation

Open VSCode terminal and type "git clone https://github.com/xLightless/uni-machine-learning.git"
After type "py -m venv .venv"
Next, change directory (cd) to .venv and then activate it.
pip install -r requirements.txt
Go to the main.ipynb and run the install requirements cell.

Data Collection

Data can publically be acquired via Yahoo Finance. This project uses an unofficial API to accommodate for the collection of such data. It also comes with a large community therefore it is a good choice for potential Machine Learning configurations such as SVMs with Particle Swarm Optimisation (PSO).

Individual Project - Using Machine Learning to Classify Diabetes

Support Vector Machines (SVM)

Support Vector Machines (SVM) are supervised learning algorithms used for classification and regression tasks. They group inputs, or features, into classified support vectors. SVM employs kernel functions—such as Radial Basis Function (RBF) and linear—to classify features in higher-dimensional spaces. The study investigates the performance of linear models, which define decision boundaries (hyperplanes) for input features. Hyperparameter optimization is conducted using Grid Search Cross Validation to enhance accuracy and precision.

Ensemble Methods

Ensemble methods combine multiple base learning models, particularly decision trees, to improve predictive performance. Techniques such as Random Forest and boosting methods like XGBoost and AdaBoost are utilized to enhance accuracy while managing time complexity. For this project, a Random Forest classifier is employed as the base model, augmented by Adaptive Boosting to achieve high predictive accuracy without requiring feature scaling.

For additional reading, refer to the following resources:

Does Random Forest Need Feature Scaling or Normalization? (2023)
Lecture 3: The Perceptron (2024)
Lecture 9: SVM (2024)
YouTube Video on SVM

Methodology

Additionally, the project explores Support Vector Machines (SVM) and Ensemble methods. Key aspects include:

Domain Knowledge: Understanding the underlying data.
Data Cleaning: Managing missing values for accuracy.
Feature Engineering: Transforming features to enhance model performance.
Normalization/Standardization: Adjusting data to similar ranges for improved analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
algorithms		algorithms
documents		documents
project		project
reports		reports
001 Project Proposal.pdf		001 Project Proposal.pdf
002 Project Concept.pdf		002 Project Concept.pdf
003 Project Report.pdf		003 Project Report.pdf
004 SVMs and Esembles.pdf		004 SVMs and Esembles.pdf
README.md		README.md
main.ipynb		main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

uni-machine-learning

Group Project - Cryptocurrency

Installation

Data Collection

Individual Project - Using Machine Learning to Classify Diabetes

Support Vector Machines (SVM)

Ensemble Methods

Methodology

About

Languages

xLightless/uni-machine-learning

Folders and files

Latest commit

History

Repository files navigation

uni-machine-learning

Group Project - Cryptocurrency

Installation

Data Collection

Individual Project - Using Machine Learning to Classify Diabetes

Support Vector Machines (SVM)

Ensemble Methods

Methodology

About

Resources

Stars

Watchers

Forks

Languages