This repository contains five lectures and three workshop sessions on introducing machine learning concepts in the advanced physical chemistry module at UoE.
Dr Antonia Mey -- antonia.mey@ed.ac.uk.
Dr Matteo Degiacomi -- matteo.t.degiacomi@durham.ac.uk
Jasmin Güven, Rohan Gorantla, Ryan Zhu
Ryan Zhu and Dominic Philips
-
Install anaconda.
-
Create a new environment:
conda create -n ml_chem
-
Activate the environment:
conda activate ml_chem
-
Install mamba to make the installation of packages faster.
conda install -c conda-forge mamba
-
Install all the required packages with mamba:
mamba install -c conda-forge scikit-learn matplotlib pandas
For Unit_03 you will also need to install
mamba install -c conda-forge rdkit seaborn
and
mamba install pytorch torchvision torchinfo -c pytorch
Release: week 4
Report Deadline: TBC
Weight: 20%
- What is machine learning?
- Examples of machine learning (in Chemistry)
- Linear Regressions
- Introduction to unsupervised learning: - Clustering (k-means and others)
- Chemistry data is high dimensional
- Dimensionality reduction:
- Principle component analysis (PCA)
- Time lagged component analysis (tICA)
- t-stochastic neighbour embedding (t-SNE)
- Classification problems
- Classifications in practice:
- Random Forests
- Support vector machine
- Shallow Learning
- Deep Learning part I
- Multilayer perceptron
- Deep Learning Part II
- Transformers
- Graph Neural Networks
- Understand the main pillars of machine learning
- Know about different clustering techniques as part of unsupervised learning
- Be able to use common nomenclature used in machine learning
- Use Principle component analysis to reduce the dimensions of a data set
- Understand how a regression problem can be cast as a machine learning problem
- Be aware of how random forests and multilayer perceptrons can be used in a classification problem
A handout with additional resources can be found here.