To get this code on GitHub, it was copied from somewhere else on my local machine, so a lot of the paths may be broken. Currently, this project is a work in progress. The main idea is to have a graph neural network encode the molecular strucure and properties of different bioactive molecules found on the internet. Then, using these encodings, we will attempt to decode the original molecule, some molecular characterisitcs and much more. I have ideas for a few novel pretraining tasks that will start to take shape as I write this project out more and more. The main libraries I am using for this project are Numpy, Pytorch Geometric, and RdKit. All of this is written in Python 3. Before this project is finished, if you have any questions or just want to reach out, you can contact me at thomasgust@seattleacademy.org.