Package that uses dimensionality reduction and classification model from sklearn, keeping compatibility with the rest of the sklearn framework
The package can be installed by directly cloning from repository:
git clone https://github.com/simomounir/GeneClassifier.git
Or by running pip
:
pip install git+https:https://github.com/simomounir/GeneClassifier.git
- [sklearn-url]: https://scikit-learn.org/stable/
- [seaborn-url]: https://seaborn.pydata.org/
- [smote-url]: https://imbalanced-learn.org/stable/references/generated/imblearn.over_sampling.SMOTE.html
As a start, two importants input parameters are passed when creating an instance of GeneClassifier: - Model: a valid model from the sklearn family. - Reduction: a dimensionality reduction method.
Python 3.8 o higher required to run the class.
The GeneClassifier class can take the prediction model and reduction method as inputs. The class also takes care of generating the plots after reduction and also balances the labels in order to avoid a big discrepancy in the different label classes. the method used for balancing is called SMOTE (Synthetic Minority Oversampling Technique).
A Grid Search is alro run to find the combination of parameters that gives the best prediction results.
This project has been set up using PyScaffold 4.3.1. For details and usage information:
PyScaffold see https://pyscaffold.org/.