GitHub - LaureBerti/Learn2Clean: Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning

Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Data Cleaning

Learn2Clean is a Python library for data preprocessing and cleaning based on Q-Learning, a model-free reinforcement learning technique. It selects, for a given dataset, a ML model, and a quality performance metric, the optimal sequence of tasks for preparing the data such that the quality of the ML model result is maximized.

You can try it for composing your own data preprocessing pipelines or for automizing data preparation before clustering, regression, and classification.

For more details, please refer to the paper presented at the Web Conf 2019 and the related tutorial.

Laure Berti-Equille. Learn2Clean: Optimizing the Sequence of Tasks for Web Data Preparation. Proceedings of the Web Conf 2019, San Francisco, May 2019. Preprint
Laure Berti-Equille. ML to Data Management: A Round Trip. Tutorial Part I, ICDE 2018. Tutorial

How to Contribute

Learn2Clean is a research prototype. Your help is very valuable to make it better for everyone.

Check out call for contributions to see what can be improved, or open an issue if you want something.
Contribute to the tests to make it more reliable.
Contribute to the documents to make it clearer for everyone.
Contribute to the examples to share your experience with other users.
Open issue if you met problems during development.

For more details, please refer to CONTRIBUTING.

Licence

Learn2Clean is licensed under the BSD 3-Clause "New" or "Revised" License.

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
datasets		datasets
docs		docs
examples		examples
html		html
python-package		python-package
save		save
.coveragerc		.coveragerc
.gitignore		.gitignore
.travis.yml		.travis.yml
AUTHORS.rst		AUTHORS.rst
LICENSE		LICENSE
Makefile		Makefile
readme.rst		readme.rst
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to Contribute

Licence

About

Releases

Packages

Languages

License

LaureBerti/Learn2Clean

Folders and files

Latest commit

History

Repository files navigation

How to Contribute

Licence

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages