Skip to content

Latest commit

 

History

History
65 lines (39 loc) · 2 KB

README.md

File metadata and controls

65 lines (39 loc) · 2 KB

DataGorri

DataGorri is an application used to extract data from tables located on websites

Getting Started

Have a look at the quick start guide DataGorri_Manual_1.1.pdf or the Wiki page, covering the most topics.

Prerequisites

For operating systems like Mac OS or Ubuntu, DataGorri can be run by installing Python 3 and running the source code by the console/terminal. How to install Python and necessary third-party libraries is described in the documentation for developers.

Installing

Windows:

There is an installer provided for Windows (32 and 64bit). The installation is straightforward and does not need extensive explanation.

Mac OS, Ubuntu:

Clone the project or download as .zip file

cd {datagorri}
python3 DataGorri.py

Examples

Steps to scrape a table:

  • Create a page model to define the content in which one is interested in.
  • Collect links of websites that should be scraped.

For some examples, have a look at the samples folder

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.

Versioning

For the versions available, see the releases on this repository.

Authors

Further, the following persons (in alphabetical order) have contributed to the current or previous versions of this software and agreed to being named as contributors:

  • Ivaylo Dimitrov
  • Matthias Franze
  • Julian Hackinger
  • Stefan Hentschel
  • Lukas Holzner
  • Florian Kreitmair
  • Daniel Krieger
  • Michael Legenc
  • Marc Müller

See also the list of contributors who participated in this project.

License

This project is licensed under the Citeware License - see the LICENSE.txt file for details