ETDMiner consists of multiple AI based applications and datasets which helps to parse, extract, classify, and mine Electronic Theses and Dissertations (ETDs).
This application is version 1.1 of etd_crf to extract metadata automatically from scanned ETDs.
It contains the dataset which is used to extract metadata from scanned ETD.
It is the AutoMeta tool version 1.0.
This is ETD segmentation tool to classify ETD pages.
This is an application to fill out the missing metadata in the database (i.e., pates_etds).
It contains the sample dataset which has been tested out in the above process.
It contains the handful of source file to pre process dataset.
It contains the code and instruction to get ETDs in html file.
Contains the crawlers & parsers for different universities developed to collect ETDs and extract metadata from the webpages.