Corpus on software library classification
This dataset was used for the conference DATA2021 - Paper: "Similarity of Software Libraries: A Tag-based Classification Approach" Beside the data, we also provide results including various models, confusion matrizes and metrics. Please contact us if you need any further details.
The software in this repository is available under the MIT license (LICENSE).
The corpus itself however is provided under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License. By using the corpus you agree to this license.