Skip to content
This repository has been archived by the owner on May 8, 2024. It is now read-only.

Segmentation training & eval data #460

Answered by ninpnin
Lauler asked this question in Q&A
Discussion options

You must be logged in to vote
  • We are in the process of creating a proper training, validation and test set for paragraph classification for the whole 1867-2023 period (#189), and subsequently training a new, improved classifier on it (#462)
  • The training set you found is for an older version of the classifier. Images were included in the data set to make the annotators' job easier. We have links now, so it's not necessary anymore to have images. The classification itself is (and probably will remain) text-based
  • The code for text-based paragraph classification is here https://github.com/welfare-state-analytics/bert-riksdagen-classifier/ . There you have another data set, which IIRC is sampled from the 1920-1989 era, di…

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by Lauler
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants