GitHub - tmro98/financial-news-classification: binary topic and 3-class sentiment classification of full-length news articles

Classification of full-length news articles

task	representation/model	metric
binary topic classification	RoBERTa-large fine-tuned	93% accuracy, 91% recall (test size = 0.2, 2890)
3-class sentiment classification	RoBERTa-large fine-tuned	75% accuracy (test size = 0.2, 1530)

class	#
financial	7647
non-financial	6799

*a major source of uncertainty and error(during manual classification and modeling),

Resources

Evaluation of Sentiment Analysis in Finance: From Lexicons to Transformers
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9142175

FinBERT: Financial Sentiment Analysis with Pre-trained Language Models
https://arxiv.org/pdf/1908.10063.pdf

How to Fine-Tune BERT for Text Classification?
https://arxiv.org/pdf/1905.05583.pdf

Financial Sentiment Analysis: An Investigation into Common Mistakes and Silver Bullets
https://aclanthology.org/2020.coling-main.85.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
binary_topic_RoBERTa_finetuned.ipynb		binary_topic_RoBERTa_finetuned.ipynb
multiclass_sentiment_RoBERTa_finetuned.ipynb		multiclass_sentiment_RoBERTa_finetuned.ipynb
readme.md		readme.md