Skip to content

Indonesian Twitter Emotion Recognition using Feature Engineering

License

Notifications You must be signed in to change notification settings

rhiosutoyo/emotion-recognition-model

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Indonesian Twitter Emotion Recognition using Feature Engineering

This research explores feature engineering techniques to find the best features for building an emotion recognition model on the Indonesian Twitter dataset. Two different text data representations were used, namely TF-IDF and word embedding.

This research proposed 12 feature engineering configurations in TF-IDF by combining data stemming, data augmentation, and machine learning classifiers. Moreover, this research proposed 27 feature engineering configurations in word embedding by combining three-word embedding models, three pooling techniques, and three machine-learning classifiers. In total, there are 39 feature engineering combinations. The full report can be seen in this link.

Dataset & Experiment Results

  • Dataset
    The preprocessed dataset was derived from the Indonesian Twitter Emotion Dataset from Saputri et al.. In the data preprocessing, this work performs case-folding, removes irrelevant information, applies standardization, data stemming, and removes stop words. The file can be downloaded in this CSV file.

  • Experiment Results
    The full results of the experiment can be seen in this CSV file. The order is sorted by f1 score (descending).

BibTeX Citation

If you use this dataset in a scientific publication, we would appreciate using the following citations:

@article{Sutoyo2023,
  title = {Indonesian Twitter Emotion Recognition Model using Feature Engineering},
  journal = {International Journal of Advanced Computer Science and Applications},
  doi = {10.14569/IJACSA.2023.01412108},
  url = {http://dx.doi.org/10.14569/IJACSA.2023.01412108},
  year = {2023},
  publisher = {The Science and Information Organization},
  volume = {14},
  number = {12},
  author = {Rhio Sutoyo and Harco Leslie Hendric Spits Warnars and Sani Muhamad Isa and Widodo Budiharto}
}

About

Indonesian Twitter Emotion Recognition using Feature Engineering

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published