Indonesian Twitter Emotion Recognition using Feature Engineering

This research explores feature engineering techniques to find the best features for building an emotion recognition model on the Indonesian Twitter dataset. Two different text data representations were used, namely TF-IDF and word embedding.

This research proposed 12 feature engineering configurations in TF-IDF by combining data stemming, data augmentation, and machine learning classifiers. Moreover, this research proposed 27 feature engineering configurations in word embedding by combining three-word embedding models, three pooling techniques, and three machine-learning classifiers. In total, there are 39 feature engineering combinations. The full report can be seen in this link.

Dataset & Experiment Results

Dataset
The preprocessed dataset was derived from the Indonesian Twitter Emotion Dataset from Saputri et al.. In the data preprocessing, this work performs case-folding, removes irrelevant information, applies standardization, data stemming, and removes stop words. The file can be downloaded in this CSV file.
Experiment Results
The full results of the experiment can be seen in this CSV file. The order is sorted by f1 score (descending).

BibTeX Citation

If you use this dataset in a scientific publication, we would appreciate using the following citations:

@article{Sutoyo2023,
  title = {Indonesian Twitter Emotion Recognition Model using Feature Engineering},
  journal = {International Journal of Advanced Computer Science and Applications},
  doi = {10.14569/IJACSA.2023.01412108},
  url = {http://dx.doi.org/10.14569/IJACSA.2023.01412108},
  year = {2023},
  publisher = {The Science and Information Organization},
  volume = {14},
  number = {12},
  author = {Rhio Sutoyo and Harco Leslie Hendric Spits Warnars and Sani Muhamad Isa and Widodo Budiharto}
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
LICENSE		LICENSE
README.md		README.md
full-experiment result.csv		full-experiment result.csv
preprocessed-dataset.csv		preprocessed-dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Indonesian Twitter Emotion Recognition using Feature Engineering

Dataset & Experiment Results

BibTeX Citation

About

Releases

Packages

License

rhiosutoyo/emotion-recognition-model

Folders and files

Latest commit

History

Repository files navigation

Indonesian Twitter Emotion Recognition using Feature Engineering

Dataset & Experiment Results

BibTeX Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages