SemEval-2019-task-6-HAD

Evaluation of Offensive Tweets with target Classification. For more details: Coda Lab_OffensEval 2019 (SemEval 2019 - Task 6)

Sub-tasks

Sub-task A - Offensive language identification (Offensive / Not Offensive)

15 Jan 2019: A test data release - 17 Jan 2019: Submission deadline

Sub-task B - Automatic categorization of offense types (Targeted Insult and Threats / Untargeted)

22 Jan 2019: A test data release - 24 Jan 2019: Submission deadline

Sub-task C - Offense target identification (Target: Individual / Group / Other)

29 Jan 2019: A test data release - 31 Jan 2019: Submission deadline

Contributors

Himanshu Bansal Univesity of Tübingen
Daniel Nagel University of Tübingen
Anita Soloveva Lomonosov MSU, University of Tübingen

Preprocessing

Removing URLs and @USER
Parsing hashtags (See Christos Baziotis et. al. 2017)

Our approaches

We are using Long short-term memory network (LSTM) model.
LSTM with fasttext vectors

Sub-task A

Using an additional preprocessed training set of tweets
Postprocessing with emojis set and an offensive word list

Sub-task B & Sub-task C

Using a list of ethnic slurs
Using a list of top twitter profiles from United States, United Kindom, Saudi Arabia, Brazil and Spain

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
Baseline		Baseline
HatEval.py		HatEval.py
LICENSE		LICENSE
README.md		README.md
model.bin		model.bin
offenseval-trial-pre.txt		offenseval-trial-pre.txt
scrapper.py		scrapper.py
test.txt		test.txt
thrones2vec.w2v		thrones2vec.w2v

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SemEval-2019-task-6-HAD

Sub-tasks

Sub-task A - Offensive language identification (Offensive / Not Offensive)

Sub-task B - Automatic categorization of offense types (Targeted Insult and Threats / Untargeted)

Sub-task C - Offense target identification (Target: Individual / Group / Other)

Contributors

Preprocessing

Our approaches

Sub-task A

Sub-task B & Sub-task C

About

Releases

Packages

Languages

License

gitvivekgupta/semeval-2019-task-6-HAD

Folders and files

Latest commit

History

Repository files navigation

SemEval-2019-task-6-HAD

Sub-tasks

Sub-task A - Offensive language identification (Offensive / Not Offensive)

Sub-task B - Automatic categorization of offense types (Targeted Insult and Threats / Untargeted)

Sub-task C - Offense target identification (Target: Individual / Group / Other)

Contributors

Preprocessing

Our approaches

Sub-task A

Sub-task B & Sub-task C

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages