Skip to content

7ananAhmed/Arabic-Hate-speech

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Arabic-Hate-speech

This is a repository for our work on Hate speech detection in Arabic Twittersphere.You can find the training and testing sets that we used to train and evaluate our models (will be published soon). Both files (train.csv and test.csv) have only tweets that annotated as hate or not hate. The files contain tweets ids along with their annotation (hate = 1 , non-hate = 0).

You can also find the full dataset (GHSD.csv) that contains 9,316 tweets annotated as (Abusive = 2 , hateful = 1, normal = 0)

The repo also contains two lexicons of hate terms extracted from the dataset using two corpus statistical-based approaches: chi-square and Pointwise Mutual Information (PMI).

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published