fake-news-challenge

This is my first project to learn Data Science

This project is about Fake News Detection. I use the data and materials from this competition site: http://www.fakenewschallenge.org/

The purpose of the project is to try to train and predict the category of a piece of new based on the compatibility between its headline and its body. This is called as Stance Detection.

Problem statement:

Input: A headline and a body text
Output: label/decision of whether the body agrees/ disargees/ discusses/ unrelated with the topic.

Data:

File train_stances.csv has the information of stances for train data
File train_bodies.csv maps bodyID with body content
File test_stances.csv and test_bodies.csv has the same information for the test data.

I did some data cleaning, feature extraction using tf-idf and experiment with some common classification algorithms.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
Stance Detection.ipynb		Stance Detection.ipynb
competition_test_bodies.csv		competition_test_bodies.csv
competition_test_stances.csv		competition_test_stances.csv
train_bodies.csv		train_bodies.csv
train_stances.csv		train_stances.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fake-news-challenge

This is my first project to learn Data Science

Problem statement:

Data:

About

Releases

Packages

Languages

manhtuanbn12/fake-news-challenge

Folders and files

Latest commit

History

Repository files navigation

fake-news-challenge

This is my first project to learn Data Science

Problem statement:

Data:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages