Fake News Detection

My solution to a DataCup competition Leaders Prize: Fact or Fake News?.

Problem Overview

The goal is to predict the truth ratings that human fact checkers would assign to each claim in the dataset based on some related articles and the metadata associated with each claim.

Data

The dataset contains claims and the associated metadata from 9 fact checking websites. On those websites, professional fact checkers publish a truth rating for each claim with links to the related articles. The truth ratings provided were mapped to the labels:

0 (false)
1 (partly true)
2 (true)

My solution

First, the claim and the related articles are preprocessed by converting each sentence into a TF-IDF representation. The 5 sentences that have the highest cosine similarity with a claim are extracted and concatenated with the metadata.

Then, Bi-directional Encoder Representations from Transformers (BERT) is fine-tuned based on these sentences and the metadata to predict the label of each claim.

See BERT_claim_classification.ipynb.

Other attempts

Used RNN to encode claim, metadata with a Feed Forward network on top to predict the labels. See fake_news_detection_rnn.ipynb.
Used Transformer encoder (implemented from scratch) with a Feed Forward network to predict the labels. See fake_news_detection_transformer.ipynb.

Transformer implementation based on:

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
BERT_claim_classification.ipynb		BERT_claim_classification.ipynb
README.md		README.md
fake_news_detection_rnn.ipynb		fake_news_detection_rnn.ipynb
fake_news_detection_rnn.py		fake_news_detection_rnn.py
fake_news_detection_transformer.ipynb		fake_news_detection_transformer.ipynb
fake_news_detection_transformer.py		fake_news_detection_transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fake News Detection

Problem Overview

Data

My solution

Other attempts

About

Releases

Packages

Languages

lidiyam/fake-news

Folders and files

Latest commit

History

Repository files navigation

Fake News Detection

Problem Overview

Data

My solution

Other attempts

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages