clickbait-detector-en

Clickbait detector for English tweets, trained on Webis-17 dataset. This repo contains three data-centric approaches with Robustly optimized BERT-Bidirectional Encoder Representations from Transformers- approach (RoBERTa). More particular, we use XLM-RoBERTa-base version (multi/cross-lingual version with 12 attention layers trained on ~100 languages - including English) from Hugging Face.

Dataset:

Webis website, or
Here and here.

Code

Up-to-date Colab notebook or here.

Results

Run	F1*	macro-F1	Acc.	macro-Acc.
	"All Dataset" (XROBERTa_clickbait)
0	0.6846	0.7914	0.846	0.7966
1	0.6738	0.7889	0.8517	0.7802
0(2)	0.3844	0.1922	0.2379	0.5
0(3)	0.3844	0.1922	0.2379	0.5
1(3)	0.6743	0.7901	0.854	0.7788
2	0.6609	0.783	0.8516	0.7678
	"Without outliers" (XROBERTa_clickbait_wo_outlier)
0	0.6664	0.7852	0.8509	0.7735
1	0.663	0.783	0.8494	0.7714
2	0.685	0.796	0.8564	0.7876
3	0.6704	0.7877	0.8524	0.7763
4	0.6653	0.7861	0.8543	0.7699
	"Annotations with strong agreement" (XROBERTa_clickbait_agree)
0	0.6677	0.7879	0.8559	0.7709
1	0.7064	0.8107	0.8683	0.7987
2^	0.7112	0.8134	0.8693	0.803
3	0.6969	0.8067	0.869	0.7879
4	0.7053	0.8104	0.8686	0.7971

(*) binary. (^) best result.

Error analysis

Confusion matrix here.

Training

Here you can see the detailed hyper-params and all results.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
confusion_matrix.pdf		confusion_matrix.pdf
tweets_clickbait.ipynb		tweets_clickbait.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

clickbait-detector-en

Dataset:

Code

Results

Error analysis

Training

About

Releases

Packages

Languages

License

mmaguero/clickbait-detector-en

Folders and files

Latest commit

History

Repository files navigation

clickbait-detector-en

Dataset:

Code

Results

Error analysis

Training

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages