Language identification of code mixed data.
-
Updated
Dec 12, 2020 - Python
Language identification of code mixed data.
Indonglish Dataset created based on Jaksel Sociolinguistic phenomenon
EACL 2021 paper (SJ_AJ@DravidianLangTech-EACL2021: Task-Adaptive Pre-Training of Multilingual BERT models for Offensive Language Identification)
Add a description, image, and links to the codemixed topic page so that developers can more easily learn about it.
To associate your repository with the codemixed topic, visit your repo's landing page and select "manage topics."