Skip to content

Latest commit

 

History

History
22 lines (16 loc) · 1.93 KB

README.md

File metadata and controls

22 lines (16 loc) · 1.93 KB

Twitterreko Euskal Komunitatearen Eduki Azterketa Pandemia Garaian

[EU] Hizkuntzaren Prozesamenduak eskaintzen dituen teknika ez-gainbegiratuak erabiliz Twittereko euskal komunitatean COVID-19aren pandemiak izan duen eragina aztertzea da lan honen asmoa. Azterketa hau aurrera eramateko sare sozial horretako erabiltzaileen euskarazko txioak masiboki bildu eta denboraren arabera ordenatu dira. Pandemiaren eragina neurtzeko, denboraren arabera edukia nola aldatu den aztertu da, horretarako testuetan azaltzen diren hitz zein emojien aldaketa kuantitatibo zein kualitatiboak baliatu dira. Azterketa kuantitatiboan, terminoek garai desberdinetan izan duten maiztasunaren aldaketari erreparatu zaio, maiztasunen erregresio lineala erabiliz. Azterketa kualitatiboan, hitzen bektore trinkoak baliatu dira, pandemiaren garai desberdinetan hitz eta emoji adierazgarrienek esanahian izan duten bilakaera aztertzeko.

[EN] The aim of this work is to study the impact of the COVID-19 pandemic on the Basque Twitter community using unsupervised techniques based on Natural Language Processing. In order to carry out this study, large quantities of tweets were gathered and sorted by time from Basque Twitter users. To analyze the impact of the pandemic, the variability of the content over time has been studied, through quantitative and qualitative changes in the words and emojis that appear in the texts. In the quantitative analysis, the shift at the frequency of the terms was calculated using linear regression over frequencies. In the qualitative analysis, Word Embeddings were used to study the changes in the meaning of the most significant words and emojis at different times during the pandemic.

Slopes word2vec
Joseba Fernandez de Landa
Iker García
Ander Salaberria
Jon Ander Campos

HiTZ Zentroa - Ixa, Euskal Herriko Unibertsitatea (UPV/EHU)