Skip to content

This dataset includes Twitter Handles extracted from Wikidata. For entity type classification task on handles, they are grouped into four groups: person, location, organization, product, and character.

License

Notifications You must be signed in to change notification settings

ardax/TwikiHandles

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

TwikiHandles Classification Dataset

This repository includes data exracted from Wikidata via SPARQL query on April 15, 2017. All available Twitter handles are collected along with their wikidata entry id and instanceof-information of those entries.

TwikiHandles dataset is created for the purpose of classifying entity type of Twitter handles in the context of named entity recognition task. Twitter handles are grouped into following categories: person, location, organization, product and character. This is done by detecting which wikidata entries correspond to these entity types and then checking if wikidata entry that contains Twitter handle information is instance of such wikidata entry.

About

This dataset includes Twitter Handles extracted from Wikidata. For entity type classification task on handles, they are grouped into four groups: person, location, organization, product, and character.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published