Skip to content

Latest commit

 

History

History
28 lines (15 loc) · 927 Bytes

README.md

File metadata and controls

28 lines (15 loc) · 927 Bytes

wiktionary

Doing Wiktionary things

Scripts

  • parse-wiktionary.py

SAX-based XML parser for Wiktionary dumps

See Parsing Wiktionary XML dumps

  • postgresql_v1

Import Wiktionary dump into PostgreSQL database (single table)

See Importing Wiktionary XML dumps into PostgreSQL database

  • postgresql_v2

Import Wiktionary dump into PostgreSQL database (sites and namespaces)

Wiktionary Datamodel

See Importing Wiktionary XML dumps into PostgreSQL database, again

  • extract-reconstruction.sh

Extract reconstructed languages from en.wiktionary dump