The dataset contains editions from the South African government magazine Vuk'uzenzele. Data was scraped from PDFs that have been placed in the data/raw folder. The PDFS were obtained from the Vuk'uzenzele website.
language dataset african-languages south-africa nlproc africanlp africannlp aldlf african-language-data-liberation-front dsfsi-datasets
-
Updated
Dec 6, 2023 - Jupyter Notebook