Skip to content

Latest commit

 

History

History
71 lines (50 loc) · 2.03 KB

README.md

File metadata and controls

71 lines (50 loc) · 2.03 KB

Czech Subjectivity Dataset

This is the repository for the newly created Czech Subjectivity Dataset (Subj-CS) and our paper:

Czech Dataset for Cross-lingual Subjectivity Classification

Accepted to LREC 2022 conference.

Dataset Download:

The Czech Subjectivity Dataset is available for download from this https://drive.google.com/file/d/1R0bPPWJ7sdIaCxyPrO_rmTVFNNsd9RaI/view?usp=sharing

The dataset is also available in the HuggingFace Datasets

Usage:

We will add usage and setup soon.

python3 baseline.py...

Setup:

Create conda enviroment

  1. Clone github repository

    git clone git@github.com:pauli31/czech-subjectivity-dataset.git
    
  2. Setup conda

  3. Setup Data

License:

The dataset and code can be freely used for academic and research purposes. It is strictly prohibited to use the dataset for any commercial purpose.

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Publication:

If you use our dataset or software for academic research, please cite our paper

@inproceedings{priban-steinberger-2022-czech,
    title = "{C}zech Dataset for Cross-lingual Subjectivity Classification",
    author = "P{\v{r}}ib{\'a}{\v{n}}, Pavel  and
      Steinberger, Josef",
    booktitle = "Proceedings of the Thirteenth Language Resources and Evaluation Conference",
    month = jun,
    year = "2022",
    address = "Marseille, France",
    publisher = "European Language Resources Association",
    url = "https://aclanthology.org/2022.lrec-1.148",
    pages = "1381--1391",
}

Contact:

pribanp@kiv.zcu.cz

http://nlp.kiv.zcu.cz