- Crawler:
requests
lxml
bs4
tqdm
- Training:
torch
transformers (3.5.1)
wandb
- Web:
flask
https://ckip.iis.sinica.edu.tw/service/gpt2/
Dataset: 三國演義
Source: Wikisource
python crawler.py
Data will be saved to ./data
We used first chapter as evaluation data
CUDA_VISIBLE_DEVICES=0 ./train.sh
Model will be saved to ./output