-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Utilser de l'apprentissage profond .. wouhou (#28)
* chore: update baselines * refactor: rendre filter_tab reutilisable * refactor: remove useless results * docs: update todo * feat: prendre en charge tokenisation Mais! Fais pas ca ... ca marche pas bien * feat: RNN training * feat: same eval as train_crf, results not too bad * refactor: reorganize 1 * fix: do cross validation properly * feat: add (scaled) vector features * fix: slightly better scaling * feat: more features * docs: record of scores * fix: featnames redundant * feat: use the knowledge based features too * fix: use 4 dimensions * feat: embed things and report * feat: take best model * fix: scale by page height for robust * feat: 64 * feat: weight up B * fix: smaller * fix: same width for all embeddings is best * docs: some results * docs: more todo * docs: todo * refactor: refactor * feat: CRF output layer, working finally! * fix: horrible python error * fix: lower lr * feat: more parameters * docs: scores * docs: more todos * refactor: move CSV without PDF to a subdirectory * chore: retrain * feat: derp lerning for the win! * feat: refactor out rnn/rnncrf stuff and add dimensions * feat: parameters (and best results of search) * docs: tdo * feat: better updates * feat: data processing for LayoutLM * fix: Figure a pas daffaire la * fix: normalize box * fix: ensure box * fix: remove bogus line * fix: make repeatable so it does not seem like random crashing! * fix: fix some errors * fix(test): fix test * fix: dropout not useful * chore: update scores * feat: activer mode test pour rnn * docs: todo * fix: poutyne removal * feat: equivalence RNN/CRF pour segmentation * fix: si y a pas de zonage * fix: no need for batch in predict * feat: do not early stop by default * feat: cross validation * docs: cross validation layoutlm * feat: try to weight * feat: weight labels * chore: updates * feat: support bonly, tonly, iobonly * feat: use tonly * docs: various only results * docs: more scores * Revert "fix: Figure a pas daffaire la" This reverts commit 27a39e1. * refactor: patches to patches * fix: better crf-rnn with allennlp * feat: try label weights * feat: make train_rnn_crf work with acc/f1 and weights * feat: synchroniser train_rnn_crf avec train_rnn * feat: standardize rnn and rnncrf scripts * feat: test with majority vote rnn * fix: label weights are exponential it seems * fix: weight transitions too (it is better) * docs: minor updates * docs: scores for best rnn-crf * fix: no need for separate test_rnn_crf * feat: add voting for CRF * feat: reuse RNN code * feat: enable --labels bonly and decoding * feat: initialize RNN-CRF from RNN (helps a lot) * feat: train and support RNN and RNN-CRF * feat: add rnn+crf training * chore: retrain * feat: workflow (will it work...flow?) * fix: format and lint
- Loading branch information
Showing
71 changed files
with
20,905 additions
and
480 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file not shown.
Oops, something went wrong.