Hypothesis thesting with approximate randomisation
Approximate randomisation is a significance testing approach suitable for NLP problems.
While randomisation tests are just as good as analytical approaches such as the t-test, they are better when the assumptions of the latter are not met and they are also quite simple to implement.
pip install randhy
- William Morgan, Statistical Hypothesis Tests for NLP - Stanford Computer Science (slides)
- Wassily Hoeffding. 1952. The Large-Sample Power of Tests Based on Permutations of Observations. Annals ofMathematical Statistics, 23, 169–192.
- Eric W. Noreen. 1989. Computer Intensive Methods forTesting Hypothesis. John Wiley & Sons