f7f69003f56d
1 2 3 4
Datafiles for ucto, the rule-based tokenization package that is used to parse texts in different languages. WWW: https://languagemachines.github.io/ucto/