9 projects
bicleaner-ai
Parallel corpus classifier, indicating the likelihood of a pair of sentences being mutual translations or not (neural version)
bicleaner-hardrules
Pre-filtering step for obvious noise based on rules, poor language based on general language modelling and vulgar language based on specific language modelling
bicleaner
Parallel corpus classifier, indicating the likelihood of a pair of sentences being mutual translations or not
monocleaner
Monolingual corpus fluency filter
bifixer
bicleaner-ai-glove
glove-python fork for bicleaner-ai
loomchild-segment
Python wrapper for Loomchild segmenter
doommoses
DoomMoses
binonymizer
Binonymizer is a tool in Python that aims at tagging personal data in a parallel corpus.