iCoSys is proud to present #SwissCrawl, the largest Swiss German text corpus to date!

The tool was built by Lucy Linder with the supervision of Jean Hennebert & Andreas Fischer and is composed of more than half a million sentences, which were generated using a customized web scraping tool that could be applied to other low-resource languages as well.

Want to inspect the code? Click here.

Want to know a bit more about the proceedings? Read the arXiv paper here.

And/or read the #LREC2020 paper here.

 

11 juin 2020
En poursuivant votre navigation sur ce site, vous acceptez l'utilisation de cookies pour améliorer votre expérience utilisateur et réaliser des statistiques de visites. Lire les mentions légales