iCoSys is proud to present #SwissCrawl, the largest Swiss German text corpus to date!

The tool was built by Lucy Linder with the supervision of Jean Hennebert & Andreas Fischer and is composed of more than half a million sentences, which were generated using a customized web scraping tool that could be applied to other low-resource languages as well.

Want to inspect the code? Click here.

Want to know a bit more about the proceedings? Read the arXiv paper here.

And/or read the #LREC2020 paper here.

 

11 juin 2020

Proseguendo la navigazione su www.heia-fr.ch e i siti connessi, acconsentire l'utilizzo dei cookies al fine di migliorare l’esperienza di navigazione e di  stabilire statistiche di visite. Leggere le avvertenze legali