iCoSys is proud to present #SwissCrawl, the largest Swiss German text corpus to date!

The tool was built by Lucy Linder with the supervision of Jean Hennebert & Andreas Fischer and is composed of more than half a million sentences, which were generated using a customized web scraping tool that could be applied to other low-resource languages as well.

Want to inspect the code? Click here.

Want to know a bit more about the proceedings? Read the arXiv paper here.

And/or read the #LREC2020 paper here.

 

11 juin 2020

Indem Sie die Website weiterhin nutzen, stimmen Sie der Verwendung von Cookies zu, um die Nutzererfahrung zu verbessern und Besucherstatistiken zur Verfügung zu stellen. Rechtliche Hinweise lesen