iCoSys is proud to present #SwissCrawl, the largest Swiss German text corpus to date!

The tool was built by Lucy Linder with the supervision of Jean Hennebert & Andreas Fischer and is composed of more than half a million sentences, which were generated using a customized web scraping tool that could be applied to other low-resource languages as well.

Want to inspect the code? Click here.

Want to know a bit more about the proceedings? Read the arXiv paper here.

And/or read the #LREC2020 paper here.

 

11 juin 2020

By continuing to browse this site, you agree to the use of cookies to improve your user experience and to provide website statistics. Read the legal notice