Bicrawler, Gema Ramirez (Prompsit)

Preview:

Citation preview

Bicrawler:create bitexts from multilingual websitesBy Gema Ramírez/Prompsit

Translations are our best friends: we learn from them, reuse them, exploit them…

But we lack translations!!

More data? REALLY???

Deep learning is data-hungry!!

Not enough for some languages and domains…

Automotive in en-ar UN corpus?

No translations found

But transla-tions are…

…out there

There is plenty of multilingual contentLet’s get it!!!Lurking in the world wide web

All I want is to get translations from a website!!!

Arrastre la imagen al marcador de posición o haga clic en el icono para agregar

Thanks!

Bicrawler: create bitexts from multilingual websites Gema Ramírez/Prompsit