Curva Fin Bloque
NEWS 31 MARCH, 2021

Pangeanic’s machine translation chosen as the main tool for a multilingual Europeana

Pangeanic’s machine translation chosen as the main tool for a multilingual Europeana

CEF chooses the Europeana Translate project so that Pangeanic’s ECO platform (machine translation, anonymization, etc.) can translate the content and metadata of over 25 million records available in the European digital library.

In the year 2000, the digital preservation of the heritage of the European Community was launched, which led to the creation of Europeana. The digitization of millions of documents provided by renowned cultural institutions from the 27 member states of the European Union sets out the aim of facilitating access to Europe’s cultural and scientific heritage.

 

Get to know Europeana Translate

The platform is currently available in 30 languages of Europeana’s cultural community, which are English, Spanish, German, French, Portuguese, Bosnian, Bulgarian, Catalan, Czech, Danish, Slovakian, Slovenian, Estonian, Finnish, Greek, Dutch, Hungarian, Irish, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Romanian, Russian, Swedish, Ukrainian, and Basque. It also stores more than 25 million documents in 45 different languages.

La biblioteca digital europea contiene más de 25 millones de registros a traducir

As a result of this, Europeana Translate was born. In order to provide access to all of this cultural heritage and remove the source language barriers, this project aims to connect the digital service infrastructures (DSI) of the platform with those of machine translation.

As a result, the project will translate the metadata of these millions of records available on Europeana into English and will send them back to the Europeana core service platform as enrichments.

 

PangeaMT, developers of  customized machine translation engines

Pangeanic, as a technology partner of the project, will contribute by bringing the experience it already gained in the customization of machine translation engines from the previous NTEU project for European Public Administrations. The machine translation engines were developed by Pangeanic’s technology division, PangeaMT. The aim of the project is to achieve translations of the cultural content and metadata with a KPI close to human parity (90%), which will be verified by the national content managers and annotators themselves. This will enable hundreds of millions of words to be translated in a scalable way in the future and will ensure that Europe’s cultural content is available.

La biblioteca digital europea contiene más de 25 millones de registros a traducirLos motores de traducción automática de Pangeanic, traducirán documentos de más de 45 idiomas distintos

As a team of language technology professionals, Pangeanic works on a daily basis on the implementation of language processing tools that allow to structure data so that humans or machines can extract actionable insights. These tools allow companies and institutions from all industries (law, finance, culture and tourism, among many others) to improve their services by gaining greater knowledge and saving time and resources when managing their projects.

Deep Adaptive technology for machine translation allows to clone engines that learn from content previously generated by the user, or similar content, and that imitate vocabulary and style. Each learning level produces deep learning algorithms that enable the data to be weighted and allow the engines to become a fundamental tool for those users that have specialized terminology and/or large-scale language content generation or processing needs.

This technology will help Europeana, which is currently in its fourth DSI operational cycle, to overcome language barriers and be able to offer European citizens and institutions access to its content according to their multilingual needs.

This project falls within the framework of Europe’s commitment to becoming one of the most competitive and dynamic knowledge-based economies. With just one click, it offers the world a cultural background that includes books, paintings, films, audio material, maps and newspapers, as well as other kinds of highly valuable records that soon we will be able to enjoy and understand with no restrictions.

CTA_Pangeanic sponsor MT summit 2021

 

Where we are

USA

Boston

One Boston Place
Suite 2600
Boston MA 02108
(617) 621-4084
boston@pangeanic.com

New York

228 E 45TH St Rm 9E
10017-3337 New York, NY

info@pangeanic.com  

Europe

Valencia

Pangeanic Headquarters

Av. Cortes Valencianas, 26-5,

Ofi 107

46015 Valencia (Spain)

(+34) 96 333 63 33
info@pangeanic.com

London

Flat8, 279 Church Road,
Crystal Palace
SE19 2QQ
United Kingdom
+44 203 5400 256

london@pangeanic.net

Madrid

Atrium
Castellana 91
Madrid 28046
Spain
(+34) 91 326 29 33
info@pangeanic.com

Asia

Hong Kong

21st Floor, CMA Building
64 Connaught Road Central
Hong Kong
Toll Free: +852 2157 3950
info@pangeanic.hk

Tokyo

Ogawa Building 3F

3-37 Kanda Sakuma-cho

Chiyoda-ku, Tokyo

101-0025

tokyo@pangeanic.net

Shanghai

Tomson Commercial Building,
Room 316-317
710 Dong Fang Road
Pu Dong, Shanghai 200122, China

shanghai@pangeanic.net