Founder and CEO

Manuel Herranz

Manuel Herranz is the founder and CEO of Pangeanic, a European language technology and artificial intelligence company specializing in multilingual AI, AI data operations (data-for-AI), machine translation workflows, anonymization, model adaptation and sovereign AI deployments. He has been involved in the creation of some European LLMs (Spanish ALIA and low-resourced Catalan Salamandra) with datasets and managing the testing, alignment, bias detection, improving the instruct and other tasks.

His work sits at the intersection of language, data, software and public infrastructure: from early machine translation systems and multilingual corpora to AI data services, model alignment, human feedback and safe deployment in regulated environments.

Contact Pangeanic View research and publications Read Manuel’s essays

Profile

From multilingual technology to operational AI systems

Manuel Herranz founded Pangeanic in Valencia in 2000 as BI Europa, the subsidiary of the now extinct B.I Corporation of Japan, acting as European Director. He joined B.I Corporation after a career shaped by engineering at Gifdings & Lewis / Ford Valencia, Rolls Royce Industrial & Marine (co-generation projects), languages and the practical demands of multilingual communication.

What began as a language services company evolved into a research-driven technology organization working on machine translation, neural language technologies, AI data operations, anonymization and enterprise AI systems.

His work has followed a consistent thread: language technologies only create value when they can be deployed in real organizations, under real constraints, with reliable data, clear governance, human supervision and measurable quality. That operational view has guided Pangeanic’s transition from translation automation to multilingual AI infrastructure.

Multilingual AI

Pangeanic’s work covers machine translation, multilingual data provision to large AI Labs, language resources, model adaptation, AI evaluation and language-aware enterprise systems across European and global languages. He personally proposed four back-to-back language technology EU projects (iADAAPTA/MT-Hub, NEC TM, NTEU and MAPA) and has partnered in many others, including ELE (European Language Equality), for Europeana, and recently, Mosaic Media with several European broadcasters.

AI data operations

Manuel’s current focus includes the data layer behind AI: sourcing, cleaning, annotation, human review, evaluation, anonymization, feedback and continuous improvement.

Sovereign deployment

Pangeanic develops technologies for organizations that need control over data, infrastructure, privacy, language coverage and AI behavior in regulated or sensitive environments.

Work at Pangeanic

Building the data and language layer for enterprise AI

Under Manuel Herranz’s leadership, Pangeanic has moved from early statistical and neural machine translation work to a broader AI portfolio that includes data for AI, multilingual datasets, human feedback, model alignment, MT quality estimation, anonymization and private AI deployments.

Area	Strategic role	Related Pangeanic capability
AI data operations	The managed service layer that turns raw, licensed or collected data into usable material for training, evaluation, adaptation and governance.	Data for AI and AI Data Operations
Multilingual datasets	Text, speech, audio, image, video, OCR, parallel corpora, instruction data and evaluation data for multilingual AI systems.	Datasets for AI
Human feedback and alignment	Expert review, preference ranking, evaluation, RLHF workflows and behavioral refinement for domain-specific and multilingual models.	Model Alignment and RLHF
Evaluation and QA	Benchmark design, multilingual quality assessment, model comparison, regression testing and release validation for AI systems.	Evaluation and AI QA
Privacy and anonymization	Secure processing of sensitive content through multilingual anonymization, data masking and controlled deployment models.	Data Masking
Adaptive translation	Domain-aware machine translation, terminology control, style adaptation and quality estimation for enterprise multilingual workflows.	Deep Adaptive AI Translation

Trajectory

A language technology path built over two decades

Pangeanic’s development reflects a broader change in the language technology industry: from translation memories and statistical machine translation to neural systems, multilingual AI data, custom model adaptation and AI systems that require continuous evaluation.

2000

Pangeanic founded in Valencia

Manuel Herranz founded B.I Europa as a subsidiary of B.I Corporation, rebranding as Pangeanic in 2005. During these early years, Pangeanic was a multilingual company combining language expertise with an early interest in software, automation and machine translation.

2010s

From translation automation to machine translation technology

Pangeanic expanded its work in machine translation, corpora, customization, data selection and neural language technologies, building the technical foundations for later AI data and model adaptation services.

EU projects

European digital language infrastructure

Pangeanic has participated in European projects involving multilingual language technologies, anonymization, machine translation, cultural heritage access and public-sector digital infrastructure.

2020s

AI data, anonymization and model alignment

The company’s portfolio expanded into AI data services, multilingual anonymization, human feedback, model alignment, evaluation, quality estimation and private AI deployment.

Today

Building sovereign AI systems with European values and Ethical Data

Manuel’s current work is centered on production AI systems that combine data governance, multilingual depth, human supervision, secure deployment and operational reliability.

Research provenance

Applied research with commercial discipline

Pangeanic’s technology direction has been shaped by years of multilingual research, public-sector deployment, European project work and production use cases. The company’s research trail connects machine translation, data selection, anonymization, evaluation, speech resources, multilingual datasets and model alignment.

25+

years of multilingual language technology and AI deployment work

200+

Languages supported in production-grade multilingual workflows

10×

selected for EU-funded AI and language technology projects

data, alignment, anonymization, evaluation and deployment infrastructure

View research and publications View EU-funded projects See BSC model alignment use case

Thought leadership

Essays on language, data and artificial intelligence

Manuel Herranz writes about the industrial shift from generic AI demonstrations to operational systems: the role of data-for-AI, RL with human feedback, multilingual evaluation, small task-specific models, governance and the changing nature of language in the age of machine-mediated communication.

Language and AI

Reflections on how large language models, translation systems and multilingual AI are changing the way organizations communicate, synthesize knowledge and operate across languages.

Data as infrastructure

Analysis of the data factories behind AI systems: collection, licensing, annotation, evaluation, feedback, governance and the quiet labor that determines model quality.

Sovereign AI

Commentary on European AI infrastructure, private deployment, public-sector systems, regulatory constraints and the need for organizations to control their data and models.

Read Manuel’s essays at manolito.info Read Pangeanic blog

Selected areas of work

These pages provide the commercial, technical and research context for Pangeanic’s current work in multilingual AI and data-centric deployment.

Data for AI

Data services for AI systems that need to work in the real world

Sourcing, licensing, cleaning, annotation, review, evaluation, anonymization and governance for production AI systems.

Datasets for AI

Multilingual, multimodal and domain-specific datasets

Text, speech, audio, image, video, OCR, parallel corpora, instruction data and evaluation data for AI builders.

AI Data Operations

The operating layer between data, models and deployment

Continuous data preparation, annotation, feedback, quality control and governance for AI systems.

Model Alignment

Human feedback and RLHF for controlled model behavior

Preference data, expert review, multilingual evaluation and alignment workflows for domain-specific AI models.

Evaluation and QA

Testing AI systems before they reach users

Benchmark design, multilingual QA, model comparison, regression testing and release validation.

Research

Research and publications behind Pangeanic’s technology

Academic, European and applied research connecting multilingual data, translation, evaluation and anonymization.

Speaking and commentary

Available for selected interviews, panels and executive briefings

Manuel Herranz regularly contributes to global conversations about multilingual AI, data-centric AI, AI governance, public-sector language infrastructure, the future of translation, model alignment and enterprise deployment. For media, research, institutional or partnership inquiries, please contact Pangeanic.

Topics

Multilingual AI, AI data operations, model alignment, sovereign AI, machine translation, data anonymization, evaluation and public-sector AI deployment.

Audiences

Enterprises, governments, research institutions, AI labs, language technology organizations, media and European digital infrastructure forums.

Format

Executive briefings, interviews, panels, conference talks, webinars, strategic commentary and thought leadership articles.

Discuss multilingual AI, AI data operations or sovereign deployment

Pangeanic works with enterprises, AI labs and public institutions that need reliable multilingual AI systems, governed data pipelines, human feedback, evaluation and private deployment options.

Contact Pangeanic Explore Data for AI