Meta unveils Omnilingual AI that understands over 1,600 languages, including many Indian dialects The Mainstream

Meta has launched Omnilingual ASR (Automatic Speech Recognition), a breakthrough AI framework capable of transcribing speech into text in more than 1,600 languages. Developed by Meta’s Fundamental AI Research (FAIR) team, the system marks a major leap in global language coverage, including support for around 500 lesser-known and low-resource languages — some being recognised by AI for the first time.

The company has also introduced the Omnilingual ASR Corpus, a large dataset featuring transcribed audio from 350 underserved languages. Alongside this, Meta unveiled Omnilingual wav2vec 2.0, a multilingual speech model that powers the new system and enables accurate transcription across a vast range of linguistic contexts.

Omnilingual ASR expands significantly beyond traditional speech-to-text systems, which typically support only a few dozen major languages. With this advancement, Meta aims to bring speech recognition, translation, and voice-enabled AI tools to communities whose languages have long been excluded from mainstream technology. The framework already includes regional Indian languages such as Awadhi, Maithili, Chhattisgarhi, and Tulu, making digital communication more inclusive for millions of speakers.

One of the system’s key strengths is its ability to learn new languages with minimal data. Rather than relying on massive labelled audio datasets, Omnilingual ASR can adapt using just a small set of paired audio and text samples. This feature is particularly valuable for languages with limited digital or written resources, allowing them to become part of modern speech technologies for the first time.

Built on Meta’s wav2vec 2.0 model, scaled up to 7 billion parameters, the framework applies techniques similar to large language models (LLMs). This enables the system to produce accurate transcriptions, even from limited examples, and makes it easier to scale to new linguistic contexts. The design also supports both lightweight versions for low-power devices and advanced models for high-performance applications.

Currently, Omnilingual ASR is available as an open research model under the Apache 2.0 license, allowing developers and researchers worldwide to freely use, adapt, and build upon it.

While Meta has not yet announced when this technology will be integrated into its consumer products, it could soon power voice-based features across WhatsApp, Instagram, and Meta AI, including voice message transcription and automatic captions.

Also read: Viksit Workforce for a Viksit Bharat

Do Follow: The Mainstream formerly known as CIO News LinkedIn Account | The Mainstream formerly known as CIO News Facebook | The Mainstream formerly known as CIO News Youtube | The Mainstream formerly known as CIO News Twitter

About us:

The Mainstream formerly known as CIO News is a premier platform dedicated to delivering latest news, updates, and insights from the tech industry. With its strong foundation of intellectual property and thought leadership, the platform is well-positioned to stay ahead of the curve and lead conversations about how technology shapes our world. From its early days as CIO News to its rebranding as The Mainstream on November 28, 2024, it has been expanding its global reach, targeting key markets in the Middle East & Africa, ASEAN, the USA, and the UK. The Mainstream is a vision to put technology at the center of every conversation, inspiring professionals and organizations to embrace the future of tech.

Top 5 This Week

Axis Bank outlines six-point GenAI roadmap to drive growth and efficiency

Telangana ramps up life sciences and AI investments with USD 25 billion target

PSBs accelerate local officer hiring and language readiness after Finance Minister raises concerns

AI Impact Summit 2026 Day 2: Policy push, health tech launches & call for inclusive AI

RBI Governor Sanjay Malhotra urges MSMEs to adopt formal and digital practices

Related News

Steam Deck OLED faces stock outage in US amid AI-driven memory crunch

Google may bring Gemini split-screen support to regular smartphones

Apple rolls out encrypted RCS messaging in iOS 26.4 developer beta

Flapping Airplanes raises $180M to rethink how AI models learn

Motorola Moto G86 Power review highlights massive battery with practical compromises

iPhone 18 Pro series tipped to bring major design, camera and battery upgrades

Meta unveils Omnilingual AI that understands over 1,600 languages, including many Indian dialects

LEAVE A REPLY Cancel reply

Popular Articles

Axis Bank outlines six-point GenAI roadmap to drive growth and efficiency

Telangana ramps up life sciences and AI investments with USD 25 billion target

PSBs accelerate local officer hiring and language readiness after Finance Minister raises concerns

AI Impact Summit 2026 Day 2: Policy push, health tech launches & call for inclusive AI

RBI Governor Sanjay Malhotra urges MSMEs to adopt formal and digital practices

Latest Articles

How dining spaces are becoming the new differentiator for GCCs in India

Thales inaugurates R&D centre in Bengaluru to strengthen global innovation network

Sanofi plans major expansion of Hyderabad GCC

Most Popular

Computer Security Day 2026: Why Protecting Your Devices Matters More Than Ever

The Mainstream: Leading Digital Advertising Company in Mumbai Powering India’s AdTech Growth

The Future of Advertising: How the Modern Ad Tech Company in India Uses AI and Data to Win

Subscribe

Subscribe to newsletter