Ministry of Science & Technology
azadi ka amrit mahotsav

PARLIAMENT QUESTION: ROLE OF BHARATGEN AI

Posted On: 05 FEB 2026 3:23PM by PIB Delhi

BharatGen is the first government supported national initiative to develop a range of sovereign foundational AI models tailored to Indian languages and societal contexts. It spans multiple modalities, including text (via Large Language Models), speech (Text-to-Speech and Automatic Speech Recognition), and vision-language systems.

Currently, BharatGen’s AI models support 15 Indian languages which include Hindi, Assamese, Bengali, Gujarati, Kannada, Maithili, Malayalam, Marathi, Nepali, Oriya, Punjabi, Sanskrit, Sindhi, Tamil and Telugu. Soon, all 22 scheduled Indian languages will be covered.

BharatGen has released domain specific fine-tuned models for Ayurveda (Ayur Param), Indian agriculture (Agri Param) and Indian legal domain (Legal Param). In addition, all BharatGen models (text, speech and vision) are useful for applications across healthcare, agriculture, education and governance.

Two Technology Innovation Hubs namely TIH Foundation for IoT and IoE, IIT Bombay and IITM Pravartak Technologies Foundation, IIT Madras under National Mission on Interdisciplinary Cyber-Physical Systems (NM-ICPS) of Department of Science and Technology (DST) are currently active as part of BharatGen network.

The following institutions are a part of the BharatGen consortium:

Institution Name

Role in BharatGen

Indian Institute of Technology, Bombay

Lead institution, guiding research and integration across consortium partners

International Institute of Information Technology, Hyderabad

Vision-language document modeling

Indian Institute of Technology, Madras

Speech foundation model development and evaluation

Indian Institute of Technology, Kanpur

Legal AI research, domain-specific datasets, and developing tokenization strategies for multilingual models

Indian Institute of Technology, Hyderabad

Advanced tokenization and vocabulary optimization for large multilingual LLMs

Indian Institute of Technology, Mandi

Inclusive multilingual model development and research on efficient training strategies for LLMs

Indian Institute of Management, Indore

Bharat-centric evaluation and benchmarking of LLMs, multilingual and multimodal data collection

 

*****


NKR/FK


(Release ID: 2223738) Visitor Counter : 135