PROJECTS > SPONSORED

Text-to-speech synthesizer in nine Indian languages

Funding agency: Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ) GmbH

Speech Recognition in Agriculture and Finance for the Poor in India

Funding agency: Bill & Melinda Gates Foundation

This project aims to address the gap between technological advancements and clinical intervention tools by developing automatic/semi-automatic analytical tools for speech and voice data that can be used to assist in the objective, quantitatively based diagnosis and treatment of speech and language difficulties associated with Autism Spectrum Disorder (ASD) and make it accessible to individuals of a variety of linguistic, cultural, and socioeconomic backgrounds in India.


Project Members: Dr. Prasanta Kumar Ghosh, Dr. BK Yamini, Prof. Shrikanth (Shri) S. Narayanan, Dr. Christina Hagedorn, Mr. Aravind Illa
'English Gyani' aims at improving one's English comprehension and reading skill with guidance in vernacular to cater to the increasing demand of English learning particularly for employment and promotions of individuals under-resourced to avail English learning materials or schools. The lessons are categorized to improve the vocabulary, grammar and sentence construction skills as parts of English comprehension.

Learn more about the project

Project Members: Dr. Prasanta Kumar Ghosh, Mr. Chiranjeevi Yarra, Mr. Ananth Nagraj, Mr. Ganesh Gopalan

Speech based Neuro-degenerative Diseases Monitoring

Funding agency: DST, Govt. of India

One of the typical symptoms of neuro-degenerative diseases like Amyotrophic Lateral Sclerosis (ALS), Parkinson's Disease (PD) and Alzheimer's Disease is dysarthria, that is, difficulty in speech production. This project aims to develop robust algorithms for early detection and monitoring of neuro-degenerative diseases using speech cues and for enhancement of the naturalness and intelligibility of the dysarthric speech.


Project Members: Jhansi Mallela, Tanuka Bhattacharjee, Aravind Illa
This project aims to develop a robust keyword spotting system for VoIP speech in real time which involves several unique challenges that demand very fast processing of concurrent sessions, high accuracy and minimal false alarms in the outputs, handling an unrestricted vocabulary and robust performance to codec and channel variations.

Learn more about the project

Project Members: Chiranjeevi Yarra, Nisha Meenakshi, Sanjeev Mittal, Srinivasa Raghavan K M, Samik Sadhu
Understanding of how various linguistic and paralinguistic components are exchanged in a human speech communication system could potentially be useful for making human-machine interaction natural.


Natural non-native English Speech synthesis

Funding agency: DST, Govt. of India

Developing models for synthesizing facial expressions in various affective states would be useful for creating a natural agent in a human-machine communication, which could be used in a variety of applications including customer support which requires presence of an agent for interacting with the customers.


Exploring the brain mechanisms associated with micro-sleep architecture dynamics among healthy practitioners of Vipassana meditation (who differ in their meditation proficiency) and control subjects.


Developing a supervised system that predicts quantitative scores corresponding to the degree of nativity in the Indian spoken English and provides qualitative feedback according to the learners' nativity in an automated way.