Neelesh Samptur, Tanuka Bhattacharjee, Anirudh Chakravarty K, Seena Vengalil, Yamini BK, Nalini Atchayaram, Prasanta Kumar Ghosh, "Exploring syllable discriminability during diadochokinetic task with increasing dysarthria severity for patients with amyotrophic lateral sclerosis", accepted in Proc. Interspeech, Kos Island, Greece, 2024, Page(s): 4114-4118. [pdf] [slides] Chetan Sharma, Vaishnavi Chwanshi, Prasanta Kumar Ghosh, "A comparative study of the impact of voiceless alveolar and palato-alveolar sibilants in english on lip aperture and protrusion during VCV production", accepted in Proc. Interspeech, Kos Island, Greece, 2024. [pdf] Sathvik Udupa, Soumi Maiti, P.rasanta Kumar Ghosh, "IndicMOS: Multilingual MOS prediction for 7 Indian languages", accepted in Proc. Interspeech, Kos Island, Greece, 2024. [pdf] [slides] [codes] Sathvik Udupa, Jersuraj, Saurabh, Deekshitha, Shya B, Abhayjeet, Savitha, Priyanka, Srinivasa, Raoul, Prasanta Kumar Ghosh, "Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models", accepted in Proc. Interspeech, Kos Island, Greece, 2024. [pdf] [poster] [codes] Jesuraja Bandekar, Sathvik Udupa, Prasanta Kumar Ghosh , "Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss", accepted in Proc. Interspeech, Kos Island, Greece, 2024. [pdf] Alex Paul Kamson, Akshay V. Sawant, Prasanta Kumar Ghosh , Satish S ,Jeevannavar, "Exploring wav2vec 2.0 model for heart sound analysis", accepted in EMBC 2024. [pdf] Anjali Jayakumar, Tanuka Bhattacharjee, Seena Vengalil, Yamini Belur, Nalini Atchayaram , Prasanta Kumar Ghosh, "Low complexity model with single dimensional feature for speech based classification of amyotrophic lateral sclerosis patients and healthy individuals", accepted in Proc. IEEE International Conference on Signal Processing and Communications (SPCOM), Bangalore, India, 2024, Page(s): 1-5. [pdf] [slides] Jesuraja Bandekar, Sathvik Udupa Prasanta Kumar Ghosh, "Discovering phoneme-specific critical articulators through a data driven approach", accepted in ISSP 2024. [pdf] Satyadev Badireddi, Shreya Shrikant Karkun, Prasanta Kumar Ghosh, "Inter-subject variation in tongue shape during vowel production in /b/V/t/ sequence: An rtMRI study using 8 vowels from 74 subjects", accepted in ISSP 2024. [pdf] [poster] Shivani Yadav, Dipanjan Gope, Uma Maheswari k, Prasanta Kumar Ghosh, "An unsupervised segmentation of vocal breath sounds", accepted in ICASSP 2024. [pdf] Chowdam Venkata Thirumala Kumar, Tanuka Bhattacharjee, Seena Vengalil, Saraswati Nashi, Madassu Keerthipriya, Yamini Belur, Nalini Atchayaram, Prasanta Kumar Ghosh, "Spectral analysis of vowels and fricatives at varied levels of dysarthria severity for Amyotrophic Lateral Sclerosis", accepted in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Republic of Korea, 2024, Page(s): 12767-12771. [pdf] [poster] Aalok Varma, Sathvik Udupa, Mohini Sengupta, Prasanta Kumar Ghosh and Vatsala Thirumalai, "A machine-learning tool to identify bistable states from calcium imaging data", accepted in The Journal of Physiology, 602(7), 1243-1271. Abhayjeet S., Amala N., Anjali J. Deekshitha G, Jesuraja B., Roopa R, Sandhya B, Sathvik U, Prasanta Kumar Ghosh, Hema A Murthy, Heiga Zen, Pranaw Kumar, Kamal Kant, Amol Bole, Bira Chandra Singh, Keiichi Tokuda, Mark Hasegawa Johnson, Philipp Olbrich, "Lightweight, Multi-Speaker, Multi-Lingual Indic Text-to-Speech", accepted in OJSP, Vol-, No.-, pp:, Mar. 2024. [pdf] Veerababu Dharanalakota and Prasanta Kumar Ghosh, "Neural network based approach for solving problems in plane wave duct acoustics", accepted in Journal of Sound and Vibration, Volume 585, 2024, 118476, ISSN 0022-460X. [pdf] [codes] Veerababu Dharanalakota , Namra Quasim and Prasanta Kumar Ghosh, "Estimation of the Acoustic Field in a Uniform Duct with Mean Flow using Neural Networks", accepted in accepted in International Journal Of Acoustics And Vibration (IJAV). Tanuka Bhattacharjee, Seena Vengalil, Yamini Belur, Nalini Atchayaram, and Prasanta Kumar Ghosh, "Inter-speaker acoustic differences of sustained vowels at varied dysarthria severities for Amyotrophic Lateral Sclerosis", accepted in accepted in Journal of Acoustical Society of America, 2024. [pdf] Sathvik Udupa, Jesuraja Bandekar, Abhayjeet Singh, Deekshitha G, Saurabh Kumar, Sandhya Badiger, Amala Nagireddi, Roopa R, Prasanta Kumar Ghosh, Hema A Murthy, Pranaw Kumar, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich, "LIMMITS’24: Multi-Speaker, Multi-Lingual Indic TTS with Voice Cloning", accepted in 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). [codes] Veerababu Dharanalakota, P.rasanta Kumar Ghosh, "Prediction of one-dimensionalacousticfieldwithaxialtemperature gradient using neuralnetworks", accepted in INTER-NOISE24, Nantes, France, Pages 4996 - 5994, pp. 5930-5937(8). [pdf]
Alex Paul Kamson, Macline Crecsilla Lewis, Akshay Sawant, Vishnu Sunil B N, Prasanta Kumar Ghosh, Satish S Jeevannavar, "E2E Multi-Scale CNN with LSTM for murmur detection in PCG or noise identification", accepted in In 2023 International Conference on Electrical Communication, and Computer Engineering (ICECCE), IEEE, 2023. [pdf] Abhayjeet Singh, Charu Shah, Rajashri Varadaraj, Sonakshi Chauhan, Prasanta Kumar Ghosh, "SPIRE-SIES: A spontaneous indian english speech corpus", accepted in 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA). IEEE 2023. [pdf] Sathvik Udupa, Jesuraja Bandekar, Deekshitha G, Saurabh Kumar, P.rasanta Kumar Ghosh, Shya Badiger, Abhayjeet Singh, Savitha Murthy, Priyanka Pai, Srinivasa Raghavan, Rohan Saxena, "Gated multi encoders and multitask objectives for dialectal speech recognition in Indian languages", accepted in ASRU 2023. [pdf] Priyanshi Pal, Shelly Jain, Chiranjeevi Yarra, P.rasanta Kumar Ghosh, Anil Kumar Vuppala, "Study of Indian english pronunciation variabilities relative to received pronunciation", accepted in SPECOM 2023. [pdf] Navneet Kaur, Prasanta Kumar Ghosh, "Curriculum learning based approach for faster convergence of TTS model", accepted in SPECOM 2023. [pdf] Abhayjeet Singh, Anjali Jayakumar, Deekshitha G, Hitesh Tiwari, Jesuraja Bandekar, Shya Badiger, Sathvik Udupa, Saurabh Kumar, Prasanta Kumar Ghosh, "An end-to-end TTS model in Chhattisgarhi, a low-resource Indian language", accepted in SPECOM 2023
. [pdf] Abhayjeet Singh, Arjun Singh Mehta, Ashish Khuraishi K S, Deekshitha G, Gauri Date, Jai Nanavati, Jesuraja Bandekar, Karnalius Basumatary, Karthika P, Shya Badiger, Sathvik Udupa, Saurabh Kumar, Prasanta Kumar Ghosh, Prashanthi V, Priyanka Pai, Raoul Nanavati, Sai Praneeth Reddy, Mora Srinivasa Raghavan, "An ASR corpus in Chhattisgarhi, a low resource Indian language", accepted in SPECOM 2023. [pdf] Veerababu Dharanalakota, J. Pavan Kumar, Prasanta Kumar Ghosh, "Loss-based optimizer switching to solve 1-D Helmholtz equation using neural networks", accepted in Acoustics 2023 Sydney, Australia, 4 - 8 Dec, (2023). [pdf] Veerababu Dharanalakota, R. Ashwin, Prasanta kumar Ghosh, "Achieving stable convergence of neural networks for estimating acoustic field in uniform ducts", accepted in Acoustics 2023 Sydney, Australia, 4 - 8 Dec, (2023). [pdf] Chowdam Venkata Thirumala Kumar, Meenakshi Sirigiraju, Rakesh Vaideeswaran Mahesh, Prasanta Kumar Ghosh, Chiranjeevi Yarra, "Can the decoded text from automatic speech recognition effectively detect spoken grammar errors?", accepted in SLATE 2023. [pdf] Chowdam Venkata Thirumala Kumar, Tanuka Bhattacharjee, Yamini BK, Nalini Atchayaram, Ravi Yadav, Prasanta Kumar Ghosh, "Classification of multi-class vowels and fricatives from patients having amyotrophic lateral sclerosis with varied levels of dysarthria severity", accepted in In Proc. Interspeech, Dublin, Ireland, 2023, Page(s): 146-150. [pdf] Shelly Jain, Priyanshi Pal, Anil Vuppala, P.rasanta Kumar Ghosh, Chiranjeevi Yarra, "An investigation of Indian native language phonemic influences on L2 English pronunciations", accepted in INTERSPEECH 2023. [pdf] Varun Belagali, Prasanta Kumar Ghosh, Achuth Rao, "Weakly supervised glottis segmentation in high-speed videoendoscopy using bounding box labels", accepted in INTERSPEECH 2023. [pdf] Mohammad Shaique Solanki, Ashutosh Bharadwaj, Jeevan Kylash, Prasanta Kumar Ghosh, "Do vocal breath sounds encode gender cues for automatic gender classification?", accepted in INTERSPEECH 2023. [pdf] Siddarth Chandrasekar, Arvind Ramesh, Tilak Purohit, Prasanta Kumar Ghosh, "A study on the importance of formant transitions for stop-consonant classification in VCV sequence", accepted in INTERSPEECH 2023. [pdf] Jesuraja Bandekar, Sathvik Udupa, Prasanta Kumar Ghosh, "Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion", accepted in INTERSPEECH 2023. [pdf] Tanuka Bhattacharjee, Anjali Jayakumar, Yamini BK, Nalini Atchayaram, Ravi Yadav, Prasanta Kumar Ghosh, "Transfer learning to aid dysarthria severity classification for patients with amyotrophic lateral sclerosis", accepted in Proc. Interspeech, Dublin, Ireland, 2023, Page(s): 1543-1547. [pdf] [poster] Dharanalakota Veerababu, Prasanta Kumar Ghosh, "Solution of 1-D Helmholtz equation using artificial neural networks", accepted in publication in the 29th International Congress on Sound and Vibration (ICSV29). [pdf] Tanuka Bhattacharjee, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Prasanta Kumar Ghosh, "Exploring the role of fricatives in classifying healthy subjects and patients with amyotrophic lateral sclerosis and parkinson's disease", accepted in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023, Page(s): 1-5. [pdf] [slides] [poster] Tanuka Bhattacharjee, Chowdam Venkata Thirumala Kumar, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Prasanta Kumar Ghosh, "Static and dynamic source and filter cues for classification of amyotrophic lateral sclerosis patients and healthy subjects", accepted in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023, Page(s): 1-5. [pdf] [slides] [poster] Sathvik Udupa, Siddarth C, Prasanta Kumar Ghosh, "Improved acoustic-to-articulatory inversion using representations from pretrained self-supervised learning models", accepted in ICASSP 2023. [pdf] Sathvik Udupa, Prasanta Kumar Ghosh, "Real-time MRI video synthesis from time aligned phonemes with sequence-to-sequence networks", accepted in ICASSP 2023. [pdf] Srikanth Raj Chetupalli, Prashant Krishnan, Neeraj Sharma, Ananya Muguli, Rohit Kumar, Viral Nanda, Lancelot Mark Pinto, Prasanta Kumar Ghosh, "Multi-modal Point-of-Care Diagnostics for COVID-19 Based On Acoustics and Symptoms", accepted in IEEE Journal of Translational Engineering in Health and Medicine ( Volume: 11). [pdf]
Priyanshi Pal, Chiranjeevi Yarra, Prasanta Kumar Ghosh, "voisTUTOR 2.0: A speech corpus with phonetic transcription for pronunciation evaluation of Indian L2 English learners", accepted in O-COCOSDA 2022. [pdf] Abinay Reddy Naini, Achuth Rao M V, Prasanta Kumar Ghosh, "Whisper to neutral mapping using i-Vector space likelihood and a cosine similarity based iterative optimization for whispered speaker verification", accepted in NCC 2022. [pdf] Aravind Illa, Aanish Nair, Prasanta Kumar Ghosh, "The impact of cross language on acoustic-to-articulatory inversion and its influence on articulatory speech synthesis", accepted in ICASSP 2022. [pdf] Anwesha Roy, Varun Belagali, Prasanta Kumar Ghosh, "An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production", accepted in ICASSP 2022. [pdf] Siddharth Subramani, Achuth Rao M V, Anwesha Roy, Prasanna Suresh Hegde, Prasanta Kumar Ghosh, "Segnet-based deep representation learning for dysphagia classification", accepted in ICASSP 2022. [pdf] Abinay Reddy Naini, Bhavuk Singhal, Prasanta Kumar Ghosh, "Dual attention pooling network for recording device classification using neutral and whispered speech", accepted in ICASSP 2022. [pdf]
Karthik G.R., Prasanta Kumar Ghosh, "Towards a calibration-free approach to deep learning based single-incidence inverse scattering", accepted in 2021 PhotonIcs & Electromagnetics Research Symposium (PIERS). [pdf] Chiranjeevi Yarra , Prasanta Kumar Ghosh, "Automatic syllable stress detection under non-parallel label and data condition", accepted in Speech Communication, Elsevier. [pdf] Karthik G.R., Prasanta Kumar Ghosh, "A scalable deep learning model for arbitrary transmitter configurations in inverse scattering", accepted in 2021 IEEE Antennas and Propagation Society International Symposium (APS-URSI). [pdf] Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh, "Convolutional dense neural network based spirometry variable FVC prediction using sustained phonations", accepted in MLSP 2021. [pdf] Abhayjeet Singh, Achuth Rao M V, Rakesh Vaideeswaran, Chiranjeevi Yarra, Prasanta Kumar Ghosh, "A study on native american english speech recognition by Indian listeners with varying word familiarity level", accepted in Oriental COCOSDA 2021. [pdf] [slides] Tilak Purohit, Tejas Umesh, Shankar Narayanan, Minulakshmi S, Prasanta Kumar Ghosh, "SPIRE VCV: An acoustic-articulatory corpus with three different speaking rates", accepted in Oriental COCOSDA 2021. [pdf] Bhavuk Singhal, Abinay Reddy Naini, Prasanta Kumar Ghosh, "WSPIRE: A parallel multi-device corpus in neutral and whisper speech", accepted in Oriental COCOSDA 2021. [pdf] [slides] Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy , Prasanta Kumar Ghosh, "Role of breath phase and breath boundaries for the classification between asthmatic and healthy subjects", accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2021. [pdf] Drishti Ramesh Megalmani, Shailesh B G, Achuth Rao M V, Satish S Jeevannava, Prasanta Kumar Ghosh, "Unsegmented heart sound classification using hybrid CNN-LSTM neural networks", accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2021. [pdf] Achuth Rao M V, Shailesh B G, Drishti Ramesh Megalmani, Satish S Jeevannava, Prasanta Kumar Ghosh, "Noise robust detection of fundamental heart sound using parametric mixture gaussian and dynamic programming", accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2021. [pdf] Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh, "Web Interface for estimating articulatory movements in speech production from acoustics and text", accepted in Show and Tell, Interspeech 2021, Brno, Czech Republic. [pdf] [poster] [codes] Manthan Sharma, Navaneetha Gaddam, Tejas Umesh, Aditya Murthy, Prasanta Kumar Ghosh, "A comparative study of different EMG features for acoustic-to-EMG mapping", accepted in Interspeech 2021, Brno, Czech Republic. [pdf] [slides] Tanuka Bhattacharjee, Jhansi Mallela, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh, "Source and vocal tract cues for speech-based classification of patients with Parkinson’s disease and healthy subjects", accepted in In Proc. Interspeech, Brno, Czechia, 2021, Page(s): 2961-2965. [pdf] [slides] Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh, "Estimating articulatory movements in speech production with transformer networks", accepted in Interspeech 2021, Brno, Czech Republic. [pdf] [slides] Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah, Ankita Singh, Srinivasa Raghavan K M, Shreya Khare, Vinit Unni, saurabh vyas, Akash Rajapuria, Chiranjeevi Yarra, Ashish Mittal, Prasanta Kumar Ghosh, Preethi Jyothi, Kalika Bali, Vivek Seshadri, Sunayana Sitaram, Samarth Bharadwaj, Jai Nanavati, Raoul Nanavati , Karthik Sankaranarayanan, "Multilingual and code-switching ASR challenges for low resource Indian languages", accepted in Interspeech 2021, Brno, Czech Republic. [pdf] [slides] Chiranjeevi Yarra , Prasanta Kumar Ghosh, "Noise robust pitch stylization using minimum mean absolute error criterion", accepted in Interspeech 2021, Brno, Czech Republic. [pdf] [slides] Ananya Muguli, Lacelot Pinto, Nirmala R, Neeraj Sharma, Prashant Krishnan, Prasanta Kumar Ghosh, Rohit Kumar, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Shreyas Ramoji , Viral N,a, "DiCOVA challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics", accepted in Interspeech 2021, Brno, Czech Republic. [pdf] Pavan Kumar J, Chiranjeevi Yarra, Prasanta Kumar Ghosh, "DNN based phrase boundary detection using knowledge-based features and feature representations from CNN", accepted in 2021 National Conference on Communications(NCC). [pdf] Aparna Srinivasan, Diviya Singh, Chiranjeevi Yarra, Aravind Illa, Prasanta Kumar Ghosh, "A robust speaking rate estimator using a CNN-BLSTM network", accepted in Circuits, Systems, and Signal Processing, 2021. [pdf] Shankar Narayanan, Arvind Illa, Nayan An, Ganesh Sinisetty, Karthick Narayanan, Prasanta Kumar Ghosh, "An acoustic investigation on the effect of speaking rate on vowel space and coarticulation in Toda VCV sequences", accepted in Sādhanā, 2021. [pdf] Achuth Rao M V, Yamini B K, Ketan J, Preetie Shetty A, Pal P, Shivashankar N, Prasanta Kumar Ghosh, "Automatic classification of healthy subjects and patients with essential vocal tremor using probabilistic source-filter model based noise robust pitch estimation", accepted in Journal of Voice, 2021. [pdf] Tilak Purohit, Achuth Rao M V, Prasanta Kumar Ghosh, "Impact of speaking rate on the source filter Interaction in speech: a study", accepted in ICASSP 2021. [pdf] [poster] Sarthak Kumar Maharana, Aravind Illa, Renuka Mannem, Yamini Belur, Preetie Shetty, Veeramani Preethish Kumar, Seena Vengalil, Kiran Polavarapu, Nalini Atchayaram, Prasanta Kumar Ghosh, "Acoustic-to-articulatory inversion for dysarthric speech by using cross-corpus acoustic-articulatory data", accepted in ICASSP 2021. [pdf] [poster] [codes] Renuka Mannem , Prasanta Kumar Ghosh, "A deep neural network based correction scheme for improved air-tissue boundary prediction in real-time magnetic resonance imaging video", accepted in Computer Speech and Language, March 2021, Volume 66. [pdf] [poster] Tanuka Bhattacharjee, Jhansi Mallela, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh, "Effect of noise and model complexity on detection of amyotrophic lateral sclerosis and Parkinson’s disease using pitch and MFCC", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 2021, Page(s): 7313-7317. [pdf] [slides] [poster]
Aravind Illa, Prasanta Kumar Ghosh, "Complexity-Performance Trade-off In Acoustic-to-Articulatory Inversion", accepted in International Seminar on Speech Production (ISSP) 2020. [pdf] [poster] Chiranjeevi Yarra, Kausthubha N K , Prasanta Kumar Ghosh, "SPIRE-ABC: An online tool for acoustic-unit boundary correction (ABC) via crowdsourcing", accepted in Oriental COCOSDA 2020. [pdf] [slides] Tilak Purohit , Prasanta Kumar Ghosh, "An investigation of the virtual lip trajectories during the production of bilabial stops and nasal at different speaking rates", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] Aravind Illa , Prasanta Kumar Ghosh, "Speaker conditioned acoustic-to-articulatory inversion using x-vectors", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] Jhansi Mallela, Aravind Illa, Yamini Belur, Nalini Atchayaram, Ravi yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh, "Raw speech waveform based classification of patients with ALS, Parkinson’s Disease and healthy controls using CNN-BLSTM", accepted in In Proc. Interspeech, Shanghai, China, 2020, Page(s): 4586-4590. [pdf] [slides] Renuka Mannem, Himajyothi Rajamahendravarapu, Aravind Illa , Prasanta Kumar Ghosh, "Speech rate task-specific representation learning from acoustic-articulatory data", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [codes] Renuka Mannem, Navaneetha Gaddam , Prasanta Kumar Ghosh, "Air-tissue boundary segmentation in real time Magnetic Resonance Imaging video using 3-D convolutional neural network", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [codes] Divya Degala, Achuth Rao M V, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadharshini, Prakash T K , Prasanta Kumar Ghosh, "Automatic Glottis Detection and Segmentation in Stroboscopic videos using Convolutional Networks", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] Abhayjeet Singh, Aravind Illa , Prasanta Kumar Ghosh, "Attention and Encoder-Decoder based models for transforming articulatory movements at different speaking rates", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] Abinay Reddy Naini, Satyapriya Malla , Prasanta Kumar Ghosh, "Whisper activity detection using CNN-LSTM based attention pooling network trained for a speaker identification task", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] Neeraj Sharma, Prashant Krishnan, Rohit Kumar, Shreyas Ramoji, Srikanth Raj Chetupalli, Nirmala R, Prasanta Kumar Ghosh , Sriram Ganapathy, "A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] Suhas BN, Jhansi Mallela, Aravind Illa, Yamini BK, Nalini Atchayaram, Ravi Yadav, Dipanjan Gope, Prasanta Kumar Ghosh, "Speech task based automatic classification of ALS and Parkinson’s Disease and their severity using log mel spectrograms", accepted in In Proc. IEEE International Conference on Signal Processing and Communications (SPCOM), Bangalore, India, 2020, Page(s): 1-5. [pdf] [slides] Renuka Mannem, Himajyothi Rajamahendravarapu, Aravind Illa, Prasanta Kumar Ghosh, "Speech rate estimation using representations learned from speech with convolutional neural network", accepted in SPCOM2020. [pdf] [slides] Jhansi Mallela, Aravind Illa, Suhas B N, Sathvik Udupa, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh, "Voice based classification of patients with amyotrophic lateral sclerosis, parkinson's disease and healthy controls with cnn-lstm using transfer learning", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, Page(s): 6784-6788. [pdf] [slides] Avni Rajpal, Achuth Rao MV, Chiranjeevi Yarra, Ritu Aggarwal, Prasanta Kumar Ghosh, "Pseudo likelihood correction technique for low resource accented ASR", accepted in ICASSP 2020. [pdf] [slides] Siddharth Subramani, Achuth Rao M V, Divya Giridhar, Prasanna Suresh Hegde, Prasanta Kumar Ghosh, "Automatic classification of volumes of water using swallow sounds from cervical auscultation", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020. [pdf] [slides] Shivani Yadav, Merugu Keerthana, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh, "Analysis of acoustic features for speech sound based classification of asthmatic and healthy subjects", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020. [pdf] [slides] Sanjeev Kadagathur Vadiraj, Achuth Rao M V, Prasanta Kumar Ghosh , "Automatic identification of speakers from head gestures in a narration", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020. [pdf] [slides] Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh, "A comparative study of estimating articulatory movements from phoneme sequences and acoustic features", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020. [pdf] [slides] Aravind Illa, Prasanta Kumar Ghosh, "Closed-set speaker conditioned acoustic-to-articulatory inversion using bi-directional long short term memory network", accepted in The Journal of the Acoustical Society of America.Vol.147, Issue 2, February 2020, EL171–EL176. [pdf] Varun Belagali, Achuth Rao M V, Pebbili Gopikishore, Rahul Krishnamurthy, Prasanta Kumar Ghosh, "Two step convolutional neural network for automatic glottis localization and segmentation in stroboscopic videos", accepted in Biomed. Opt. Express 11, 4695-4713 2020. [pdf] Achuth Rao MV , Prasanta Kumar Ghosh, "SFnet: A computationally efficient source filter model based neural speech synthesis", accepted in IEEE Signal Processing Letters, Volume 27, July 2020, pp 1170–1174. [pdf] Anusuya P K, Aravind Illa and Prasanta Kumar Ghosh, "A data-driven phoneme-specific analysis of articulatory importance", accepted in accepted 12th International Seminar on Speech Production (ISSP) 2020. [pdf] [poster] Aravind Illa, Prasanta Kumar Ghosh, "The impact of speaking rate on acoustic-to-articulatory inversion", accepted in Computer Speech & Language. [pdf] Prasanta Kumar Ghosh, Shantanu R. Godbole, Sachindra Joshi, Srujana Merugu, Ashish Verma, "Labeling of data for machine learning", accepted in US Patent Application US16/734,570, granted on 26 January 2021.
Divya Giridhar, Achuth Rao M V, Prasanna Suresh Hedge, Prasanta Kumar Ghosh, "Analysis of swallow sounds of healthy controls for different volumes of water", accepted in International Conference on Engineering in Medicine and Life Sciences, PSG College of Technology, Coimbatore, India, December 19-21, 2019. [pdf] Chiranjeevi Yarra, Ritu Aggarwal, Avni Rajpal, Prasanta Kumar Ghosh, "Indic TIMIT and Indic english lexicon: A speech database of Indian speakers using TIMIT stimuli and a lexicon from their mispronunciations", accepted in Oriental COCOSDA 2019. [pdf] [slides] Chiranjeevi Yarra, Aparna Srinivasan, Chandana Srinivasa, Ritu Aggarwal, Prasanta Kumar Ghosh, "voisTUTOR corpus: A speech corpus of Indian L2 English learners for pronunciation assessment", accepted in Oriental COCOSDA 2019. [pdf] Shankar Narayanan, Aravind Illa, Nayan Anand, Ganesh Sinisetty, Karthick Narayanan, Prasanta Kumar Ghosh , "An acoustic-articulatory database of VCV sequences and words in Toda at different speaking rates", accepted in Oriental COCOSDA 2019. [pdf] [slides] Chiranjeevi Yarra, Prasanta Kumar Ghosh, "voisTUTOR: Virtual operator for interactive spoken english TUTORing", accepted in 8th Workshop on Speech and Language Technology in Education, 2019. [pdf] [slides] Chiranjeevi Yarra, Manoj Kumar Ramanathi, Prasanta Kumar Ghosh, "Comparison of automatic syllable stress detection quality with time-aligned boundaries and context dependencies", accepted in 8th Workshop on Speech and Language Technology in Education, 2019. [pdf] [slides] Aparna Srinivasan, Chiranjeevi Yarra, Prasanta Kumar Ghosh, "Automatic assessment of pronunciation and its dependent factors by exploring their interdependencies using DNN and LSTM", accepted in 8th Workshop on Speech and Language Technology in Education, 2019. [pdf] [slides] Sweekar Sudhakara, Manoj Kumar Ramanathi, Chiranjeevi Yarra, Anurag Das, Prasanta Kumar Ghosh, "Noise robust goodness of pronunciation measures using teacher's utterance", accepted in 8th Workshop on Speech and Language Technology in Education, 2019. [pdf] [poster] Suhas BN, Deep Patel, Nithin Rao, Yamini Belur, Pradeep Reddy, Nalini Atchayaram, Ravi Yadav, Dipanjan Gope, Prasanta Kumar Ghosh , "Comparison of speech tasks and recording devices for voice based automatic classification of healthy subjects and patients with amyotrophic lateral sclerosis", accepted in in Proc. Interspeech, Graz, Austria, 2019, Pages(s): 4564-4568. [pdf] [poster] Manoj Kumar Ramanathi, Chiranjeevi Yarra, Prasanta Kumar Ghosh, "ASR inspired syllable stress detection for pronunciation evaluation without using a supervised classifier and syllable level features", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Abinay Reddy Naini, Achuth Rao MV, Prasanta Kumar Ghosh, "Whisper to neutral mapping using cosine similarity maximization in i-vector space for speaker verification", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Renuka Mannem, Jhansi Mallela, Aravind Illa, Prasanta Kumar Ghosh, "Acoustic and articulatory feature based speech rate estimation using a convolutional dense neural network", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Atreyee Saha, Chiranjeevi Yarra, Prasanta Kumar Ghosh, "Low resource automatic intonation classification using gated recurrent unit (GRU) networks pre-trained with synthesized pitch patterns", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Sweekar Sudhakara, Manoj Kumar Ramanathi, Chiranjeevi Yarra , Prasanta Kumar Ghosh, "An improved goodness of pronunciation (GoP) measure for pronunciation evaluation with DNN-HMM system considering HMM transition probabilities", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Aravind Illa, Prasanta Kumar Ghosh, "An investigation on speaker specific articulatory synthesis with speaker independent articulatory inversion", accepted in Interspeech 2019, Graz, Austria. [pdf] [slides] Chiranjeevi Yarra, Aparna Srinivasan, Sravani Gottimukkala, Prasanta Kumar Ghosh, "SPIRE-fluent: A self-learning app for tutoring oral fluency to second language English learners", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Achuth Rao M V, Prasanta Kumar Ghosh, Tanuka Bhattacharjee, Anirban Dutta Choudhury. , "Trend statistics network and channel invariant EEG network for sleep arousal study", accepted in The 41th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’19), Berlin, Germany. [pdf] [slides] Aravind Illa, Prasanta Kumar Ghosh, "Representation learning using convolution neural network for acoustic-to-articulatory inversion", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5931-5935. [pdf] [slides] Gokul Srinivasan, Aravind Illa, Prasanta Kumar Ghosh , "A study on robustness of articulatory features for automatic speech recognition of neutral and whispered speech", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5936-5940. [pdf] [slides] Valliappan CA, Avinash Kumar, Renuka Mannem, Karthik Girija Ramesan, Prasanta Kumar Ghosh, "An improved air tissue boundary segmentation technique for real time magnetic resonance imaging video using segnet", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5921-5925. [pdf] Renuka Mannem, Prasanta Kumar Ghosh, "Air-tissue boundary segmentation in real time magnetic resonance imaging video using a convolutional encoder-decoder network", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5941-5945. [pdf] [slides] Abinay Reddy Naini, Achuth Rao M V , Prasanta Kumar Ghosh , "Formant-gaps features for speaker verification using whispered speech", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK,2019, Pages: 6231-6235. [pdf] [poster] Renuka Mannem, Valliappan C A, Prasanta Kumar Ghosh, "A SegNet based image enhancement technique for air-tissue boundary segmentation in real-time magnetic resonance imaging video", accepted in National Conference on Communications (NCC) 2019, Bangalore, India, Pages: 1-6. [pdf] [slides] Chiranjeevi Yarra, Supriya Nagesh, Prasanta Kumar Ghosh , "Noise robust speech rate estimation using SNR dependent sub-band selection and peak detection strategy", accepted in The Journal of the Acoustical Society of America.. [pdf] Achuth Rao MV, Prakhar Gupta, Prasanta Kumar Ghosh, "P- and T-wave delineation in ECG signals using parametric mixture Gaussian and dynamic programming", accepted in Biomedical Signal Processing and Control, May 2019, Volume 51, pages: 328-337. [pdf] Anurendra Kumar, Tanaya Guha , Prasanta Kumar Ghosh, "Dirichlet Latent Variable Model: A Dynamic Model Based on Dirichlet Prior for Audio Processing", accepted in IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 27, no. 5 (2019): 919-931. [pdf] Achuth Rao M V, Prasanta Kumar Ghosh, "Glottal inverse filtering using probabilistic weighted linear prediction", accepted in Speech and Language Processing (TASLP), Vol. 27, Issue 1, (2019): 114-124. [pdf] Tanuka Bhattacharjee, Shreyasi Datta, Deepan Das, Anirban Dutta Choudhury, Arpan Pal, Prasanta Kumar Ghosh, "Heart rate driven unsupervised techniques for continuous monitoring of arousal trend of users", accepted in US Patent Application US16/190,800, granted on 29 June 2021. Tanuka Bhattacharjee, Deepan Das, Shahnawaz Alam, Rohan Banerjee, Anirban Dutta Choudhury, Arpan Pal, Achuth Rao Melavarige Venkatagiri, Prasanta Kumar Ghosh, Ayush Ranjan Lohani, "System and method for non-apnea sleep arousal detection", accepted in US Patent Application US16/578,270 granted on 23 August 2022.
Aravind Illa, Prasanta Kumar Ghosh, "Inferring speaker identity from articulatory motion during speech", accepted in Machine Learning in Speech and Language Processing Workshop (MLSLP) 2018. [pdf] [slides] Aravind Illa, Prasanta Kumar Ghosh, "Low resource acoustic-to-articulatory inversion using bi-directional long short term memory", accepted in Interspeech, Hyderabad, India,2018, Page(s): 3122-3126. [pdf] [codes] G.Nisha Meenakshi, Prasanta Kumar Ghosh, "Whispered speech to neutral speech conversion using bidirectional LSTMs", accepted in Interspeech, Hyderabad, India,2018, Page(s): 491-495. [pdf] [slides] Pavan Karjol, Prasanta Kumar Ghosh, "Speech enhancement using deep mixture of experts based on hard expectation maximization", accepted in Interspeech, Hyderabad, India,2018, Page(s): 3254-3258. [pdf] [poster] Abinay Reddy N, Achuth Rao M V, G. Nisha Meenakshi , Prasanta Kumar Ghosh, "Reconstructing neutral speech from tracheoesophageal speech", accepted in Interspeech, Hyderabad, India,2018, Page(s): 1541-1545. [pdf] [poster] Astha Singh, G. Nisha Meenakshi, Prasanta Kumar Ghosh, "Relating articulatory motions in different speaking rates", accepted in Interspeech, Hyderabad, India,2018, Page(s): 2992-2996. [pdf] [poster] Chandana S, Chiranjeevi Yarra, Ritu Aggarwal, Sanjeev Kumar Mittal, Kausthubha N K,Raseena K T, Astha Singh , Prasanta Kumar Ghosh , "Automatic visual augmentation for concatenation based synthesized articulatory videos from real-time MRI data for spoken language training", accepted in Interspeech, Hyderabad, India,2018, Page(s): 3127-3131. [pdf] Achuth Rao M V, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadarshini, Prasanta Kumar Ghosh, "Automatic glottis localization and segmentation in stroboscopic videos using deep neural network", accepted in Interspeech, Hyderabad, India,2018, Page(s): 3007-3011. [pdf] Girija Ramesan Karthik, Parth Suresh , Prasanta Kumar Ghosh, "Subband weighting for binaural speech source localization", accepted in Interspeech, Hyderabad, India,2018, Page(s): 861-865. [pdf] [poster] Valliappan CA, Renuka Mannem, Prasanta Kumar Ghosh, "Air-tissue boundary segmentation in real-time magnetic resonance imaging video using semantic segmentation with fully convolutional networks", accepted in Interspeech, Hyderabad, India,2018, Page(s): 3132-3136. [pdf] [slides] Anand P A, Chiranjeevi Yarra, Kausthubha N K, Prasanta Kumar Ghosh, "Intonation tutor by SPIRE (In-SPIRE): An online tool for an automatic feedback to the second language learners in learning intonation", accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 546-547. [pdf] Chiranjeevi Yarra, Anand P A, Kausthubha N K, Prasanta Kumar Ghosh, "SPIRE-SST: An automatic web-based self-learning tool for syllable stress tutoring (SST) to the second language learners", accepted in In Proc. Interspeech, Hyderabad, India, 2018, Page(s): 2390-2391. [pdf] Valliappan C A, Anurag Das, Prasanta Kumar Ghosh, "Classification of story-telling and poem recitation using head gesture of the talker", accepted in In Proc. International Conference on Signal Processing and Communications (SPCOM), Bangalore, India, 2018, Page(s): 36-40'. [pdf] [slides] Pavan Karjol, Prasanta Kumar Ghosh, "Broad phoneme class specific deep neural network based speech enhancement", accepted in in Proc. International Conference on Signal Processing and Communications (SPCOM), Bangalore, India, 2018, Page(s): 372-376'. [pdf] [slides] Shivani Yadav, Kausthubha N K, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh, "Comparison of cough, wheeze and sustained phonations for automatic classification between healthy subjects and asthmatic patients", accepted in in Proc. Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’18), Honolulu, HI, USA,2018, Page(s): 1400-1403'. [pdf] [slides] Raseena K T, Prasanta Kumar Ghosh, "A maximum likelihood formulation to exploit heart rate variability for robust heart rate estimation from facial video", accepted in in Proc. Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’18), Honolulu, HI, USA, 2018, Page(s): 5191-5194'. [pdf] [slides] Urvish Desai, Chiranjeevi Yarra, Prasanta Kumar Ghosh, "Concatenative articulatory video synthesis using real-time MRI data for spoken language training", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 4999-5003'. [pdf] [slides] Aravind Illa, Deep Patel, Yamini B K, Meera S S, Shivashankar N, Preethish Kumar Veeramani, Seena Vengalil, Kiran Polavarapu, Saraswati Nashi, Atchayaram Nalini, Prasanta Kumar Ghosh, "Comparison of speech tasks for automatic classification of patients with amyotrophic lateral sclerosis and healthy subjects", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 6014-6018'. [pdf] [poster] Anurendra Kumar, Tanaya Guha, Prasanta Kumar Ghosh, "A dynamic latent variable model for source separation", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 2871-2875'. [pdf] [poster] Advait Koparkar, Prasanta Kumar Ghosh, "A supervised air-tissue boundary segmentation technique in real-time magnetic resonance imaging video using a novel measure of contrast and dynamic programming", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 5004-5008'. [pdf] Karthik Girija Ramesan, Prasanta Kumar Ghosh, "Binaural speech source localization using template matching of interaural time difference patterns", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 5164-5168'. [pdf] [poster] Pavan Karjol, Ajay Kumar M, Prasanta Kumar Ghosh, "Speech enhancement using multiple deep neural networks", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 5049-5052'. [pdf] Chiranjeevi Yarra, Prasanta Kumar Ghosh, "Automatic intonation classification using temporal patterns in utterance-level pitch contour and perceptually motivated pitch transformation", accepted in The Journal of the Acoustical Society of America 144.5 (2018): EL471-EL476. [pdf] Chiranjeevi Yarra, Om D. Deshmukh, Prasanta Kumar Ghosh, "A frame selective dynamic programming approach for noise robust pitch estimation", accepted in The Journal of the Acoustical Society of America, 143, no. 4 (2018): 2289-2300'. [pdf] Nisha Meenakshi, Prasanta Kumar Ghosh, "Reconstruction of articulatory movements during neutral speech from those during whispered speech", accepted in Journal of Acoustical Society of America, June 2018, 143 (6), page 3352-3364'. [pdf] Achuth Rao M V, Prasanta Kumar Ghosh, "PSFM - A probabilistic source filter model for noise robust Glottal closure instant detection", accepted in IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 26, no. 9 (2018): 1645-1657'. [pdf] Ratna Jyothi Kakumanu, Ajay Kumar Nair, Rahul Venugopal, Arun Sasidharan, Prasanta Kumar Ghosh, John P. John, Seema Mehrotra, Ravindra Panth, Bindu M. Kutty, "Dissociating meditation proficiency and experience dependent EEG changes during traditional Vipassana meditation practice", accepted in Biological psychology 135 (2018): 65-75'. [pdf] Achuth Rao M V, Shiny Victory, Prasanta Kumar Ghosh, "Effect of source filter interaction on isolated vowel-consonant-vowel perception", accepted in The Journal of the Acoustical Society of America, 144.2 (2018): EL95-EL99'. [pdf]
Vijitha Periyasamy, Manojit Pramanik, Prasanta Kumar Ghosh, "Review on heart-rate estimation from photoplethysmography and accelerometer signals during physical exercise", accepted in Journal of the Indian Institute of Science, 97(3), 313-324 (2017). [pdf] Pattem Ashok Kumar, Aravind Illa, Amber Afshan, Prasanta Kumar Ghosh, "Optimal sensor placement in electromagnetic articulography recording for speech production study", accepted in Computer Speech & Language 47 (2018): 157-174'. [pdf] [slides] Sahil Bansal, Anindita Ghosh, Chandra Sekhar Seelamantula, Gurunath Gurrala, Prasanta Kumar Ghosh, "Adaptive frequency estimation approach using iterative DESA with RDFT-based filter", accepted in In Proc. IEEE PES Asia-Pacific Power and Energy Engineering Conference (APPEEC), 2017, pp. 1-6. [pdf] Nazreen P. M., A. G. Ramakrishnan, Prasanta Kumar Ghosh, "A joint enhancement-decoding formulation for noise robust phoneme recognition", accepted in In Proc. INDICON-2017, IIT Roorkee, pp. 1-6. [pdf] Samik Sadhu, Prasanta Kumar Ghosh, "Low resource point process models for keyword spotting using unsupervised online learning", accepted in In Signal Processing Conference (EUSIPCO), 2017 25th European (pp. 538-542). IEEE. [pdf] [poster] Achuth Rao M V, Prasanta Kumar Ghosh, "Pitch prediction from mel-generalized cepstrum - a computationally efficient pitch modeling approach for speech synthesis", accepted in In Signal Processing Conference (EUSIPCO), 2017 25th European (pp. 1629-1633). IEEE. [pdf] [poster] Achuth Rao M V, Kausthubha N K, Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh, "Automatic prediction of spirometry readings from cough and wheeze for monitoring of asthma severity", accepted in In Signal Processing Conference (EUSIPCO), 2017 25th European (pp. 41-45). IEEE. [pdf] [poster] Akshay Kalkunte Suresh, Srinivasa Raghavan K M, Prasanta Kumar Ghosh, "Phoneme state posteriorgram features for speech based automatic classification of speakers in cold and healthy condition", accepted in In Proc. Interspeech 2017, 3462-3466. [pdf] [poster] Gaurav Fotedar, Prasanta Kumar Ghosh, "An information theoretic analysis of the temporal synchrony between head gestures and prosodic patterns in spontaneous speech", accepted in In Proc. Interspeech 2017, 157-161. [pdf] [poster] Girija Ramesan Karthik, Prasanta Kumar Ghosh, "Subband selection for binaural speech source localization", accepted in In Proc. Interspeech 2017, Stockholm, Sweden, 1929-1933. [pdf] [poster] G. Nisha Meenakshi, Prasanta Kumar Ghosh, "A robust voiced/unvoiced phoneme classification from whispered speech using the \'color\' of whispered phonemes and deep neural network", accepted in In Proc. Interspeech 2017, 503-507. [pdf] [poster] Abhishek Narwekar, Prasanta Kumar Ghosh, "PRAV: A phonetically rich audio visual corpus", accepted in In Proc. Interspeech 2017, 3747-3751. [pdf] [poster] Achuth Rao M V, Shivani Yadav, Prasanta Kumar Ghosh, "A dual source-filter model of snore audio for snorer group classification", accepted in In Proc. Interspeech 2017, Stockholm, Sweden, 3502-3506. [pdf] Srinivasa Raghavan, Nisha Meenakshi, Sanjeev Kumar Mittal, Chiranjeevi Yarra, Anupam Mandal, K R Prasanna Kumar, Prasanta Kumar Ghosh, "A comparative study on the effect of different codecs on speech recognition accuracy using various acoustic modeling techniques", accepted in In Communications (NCC), 2017 Twenty-third National Conference on (pp. 1-6). IEEE. [pdf] [poster] Pradyumna Suresha, Supriya Nagesh, Priyadarshini Savan Roshan, Aditya Gaonkar P, Nisha Meenakshi, Prasanta Kumar Ghosh, "A high resolution ENF based multi stage classifier for location forensics of media recordings", accepted in In Communications (NCC), 2017 Twenty-third National Conference on (pp. 1-6). IEEE. [pdf] Mekhala H S, Yamini B K, Ketan J, Pal P, Shivashankar N, Prasanta Kumar Ghosh, "Classification of healthy subjects and patients with essential vocal tremor using empirical mode decomposition of high resolution pitch contour", accepted in In Communications (NCC), 2017 Twenty-third National Conference on (pp. 1-6). IEEE. [pdf] [poster] Achuth Rao MV, Prasanta Kumar Ghosh, "Pitch prediction from mel-frequency cepstral coefficients using sparse spectrum recovery", accepted in In Communications (NCC), 2017 Twenty-third National Conference on (pp. 1-6). IEEE. [pdf] [poster] Chiranjeevi Yarra, Om D. Deshmukh, Prasanta Kumar Ghosh, "An automatic detection of syllable stress using sonority based prominence features for pronunciation evaluation", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, (pp. 5845-5849). [pdf] [poster] Aravind Illa, Nisha Meenakshi G, Prasanta Kumar Ghosh, "A comparative study of acoustic-to-articulatory inversion for neutral and whispered speech", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017 (pp. 5075-5079). [pdf] [poster] Rao, Nithin K, Meenakshi, Nisha, Prasanta Kumar Ghosh, "Spectrogram Enhancement Using Multiple Window Savitzky-Golay (MWSG) Filter for Robust Bird Sound Detection", accepted in IEEE/ACM Transactions on Audio, Speech, and Language Processing 25.6 (2017): 1183-1192. [pdf] [slides] [codes]
Fotedar, Gaurav, Aditya Gaonkar, Saikat Chatterjee, Prasanta Kumar Ghosh, "Automatic recognition of social roles using long term role transitions in small group interactions", accepted in In Proc. Interspeech 2016 (2016): 2065-2069. [pdf] [poster] Nazreen P. M., A. G. Ramakrishnan, Prasanta Kumar Ghosh, "A class-specific speech enhancement for phoneme recognition: a dictionary learning approach", accepted in In Proc. Interspeech 2016} (2016): 3728-3732. [pdf] [poster] Aditya Gaonkar P, Bhuthesh R, Dipanjan Gope, Prasanta Kumar Ghosh, "Robust real-time pulse rate estimation from facial video using sparse spectral peak tracking", accepted in In Proc. International Conference onSignal Processing and Communications (SPCOM), 2016, pp. 1-5. IEEE, 2016. [pdf] [poster] Abhishek Narwekar, Prasanta Kumar Ghosh, "A comparative study of articulatory features from facial video and acoustic-to-articulatory inversion for phonetic discrimination", accepted in In Proc. International Conference on Signal Processing and Communications (SPCOM), pp. 1-5. IEEE, 2016. [pdf] [poster] Nagesh, Supriya, Chiranjeevi Yarra, Om D. Deshmukh, Prasanta Kumar Ghosh, "A robust speech rate estimation based on the activation profile from the selected acoustic unit dictionary", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5400-5404. 2016. . [pdf] Afshan, Amber, Prasanta Kumar Ghosh, "Better acoustic normalization in subject independent acoustic-to-articulatory inversion: Benefit to recognition", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5395-5399. 2016. [pdf] Prasad, Abhay, Prasanta Kumar Ghosh, "Information theoretic optimal vocal tract region selection from real time magnetic resonance images for broad phonetic class recognition", accepted in Computer Speech & Language 39 (2016): 108-128. [pdf] Yarra, Chiranjeevi, Om D. Deshmukh, Prasanta Kumar Ghosh, "A mode-shape classification technique for robust speech rate estimation and syllable nuclei detection", accepted in Speech Communication 78 (2016): 62-71. [pdf] Prathosh A. P., Sujith P., Ramakrishnan A. G., Prasanta Kumar Ghosh, "Cumulative impulse strength for epoch extraction", accepted in IEEE Signal Processing Letters, 23(4) (2016), 424-428. [pdf] Ming Li, Jangwon Kim, Adam Lammert, Prasanta Kumar Ghosh, Vikram Ramanarayanan, Shrikanth Narayanan, "Speaker verification based on the fusion of speech acoustics and inverted articulatory signals", accepted in Computer Speech and Language,2016. [pdf]
Prasad, Abhay, Prasanta Kumar Ghosh, "Automatic classification of eating conditions from speech using acoustic feature selection and a set of hierarchical support vector machine classifiers", accepted in Proc. of INTERSPEECH. ISCA. Dresden, Germany: ISCA (2015): 884-888. [pdf] Parida, Satyabrata, Ashok Kumar Pattem, Prasanta Kumar Ghosh, "Estimation of the air-tissue boundaries of the vocal tract in the mid-sagittal plane from electromagnetic articulograph data", accepted in Sixteenth Annual Conference of the International Speech Communication Association. 2015. [pdf] Meenakshi, G. Nisha, Prasanta Kumar Ghosh, "A discriminative analysis within and across voiced and unvoiced consonants in neutral and whispered speech in multiple Indian languages", accepted in Sixteenth Annual Conference of the International Speech Communication Association. 2015. [pdf] Sujith P., Prathosh A. P., A. G. Ramakrishnan, Prasanta Kumar Ghosh, "An error correction scheme for GCI detection algorithms using pitch smoothness criterion", accepted in Sixteenth Annual Conference of the International Speech Communication Association. 2015. [pdf] Adria Casamitjana, Martin Sundin, Prasanta Kumar Ghosh, Saikat Chatterjee, "Bayesian learning for time-varying linear prediction of speech", accepted in Proc. EUSIPCO, Aug 31- Sep 4 2015, pp 325-329. [pdf] A. Prasad, V. Periyasamy, Prasanta Kumar Ghosh, "Estimation of the invariant and variant characteristics in speech articulation and its application to speaker identification", accepted in Proc. ICASSP, 19-24 April, 2015, pp 4265-4269. [pdf] [poster] Nisha Meenakshi, Prasanta Kumar Ghosh, "Automatic gender classification using the mel frequency Cepstrum of neutral and whispered speech: a comparative study", accepted in Proc. 21st National Conference on Communications, 27 Feb - 1 March, 2015, pp 1-6. [pdf] Murthy, Navaneet K. Lakshminarasimha, Pavan C. Madhusudana, Pradyumna Suresha, Vijitha Periyasamy, Prasanta Kumar Ghosh, "Multiple spectral peak tracking for heart rate monitoring from photoplethysmography signal during intensive physical exercise", accepted in IEEE Signal Processing Letters 22.12 (2015): 2391-2395. [pdf] Nisha Meenakshi, Prasanta Kumar Ghosh, "Robust whisper activity detection using long-term log energy variation of sub-band signal", accepted in IEEE Signal Processing Letters, Volume 22, Issue 11, June 2015, pp 1859-1863. [pdf] Afsan A., Prasanta Kumar Ghosh, "Improved subject-independent acoustic-to-articulatory inversion", accepted in Speech Communication, Elsevier. Vol. 66,2015,January,1-16. [pdf]
Prasad Sudhakar, Prasanta Kumar Ghosh, "Sparse smoothing of articulatory features from Gaussian mixture model based acoustic-to-articulatory inversion: Benefit to speech recognition", accepted in InterSpeech, 2014. [pdf] Abhay Prasad, Prasanta Kumar Ghosh, Shrikanth Narayanan, "Selection of optimal vocal tract regions using real-time magnetic resonance imaging for robust voice activity detection", accepted in InterSpeech, 2014. [pdf] Sujith P , Prasanta Kumar Ghosh, "Missing samples estimation in electromagnetic articulography data using equality constrained Kalman smoother", accepted in InterSpeech, 2014. [pdf] Nisha Meenakshi, Chiranjeevi Yarra, B. K. Yamini, Prasanta Kumar Ghosh, "Comparison of speech quality with and without sensors in electromagnetic articulograph AG 501 recording", accepted in InterSpeech, 2014. [pdf] Sujith P., Prasanta Kumar Ghosh, "Maximum a-posteriori estimation of missing samples with continuity constraint in electromagnetic articulography data", accepted in ICASSP 2014. [pdf] Abhijith Mundanad Narayanan, Prasanta Kumar Ghosh, K. Rajgopal, "Multi-Pitch Tracking using Gaussian mixture model with time varying parameters and Grating Compression Transform", accepted in ICASSP 2014. [pdf] Prasad Sudhakar, Laurent Jacques, Prasanta Kumar Ghosh, "A sparse smoothing approach for Gaussian mixture model based acoustic-to-articulatory inversion", accepted in ICASSP 2014. [pdf] Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis Georgiou, Shrikanth S. Narayanan, "Classification of clean and noisy bilingual movie audio for speech-to-speech translation corpora design", accepted in ICASSP 2014. [pdf] Shrikanth Narayanan, Asterios Toutios, Vikram Ramanarayanan, Adam Lammert, Jangwon Kim, Sungbok Lee, Krishna Nayak, Yoon-Chul Kim, Yinghua Zhu, Louis Goldstein, Dani Byrd, Erik Bresch, Prasanta Kumar Ghosh, Athanasios Katsamanis , Michael Proctor, "Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research", accepted in Journal of Acoustical Society of America, Volume 136,2014,September,1307. [pdf] Jangwon Kim, Adam Lammert, Prasanta Kumar Ghosh, Shrikanth S. Narayanan, "Co-registration of speech production datasets from electromagnetic articulography and real-time magnetic resonance imaging", accepted in Journal of the Acoustical Society of America Express Letter (JASAEL), Volume 135, Issue 2, 2014, EL115-EL121. [pdf]
Bhuthesh R, Prasanta Kumar Ghosh, Dipanjan Gope, "Robust real-time pulse rate estimation from facial video using sparse spectral peak tracking", accepted in IEEE workshop on Computational Intelligence, IIT Kanpur,2013,July. [pdf] M. Li, A. Lammert, J. Kim, Prasanta Kumar Ghosh , S. Narayanan, "Automatic Classification of Palatal and PharyngealWall Shape Categories from Speech Acoustics and Inverted Articulatory Signas", accepted in Workshop on Speech Production in Automatic Speech Recognition, Interspeech, Lyon, France,2013. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Information theoretic acoustic feature selection for acoustic-to-articulatory inversion", accepted in Proc. Interspeech 2013, Lyon, France.,2013. [pdf] A. Tsiartas, T. Chaspari, N. Katsamanis, Prasanta Kumar Ghosh, M. Li, M. V. Sebroeck, A. Potamianos, S. Narayanan, "Multi-band long-term signal variability features for robust voice activity detection", accepted in Proc. Interspeech 2013, Lyon, France.,2013. [pdf] M. Li, J. Kim, Prasanta Kumar Ghosh, V. Ramanarayanan , S. Narayanan, "Speaker verification based on fusion of acoustic and articulatory information", accepted in Proc. Interspeech 2013, Lyon, France.,2013. [pdf] Shajith Ikbal, Ashish Verma, Prasanta Kumar Ghosh, Kenneth Church, Jeffrey Marcus, "Intent focused summarization of caller-agent conversations", accepted in Proc. ICASSP 2013.,2013. [pdf] Jangwon Kim, Adam Lammert, Prasanta Kumar Ghosh, Shrikanth Narayanan, "Spatial and temporal alignment of multimodal human speech production data: real time imaging, flesh point tracking and audio", accepted in Proc. ICASSP 2013.,2013. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "On smoothing articulatory trajectories obtained from Gaussian mixture model based acoustic-to-articulatory inversion", accepted in Journal of the Acoustical Society of America Express Letter (JASAEL), Volume 134, Issue 2,2013,July. [pdf] Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth S. Narayanan, "High-quality bilingual subtitle document alignments with application to spontaneous speech translation", accepted in Computer, Speech, and Language,2013. [pdf] Shajith Ikbal Mohamed, Kenneth W. Church, Ashish Verma, Prasanta Kumar Ghosh, Jeffrey N. Marcus, "System and method for identification of intent segment(s) in caller-agent conversations", accepted in US Patent Application US13/781,351, granted on 16 July 2019.
J. Kim, Prasanta Kumar Ghosh, S. Lee, S. S. Narayanan, "A study of emotional information present in articulatory movements estimated using acoustic-to-articulatory inversion", accepted in APSIPA ASC,2012. [pdf] V. Ramanarayanan, Prasanta Kumar Ghosh, A. Lammert, S. Narayanan, "Exploiting speech production information for automatic speech and speaker modeling and recognition -- possibilities and new opportunities", accepted in APSIPA ASC,2012. [pdf]
Prasanta Kumar Ghosh, Shrikanth Narayanan, "Analysis of inter-articulator correlation in acoustic-to-articulatory inversion using generalized smoothness criterion", accepted in Proc. Interspeech, Florence, Italy,2011. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "A subject-independent acoustic-to-articulatory inversion", accepted in Proceedings ICASSP, Prague, Czech Republic, 22-27 May,2011. [pdf] S. Narayanan, E. Bresch, Prasanta Kumar Ghosh, L. Goldstein, A. Katsamanis, Y. Kim, A. Lammert, M. Proctor, V. Ramanarayanan, , Y. Zhu, "A multimodal real-time MRI articulatory corpus for speech research", accepted in Proc. Interspeech, Florence, Italy,2011. [pdf] Bo Xiao, Prasanta Kumar Ghosh, Panayiotis Georgiou, Shrikanth Narayanan, "Overlapped speech detection using long-term spectro-temporal similarity in stereo recording", accepted in Proc. ICASSP, Prague, Czech Republic, 2011. [pdf] Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis Georgiou, Shrikanth Narayanan, "Bilingual audio-subtitle extraction using automatic segmentation of movie audio", accepted in Proc. ICASSP, Prague, Czech Republic, 2011. [pdf] Prasanta Kumar Ghosh, Andreas Tsiartas, Shrikanth Narayanan, "Robust voice activity detection using long-term signal variability", accepted in IEEE Trans. Audio, Speech and Language Processing, Volume 19, No. 3, March 2011, pp 600-613. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion", accepted in J. Acoust. Soc. Am. Express Letters (JASAEL), Volume 130, Issue 4,2011,Aug,EL251-EL257. [pdf] Prasanta Kumar Ghosh, Louis M. Goldstein, Shrikanth Narayanan, "Processing speech signal using auditory-like filterbank provides least uncertainty about articulatory gestures", accepted in J. Acoust. Soc. Am., Volume 129, Issue 6,2011,Jun,4014-4022. [pdf] Prasanta Kumar Ghosh, Louis M. Goldstein, Shrikanth Narayanan, "Auditory-like filterbank: An optimal speech processor for efficient human speech communication", accepted in Springer Proceedings of Indian Academy of Sciences (Sadhana), Special Issue on Speech Processing,2011,October, 699-712. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter", accepted in Speech Communication, Elsevier, Volume 53, No. 1,2011,January,98-109. [pdf]
Prasanta Kumar Ghosh, Andreas Tsiartas, Panayiotis G. Georgiou, Shrikanth Narayanan, "Robust voice activity detection in stereo recording with crosstalk", accepted in Proc. InterSpeech, Makuhari, Japan, 2010, Sep. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "A generalized smoothness criterion for acoustic-to-articulatory inversion", accepted in Journal of Acoustical Society of America, Volume 128, No. 4, 2010, Oct, 2162-2172. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Bark frequency transform using an arbitrary order allpass filter", accepted in IEEE Signal Processing Letters, Volume 17, No. 6, 2010, June, 543-546. [pdf]
Prasanta Kumar Ghosh, Shrikanth Narayanan, Pierre Divenyi, Louis Goldstein, Elliot Saltzman, "Estimation of articulatory gesture patterns from speech acoustics", accepted in Proc. InterSpeech, 6-10 Sep, 2009, Brighton, UK, 2009.2803-2806. [pdf] Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth Narayanan, "Context-driven bilingual movie subtitle alignment", accepted in Proc. InterSpeech, 6-10 Sep, 2009, Brighton, UK, 2009.444-447. [pdf] Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth Narayanan, "Robust word boundary detection in spontaneous speech using acoustic and lexical cues", accepted in Proceedings of ICASSP, Taipei, Taiwan.2009,Apr. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Pitch contour stylization using an optimal piecewise polynomial approximation", accepted in IEEE Signal Processing Letters, Volume 16, No. 9, 2009, Sept, 810-813. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Closure duration analysis of incomplete stop consonants due to stop-stop interaction", accepted in J. Acoust. Soc. Am. Express Letters, Volume 126, Issue 1, 2009, Jul, EL1-EL7. [pdf]
Prasanta Kumar Ghosh, Antonio Ortega, Shrikanth Narayanan, "Pitch Period Estimation using Multipulse Model and Wavelet Transform", accepted in Proceedings of InterSpeech ICSLP, Antwerp, Belgium, 2007, Aug, 2761-2764. [pdf] Prasanta Kumar Ghosh, "Speech Segmentation using Extrema-Based Signal Track Length Measure", accepted in ICASSP 2007, Volume 4, 15-20, 2007, April, IV-1065 - IV-1068. [pdf]
Prasanta Kumar Ghosh, T.V. Sreenivas, "Dynamic Programming Based Optimum Non-Uniform Samples For Speech Reconstruction and Coding", accepted in ICASSP, Volume 1, 2006. [pdf] Prasanta Kumar Ghosh, T.V. Sreenivas, "Extrema based Unwarping for Time-varying Pitch Estimation", accepted in 12th National Conference on Communication (NCC), 2006. [pdf] Amitava Das, Manoj Balwani, Rahul Thota, Prasanta Kumar Ghosh, "Face Recognition from Images with High Pose Variations by Transform Vector Quantization", accepted in ICVGIP, 2006.674-685. [pdf] Das, A., Prasanta Kumar Ghosh, "Audio-Visual Biometric Recognition by Vector Quantization", accepted in
IEEE Spoken Language Technology(SLT) Workshop, 2006, Dec,166 - 169. [pdf] Prasanta Kumar Ghosh, T.V. Sreenivas, "Time-varying filter interpretation of fourier transform and its variants", accepted in Signal Processing (Elsevier), Volume 86, Issue 11, 2006, November, 3258-3263. [pdf]
Prasanta Kumar Ghosh, A. Konar, "Modification of the LMS Predictor to Reduce Signal Prediction Error in Linear Prediction", accepted in International Conference on Communication, Devices and Intelligent Systems (CODIS 2004), Jan 2004, Kolkata, India.