Neelesh Samptur, Tanuka Bhattacharjee, Anirudh Chakravarty K, Seena Vengalil, Yamini BK, Nalini Atchayaram, P. K. Ghosh, , "Exploring Syllable Discriminability during Diadochokinetic Task with Increasing Dysarthria Severity for Patients with Amyotrophic Lateral Sclerosis", accepted in Interspeech 2024. Chetan Sharma, Vaishnavi Chwanshi, P. K. Ghosh, , "A comparative study of the impact of voiceless alveolar and palato-alveolar sibilants in English on lip aperture and protrusion during VCV production", accepted in Interspeech 2024. Sathvik Udupa, Soumi Maiti, P. K. Ghosh, , "IndicMOS: Multilingual MOS Prediction for 7 Indian languages", accepted in Interspeech 2024. Sathvik, Jersuraj, Saurabh, Deekshitha, Shya B, Abhayjeet, Savitha, Priyanka, Srinivasa, Raoul, P. K. Ghosh, , "Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models", accepted in Interspeech 2024. Jesuraja Bandekar, Sathvik Udupa, P. K. Ghosh, , "Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss", accepted in Interspeech 2024. Alex Paul Kamson, Akshay V. Sawant, P. K. Ghosh Satish S Jeevannavar, , "Exploring wav2vec 2.0 Model for Heart Sound Analysis", ", accepted in EMBC 2024. Anjali Jayakumar, Tanuka Bhattacharjee, Seena Vengalil, Yamini Belur, Nalini Atchayaram P. K. Ghosh, , "Low Complexity Model with Single Dimensional Feature for Speech Based Classification of Amyotrophic Lateral Sclerosis Patients and Healthy Individuals", accepted in SPCOM 2024. Jesuraja Bandekar, Sathvik Udupa P. K. Ghosh, , "Discovering phoneme-specific critical articulators through a data driven approach", accepted in ISSP 2024. Satyadev Badireddi, Shreya Shrikant Karkun, P. K. Ghosh, , "Inter-subject variation in tongue shape during vowel production in /b/V/t/ sequence: An rtMRI study using 8 vowels from 74 subjects", accepted in ISSP 2024. SHIVANI YADAV, DIPANJAN GOPE, UMA MAHESWARI K., P. K. Ghosh, , "AN UNSUPERVISED SEGMENTATION OF VOCAL BREATH SOUNDS", ", accepted in ICASSP 2024. Chowdam Venkata Thirumala Kumar, Tanuka Bhattacharjee, Seena Vengalil, Saraswati Nashi, Madassu Keerthipriya, Yamini Belur, Nalini Atchayaram, P. K. Ghosh, , "SPECTRAL ANALYSIS OF VOWELS AND FRICATIVES AT VARIED LEVELS OF DYSARTHRIA SEVERITY FOR AMYOTROPHIC LATERAL SCLEROSIS", accepted in ICASSP 2024.
Alex Paul Kamson, Macline Crecsilla Lewis, Akshay Sawant, Vishnu Sunil B N, P. K. Ghosh, Satish S Jeevannavar. , "E2E Multi-Scale CNN with LSTM for Murmur Detection in PCG or Noise Identification", accepted in In 2023 International Conference on Electrical, Communication, and Computer Engineering (ICECCE), IEEE, 2023. Abhayjeet Singh, Charu Shah, Rajashri Varadaraj, Sonakshi Chauhan, & P. K. Ghosh, "SPIRE-SIES: A Spontaneous Indian English Speech Corpus", accepted in 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA). IEEE 2023. Sathvik Udupa, Jesuraja Bandekar, Deekshitha G, Saurabh Kumar, P. K. Ghosh, Shya Badiger, Abhayjeet Singh, Savitha Murthy, Priyanka Pai, Srinivasa Raghavan, Rohan Saxena, "Gated Multi Encoders and multitask objectives for Dialectal Speech recognition in Indian languages", accepted in ASRU 2023. Priyanshi Pal, Shelly Jain, Chiranjeevi Yarra, P. K. Ghosh Anil Kumar Vuppala, "Study of Indian English Pronunciation variabilities relative to Received Pronunciation", accepted in SPECOM 2023. Navneet Kaur P. K. Ghosh, "Curriculum Learning based approach for faster convergence of TTS model", accepted in SPECOM 2023. Abhayjeet Singh, Anjali Jayakumar, Deekshitha G, Hitesh Tiwari, Jesuraja Bandekar, Shya Badiger, Sathvik Udupa, Saurabh Kumar P. K. Ghosh, "An end-to-end TTS model in Chhattisgarhi, a low-resource Indian language", accepted in SPECOM 2023. Abhayjeet Singh, Arjun Singh Mehta, Ashish Khuraishi K S, Deekshitha G, Gauri Date, Jai Nanavati, Jesuraja Bandekar, Karnalius Basumatary, Karthika P, Shya Badiger, Sathvik Udupa, Saurabh Kumar, P. K. Ghosh, Prashanthi V, Priyanka Pai, Raoul Nanavati, Sai Praneeth Reddy Mora Srinivasa Raghavan, "An ASR corpus in Chhattisgarhi, a low resource Indian language", accepted in SPECOM 2023. Veerababu Dharanalakota, Namra Quasim, P. K. Ghosh, "Estimation of Acoustic Field in a Uniform Duct with Mean Flow using Neural Networks", accepted in presentation at the 2024 AIAA SciTech. Veerababu Dharanalakota, J. Pavan Kumar, P. K. Ghosh, "Loss-based Optimizer Switching to Solve 1-D Helmholtz Equation using Neural Networks", accepted in Acoustics 2023 Sydney, Australia, 4 - 8 Dec, (2023). Veerababu Dharanalakota, R. Ashwin, P. K. Ghosh , "Achieving Stable Convergence of Neural Networks for Estimating Acoustic Field in Uniform Ducts", accepted in Acoustics 2023 Sydney, Australia, 4 - 8 Dec, (2023). Chowdam Venkata Thirumala Kumar, MEENAKSHI SIRIGIRAJU, Rakesh Vaideeswaran Mahesh, P. K. Ghosh, Chiranjeevi Yarra, "Can the decoded text from automatic speech recognition effectively detect spoken grammar errors?", accepted in SLATE 2023. Chowdam Venkata Thirumala Kumar, Tanuka Bhattacharjee, Yamini BK, Nalini Atchayaram, Ravi Yadav, P. K. Ghosh, "Classification of multi-class vowels and fricatives from patients having Amyotrophic Lateral Sclerosis with varied levels of dysarthria severity", accepted in INTERSPEECH 2023. Shelly Jain, Priyanshi Pal, Anil Vuppala, P. K. Ghosh, Chiranjeevi Yarra, "An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations", accepted in INTERSPEECH 2023. Varun Belagali, P. K. Ghosh, Achuth Rao, "Weakly supervised glottis segmentation in high-speed videoendoscopy using bounding box labels", accepted in INTERSPEECH 2023. Mohammad Shaique Solanki, Ashutosh Bharadwaj, Jeevan Kylash, P. K. Ghosh, "Do Vocal Breath Sounds Encode Gender Cues for Automatic Gender Classification?", accepted in INTERSPEECH 2023. Siddarth Chandrasekar, Arvind Ramesh, Tilak Purohit, P. K. Ghosh, "A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence", accepted in INTERSPEECH 2023. Jesuraja Bandekar, Sathvik Udupa, P. K. Ghosh, "Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion", accepted in INTERSPEECH 2023. Tanuka Bhattacharjee, Anjali Jayakumar, Yamini BK, Nalini Atchayaram, Ravi Yadav, P. K. Ghosh, "Transfer Learning to Aid Dysarthria Severity Classification for Patients with Amyotrophic Lateral Sclerosis", accepted in INTERSPEECH 2023. Dharanalakota Veerababu, P. K. Ghosh, "SOLUTION OF 1-D HELMHOLTZ EQUATION USING ARTIFICIAL NEURAL NETWORKS", accepted in publication in the 29th International Congress on Sound and Vibration (ICSV29). Tanuka Bhattacharjee, Yamini Belur, Nalini Atchayaram, Ravi Yadav, P. K. Ghosh, "EXPLORING THE ROLE OF FRICATIVES IN CLASSIFYING HEALTHY SUBJECTS AND PATIENTS WITH AMYOTROPHIC LATERAL SCLEROSIS AND PARKINSON'S DISEASE", accepted in ICASSP 2023. Tanuka Bhattacharjee, Chowdam Venkata Thirumala Kumar, Yamini Belur, lini Atchayaram, Ravi Yadav, P. K. Ghosh, "STATIC AND DYNAMIC SOURCE AND FILTER CUES FOR CLASSIFICATION OF AMYOTROPHIC LATERAL SCLEROSIS PATIENTS AND HEALTHY SUBJECTS", accepted in ICASSP 2023. Sathvik Udupa, Siddarth C, P. K. Ghosh, "IMPROVED ACOUSTIC-TO-ARTICULATORY INVERSION USING REPRESENTATIONS FROM PRETRAINED SELF-SUPERVISED LEARNING MODELS", accepted in ICASSP 2023. Sathvik Udupa, P. K. Ghosh, "REAL-TIME MRI VIDEO SYNTHESIS FROM TIME ALIGNED PHONEMES WITH SEQUENCE-TO-SEQUENCE NETWORKS", accepted in ICASSP 2023.
Priyanshi Pal, Chiranjeevi Yarra, P. K. Ghosh, "voisTUTOR 2.0: A speech corpus with phonetic transcription for pronunciation evaluation of Indian L2 English learners", accepted in O-COCOSDA 2022. Abinay Reddy Naini, Achuth Rao M V, P. K. Ghosh, "Whisper to Neutral Mapping Using i-Vector Space Likelihood and a Cosine Similarity Based Iterative Optimization for Whispered Speaker Verification", accepted in NCC 2022. Aravind Illa, Aanish Nair, P. K. Ghosh, , "The impact of cross language on acoustic-to-articulatory inversion and its influence on articulatory speech synthesis", accepted in ICASSP 2022. [pdf] Anwesha Roy, Varun Belagali, P. K. Ghosh, , "An Error Correction Scheme for improved Air-Tissue boundary in real-time Mri video for speech production", accepted in ICASSP 2022. [pdf] Siddharth Subramani, Achuth Rao M V, Anwesha Roy, Prasanna Suresh Hegde, P. K. Ghosh, , "Segnet-based Deep Representation Learning for Dysphagia classification", accepted in ICASSP 2022. [pdf] Abinay Reddy Naini, Bhavuk Singhal, P. K. Ghosh, , "Dual Attention Pooling Network for recording device classification using Neutral and Whispered Speech", accepted in ICASSP 2022. [pdf]
Karthik G.R., P. K. Ghosh, , "Towards a Calibration-free Approach to Deep Learning based Single-incidence Inverse Scattering", accepted in 2021 PhotonIcs & Electromagnetics Research Symposium (PIERS). [pdf] Chiranjeevi Yarra , P. K. Ghosh, , "Automatic syllable stress detection under non-parallel label and data condition", accepted in Speech Communication, Elsevier. Karthik G.R., P. K. Ghosh, , "A Scalable Deep Learning Model for Arbitrary Transmitter Configurations in Inverse Scattering", accepted in 2021 IEEE Antennas and Propagation Society International Symposium (APS-URSI). [pdf] Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy, P. K. Ghosh, , "Convolutional Dense Neural Network based Spirometry Variable FVC Prediction using Sustained Phonations", accepted in MLSP 2021. [pdf] Abhayjeet Singh, Achuth Rao M V, Rakesh Vaideeswaran, Chiranjeevi Yarra, P. K. Ghosh, , "A Study on native American English speech recognition by Indian listeners with varying word familiarity level", accepted in Oriental COCOSDA 2021. [pdf] [slides] [presentation] Tilak Purohit, Tejas Umesh, Shankar Narayanan, Minulakshmi S, P. K. Ghosh, , "SPIRE VCV: An acoustic-articulatory corpus with three different speaking rates", accepted in Oriental COCOSDA 2021. [pdf] Bhavuk Singhal, Abinay Reddy Naini, P. K. Ghosh, , "WSPIRE: A parallel multi-device corpus in neutral and whisper speech", accepted in Oriental COCOSDA 2021. [pdf] [slides] [presentation] Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy , P. K. Ghosh, , "Role of breath phase and breath boundaries for the classification between asthmatic and healthy subjects", accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2021. Drishti Ramesh Megalmani, Shailesh B G, Achuth Rao M V, Satish S Jeevannava, P. K. Ghosh, , "Unsegmented Heart Sound Classification Using Hybrid CNN-LSTM Neural Networks", accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2021. Achuth Rao M V, Shailesh B G, Drishti Ramesh Megalmani, Satish S Jeevannava, P. K. Ghosh, , "Noise Robust Detection of Fundamental Heart Sound using Parametric Mixture Gaussian and Dynamic Programming", accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2021. Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, P. K. Ghosh,, "Web Interface for estimating articulatory movements in speech production from acoustics and text,", accepted in Show and Tell, Interspeech 2021, Brno, Czech Republic. [pdf] [poster] Manthan Sharma, Navaneetha Gaddam, Tejas Umesh, Aditya Murthy, P. K. Ghosh,, "A Comparative Study Of Different EMG Features For Acoustic-to-EMG Mapping", accepted in Interspeech 2021, Brno, Czech Republic. [pdf] [slides] [presentation] Tanuka Bhattacharjee, Jhansi Mallela, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, P. K. Ghosh, , "Source and Vocal Tract Cues for Speech-based Classification of Patients with Parkinson’s Disease and Healthy Subjects", accepted in Interspeech 2021, Brno, Czech Republic. [pdf] [slides] [presentation] Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, P. K. Ghosh, , "Estimating articulatory movements in speech production with transformer networks", accepted in Interspeech 2021, Brno, Czech Republic. [pdf] [slides] [presentation] Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah, Ankita Singh, Srinivasa Raghavan K M, Shreya Khare, Vinit Unni, saurabh vyas, Akash Rajapuria, Chiranjeevi Yarra, Ashish Mittal, P. K. Ghosh, Preethi Jyothi, Kalika Bali, Vivek Seshadri, Sunayana Sitaram, Samarth Bharadwaj, Jai Nanavati, Raoul Nanavati , Karthik Sankaranarayanan, , "Multilingual and code-switching ASR challenges for low resource Indian languages", accepted in Interspeech 2021, Brno, Czech Republic. [pdf] [slides] [presentation] Chiranjeevi Yarra , P. K. Ghosh, , "Noise robust pitch stylization using minimum mean absolute error criterion", accepted in Interspeech 2021, Brno, Czech Republic. [pdf] [slides] [presentation] Ananya Muguli, Lacelot Pinto, Nirmala R, Neeraj Sharma, Prashant Krishnan, P. K. Ghosh, Rohit Kumar, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Shreyas Ramoji , Viral N,a , "DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics", accepted in Interspeech 2021, Brno, Czech Republic. Pavan Kumar J, Chiranjeevi Yarra, P. K. Ghosh,, "DNN Based Phrase Boundary Detection Using Knowledge-Based Features and Feature Representations from CNN", accepted in 2021 National Conference on Communications(NCC). Aparna Srinivasan, Diviya Singh, Chiranjeevi Yarra, Aravind Illa, P. K. Ghosh,, "A robust speaking rate estimator using a CNN-BLSTM network", accepted in Circuits, Systems, and Signal Processing, 2021. Shankar Narayanan, Arvind Illa, Nayan An,, Ganesh Sinisetty, Karthick Narayanan, P. K. Ghosh,, "“An Acoustic Investigation on the Effect of Speaking Rate on Vowel Space and Coarticulation in Toda VCV Sequences”", accepted in Sādhanā, 2021. Achuth Rao M V, Yamini B K, Ketan J, Preetie Shetty A, Pal P, Shivashankar N, P. K. Ghosh. , "Automatic Classification of Healthy Subjects and Patients With Essential Vocal Tremor Using Probabilistic Source-Filter Model Based Noise Robust Pitch Estimation", accepted in Journal of Voice, 2021. [pdf] Tanuka Bhattacharjee, Jhansi Mallela, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, P. K. Ghosh,, "Effect of noise and model complexity on detection of Amyotrophic Lateral Sclerosis and Parkinson’s disease using pitch and MFCC", accepted in ICASSP 2021. [pdf] [slides] [poster] Tilak Purohit, Achuth Rao M V, P. K. Ghosh,, "Impact of speaking rate on the source filter Interaction in speech: a study", accepted in ICASSP 2021. [pdf] [poster] Sarthak Kumar Maharana, Aravind Illa, Renuka Mannem, Yamini Belur, Preetie Shetty, Veeramani Preethish Kumar, Seena Vengalil, Kiran Polavarapu, Nalini Atchayaram, P. K. Ghosh, , "Acoustic-to-articulatory inversion for dysarthric speech by using cross-corpus acoustic-articulatory data", accepted in ICASSP 2021. [pdf] [poster] [codes] Renuka Mannem , P. K. Ghosh. , "A deep neural network based correction scheme for improved air-tissue boundary prediction in real-time magnetic resonance imaging video", accepted in Computer Speech and Language, March 2021, Volume 66. [pdf]
Anusuya, P. K., Aravind Illa, Prasanta Kumar Ghosh , "A Data-Driven Phoneme-Specific Analysis of Articulatory Importance", accepted in International Seminar on Speech Production (ISSP) 2020. [pdf] [poster] Aravind Illa, Prasanta Kumar Ghosh. , "Complexity-Performance Trade-off In Acoustic-to-Articulatory Inversion", accepted in International Seminar on Speech Production (ISSP) 2020. [pdf] [poster] Chiranjeevi Yarra, Kausthubha N K , P. K. Ghosh. , "SPIRE-ABC: An online tool for acoustic-unit boundary correction (ABC) via crowdsourcing", accepted in Oriental COCOSDA 2020. [pdf] [slides] [presentation] Tilak Purohit , P. K. Ghosh. , "An investigation of the virtual lip trajectories during the production of bilabial stops and nasal at different speaking rates", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [presentation] Aravind Illa , P. K. Ghosh. , "Speaker conditioned acoustic-to-articulatory inversion using x-vectors", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [presentation] Jhansi Mallela, Aravind Illa, Yamini Belur, Nalini Atchayaram, Ravi yadav, Pradeep Reddy, Dipanjan Gope, P. K. Ghosh. , "Raw speech waveform based classification of patients with ALS, Parkinson’s Disease and healthy controls using CNN-BLSTM", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [presentation] Renuka Mannem, Himajyothi Rajamahendravarapu, Aravind Illa , P. K. Ghosh. , "Speech rate task-specific representation learning from acoustic-articulatory data", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [codes] [presentation] Renuka Mannem, Navaneetha Gaddam , P. K. Ghosh. , "Air-tissue boundary segmentation in real time Magnetic Resonance Imaging video using 3-D convolutional neural network", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [codes] [presentation] Divya Degala, Achuth Rao M V, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadharshini, Prakash T K , P. K. Ghosh. , "Automatic Glottis Detection and Segmentation in Stroboscopic videos using Convolutional Networks", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [presentation] Abhayjeet Singh, Aravind Illa , P. K. Ghosh. , "Attention and Encoder-Decoder based models for transforming articulatory movements at different speaking rates", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [presentation] Abinay Reddy Naini, Satyapriya Malla , P. K. Ghosh. , "Whisper activity detection using CNN-LSTM based attention pooling network trained for a speaker identification task", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [presentation] Neeraj Sharma, Prashant Krishnan, Rohit Kumar, Shreyas Ramoji, Srikanth Raj Chetupalli, Nirmala R, P. K. Ghosh , Sriram Ganapathy,, "A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis", accepted in Interspeech 2020, Shanghai, China. [pdf] [slides] [presentation] Suhas BN, Jhansi Mallela, Aravind Illa, Yamini BK, Nalini Atchayaram, Ravi Yadav, Dipanjan, Prasanta Kumar Ghosh,, "Speech task based automatic classification of ALS and Parkinson’s Disease and their severity using log mel spectrograms", accepted in SPCOM2020. [pdf] [slides] Renuka Mannem, Himajyothi Rajamahendravarapu, Aravind Illa, Prasanta Kumar Ghosh,, "Speech rate estimation using representations learned from speech with convolutional neural network", accepted in SPCOM2020. [pdf] [slides] Jhansi Mallela, Aravind Illa, Suhas B N, Sathvik Udupa, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh,, "Voice based classification of patients with Amyotrophic Lateral Sclerosis, Parkinson's Disease and Healthy Controls with Cnn-lstm Using transfer learning", accepted in ICASSP 2020. [pdf] [slides] [presentation] Avni Rajpal, Achuth Rao MV, Chiranjeevi Yarra, Ritu Aggarwal, Prasanta Kumar Ghosh,, "Pseudo Likelihood Correction Technique For Low Resource Accented ASR", accepted in ICASSP 2020. [pdf] [slides] Siddharth Subramani, Achuth Rao M V, Divya Giridhar, Prasanna Suresh Hegde, Prasanta Kumar Ghosh. , "Automatic Classification of Volumes of Water using Swallow Sounds From Cervical Auscultation", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020. [pdf] [slides] [presentation] Shivani Yadav, Merugu Keerthana, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh. , "Analysis of Acoustic Features for Speech Sound Based Classification of Asthmatic and Healthy Subjects", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020. [pdf] [slides] [presentation] Sanjeev Kadagathur Vadiraj, Achuth Rao M V, Prasanta Kumar Ghosh. , "Automatic Identification of Speakers From Head Gestures in a Narration", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020. [pdf] [slides] [presentation] Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh. , "A Comparative Study of estimating articulatory movements from phoneme sequences and acoustic features", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020. [pdf] [slides] [presentation] Aravind Illa, Prasanta Kumar Ghosh. , "Closed-set speaker conditioned acoustic-to-articulatory inversion using bi-directional long short term memory network", accepted in The Journal of the Acoustical Society of America.Vol.147, Issue 2, February 2020, EL171–EL176. [pdf] Varun Belagali, Achuth Rao M V, Pebbili Gopikishore, Rahul Krishnamurthy, P. K. Ghosh. , "Two step convolutional neural network for automatic glottis localization and segmentation in stroboscopic videos", accepted in Biomed. Opt. Express 11, 4695-4713 2020. [pdf] Achuth Rao MV , P. K. Ghosh. , "SFNet: A Computationally Efficient Source Filter Model Based Neural Speech Synthesis", accepted in IEEE Signal Processing Letters, Volume 27, July 2020, pp 1170–1174. [pdf]
Divya Giridhar, Achuth Rao M V, Prasanna Suresh Hedge, P. K. Ghosh. , "Analysis of swallow sounds of healthy controls for different volumes of water", accepted in International Conference on Engineering in Medicine and Life Sciences, PSG College of Technology, Coimbatore, India, December 19-21, 2019. [pdf] Chiranjeevi Yarra, Ritu Aggarwal, Avni Rajpal, P. K. Ghosh. , "Indic TIMIT and Indic English lexicon: A speech database of Indian speakers using TIMIT stimuli and a lexicon from their mispronunciations", accepted in Oriental COCOSDA 2019. [pdf] [slides] Chiranjeevi Yarra, Aparna Srinivasan, Chandana Srinivasa, Ritu Aggarwal, P. K. Ghosh. , "voisTUTOR corpus: A speech corpus of Indian L2 English learners for pronunciation assessment", accepted in Oriental COCOSDA 2019. [pdf] Shankar Narayanan, Aravind Illa, Nayan Anand, Ganesh Sinisetty, Karthick Narayanan, P. K. Ghosh. , "An acoustic-articulatory database of VCV sequences and words in Toda at different speaking rates", accepted in Oriental COCOSDA 2019. [pdf] [slides] Chiranjeevi Yarra, P. K. Ghosh. , "voisTUTOR: Virtual Operator for Interactive Spoken English TUTORing", accepted in 8th Workshop on Speech and Language Technology in Education, 2019. [pdf] [slides] Chiranjeevi Yarra, Manoj Kumar Ramanathi, P. K. Ghosh. , "Comparison of automatic syllable stress detection quality with time-aligned boundaries and context dependencies", accepted in 8th Workshop on Speech and Language Technology in Education, 2019. [pdf] [slides] Aparna Srinivasan, Chiranjeevi Yarra, P. K. Ghosh. , "Automatic assessment of pronunciation and its dependent factors by exploring their interdependencies using DNN and LSTM", accepted in 8th Workshop on Speech and Language Technology in Education, 2019. [pdf] [slides] Sweekar Sudhakara, Manoj Kumar Ramanathi, Chiranjeevi Yarra, Anurag Das, P. K. Ghosh. , "Noise robust goodness of pronunciation measures using teacher's utterance", accepted in 8th Workshop on Speech and Language Technology in Education, 2019. [pdf] [poster] Suhas BN, Deep Patel, Nithin Rao, Yamini Belur, Pradeep Reddy, Nalini Atchayaram, Ravi Yadav, Dipanjan Gope, P. K. Ghosh. , "Comparison of Speech Tasks and Recording Devices for Voice Based Automatic Classification of Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Manoj Kumar Ramanathi, Chiranjeevi Yarra, P. K. Ghosh. , "ASR inspired syllable stress detection for pronunciation evaluation without using a supervised classifier and syllable level features", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Abinay Reddy Naini, Achuth Rao MV, P. K. Ghosh. , "Whisper to neutral mapping using cosine similarity maximization in i-vector space for speaker verification", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Renuka Mannem, Jhansi Mallela, Aravind Illa, P. K. Ghosh. , "Acoustic and articulatory feature based speech rate estimation using a convolutional dense neural network", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Atreyee Saha, Chiranjeevi Yarra, P. K. Ghosh. , "Low resource automatic intonation classification using gated recurrent unit (GRU) networks pre-trained with synthesized pitch patterns", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Sweekar Sudhakara, Manoj Kumar Ramanathi, Chiranjeevi Yarra , P. K. Ghosh. , "An improved goodness of pronunciation (GoP) measure for pronunciation evaluation with DNN-HMM system considering HMM transition probabilities", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Aravind Illa, P. K. Ghosh. , "An investigation on speaker specific articulatory synthesis with speaker independent articulatory inversion", accepted in Interspeech 2019, Graz, Austria. [pdf] [slides] Chiranjeevi Yarra, Aparna Srinivasan, Sravani Gottimukkala, P. K. Ghosh. , "SPIRE-fluent: A self-learning app for tutoring oral fluency to second language English learners", accepted in Interspeech 2019, Graz, Austria. [pdf] [poster] Achuth Rao M V, P. K. Ghosh, Tanuka Bhattacharjee, Anirban Dutta Choudhury. , "Trend Statistics Network and Channel invariant EEG Network for sleep arousal study", accepted in The 41th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’19), Berlin, Germany. [pdf] [slides] Aravind Illa, Prasanta Kumar Ghosh. , "Representation learning using convolution neural network for acoustic-to-articulatory inversion", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5931-5935. [pdf] [slides] Gokul Srinivasan, Aravind Illa, Prasanta Kumar Ghosh. , "A study on robustness of articulatory features for automatic speech recognition of neutral and whispered speech", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5936-5940. [pdf] [slides] Valliappan CA, Avinash Kumar, Renuka Mannem, Karthik Girija Ramesan, Prasanta Kumar Ghosh. , "An improved air tissue boundary segmentation technique for real time magnetic resonance imaging video using segnet", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5921-5925. [pdf] Renuka Mannem, Prasanta Kumar Ghosh. , "Air-tissue boundary segmentation in real time magnetic resonance imaging video using a convolutional encoder-decoder network", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5941-5945. [pdf] [slides] Abinay Reddy Naini, Achuth Rao M V , Prasanta Kumar Ghosh. , "Formant-gaps features for speaker verification using whispered speech", accepted in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK,2019, Pages: 6231-6235. [pdf] [poster] Renuka Mannem, Valliappan C A, P. K. Ghosh, "A SegNet Based Image Enhancement Technique for Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video", accepted in National Conference on Communications (NCC) 2019, Bangalore, India, Pages: 1-6. [pdf] [slides] [poster] Chiranjeevi Yarra, Supriya Nagesh, Prasanta Kumar Ghosh. , "Noise robust speech rate estimation using SNR dependent sub-band selection and peak detection strategy", accepted in The Journal of the Acoustical Society of America. [pdf] Aravind Illa, Prasanta Kumar Ghosh. , "The impact of speaking rate on acoustic-to-articulatory inversion", accepted in Computer Speech & Language. [pdf] Achuth Rao MV, Prakhar Gupta, Prasanta Kumar Ghosh. , "P- and T-wave delineation in ECG signals using parametric mixture Gaussian and dynamic programming", accepted in Biomedical Signal Processing and Control, May 2019, Volume 51, pages: 328-337. [pdf] Anurendra Kumar, Tanaya Guha , Prasanta Kumar Ghosh. , "Dirichlet Latent Variable Model: A Dynamic Model Based on Dirichlet Prior for Audio Processing", accepted in IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 27, no. 5 (2019): 919-931. [pdf] Achuth Rao M V, Prasanta Kumar Ghosh. , "Glottal inverse filtering using probabilistic weighted linear prediction", accepted in Speech and Language Processing (TASLP), Vol. 27, Issue 1, (2019): 114-124. [pdf]
Aravind Illa, Prasanta Kumar Ghosh. , "Inferring speaker identity from articulatory motion during speech", accepted in Machine Learning in Speech and Language Processing Workshop (MLSLP) 2018. [pdf] [slides] Aravind Illa, Prasanta Kumar Ghosh. , "Low resource acoustic-to-articulatory inversion using bi-directional long short term memory", accepted in Interspeech, Hyderabad, India,2018, Page(s): 3122-3126. [pdf] [codes] G.Nisha Meenakshi, Prasanta Kumar Ghosh. , "Whispered speech to neutral speech conversion using bidirectional LSTMs", accepted in Interspeech, Hyderabad, India,2018, Page(s): 491-495. [pdf] [slides] Pavan Karjol, Prasanta Kumar Ghosh. , "Speech enhancement using deep mixture of experts based on hard expectation maximization", accepted in Interspeech, Hyderabad, India,2018, Page(s): 3254-3258. [pdf] [poster] Abinay Reddy N, Achuth Rao M V, G. Nisha Meenakshi , Prasanta Kumar Ghosh. , "Reconstructing neutral speech from tracheoesophageal speech", accepted in Interspeech, Hyderabad, India,2018, Page(s): 1541-1545. [pdf] [poster] Astha Singh, G. Nisha Meenakshi, Prasanta Kumar Ghosh. , "Relating articulatory motions in different speaking rates", accepted in Interspeech, Hyderabad, India,2018, Page(s): 2992-2996. [pdf] [poster] Chandana S, Chiranjeevi Yarra, Ritu Aggarwal, Sanjeev Kumar Mittal, Kausthubha N K,Raseena K T, Astha Singh , Prasanta Kumar Ghosh. , "Automatic visual augmentation for concatenation based synthesized articulatory videos from real-time MRI data for spoken language training", accepted in Interspeech, Hyderabad, India,2018, Page(s): 3127-3131. [pdf] Achuth Rao M V, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadarshini, Prasanta Kumar Ghosh. , "Automatic glottis localization and segmentation in stroboscopic videos using deep neural network", accepted in Interspeech, Hyderabad, India,2018, Page(s): 3007-3011. [pdf] Girija Ramesan Karthik, Parth Suresh , Prasanta Kumar Ghosh. , "Subband weighting for binaural speech source localization", accepted in [pdf][poster] Valliappan CA, Renuka Mannem, Prasanta Kumar Ghosh. , "Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video using Semantic Segmentation with Fully Convolutional Networks", accepted in [pdf][slides] Anand P A, Chiranjeevi Yarra, Kausthubha N K, Prasanta Kumar Ghosh, , "Intonation tutor by SPIRE (In-SPIRE): An online tool for an automatic feedback to the second language learners in learning intonation", accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 546-547. [pdf] Chiranjeevi Yarra, Anand P A, Kausthubha N K, Prasanta Kumar Ghosh, , "SPIRE-SST: An automatic web-based self-learning tool for syllable stress tutoring (SST) to the second language learners", accepted in In Proc. Interspeech, Hyderabad, India, 2018, Page(s): 2390-2391. [pdf] Valliappan C A, Anurag Das, Prasanta Kumar Ghosh, , "Classification of Story-Telling and Poem Recitation Using Head Gesture of the Talker", accepted in In Proc. International Conference on Signal Processing and Communications (SPCOM), Bangalore, India, 2018, Page(s): 36-40. [pdf] [slides] Pavan Karjol, Prasanta Kumar Ghosh, , "Broad Phoneme Class Specific Deep Neural Network Based Speech Enhancement", accepted in in Proc. International Conference on Signal Processing and Communications (SPCOM), Bangalore, India, 2018, Page(s): 372-376. [pdf] [slides] Shivani Yadav, Kausthubha N K, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh, , "Comparison of Cough, Wheeze and Sustained Phonations for Automatic Classification between Healthy Subjects and Asthmatic Patients", accepted in in Proc. Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’18), Honolulu, HI, USA,2018, Page(s): 1400-1403. [pdf] [slides] Raseena K T, Prasanta Kumar Ghosh, , "A Maximum Likelihood Formulation to Exploit Heart Rate Variability for Robust Heart Rate Estimation from Facial Video", accepted in in Proc. Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’18), Honolulu, HI, USA, 2018, Page(s): 5191-5194. [pdf] [slides] Urvish Desai, Chiranjeevi Yarra, Prasanta Kumar Ghosh, , "CONCATENATIVE ARTICULATORY VIDEO SYNTHESIS USING REAL-TIME MRI DATA FOR SPOKEN LANGUAGE TRAINING", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 4999-5003. [pdf] [slides] Aravind Illa, Deep Patel, Yamini B K, Meera S S, Shivashankar N, Preethish Kumar Veeramani, Seena Vengalil, Kiran Polavarapu, Saraswati Nashi, Atchayaram Nalini, Prasanta Kumar Ghosh, , "COMPARISON OF SPEECH TASKS FOR AUTOMATIC CLASSIFICATION OF PATIENTS WITH AMYOTROPHIC LATERAL SCLEROSIS AND HEALTHY SUBJECTS", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 6014-6018. [pdf] [poster] Anurendra Kumar, Tanaya Guha, Prasanta Kumar Ghosh, , "A DYNAMIC LATENT VARIABLE MODEL FOR SOURCE SEPARATION", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 2871-2875. [pdf] [poster] Advait Koparkar, Prasanta Kumar Ghosh, , "A SUPERVISED AIR-TISSUE BOUNDARY SEGMENTATION TECHNIQUE IN REAL-TIME MAGNETIC RESONANCE IMAGING VIDEO USING A NOVEL MEASURE OF CONTRAST AND DYNAMIC PROGRAMMING", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 5004-5008. [pdf] Karthik Girija Ramesan, Prasanta Kumar Ghosh, , "BINAURAL SPEECH SOURCE LOCALIZATION USING TEMPLATE MATCHING OF INTERAURAL TIME DIFFERENCE PATTERNS", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 5164-5168. [pdf] [poster] Pavan Karjol, Ajay Kumar M, Prasanta Kumar Ghosh, , "SPEECH ENHANCEMENT USING MULTIPLE DEEP NEURAL NETWORKS", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 5049-5052. [pdf] Chiranjeevi Yarra, Prasanta Kumar Ghosh, , "Automatic intonation classification using temporal patterns in utterance-level pitch contour and perceptually motivated pitch transformation", accepted in The Journal of the Acoustical Society of America 144.5 (2018): EL471-EL476. [pdf] Chiranjeevi Yarra, Om D. Deshmukh, Prasanta Kumar Ghosh, , "A frame selective dynamic programming approach for noise robust pitch estimation", accepted in The Journal of the Acoustical Society of America, 143, no. 4 (2018): 2289--2300. [pdf] Nisha Meenakshi, Prasanta Kumar Ghosh, , "Reconstruction of Articulatory Movements During Neutral Speech From Those During Whispered Speech", accepted in Journal of Acoustical Society of America, June 2018, 143 (6), page 3352--3364. [pdf] Achuth Rao M V, Prasanta Kumar Ghosh, , "PSFM - A Probabilistic Source Filter Model for Noise Robust Glottal Closure Instant Detection", accepted in IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 26, no. 9 (2018): 1645--1657. [pdf] Ratna Jyothi Kakumanu, Ajay Kumar Nair, Rahul Venugopal, Arun Sasidharan, Prasanta Kumar Ghosh, John P. John, Seema Mehrotra, Ravindra Panth, Bindu M. Kutty, , "Dissociating meditation proficiency and experience dependent EEG changes during traditional Vipassana meditation practice", accepted in Biological psychology 135 (2018): 65-75. [pdf] Achuth Rao M V, Shiny Victory, Prasanta Kumar Ghosh, , "Effect of source filter interaction on isolated vowel-consonant-vowel perception", accepted in The Journal of the Acoustical Society of America, 144.2 (2018): EL95--EL99. [pdf]
Vijitha Periyasamy, Manojit Pramanik, Prasanta Kumar Ghosh, , "Review on heart-rate estimation from photoplethysmography and accelerometer signals during physical exercise", accepted in Journal of the Indian Institute of Science, 97(3), 313-324 (2017). [pdf] Pattem Ashok Kumar, Aravind Illa, Amber Afshan, Prasanta Kumar Ghosh, , "Optimal sensor placement in electromagnetic articulography recording for speech production study", accepted in Computer Speech & Language 47 (2018): 157-174. [pdf] [slides] Sahil Bansal, Anindita Ghosh, Chandra Sekhar Seelamantula, Gurunath Gurrala, Prasanta Kumar Ghosh, , "Adaptive Frequency Estimation Approach Using Iterative DESA with RDFT-Based Filter", accepted in In Proc. IEEE PES Asia-Pacific Power and Energy Engineering Conference (APPEEC), 2017, pp. 1-6. [pdf] Nazreen P. M., A. G. Ramakrishnan, Prasanta Kumar Ghosh, , "A Joint Enhancement-Decoding Formulation for Noise Robust Phoneme Recognition", accepted in In Proc. INDICON-2017, IIT Roorkee, pp. 1-6. [pdf] Samik Sadhu, Prasanta Kumar Ghosh, , "Low Resource Point Process Models for Keyword Spotting Using Unsupervised Online Learning", accepted in In Signal Processing Conference (EUSIPCO), 2017 25th European (pp. 538-542). IEEE. [pdf] [poster] Achuth Rao M V, Prasanta Kumar Ghosh, , "Pitch Prediction from Mel-generalized Cepstrum - a Computationally Efficient Pitch Modeling Approach for Speech Synthesis", accepted in In Signal Processing Conference (EUSIPCO), 2017 25th European (pp. 1629-1633). IEEE. [pdf] [poster] Achuth Rao M V, Kausthubha N K, Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh, , "Automatic Prediction of Spirometry Readings from Cough and Wheeze for Monitoring of Asthma Severity", accepted in In Signal Processing Conference (EUSIPCO), 2017 25th European (pp. 41-45). IEEE. [pdf] [poster] Akshay Kalkunte Suresh, Srinivasa Raghavan K M, Prasanta Kumar Ghosh, , "Phoneme state posteriorgram features for speech based automatic classification of speakers in cold and healthy condition", accepted in In Proc. Interspeech 2017, 3462-3466. [pdf] [poster] Gaurav Fotedar, Prasanta Kumar Ghosh, , "An information theoretic analysis of the temporal synchrony between head gestures and prosodic patterns in spontaneous speech", accepted in In Proc. Interspeech 2017, 157-161. [pdf] [poster] Girija Ramesan Karthik, Prasanta Kumar Ghosh, , "Subband selection for binaural speech source localization", accepted in In Proc. Interspeech 2017, Stockholm, Sweden, 1929-1933. [pdf] [poster] G. Nisha Meenakshi, Prasanta Kumar Ghosh, , "A robust Voiced/Unvoiced phoneme classification from whispered speech using the 'color' of whispered phonemes and Deep Neural Network", accepted in In Proc. Interspeech 2017, 503-507. [pdf] [poster] Abhishek Narwekar, Prasanta Kumar Ghosh, , "PRAV: A Phonetically Rich Audio Visual Corpus", accepted in In Proc. Interspeech 2017, 3747-3751. [pdf] [poster] Achuth Rao M V, Shivani Yadav, Prasanta Kumar Ghosh, , "A dual source-filter model of snore audio for snorer group classification", accepted in In Proc. Interspeech 2017, Stockholm, Sweden, 3502-3506. [pdf] [poster] Srinivasa Raghavan, Nisha Meenakshi, Sanjeev Kumar Mittal, Chiranjeevi Yarra, Anupam Mandal, K R Prasanna Kumar, Prasanta Kumar Ghosh, , "A Comparative Study on the Effect of Different Codecs on Speech Recognition Accuracy Using Various Acoustic Modeling Techniques", accepted in In Communications (NCC), 2017 Twenty-third National Conference on (pp. 1-6). IEEE. [pdf] [poster] Pradyumna Suresha, Supriya Nagesh, Priyadarshini Savan Roshan, Aditya Gaonkar P, Nisha Meenakshi, Prasanta Kumar Ghosh, , "A High Resolution ENF Based MultiStage Classifier for Location Forensics of Media Recordings", accepted in In Communications (NCC), 2017 Twenty-third National Conference on (pp. 1-6). IEEE. [pdf] Mekhala H S, Yamini B K, Ketan J, Pal P, Shivashankar N, Prasanta Kumar Ghosh, , "Classification of Healthy Subjects and Patients with Essential Vocal Tremor Using Empirical Mode Decomposition of High Resolution Pitch Contour", accepted in In Communications (NCC), 2017 Twenty-third National Conference on (pp. 1-6). IEEE. [pdf] [poster] Achuth Rao MV, Prasanta Kumar Ghosh, , "Pitch Prediction from Mel-Frequency Cepstral Coefficients Using Sparse Spectrum Recovery", accepted in In Communications (NCC), 2017 Twenty-third National Conference on (pp. 1-6). IEEE. [pdf] [poster] Chiranjeevi Yarra, Om D. Deshmukh, Prasanta Kumar Ghosh, , "An automatic detection of syllable stress using sonority based prominence features for pronunciation evaluation", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, (pp. 5845-5849). [pdf] [poster] Aravind Illa, Nisha Meenakshi G, Prasanta Kumar Ghosh, , "A comparative study of acoustic-to-articulatory inversion for neutral and whispered speech", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017 (pp. 5075-5079). [pdf] [poster] Rao, Nithin K, Meenakshi, Nisha, Prasanta Kumar Ghosh, , "Spectrogram enhancement using multiple window Savitzky Golay (MWSG) filter for robust bird sound detection", accepted in IEEE/ACM Transactions on Audio, Speech, and Language Processing 25.6 (2017): 1183-1192. [pdf] [slides] [poster]
Fotedar, Gaurav, Aditya Gaonkar, Saikat Chatterjee, Prasanta Kumar Ghosh, , "Automatic Recognition of Social Roles Using Long Term Role Transitions in Small Group Interactions", accepted in In Proc. Interspeech 2016 (2016): 2065-2069. [pdf] [poster] Nazreen P. M., A. G. Ramakrishnan, Prasanta Kumar Ghosh, , "A Class-Specific Speech Enhancement for Phoneme Recognition: A Dictionary Learning Approach", accepted in In Proc. Interspeech 2016} (2016): 3728-3732. [pdf] [poster] Aditya Gaonkar P, Bhuthesh R, Dipanjan Gope, Prasanta Kumar Ghosh, , "Robust Real-Time Pulse Rate Estimation From Facial Video Using Sparse Spectral Peak Tracking", accepted in In Proc. International Conference onSignal Processing and Communications (SPCOM), 2016, pp. 1-5. IEEE, 2016. [pdf] [poster] Abhishek Narwekar, Prasanta Kumar Ghosh, , "A Comparative Study of Articulatory Features From Facial Video and Acoustic-To-Articulatory Inversion for Phonetic Discrimination", accepted in In Proc. International Conference on Signal Processing and Communications (SPCOM), pp. 1-5. IEEE, 2016. [pdf] [poster] Nagesh, Supriya, Chiranjeevi Yarra, Om D. Deshmukh, Prasanta Kumar Ghosh, , "A robust speech rate estimation based on the activation profile from the selected acoustic unit dictionary", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5400-5404. 2016. . [pdf] Afshan, Amber, Prasanta Kumar Ghosh, , "Better acoustic normalization in subject independent acoustic-to-articulatory inversion: Benefit to recognition", accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5395-5399. 2016. [pdf] Prasad, Abhay, Prasanta Kumar Ghosh, , "Information theoretic optimal vocal tract region selection from real time magnetic resonance images for broad phonetic class recognition", accepted in Computer Speech & Language 39 (2016): 108-128. [pdf] Yarra, Chiranjeevi, Om D. Deshmukh, Prasanta Kumar Ghosh, "A mode-shape classification technique for robust speech rate estimation and syllable nuclei detection", accepted in Speech Communication 78 (2016): 62-71. [pdf] Prathosh A. P., Sujith P., Ramakrishnan A. G., Prasanta Kumar Ghosh, "Cumulative Impulse Strength for Epoch Extraction", accepted in IEEE Signal Processing Letters, 23(4) (2016), 424-428. [pdf] Ming Li, Jangwon Kim, Adam Lammert, P. K. Ghosh, Vikram Ramanarayanan, Shrikanth Narayanan, "Speaker verification based on the fusion of speech acoustics and inverted articulatory signals", accepted in Computer Speech and Language,2016. [pdf]
Prasad, Abhay, Prasanta Kumar Ghosh, "Automatic classification of eating conditions from speech using acoustic feature selection and a set of hierarchical support vector machine classifiers", accepted in Proc. of INTERSPEECH. ISCA. Dresden, Germany: ISCA (2015): 884-888. [pdf] Parida, Satyabrata, Ashok Kumar Pattem, Prasanta Kumar Ghosh, "Estimation of the air-tissue boundaries of the vocal tract in the mid-sagittal plane from electromagnetic articulograph data", accepted in Sixteenth Annual Conference of the International Speech Communication Association. 2015. [pdf] Meenakshi, G. Nisha, Prasanta Kumar Ghosh, "A discriminative analysis within and across voiced and unvoiced consonants in neutral and whispered speech in multiple Indian languages", accepted in Sixteenth Annual Conference of the International Speech Communication Association. 2015. [pdf] Sujith P., Prathosh A. P., A. G. Ramakrishnan, Prasanta Kumar Ghosh, "An Error Correction Scheme for GCI Detection Algorithms Using Pitch Smoothness Criterion", accepted in Sixteenth Annual Conference of the International Speech Communication Association. 2015. [pdf] Adria Casamitjana, Martin Sundin, P. K. Ghosh, Saikat Chatterjee, "Bayesian learning for time-varying linear prediction of speech", accepted in Proc. EUSIPCO, Aug 31- Sep 4 2015, pp 325--329. [pdf] A. Prasad, V. Periyasamy, P. K. Ghosh,, "Estimation of the invariant and variant characteristics in speech articulation and its application to speaker identification", accepted in Proc. ICASSP, 19-24 April, 2015, pp 4265--4269. [pdf] [poster] Nisha Meenakshi, P. K. Ghosh, "Automatic Gender Classification Using the Mel Frequency Cepstrum of Neutral and Whispered Speech: a Comparative Study", accepted in Proc. 21st National Conference on Communications, 27 Feb - 1 March, 2015, pp 1--6. [pdf] Murthy, Navaneet K. Lakshminarasimha, Pavan C. Madhusudana, Pradyumna Suresha, Vijitha Periyasamy, Prasanta Kumar Ghosh, "Multiple spectral peak tracking for heart rate monitoring from photoplethysmography signal during intensive physical exercise", accepted in IEEE Signal Processing Letters 22.12 (2015): 2391-2395. [pdf] Nisha Meenakshi, P. K. Ghosh, "Robust whisper activity detection using long-term log energy variation of sub-band signal", accepted in IEEE Signal Processing Letters, Volume 22, Issue 11, June 2015, pp 1859-1863. [pdf] Afsan A., P. K. Ghosh, "Improved subject-independent acoustic-to-articulatory inversion", accepted in Speech Communication, Elsevier. Vol. 66,2015,January,1-16. [pdf] [slides]
Prasad Sudhakar, P. K. Ghosh, "Sparse smoothing of articulatory features from Gaussian mixture model based acoustic-to-articulatory inversion: Benefit to speech recognition", accepted in InterSpeech, 2014. [pdf] Abhay Prasad, P. K. Ghosh, Shrikanth Narayanan, "Selection of optimal vocal tract regions using real-time magnetic resonance imaging for robust voice activity detection", accepted in InterSpeech, 2014. [pdf] Sujith P , P. K. Ghosh, "Missing samples estimation in electromagnetic articulography data using equality constrained Kalman smoother", accepted in InterSpeech, 2014. [pdf] [codes] Nisha Meenakshi, Chiranjeevi Yarra, B. K. Yamini, P. K. Ghosh, "Comparison of speech quality with and without sensors in electromagnetic articulograph AG 501 recording", accepted in InterSpeech, 2014. [pdf] Sujith P., P. K. Ghosh, "Maximum a-posteriori estimation of missing samples with continuity constraint in electromagnetic articulography data", accepted in ICASSP 2014. [pdf] Abhijith Mundanad Narayanan, P. K. Ghosh, K. Rajgopal, "Multi-Pitch Tracking using Gaussian mixture model with time varying parameters and Grating Compression Transform", accepted in ICASSP 2014. [pdf] Prasad Sudhakar, Laurent Jacques, P. K. Ghosh, "A sparse smoothing approach for Gaussian mixture model based acoustic-to-articulatory inversion", accepted in ICASSP 2014. [pdf] Andreas Tsiartas, P. K. Ghosh, Panayiotis Georgiou, Shrikanth S. Narayanan, "Classification of clean and noisy bilingual movie audio for speech-to-speech translation corpora design", accepted in ICASSP 2014. [pdf] Shrikanth Narayanan, Asterios Toutios, Vikram Ramanarayanan, Adam Lammert, Jangwon Kim, Sungbok Lee, Krishna Nayak, Yoon-Chul Kim, Yinghua Zhu, Louis Goldstein, Dani Byrd, Erik Bresch, Prasanta Ghosh, Athanasios Katsamanis , Michael Proctor,, "Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research", accepted in Journal of Acoustical Society of America, Volume 136,2014,September,1307. [pdf] Jangwon Kim, Adam Lammert, Prasanta Kumar Ghosh, Shrikanth S. Narayanan, "Co-registration of speech production datasets from electromagnetic articulography and real-time magnetic resonance imaging", accepted in Journal of the Acoustical Society of America Express Letter (JASAEL), Volume 135, Issue 2, 2014, EL115-EL121. [pdf]
Bhuthesh R, Prasanta Kumar Ghosh, Dipanjan Gope, "Enhanced Pulse Rate Measurement from Facial video by Automatic Detection of Sensitive Skin Regions", accepted in IEEE workshop on Computational Intelligence, IIT Kanpur,2013,July. M. Li, A. Lammert, J. Kim, Prasanta K Ghosh , S. Narayanan, "Automatic Classification of Palatal and PharyngealWall Shape Categories from Speech Acoustics and Inverted Articulatory Signals", accepted in Workshop on Speech Production in Automatic Speech Recognition, Interspeech, Lyon, France,2013. [pdf] Prasanta K Ghosh, Shrikanth Narayanan, "Information theoretic acoustic feature selection for acoustic-to-articulatory inversion", accepted in Proc. Interspeech 2013, Lyon, France.,2013. [pdf] A. Tsiartas, T. Chaspari, N. Katsamanis, Prasanta K Ghosh, M. Li, M. V. Sebroeck, A. Potamianos, S. Narayanan, "Multi-band long-term signal variability features for robust voice activity detection", accepted in Proc. Interspeech 2013, Lyon, France.,2013. [pdf] M. Li, J. Kim, Prasanta K Ghosh, V. Ramanarayanan , S. Narayanan, "Speaker verification based on fusion of acoustic and articulatory information", accepted in Proc. Interspeech 2013, Lyon, France.,2013. [pdf] Shajith Ikbal, Ashish Verma, Prasanta Ghosh, Kenneth Church, Jeffrey Marcus, "Intent Focused Summarization of Caller-Agent Conversations", accepted in Proc. ICASSP 2013.,2013. [pdf] Jangwon Kim, Adam Lammert, Prasanta Ghosh, Shrikanth Narayanan, "Spatial and temporal alignment of multimodal human speech production data: real time imaging, flesh point tracking and audio", accepted in Proc. ICASSP 2013.,2013. [pdf] Prasanta K Ghosh, Shrikanth Narayanan, "On smoothing articulatory trajectories obtained from Gaussian mixture model based acoustic-to-articulatory inversion", accepted in Journal of the Acoustical Society of America Express Letter (JASAEL), Volume 134, Issue 2,2013,July. [pdf] Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth S. Narayanan, "High-quality bilingual subtitle document alignments with application to spontaneous speech translation", accepted in Computer, Speech, and Language,2013. [pdf]
J. Kim, P. K. Ghosh, S. Lee, S. S. Narayanan, "A study of emotional information present in articulatory movements estimated using acoustic-to-articulatory inversion", accepted in APSIPA ASC,2012. [pdf] V. Ramanarayanan, P. K. Ghosh, A. Lammert, S. Narayanan, "Exploiting speech production information for automatic speech and speaker modeling and recognition -- possibilities and new opportunities", accepted in APSIPA ASC,2012. [pdf]
Prasanta Ghosh, Shrikanth Narayanan, "Analysis of inter-articulator correlation in acoustic-to-articulatory inversion using generalized smoothness criterion", accepted in Proc. Interspeech, Florence, Italy,2011. [pdf] Prasanta Ghosh, Shrikanth Narayanan, "A subject-independent acoustic-to-articulatory inversion", accepted in Proceedings ICASSP, Prague, Czech Republic, 22-27 May,2011. [pdf] S. Narayanan, E. Bresch, Prasanta Ghosh, L. Goldstein, A. Katsamanis, Y. Kim, A. Lammert, M. Proctor, V. Ramanarayanan, , Y. Zhu, "A Multimodal Real-Time MRI Articulatory Corpus for Speech Research", accepted in Proc. Interspeech, Florence, Italy,2011. [pdf] Bo Xiao, Prasanta Kumar Ghosh, Panayiotis Georgiou, Shrikanth Narayanan, "Overlapped speech detection using long-term spectro-temporal similarity in stereo recording", accepted in Proc. ICASSP, Prague, Czech Republic, 2011. [pdf] Andreas Tsiartas, Prasanta Ghosh, Panayiotis Georgiou, Shrikanth Narayanan, "Bilingual audio-subtitle extraction using automatic segmentation of movie audio", accepted in Proc. ICASSP, Prague, Czech Republic, 2011. [pdf] Prasanta Kumar Ghosh, Andreas Tsiartas, Shrikanth Narayanan, "Robust voice activity detection using long-term signal variability", accepted in IEEE Trans. Audio, Speech and Language Processing, Volume 19, No. 3, March 2011, pp 600-613. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Automatic Speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion", accepted in J. Acoust. Soc. Am. Express Letters (JASAEL), Volume 130, Issue 4,2011,Aug,EL251-EL257. [pdf] Prasanta Kumar Ghosh, Louis M. Goldstein, Shrikanth Narayanan, "Processing speech signal using auditory-like filterbank provides least uncertainty about articulatory gestures", accepted in J. Acoust. Soc. Am., Volume 129, Issue 6,2011,Jun,4014-4022. [pdf] Prasanta Kumar Ghosh, Louis M. Goldstein, Shrikanth Narayanan, "Auditory-like filterbank: An optimal speech processor for efficient human speech communication", accepted in Springer Proceedings of Indian Academy of Sciences (Sadhana), Special Issue on Speech Processing,2011,October, 699-712. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter", accepted in Speech Communication, Elsevier, Volume 53, No. 1,2011,January,98-109. [pdf]
Prasanta Kumar Ghosh, Andreas Tsiartas, Panayiotis G. Georgiou, Shrikanth Narayanan, "Robust voice activity detection in stereo recording with crosstalk", accepted in Proc. InterSpeech, Makuhari, Japan, 2010, Sep. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "A generalized smoothness criterion for acoustic-to-articulatory inversion", accepted in Journal of Acoustical Society of America, Volume 128, No. 4, 2010, Oct, 2162-2172. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Bark Frequency Transform Using an Arbitrary Order Allpass Filter", accepted in IEEE Signal Processing Letters, Volume 17, No. 6, 2010, June, 543-546. [pdf]
Prasanta Ghosh, Shrikanth Narayanan, Pierre Divenyi, Louis Goldstein, Elliot Saltzman, "Estimation of articulatory gesture patterns from speech acoustics", accepted in Proc. InterSpeech, 6-10 Sep, 2009, Brighton, UK, 2009.2803-2806. [pdf] Andreas Tsiartas, Prasanta Ghosh, Panayiotis G. Georgiou, Shrikanth Narayanan, "Context-driven bilingual movie subtitle alignment", accepted in Proc. InterSpeech, 6-10 Sep, 2009, Brighton, UK, 2009.444-447. [pdf] Andreas Tsiartas, Prasanta Ghosh, Panayiotis G. Georgiou, Shrikanth Narayanan, "Robust word boundary detection in spontaneous speech using acoustic and lexical cues", accepted in Proceedings of ICASSP, Taipei, Taiwan.2009,Apr. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Pitch contour stylization using an optimal piecewise polynomial approximation", accepted in IEEE Signal Processing Letters, Volume 16, No. 9, 2009, Sept, 810-813. [pdf] Prasanta Kumar Ghosh, Shrikanth Narayanan, "Closure duration analysis of incomplete stop consonants due to stop-stop interaction", accepted in J. Acoust. Soc. Am. Express Letters, Volume 126, Issue 1, 2009, Jul, EL1-EL7. [pdf]
Prasanta Ghosh, Antonio Ortega, Shrikanth Narayanan, "Pitch period estimation using multipulse model and wavelet transform", accepted in Proceedings of InterSpeech ICSLP, Antwerp, Belgium, 2007, Aug, 2761-2764. [pdf] Prasanta Kumar Ghosh, "Speech Segmentation using Extrema-Based Signal Track Length Measure", accepted in ICASSP 2007, Volume 4, 15-20, 2007, April, IV-1065 - IV-1068. [pdf]
Prasanta Kumar Ghosh, T.V. Sreenivas, "Dynamic Programming Based Optimum Non-Uniform Samples For Speech Reconstruction and Coding", accepted in ICASSP, Volume 1, 2006. [pdf] Prasanta Kumar Ghosh, T.V. Sreenivas, "Extrema based Unwarping for Time-varying Pitch Estimation", accepted in 12th National Conference on Communication (NCC), 2006. [pdf] Amitava Das, Manoj Balwani, Rahul Thota, Prasanta Ghosh, "Face Recognition from Images with High Pose Variations by Transform Vector Quantization", accepted in ICVGIP, 2006.674-685. [pdf] Das, A., Ghosh, P, "Audio-Visual Biometric Recognition by Vector Quantization", accepted in IEEE Spoken Language Technology(SLT) Workshop, 2006, Dec,166 - 169. [pdf] Prasanta Kumar Ghosh, T.V. Sreenivas, "Time-varying Filter Interpretation of Fourier Transform and its Variants", accepted in Signal Processing (Elsevier), Volume 86, Issue 11, 2006, November, 3258-3263. [pdf]