Neelesh Samptur, Tanuka Bhattacharjee, Anirudh Chakravarty K, Seena Vengalil, Yamini BK, Nalini Atchayaram, P. K. Ghosh "Exploring Syllable Discriminability during Diadochokinetic Task with Increasing Dysarthria Severity for Patients with Amyotrophic Lateral Sclerosis" accepted in Interspeech 2024 [pdf] [codes] [Corrigendum] [GitHub] Chetan Sharma, Vaishnavi Chandwanshi, P. K. Ghosh "A comparative study of the impact of voiceless alveolar and palato-alveolar sibilants in English on lip aperture and protrusion during VCV production" accepted in Interspeech 2024 [pdf] [codes] [Corrigendum] [GitHub] Sathvik Udupa, Soumi Maiti, P. K. Ghosh "IndicMOS: Multilingual MOS Prediction for 7 Indian languages" accepted in Interspeech 2024 [pdf] [codes] [Corrigendum] [GitHub] Sathvik, Jersuraj, Saurabh, Deekshitha, Sandhya B, Abhayjeet, Savitha, Priyanka, Srinivasa, Raoul, P. K. Ghosh "Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models" accepted in Interspeech 2024 [pdf] [codes] [Corrigendum] [GitHub] Jesuraj Bandekar, Sathvik Udupa, P. K. Ghosh "Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss" accepted in Interspeech 2024 [pdf] [codes] [Corrigendum] [GitHub] Alex Paul Kamson, Akshay V. Sawant, P. K. Ghosh, Satish S Jeevannavar "Exploring wav2vec 2.0 Model for Heart Sound Analysis" accepted in EMBC 2024 [pdf] [codes] [Corrigendum] [GitHub] Anjali Jayakumar, Tanuka Bhattacharjee, Seena Vengalil, Yamini Belur, Nalini Atchayaram, P. K. Ghosh "Low Complexity Model with Single Dimensional Feature for Speech Based Classification of Amyotrophic Lateral Sclerosis Patients and Healthy Individuals" accepted in SPCOM 2024 [pdf] [codes] [Corrigendum] [GitHub] Jesuraja Bandekar, Sathvik Udupa, P. K. Ghosh "Discovering phoneme-specific critical articulators through a data driven approach" accepted in ISSP 2024 [pdf] [codes] [Corrigendum] [GitHub] Satyadev Badireddi, Shreya Shrikant Karkun, P. K. Ghosh "Inter-subject variation in tongue shape during vowel production in /b/V/t/ sequence: An rtMRI study using 8 vowels from 74 subjects" accepted in ISSP 2024 [pdf] [codes] [Corrigendum] [GitHub] SHIVANI YADAV, DIPANJAN GOPE, UMA MAHESWARI K, P. K. Ghosh "AN UNSUPERVISED SEGMENTATION OF VOCAL BREATH SOUNDS" accepted in ICASSP 2024 [pdf] [codes] [Corrigendum] [GitHub] Chowdam Venkata Thirumala Kumar, Tanuka Bhattacharjee, Seena Vengalil, Saraswati Nashi, Madassu Keerthipriya, Yamini Belur, Nalini Atchayaram, P. K. Ghosh "SPECTRAL ANALYSIS OF VOWELS AND FRICATIVES AT VARIED LEVELS OF DYSARTHRIA SEVERITY FOR AMYOTROPHIC LATERAL SCLEROSIS" accepted in ICASSP 2024 [pdf] [codes] [Corrigendum] [GitHub] Veerababu Dharanalakota, Namra Quasim, P. K. Ghosh "Estimation of Acoustic Field in a Uniform Duct with Mean Flow using Neural Networks" accepted in accepted for presentation at the 2024 AIAA SciTech [pdf] [codes] [Corrigendum] [GitHub]
Alex Paul Kamson, Macline Crecsilla Lewis, Akshay Sawant, Vishnu Sunil B N, P. K. Ghosh, Satish S Jeevannavar "E2E Multi-Scale CNN with LSTM for Murmur Detection in PCG or Noise Identification" accepted in In 2023 International Conference on Electrical, Communication, and Computer Engineering (ICECCE), IEEE, 2023 [pdf] [codes] [Corrigendum] [GitHub] Abhayjeet Singh, Charu Shah, Rajashri Varadaraj, Sonakshi Chauhan, & P. K. Ghosh "SPIRE-SIES: A Spontaneous Indian English Speech Corpus" accepted in 2023 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA). IEEE [pdf] [codes] [Corrigendum] [GitHub] Sathvik Udupa, Jesuraj Bandekar, Deekshitha G, Saurabh Kumar, P. K. Ghosh, Sandhya Badiger, Abhayjeet Singh, Savitha Murthy, Priyanka Pai, Srinivasa Raghavan, Rohan Saxena "Gated Multi Encoders and multitask objectives for Dialectal Speech recognition in Indian languages" accepted in ASRU 2023 [pdf] [codes] [Corrigendum] [GitHub] Priyanshi Pal, Shelly Jain, Chiranjeevi Yarra, P. K. Ghosh, Anil Kumar Vuppala "Study of Indian English Pronunciation variabilities relative to Received Pronunciation" accepted in accepted for SPECOM 2023 [pdf] [codes] [Corrigendum] [GitHub] Navneet Kaur, P. K. Ghosh "Curriculum Learning based approach for faster convergence of TTS model" accepted in accepted for SPECOM 2023 [pdf] [codes] [Corrigendum] [GitHub] Abhayjeet Singh, Anjali Jayakumar, Deekshitha G, Hitesh Tiwari, Jesuraja Bandekar, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, P. K. Ghosh "An end-to-end TTS model in Chhattisgarhi, a low-resource Indian language" accepted in accepted for SPECOM 2023 [pdf] [codes] [Corrigendum] [GitHub] Abhayjeet Singh, Arjun Singh Mehta, Ashish Khuraishi K S, Deekshitha G, Gauri Date, Jai Nanavati, Jesuraja Bandekar, Karnalius Basumatary, Karthika P, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, P. K. Ghosh, Prashanthi V, Priyanka Pai, Raoul Nanavati, Sai Praneeth Reddy Mora, Srinivasa Raghavan "An ASR corpus in Chhattisgarhi, a low resource Indian language" accepted in accepted for SPECOM 2023 [pdf] [codes] [Corrigendum] [GitHub] Veerababu Dharanalakota, J. Pavan Kumar, P. K. Ghosh "Loss-based Optimizer Switching to Solve 1-D Helmholtz Equation using Neural Networks" accepted in Acoustics 2023 Sydney, Australia, 4 - 8 Dec, (2023) [pdf] [codes] [Corrigendum] [GitHub] Veerababu Dharanalakota, R. Ashwin, P. K. Ghosh "Achieving Stable Convergence of Neural Networks for Estimating Acoustic Field in Uniform Ducts" accepted in Acoustics 2023 Sydney, Australia, 4 - 8 Dec, (2023) [pdf] [codes] [Corrigendum] [GitHub] Chowdam Venkata Thirumala Kumar, MEENAKSHI SIRIGIRAJU, Rakesh Vaideeswaran Mahesh, P. K. Ghosh, Chiranjeevi Yarra "Can the decoded text from automatic speech recognition effectively detect spoken grammar errors?" accepted in SLATE 2023 [pdf] [codes] [Corrigendum] [GitHub] Chowdam Venkata Thirumala Kumar, Tanuka Bhattacharjee, Yamini BK, Nalini Atchayaram, Ravi Yadav, P. K. Ghosh "Classification of multi-class vowels and fricatives from patients having Amyotrophic Lateral Sclerosis with varied levels of dysarthria severity" accepted in INTERSPEECH 2023 [pdf] [codes] [Corrigendum] [GitHub] Shelly Jain, Priyanshi Pal, Anil Vuppala, P. K. Ghosh, Chiranjeevi Yarra "An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations" accepted in INTERSPEECH 2023 [pdf] [codes] [Corrigendum] [GitHub] Varun Belagali, P. K. Ghosh, Achuth Rao "Weakly supervised glottis segmentation in high-speed videoendoscopy using bounding box labels" accepted in INTERSPEECH 2023 [pdf] [codes] [Corrigendum] [GitHub] Mohammad Shaique Solanki, Ashutosh Bharadwaj, Jeevan Kylash, P. K. Ghosh "Do Vocal Breath Sounds Encode Gender Cues for Automatic Gender Classification?" accepted in INTERSPEECH 2023 [pdf] [codes] [Corrigendum] [GitHub] Siddarth Chandrasekar, Arvind Ramesh, Tilak Purohit, P. K. Ghosh "A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence" accepted in INTERSPEECH 2023 [pdf] [codes] [Corrigendum] [GitHub] Jesuraj Bandekar, Sathvik Udupa, P. K. Ghosh "Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion" accepted in INTERSPEECH 2023 [pdf] [codes] [Corrigendum] [GitHub] Tanuka Bhattacharjee, Anjali Jayakumar, Yamini BK, Nalini Atchayaram, Ravi Yadav, P. K. Ghosh "Transfer Learning to Aid Dysarthria Severity Classification for Patients with Amyotrophic Lateral Sclerosis" accepted in INTERSPEECH 2023 [pdf] [codes] [Corrigendum] [GitHub] Dharanalakota Veerababu, P. K. Ghosh "SOLUTION OF 1-D HELMHOLTZ EQUATION USING ARTIFICIAL NEURAL NETWORKS" accepted in accepted for publication in the 29th International Congress on Sound and Vibration (ICSV29) [pdf] [codes] [Corrigendum] [GitHub] Tanuka Bhattacharjee, Yamini Belur, Nalini Atchayaram, Ravi Yadav, P. K. Ghosh "EXPLORING THE ROLE OF FRICATIVES IN CLASSIFYING HEALTHY SUBJECTS AND PATIENTS WITH AMYOTROPHIC LATERAL SCLEROSIS AND PARKINSON’S DISEASE" accepted in accepted at ICASSP 2023 [pdf] [codes] [Corrigendum] [GitHub] Tanuka Bhattacharjee, Chowdam Venkata Thirumala Kumar, Yamini Belur, lini Atchayaram, Ravi Yadav, P. K. Ghosh "STATIC AND DYNAMIC SOURCE AND FILTER CUES FOR CLASSIFICATION OF AMYOTROPHIC LATERAL SCLEROSIS PATIENTS AND HEALTHY SUBJECTS" accepted in accepted at ICASSP 2023 [pdf] [codes] [Corrigendum] [GitHub] Sathvik Udupa, Siddarth C, P. K. Ghosh "IMPROVED ACOUSTIC-TO-ARTICULATORY INVERSION USING REPRESENTATIONS FROM PRETRAINED SELF-SUPERVISED LEARNING MODELS" accepted in accepted at ICASSP 2023 [pdf] [codes] [Corrigendum] [GitHub] Sathvik Udupa, P. K. Ghosh "REAL-TIME MRI VIDEO SYNTHESIS FROM TIME ALIGNED PHONEMES WITH SEQUENCE-TO-SEQUENCE NETWORKS" accepted in accepted at ICASSP 2023 [pdf] [codes] [Corrigendum] [GitHub]
Priyanshi Pal, Chiranjeevi Yarra, P. K. Ghosh "voisTUTOR 2.0: A speech corpus with phonetic transcription for pronunciation evaluation of Indian L2 English learners" accepted in O-COCOSDA 2022 [pdf] [codes] [Corrigendum] [GitHub] Abinay Reddy Naini, Achuth Rao M V, P. K. Ghosh "Whisper to Neutral Mapping Using i-Vector Space Likelihood and a Cosine Similarity Based Iterative Optimization for Whispered Speaker Verification" accepted in NCC 2022 [pdf] [codes] [Corrigendum] [GitHub] Aravind Illa, Aanish Nair, P. K. Ghosh "The impact of cross language on acoustic-to-articulatory inversion and its influence on articulatory speech synthesis" accepted in ICASSP 2022 [pdf] [codes] [Corrigendum] [GitHub] Anwesha Roy, Varun Belagali, P. K. Ghosh "AN ERROR CORRECTION SCHEME FOR IMPROVED AIR-TISSUE BOUNDARY IN REAL-TIME MRI VIDEO FOR SPEECH PRODUCTION" accepted in ICASSP 2022 [pdf] [codes] [Corrigendum] [GitHub] Siddharth Subramani, Achuth Rao M V, Anwesha Roy, Prasanna Suresh Hegde, P. K. Ghosh "SEGNET-BASED DEEP REPRESENTATION LEARNING FOR DYSPHAGIA CLASSIFICATION" accepted in ICASSP 2022 [pdf] [codes] [Corrigendum] [GitHub] Abinay Reddy Naini, Bhavuk Singhal, P. K. Ghosh "DUAL ATTENTION POOLING NETWORK FOR RECORDING DEVICE CLASSIFICATION USING NEUTRAL AND WHISPERED SPEECH" accepted in ICASSP 2022 [pdf] [codes] [Corrigendum] [GitHub]
Karthik G.R, P. K. Ghosh "Towards a Calibration-free Approach to Deep Learning based Single-incidence Inverse Scattering" accepted in In 2021 Photonics & Electromagnetics Research Symposium (PIERS), pp. 2355-2361. IEEE, 2021 [pdf] [codes] [Corrigendum] [GitHub] Karthik G.R, P. K. Ghosh "A Scalable Deep Learning Model for Arbitrary Transmitter Configurations in Inverse Scattering" accepted in 2021 IEEE Antennas and Propagation Society International Symposium (APS-URSI) to be held in Singapore [pdf] [codes] [Corrigendum] [GitHub] Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy, P. K. Ghosh "Convolutional Dense Neural Network based Spirometry Variable FVC Prediction using Sustained Phonations" accepted in MLSP 2021 [pdf] [codes] [Corrigendum] [GitHub] Abhayjeet Singh, Achuth Rao M V, Rakesh Vaideeswaran, Chiranjeevi Yarra, P. K. Ghosh "A study on native American English speech recognition by Indian listeners with varying word familiarity level" accepted in Oriental COCOSDA 2021 [pdf] [codes] [Corrigendum] [GitHub] Tilak Purohit, Tejas Umesh, Shankar Narayanan, Minulakshmi S, P. K. Ghosh "SPIRE VCV: An acoustic-articulatory corpus with three different speaking rates" accepted in Oriental COCOSDA 2021 [pdf] [codes] [Corrigendum] [GitHub] Bhavuk Singhal, Abinay Reddy Naini, P. K. Ghosh "WSPIRE: A parallel multi-device corpus in neutral and whisper speech" accepted in Oriental COCOSDA 2021 [pdf] [codes] [Corrigendum] [GitHub] Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy, P. K. Ghosh "Role of breath phase and breath boundaries for the classification between asthmatic and healthy subjects" accepted in accepted for presentation at the 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2021 [pdf] [codes] [Corrigendum] [GitHub] Drishti Ramesh Megalmani, Shailesh B G, Achuth Rao M V, Satish S Jeevannava, P. K. Ghosh "Unsegmented Heart Sound Classification Using Hybrid CNN-LSTM Neural Networks" accepted in accepted for presentation at the 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2021 [pdf] [codes] [Corrigendum] [GitHub] Achuth Rao M V, Shailesh B G, Drishti Ramesh Megalmani, Satish S Jeevannava, P. K. Ghosh "Noise Robust Detection of Fundamental Heart Sound using Parametric Mixture Gaussian and Dynamic Programming" accepted in accepted for presentation at the 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2021 [pdf] [codes] [Corrigendum] [GitHub] Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, P. K. Ghosh "Web Interface for estimating articulatory movements in speech production from acoustics and text" accepted in Show and Tell, Interspeech 2021, Brno, Czech Republic [pdf] [codes] [Corrigendum] [GitHub] Manthan Sharma, Navaneetha Gaddam, Tejas Umesh, Aditya Murthy, P. K. Ghosh "" accepted in Interspeech 2021, Brno, Czech Republic [pdf] [codes] [Corrigendum] [GitHub] Tanuka Bhattacharjee, Jhansi Mallela, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, P. K. Ghosh "Source and Vocal Tract Cues for Speech-based Classification of Patients with Parkinson’s Disease and Healthy Subjects" accepted in Interspeech 2021, Brno, Czech Republic [pdf] [codes] [Corrigendum] [GitHub] Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, P. K. Ghosh "Estimating articulatory movements in speech production with transformer networks" accepted in Interspeech 2021, Brno, Czech Republic [pdf] [codes] [Corrigendum] [GitHub] Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah, Ankita Singh, SRINIVASA RAGHAVAN K M, Shreya Khare, Vinit Unni, saurabh vyas, Akash Rajapuria, Chiranjeevi Yarra, Ashish Mittal, P. K. Ghosh, Preethi Jyothi, Kalika Bali, Vivek Seshadri, Sunayana Sitaram, Samarth Bharadwaj, Jai Nanavati, Raoul Nanavati, Karthik Sankaranarayanan "Multilingual and code-switching ASR challenges for low resource Indian languages" accepted in Interspeech 2021, Brno, Czech Republic [pdf] [codes] [Corrigendum] [GitHub] Chiranjeevi Yarra, P. K. Ghosh "Noise robust pitch stylization using minimum mean absolute error criterion" accepted in Interspeech 2021, Brno, Czech Republic [pdf] [codes] [Corrigendum] [GitHub] Ananya Muguli, Lacelot Pinto, Nirmala R, Neeraj Sharma, Prashant Krishnan, P. K. Ghosh, Rohit Kumar, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Shreyas Ramoji, Viral Nanda "DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics" accepted in Interspeech 2021, Brno, Czech Republic [pdf] [codes] [Corrigendum] [GitHub] Pavan Kumar J, Chiranjeevi Yarra, P. K. Ghosh "DNN Based Phrase Boundary Detection Using Knowledge-Based Features and Feature Representations from CNN" accepted in accepted at the 2021 National Conference on Communications (NCC) [pdf] [codes] [Corrigendum] [GitHub] Tanuka Bhattacharjee, Jhansi Mallela, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, P. K. Ghosh "EFFECT OF NOISE AND MODEL COMPLEXITY ON DETECTION OF AMYOTROPHIC LATERAL SCLEROSIS AND PARKINSON’S DISEASE USING PITCH AND MFCC" accepted in ICASSP 2021 [pdf] [codes] [Corrigendum] [GitHub] Tilak Purohit, Achuth Rao M V, P. K. Ghosh "Impact of speaking rate on the source filter Interaction in speech: a study" accepted in ICASSP 2021 [pdf] [codes] [Corrigendum] [GitHub] Sarthak Kumar Maharana, Aravind Illa, Renuka Mannem, Yamini Belur, Preetie Shetty, Veeramani Preethish Kumar, Seena Vengalil, Kiran Polavarapu, Nalini Atchayaram, P. K. Ghosh "ACOUSTIC-TO-ARTICULATORY INVERSION FOR DYSARTHRIC SPEECH BY USING CROSS-CORPUS ACOUSTIC-ARTICULATORY DATA" accepted in ICASSP 2021 [pdf] [codes] [Corrigendum] [GitHub]
Aravind Illa, P. K. Ghosh "Complexity-performance trade-off in acoustic-to-articulatory inversion" accepted in accepted 12th International Seminar on Speech Production (ISSP) 2020 [pdf] [codes] [Corrigendum] [GitHub] Anusuya P K, Aravind Illa, P. K. Ghosh "A data-driven phoneme-specific analysis of articulatory importance" accepted in accepted 12th International Seminar on Speech Production (ISSP) 2020 [pdf] [codes] [Corrigendum] [GitHub] Chiranjeevi Yarra, Kausthubha N K, P. K. Ghosh "SPIRE-ABC: An online tool for acoustic-unit boundary correction (ABC) via crowdsourcing" accepted in Oriental COCOSDA 2020 [pdf] [codes] [Corrigendum] [GitHub] Tilak Purohit, P. K. Ghosh "An investigation of the virtual lip trajectories during the production of bilabial stops and nasal at different speaking rates" accepted in Interspeech 2020, Shanghai, China [pdf] [codes] [Corrigendum] [GitHub] Aravind Illa, P. K. Ghosh "Speaker conditioned acoustic-to-articulatory inversion using x-vectors" accepted in Interspeech 2020, Shanghai, China [pdf] [codes] [Corrigendum] [GitHub] Jhansi Mallela, Aravind Illa, Yamini Belur, Nalini Atchayaram, Ravi yadav, Pradeep Reddy, Dipanjan Gope, P. K. Ghosh "Raw speech waveform based classification of patients with ALS, Parkinson’s Disease and healthy controls using CNN-BLSTM" accepted in Interspeech 2020, Shanghai, China [pdf] [codes] [Corrigendum] [GitHub] Renuka Mannem, Himajyothi Rajamahendravarapu, Aravind Illa, P. K. Ghosh "Speech rate task-specific representation learning from acoustic-articulatory data" accepted in Interspeech 2020, Shanghai, China [pdf] [codes] [Corrigendum] [GitHub] Renuka Mannem, Navaneetha Gaddam, P. K. Ghosh "Air-tissue boundary segmentation in real time Magnetic Resonance Imaging video using 3-D convolutional neural network" accepted in Interspeech 2020, Shanghai, China [pdf] [codes] [Corrigendum] [GitHub] Divya Degala, Achuth Rao M V, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadharshini, Prakash T K, P. K. Ghosh "Automatic Glottis Detection and Segmentation in Stroboscopic videos using Convolutional Networks" accepted in Interspeech 2020, Shanghai, China [pdf] [codes] [Corrigendum] [GitHub] Abhayjeet Singh, Aravind Illa, P. K. Ghosh "Attention and Encoder-Decoder based models for transforming articulatory movements at different speaking rates" accepted in Interspeech 2020, Shanghai, China [pdf] [codes] [Corrigendum] [GitHub] Abinay Reddy Naini, Satyapriya Malla, P. K. Ghosh "Whisper activity detection using CNN-LSTM based attention pooling network trained for a speaker identification task" accepted in Interspeech 2020, Shanghai, China [pdf] [codes] [Corrigendum] [GitHub] Neeraj Sharma, Prashant Krishnan, Rohit Kumar, Shreyas Ramoji, Srikanth Raj Chetupalli, Nirmala R, P. K. Ghosh, Sriram Ganapathy "A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis" accepted in Interspeech 2020, Shanghai, China [pdf] [codes] [Corrigendum] [GitHub] Suhas BN, Jhansi Mallela, Aravind Illa, Yamini BK, Nalini Atchayaram, Ravi Yadav, Dipanjan Gope, P. K. Ghosh "Speech task based automatic classification of ALS and Parkinson’s Disease and their severity using log mel spectrograms" accepted in Accepted at SPCOM2020 [pdf] [codes] [Corrigendum] [GitHub] Renuka Mannem, Himajyothi Rajamahendravarapu, Aravind Illa, P. K. Ghosh "Speech rate estimation using representations learned from speech with convolutional neural network" accepted in Accepted at SPCOM2020 [pdf] [codes] [Corrigendum] [GitHub] JHANSI MALLELA, ARAVIND ILLA, SUHAS B N, SATHVIK UDUPA, YAMINI BELUR, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, P. K. Ghosh "VOICE BASED CLASSIFICATION OF PATIENTS WITH AMYOTROPHIC LATERAL SCLEROSIS, PARKINSON" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pages: [pdf] [codes] [Corrigendum] [GitHub] Avni Rajpal, Achuth Rao MV, Chiranjeevi Yarra, Ritu Aggarwal, P. K. Ghosh "PSEUDO LIKELIHOOD CORRECTION TECHNIQUE FOR LOW RESOURCE ACCENTED ASR" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pages: [pdf] [codes] [Corrigendum] [GitHub] Siddharth Subramani, Achuth Rao M V, Divya Giridhar, Prasanna Suresh Hegde, P. K. Ghosh "AUTOMATIC CLASSIFICATION OF VOLUMES OF WATER USING SWALLOW SOUNDS FROM CERVICAL AUSCULTATION" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pages: [pdf] [codes] [Corrigendum] [GitHub] Shivani Yadav, Merugu Keerthana, Dipanjan Gope, Uma Maheswari Krishnaswamy, P. K. Ghosh "ANALYSIS OF ACOUSTIC FEATURES FOR SPEECH SOUND BASED CLASSIFICATION OF ASTHMATIC AND HEALTHY SUBJECTS" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pages: [pdf] [codes] [Corrigendum] [GitHub] Sanjeev Kadagathur Vadiraj, Achuth Rao M V, P. K. Ghosh "AUTOMATIC IDENTIFICATION OF SPEAKERS FROM HEAD GESTURES IN A NARRATION" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pages: [pdf] [codes] [Corrigendum] [GitHub] ABHAYJEET SINGH, ARAVIND ILLA, P. K. Ghosh "A COMPARATIVE STUDY OF ESTIMATING ARTICULATORY MOVEMENTS FROM PHONEME SEQUENCES AND ACOUSTIC FEATURES" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pages: [pdf] [codes] [Corrigendum] [GitHub]
Divya Giridhar, Achuth Rao M V, Prasanna Suresh Hedge, P. K. Ghosh "Analysis of swallow sounds of healthy controls for different volumes of water" accepted in International Conference on Engineering in Medicine and Life Sciences, PSG College of Technology, Coimbatore, India, December 19-21, 2019 [pdf] [codes] [Corrigendum] [GitHub] Chiranjeevi Yarra, Ritu Aggarwal, Avni Rajpal, P. K. Ghosh "Indic TIMIT and Indic English lexicon: A speech database of Indian speakers using TIMIT stimuli and a lexicon from their mispronunciations" accepted in Oriental COCOSDA 2019 [pdf] [codes] [Corrigendum] [GitHub] Chiranjeevi Yarra, Aparna Srinivasan, Chandana Srinivasa, Ritu Aggarwal, P. K. Ghosh "voisTUTOR corpus: A speech corpus of Indian L2 English learners for pronunciation assessment" accepted in Oriental COCOSDA 2019 [pdf] [codes] [Corrigendum] [GitHub] Shankar Narayanan, Aravind Illa, Nayan Anand, Ganesh Sinisetty, Karthick Narayanan, P. K. Ghosh "An acoustic-articulatory database of VCV sequences and words in Toda at different speaking rates" accepted in Oriental COCOSDA 2019 [pdf] [codes] [Corrigendum] [GitHub] Chiranjeevi Yarra, P. K. Ghosh "voisTUTOR: Virtual Operator for Interactive Spoken English TUTORing" accepted in 8th Workshop on Speech and Language Technology in Education, 2019 [pdf] [codes] [Corrigendum] [GitHub] Chiranjeevi Yarra, Manoj Kumar Ramanathi, P. K. Ghosh "Comparison of automatic syllable stress detection quality with time-aligned boundaries and context dependencies" accepted in 8th Workshop on Speech and Language Technology in Education, 2019 [pdf] [codes] [Corrigendum] [GitHub] Aparna Srinivasan, Chiranjeevi Yarra, P. K. Ghosh "Automatic assessment of pronunciation and its dependent factors by exploring their interdependencies using DNN and LSTM" accepted in 8th Workshop on Speech and Language Technology in Education, 2019 [pdf] [codes] [Corrigendum] [GitHub] Sweekar Sudhakara, Manoj Kumar Ramanathi, Chiranjeevi Yarra, Anurag Das, P. K. Ghosh "Noise robust goodness of pronunciation measures using teacher" accepted in 8th Workshop on Speech and Language Technology in Education, 2019 [pdf] [codes] [Corrigendum] [GitHub] Suhas BN, Deep Patel, Nithin Rao, Yamini Belur, Pradeep Reddy, Nalini Atchayaram, Ravi Yadav, Dipanjan Gope, P. K. Ghosh "Comparison of Speech Tasks and Recording Devices for Voice Based Automatic Classification of Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis" accepted in in Proc. Interspeech 2019, Graz, Austria, pages(s): 4564-4568 [pdf] [codes] [Corrigendum] [GitHub] Manoj Kumar Ramanathi, Chiranjeevi Yarra, P. K. Ghosh "ASR inspired syllable stress detection for pronunciation evaluation without using a supervised classifier and syllable level features" accepted in Interspeech 2019, Graz, Austria [pdf] [codes] [Corrigendum] [GitHub] Abinay Reddy Naini, Achuth Rao MV, P. K. Ghosh "Whisper to neutral mapping using cosine similarity maximization in i-vector space for speaker verification" accepted in Interspeech 2019, Graz, Austria [pdf] [codes] [Corrigendum] [GitHub] Renuka Mannem, Jhansi Mallela, Aravind Illa, P. K. Ghosh "Acoustic and articulatory feature based speech rate estimation using a convolutional dense neural network" accepted in Interspeech 2019, Graz, Austria [pdf] [codes] [Corrigendum] [GitHub] Atreyee Saha, Chiranjeevi Yarra, P. K. Ghosh "Low resource automatic intonation classification using gated recurrent unit (GRU) networks pre-trained with synthesized pitch patterns" accepted in Interspeech 2019, Graz, Austria [pdf] [codes] [Corrigendum] [GitHub] Sweekar Sudhakara, Manoj Kumar Ramanathi, Chiranjeevi Yarra, P. K. Ghosh "An improved goodness of pronunciation (GoP) measure for pronunciation evaluation with DNN-HMM system considering HMM transition probabilities" accepted in Interspeech 2019, Graz, Austria [pdf] [codes] [Corrigendum] [GitHub] Aravind Illa, P. K. Ghosh "An investigation on speaker specific articulatory synthesis with speaker independent articulatory inversion" accepted in Interspeech 2019, Graz, Austria [pdf] [codes] [Corrigendum] [GitHub] Chiranjeevi Yarra, Aparna Srinivasan, Sravani Gottimukkala, P. K. Ghosh "SPIRE-fluent: A self-learning app for tutoring oral fluency to second language English learners" accepted in Interspeech 2019, Graz, Austria [pdf] [codes] [Corrigendum] [GitHub] Achuth Rao M V, P. K. Ghosh, Tanuka Bhattacharjee, Anirban Dutta Choudhury "Trend Statistics Network and Channel invariant EEG Network for sleep arousal study" accepted in the 41th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’19), Berlin, Germany [pdf] [codes] [Corrigendum] [GitHub] Aravind Illa, P. K. Ghosh "Representation learning using convolution neural network for acoustic-to-articulatory inversion" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5931-5935 [pdf] [codes] [Corrigendum] [GitHub] Gokul Srinivasan, Aravind Illa, P. K. Ghosh "A study on robustness of articulatory features for automatic speech recognition of neutral and whispered speech" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5936-5940 [pdf] [codes] [Corrigendum] [GitHub] Valliappan CA, Avinash Kumar, Renuka Mannem, Karthik Girija Ramesan, P. K. Ghosh "An improved air tissue boundary segmentation technique for real time magnetic resonance imaging video using segnet" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pages: 5921-5925 [pdf] [codes] [Corrigendum] [GitHub] Renuka Mannem, P. K. Ghosh "Air-tissue boundary segmentation in real time magnetic resonance imaging video using a convolutional encoder-decoder network" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Brighton, UK, 2019, pages: 5941-5945 [pdf] [codes] [Corrigendum] [GitHub] Abinay Reddy Naini, Achuth Rao M V, P. K. Ghosh "Formant-gaps features for speaker verification using whispered speech" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Brighton, UK,2019, Pages: 6231-6235 [pdf] [codes] [Corrigendum] [GitHub] Renuka Mannem, Valliappan C A, P. K. Ghosh "A SegNet Based Image Enhancement Technique for Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video" accepted in In Proc. National Conference on Communications (NCC) 2019, Bangalore, India, Pages: 1-6 [pdf] [codes] [Corrigendum] [GitHub]
Aravind Illa, P. K. Ghosh "Inferring speaker identity from articulatory motion during speech" accepted in Workshop on Machine Learning in Speech and Language Processing (MLSLP) 2018, Hyderabad, India [pdf] [codes] [Corrigendum] [GitHub] Aravind Illa, P. K. Ghosh "Low resource acoustic-to-articulatory inversion using bi-directional long short term memory" accepted in in Proc. Interspeech, Hyderabad, India,2018, Page(s): 3122-3126 [pdf] [codes] [Corrigendum] [GitHub] G. Nisha Meenakshi, P. K. Ghosh "Whispered speech to neutral speech conversion using bidirectional LSTMs" accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 491-495 [pdf] [codes] [Corrigendum] [GitHub] Pavan Karjol, P. K. Ghosh "Speech enhancement using deep mixture of experts based on hard expectation maximization" accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 3254-3258 [pdf] [codes] [Corrigendum] [GitHub] Abinay Reddy N, Achuth Rao M V, G. Nisha Meenakshi, P. K. Ghosh "Reconstructing Neutral Speech from Tracheoesophageal Speech" accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 1541-1545 [pdf] [codes] [Corrigendum] [GitHub] Astha Singh, G. Nisha Meenakshi, P. K. Ghosh "Relating articulatory motions in different speaking rates" accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 2992-2996 [pdf] [codes] [Corrigendum] [GitHub] Chandana S, Chiranjeevi Yarra, Ritu Aggarwal, Sanjeev Kumar Mittal, Kausthubha N K, Raseena K T, Astha Singh, P. K. Ghosh "Automatic visual augmentation for concatenation based synthesized articulatory videos from real-time MRI data for spoken language training" accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 3127-3131 [pdf] [codes] [Corrigendum] [GitHub] Achuth Rao M V, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadarshini, P. K. Ghosh "Automatic glottis localization and segmentation in stroboscopic videos using deep neural network" accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 3007-3011. (in Finalist for the best paper award) [pdf] [codes] [Corrigendum] [GitHub] Girija Ramesan Karthik, Parth Suresh, P. K. Ghosh "Subband weighting for binaural speech source localization" accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 861-865 [pdf] [codes] [Corrigendum] [GitHub] Valliappan CA, Renuka Mannem, P. K. Ghosh "Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video using Semantic Segmentation with Fully Convolutional Networks" accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 3132-3136 [pdf] [codes] [Corrigendum] [GitHub] Ananda P A, Chiranjeevi Yarra, Kausthubha N K, P. K. Ghosh "Intonation tutor by SPIRE (In-SPIRE): An online tool for an automatic feedback to the second language learners in learning intonation" accepted in In Proc. Interspeech, Hyderabad, India,2018, Page(s): 546-547 [pdf] [codes] [Corrigendum] [GitHub] Chiranjeevi Yarra, Ananda P A, Kausthubha N K, P. K. Ghosh "SPIRE-SST: An automatic web-based self-learning tool for syllable stress tutoring (SST) to the second language learners" accepted in In Proc. Interspeech, Hyderabad, India, 2018, Page(s): 2390-2391 [pdf] [codes] [Corrigendum] [GitHub] Valliappan CA, Anurag Das, P. K. Ghosh "Classification of Story-Telling and Poem Recitation Using Head Gesture of the Talker" accepted in in Proc. International Conference on Signal Processing and Communications (SPCOM), Bangalore, India, 2018, Page(s): 36-40 [pdf] [codes] [Corrigendum] [GitHub] Pavan Karjol, P. K. Ghosh "Broad Phoneme Class Specific Deep Neural Network Based Speech Enhancement" accepted in in Proc. International Conference on Signal Processing and Communications (SPCOM), Bangalore, India, 2018, Page(s): 372-376 [pdf] [codes] [Corrigendum] [GitHub] Shivani Yadav, Kausthubha NK, Dipanjan Gope, Uma Maheswari Krishnaswamy, P. K. Ghosh "Comparison of Cough, Wheeze and Sustained Phonations for Automatic Classification between Healthy Subjects and Asthmatic Patients" accepted in in Proc. Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’18), Honolulu, HI, USA,2018, Page(s): 1400-1403 [pdf] [codes] [Corrigendum] [GitHub] Raseena K T, P. K. Ghosh "A Maximum Likelihood Formulation to Exploit Heart Rate Variability for Robust Heart Rate Estimation from Facial Video" accepted in in Proc. Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’18), Honolulu, HI, USA, 2018, Page(s): 5191-5194 [pdf] [codes] [Corrigendum] [GitHub] Urvish Desai, Chiranjeevi Yarra, P. K. Ghosh "Concatenative articulatory video synthesis using real-time MRI data for spoken language training" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 4999-5003 [pdf] [codes] [Corrigendum] [GitHub] Aravind Illa, Deep Patel, Yamini BK, Meera SS, Shivashankar N, Preethish Kumar Veeramani, Seena Vengalil, Kiran Polavarapu, Saraswati Nashi, Atchayaram Nalini, P. K. Ghosh "Comparison of speech tasks for automatic classification of patients with amyotrophic lateral sclerosis and healthy subjects" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 6014-6018 [pdf] [codes] [Corrigendum] [GitHub] Anurendra Kumar, Tanaya Guha, P. K. Ghosh "A dynamic latent variable model for source separation" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 2871-2875 [pdf] [codes] [Corrigendum] [GitHub] Advait Koparkar, P. K. Ghosh "A supervised air-tissue boundary segmentation technique in real-time magnetic resonance imaging video using a novel measure of contrast and dynamic programming" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 5004-5008 [pdf] [codes] [Corrigendum] [GitHub] Karthik Girija Ramesan, P. K. Ghosh "Binaural speech source localization using template matching of interaural time difference patterns" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 5164-5168 [pdf] [codes] [Corrigendum] [GitHub] Pavan Karjol, Ajay Kumar M, P. K. Ghosh "Speech enhancement using multiple deep neural networks" accepted in In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, page(s): 5049-5052 [pdf] [codes] [Corrigendum] [GitHub]
Sahil Bansal, Anindita Ghosh, Chandra Sekhar Seelamantula, Gurunath Gurrala, P. K. Ghosh "Adaptive Frequency Estimation Approach Using Iterative DESA with RDFT-Based Filter" accepted in In Proc. IEEE PES Asia-Pacific Power and Energy Engineering Conference (APPEEC), 2017, pp. 1-6 [pdf] [codes] [Corrigendum] [GitHub] Nazreen P. M, A. G. Ramakrishnan, P. K. Ghosh "A Joint Enhancement-Decoding Formulation for Noise Robust Phoneme Recognition" accepted in Proc. INDICON-2017, IIT Roorkee, pp. 1-6 [pdf] [codes] [Corrigendum] [GitHub] Samik Sadhu, P. K. Ghosh "Low Resource Point Process Models for Keyword Spotting Using Unsupervised Online Learning" accepted in In Proc. 25th European Signal Processing Conference (EUSIPCO), 2017, pp. 538-542 [pdf] [codes] [Corrigendum] [GitHub] Achuth Rao MV, P. K. Ghosh "Pitch Prediction from Mel-generalized Cepstrum - a Computationally Efficient Pitch Modeling Approach for Speech Synthesis" accepted in In Proc. 25th European Signal Processing Conference (EUSIPCO), 2017, pp. 1629-1633 [pdf] [codes] [Corrigendum] [GitHub] Achuth Rao MV, Kausthubha NK, Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy, P. K. Ghosh "Automatic Prediction of Spirometry Readings from Cough and Wheeze for Monitoring of Asthma Severity" accepted in In Proc. 25th European Signal Processing Conference (EUSIPCO), 2017, pp. 41-45 [pdf] [codes] [Corrigendum] [GitHub] Akshay Kalkunte Suresh, Srinivasa Raghavan KM, P. K. Ghosh "Phoneme state posteriorgram features for speech based automatic classification of speakers in cold and healthy condition" accepted in Proc. Interspeech 2017, Stockholm, Sweden, 3462-3466 [pdf] [codes] [Corrigendum] [GitHub] Gaurav Fotedar, P. K. Ghosh "An information theoretic analysis of the temporal synchrony between head gestures and prosodic patterns in spontaneous speech" accepted in Proc. Interspeech 2017, Stockholm, Sweden, 157-161 [pdf] [codes] [Corrigendum] [GitHub] Girija Ramesan Karthik, P. K. Ghosh "Subband selection for binaural speech source localization" accepted in Proc. Interspeech 2017, Stockholm, Sweden, 1929-1933 [pdf] [codes] [Corrigendum] [GitHub] G. Nisha Meenakshi, P. K. Ghosh "A robust Voiced/Unvoiced phoneme classification from whispered speech using the ’color’ of whispered phonemes and Deep Neural Network" accepted in Proc. Interspeech 2017, Stockholm, Sweden, 503-507 [pdf] [codes] [Corrigendum] [GitHub] Abhishek Narwekar, P. K. Ghosh "PRAV: A Phonetically Rich Audio Visual Corpus" accepted in Proc. Interspeech 2017, Stockholm, Sweden, 3747-3751 [pdf] [codes] [Corrigendum] [GitHub] Achuth Rao M V, Shivani Yadav, P. K. Ghosh "A dual source-filter model of snore audio for snorer group classification" accepted in Proc. Interspeech 2017, Stockholm, Sweden, 3502-3506 [pdf] [codes] [Corrigendum] [GitHub] Srinivasa Raghavan, Nisha Meenakshi, Sanjeev Kumar Mittal, Chiranjeevi Yarra, Anupam Mandal, K R Prasanna Kumar, P. K. Ghosh "A Comparative Study on the Effect of Different Codecs on Speech Recognition Accuracy Using Various Acoustic Modeling Techniques" accepted in in Proc. National Conference on Communications (NCC), Chennai, India,2017, Page(s): 1-6 [pdf] [codes] [Corrigendum] [GitHub] Pradyumna Suresha, Supriya Nagesh, Priyadarshini Savan Roshan, Aditya Gaonkar P, Nisha Meenakshi, P. K. Ghosh "A High Resolution ENF Based MultiStage Classifier for Location Forensics of Media Recordings" accepted in in Proc. National Conference on Communications (NCC), Chennai, India,2017, Page(s): 1-6 [pdf] [codes] [Corrigendum] [GitHub] Mekhala H S, Yamini B K, Ketan J, Pal P, Shivashankar N, P. K. Ghosh "Classification of Healthy Subjects and Patients with Essential Vocal Tremor Using Empirical Mode Decomposition of High Resolution Pitch Contour" accepted in in Proc. National Conference on Communications (NCC), Chennai, India,2017, Page(s): 1-6 [pdf] [codes] [Corrigendum] [GitHub] Achuth Rao MV, P. K. Ghosh "Pitch Prediction from Mel-Frequency Cepstral Coefficients Using Sparse Spectrum Recovery" accepted in in Proc. National Conference on Communications (NCC), Chennai, India,2017, Page(s): 1-6 [pdf] [codes] [Corrigendum] [GitHub] Chiranjeevi Yarra, Om D. Deshmukh, P. K. Ghosh "An automatic detection of syllable stress using sonority based prominence features for pronunciation evaluation" accepted in in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans,2017, Page(s): 5845-5849 [pdf] [codes] [Corrigendum] [GitHub] Aravind Illa, Nisha Meenakshi G, P. K. Ghosh "A comparative study of acoustic-to-articulatory inversion for neutral and whispered speech" accepted in in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans,2017, Page(s): 5075-5079 [pdf] [codes] [Corrigendum] [GitHub]
Gaurav Fotedar, Aditya Gaonkar P, Saikat Chatterjee, P. K. Ghosh "Automatic recognition of social roles using long term role transitions in small group interactions" accepted in in Proc. INTERSPEECH, 8-12 September, 2016, Pages(s): 2065-2069 [pdf] [codes] [Corrigendum] [GitHub] Nazreen P.M, A. G. Ramakrishnan, P. K. Ghosh "A class-specific speech enhancement for phoneme recognition: a dictionary learning approach" accepted in in Proc. INTERSPEECH, 8-12 September, 2016, Pages(s): 3728-3732 [pdf] [codes] [Corrigendum] [GitHub] Aditya Gaonkar P, Bhuthesh R, Dipanjan Gope, P. K. Ghosh "Robust Real-Time Pulse Rate Estimation From Facial Video Using Sparse Spectral Peak Tracking" accepted in in Proc. SPCOM, 12-15 June, 2016, Page(s): 1-5 [pdf] [codes] [Corrigendum] [GitHub] Abhishek Narwekar, P. K. Ghosh "A Comparative Study of Articulatory Features From Facial Video and Acoustic-To-Articulatory Inversion for Phonetic Discrimination" accepted in in Proc. SPCOM, 12-15 June, 2016, Page(s): 1-5 [pdf] [codes] [Corrigendum] [GitHub] Supriya Nagesh, Chiranjeevi Yarra, Om D. Deshmukh, P. K. Ghosh "A robust speech rate estimation based on the activation profile from the selected acoustic unit dictionary" accepted in in Proc. ICASSP, 21-25 March, 2016, Page(s): 5400-5404 [pdf] [codes] [Corrigendum] [GitHub] Amber Afshan, P. K. Ghosh "Better acoustic normalization in subject independent acoustic-to-articulatory inversion: benefit to recognition" accepted in in Proc. ICASSP, 21-25 March, 2016, Page(s): 5395-5399 [pdf] [codes] [Corrigendum] [GitHub]
Abhay Prasad, P. K. Ghosh "Automatic classification of eating conditions from speech using acoustic feature selection and a set of hierarchical support vector machine classifiers" accepted in in Proc. Interspeech, Sep 6-10 2015, Page(s): 884-888 [pdf] [codes] [Corrigendum] [GitHub] Satyabrata Parida, Pattem Ashok Kumar, P. K. Ghosh "Estimation of the air-tissue boundaries of the vocal tract in the mid-sagittal plane from electromagnetic articulograph data" accepted in in Proc. Interspeech, Sep 6-10, 2015, Page(s): 2147-2151. (in Finalist for the best paper award) [pdf] [codes] [Corrigendum] [GitHub] Nisha Meenakshi, P. K. Ghosh "A discriminative analysis within and across voiced and unvoiced consonants in neutral and whispered speech in multiple Indian languages" accepted in in Proc. Interspeech, Sep 6-10, 2015, Page(s): 781-785 [pdf] [codes] [Corrigendum] [GitHub] Sujith P, Prathosh A. P, Ramakrishnan A. G, P. K. Ghosh "An Error Correction Scheme for GCI Detection Algorithms using Pitch Smoothness Criterion" accepted in in Proc. Interspeech, Sep 6-10, 2015, Page(s): 3284-3288 [pdf] [codes] [Corrigendum] [GitHub] Adria Casamitjana, Martin Sundin, P. K. Ghosh, Saikat Chatterjee "Bayesian learning for time-varying linear prediction of speech" accepted in in Proc. EUSIPCO, Aug 31- Sep 4 2015, pp 325-329 [pdf] [codes] [Corrigendum] [GitHub] A. Prasad, V. Periyasamy, P. K. Ghosh "Estimation of the invariant and variant characteristics in speech articulation and its application to speaker identification" accepted in In Proc. ICASSP 2015, Page(s): 4265-4269 [pdf] [codes] [Corrigendum] [GitHub] Nisha Meenakshi, P. K. Ghosh, Automatic Gender Classification Using the Mel Frequency Cepstrum of Neutral, Whispered Speech: a Comparative Study "In Proc. NCC 2015, Page(s): 1-6" accepted in In Proc. NCC 2015, Page(s): 1-6 [pdf] [codes] [Corrigendum] [GitHub]
Prasad Sudhakar, P. K. Ghosh "Sparse smoothing of articulatory features from Gaussian mixture model based acoustic-to-articulatory inversion: Benefit to speech recognition" accepted in in Proc. InterSpeech 2014, Page(s): 169-173 [pdf] [codes] [Corrigendum] [GitHub] Abhay Prasad, P. K. Ghosh, Shrikanth Narayanan "Selection of optimal vocal tract regions using real-time magnetic resonance imaging for robust voice activity detection" accepted in in Proc. InterSpeech 2014, Page(s): 1539-1543 [pdf] [codes] [Corrigendum] [GitHub] Sujith P, P. K. Ghosh "Missing samples estimation in electromagnetic articulography data using equality constrained Kalman smoother" accepted in in Proc. InterSpeech 2014, Page(s): 716-720 [pdf] [codes] [Corrigendum] [GitHub] Nisha Meenakshi, Chiranjeevi Yarra, B. K. Yamini, P. K. Ghosh "Comparison of speech quality with and without sensors in electromagnetic articulograph AG 501 recording" accepted in in Proc. InterSpeech 2014, Page(s): 935-939 [pdf] [codes] [Corrigendum] [GitHub] Sujith P, P. K. Ghosh "Maximum a-posteriori estimation of missing samples with continuity constraint in electromagnetic articulography data" accepted in in Proc. ICASSP 2014, page(s): 940-944 [pdf] [codes] [Corrigendum] [GitHub] Abhijith Mundanad Narayanan, P. K. Ghosh, K. Rajgopal "Multi-Pitch Tracking using Gaussian mixture model with time varying parameters and Grating Compression Transform" accepted in in Proc. ICASSP 2014, page(s): 1473-1477 [pdf] [codes] [Corrigendum] [GitHub] Prasad Sudhakar, Laurent Jacques, P. K. Ghosh "A sparse smoothing approach for Gaussian mixture model based acoustic-to-articulatory inversion" accepted in in Proc. ICASSP 2014, page(s): 3032-3036 [pdf] [codes] [Corrigendum] [GitHub] Andreas Tsiartas, P. K. Ghosh, Panayiotis Georgiou, Shrikanth S. Narayanan "Classification of clean and noisy bilingual movie audio for speech-to-speech translation corpora design" accepted in in Proc. ICASSP 2014, page(s): 121-125 [pdf] [codes] [Corrigendum] [GitHub]
P. K. Ghosh, S. Narayanan "Information theoretic acoustic feature selection for acoustic-to-articulatory inversion" accepted in in Proc. Interspeech, Lyon, France, 2013, page(s): 3177-3181 [pdf] [codes] [Corrigendum] [GitHub] A. Tsiartas, T. Chaspari, N. Katsamanis, P. K. Ghosh, M. Li, M. V. Sebroeck, A. Potamianos, S. Narayanan "Multi-band long-term signal variability features for robust voice activity detection" accepted in in Proc. Interspeech, Lyon, France, 2013, page(s): 718-722 [pdf] [codes] [Corrigendum] [GitHub] M. Li, J. Kim, P. K. Ghosh, V. Ramanarayanan, S. Narayanan "Speaker verification based on fusion of acoustic and articulatory information" accepted in in Proc. Interspeech, Lyon, France, 2013, page(s): 1614-1618 [pdf] [codes] [Corrigendum] [GitHub] Bhuthesh R, P. K. Ghosh, Dipanjan Gope "Enhanced Pulse Rate Measurement from Facial video by Automatic Detection of Sensitive Skin Regions" accepted in in IEEE workshop on Computational Intelligence, IIT Kanpur, July 2013 [pdf] [codes] [Corrigendum] [GitHub] M. Li, A. Lammert, J. Kim, P. K. Ghosh, S. Narayanan "Automatic Classification of Palatal and PharyngealWall Shape Categories from Speech Acoustics and Inverted Articulatory Signals" accepted in in Proc. Workshop on Speech Production in Automatic Speech Recognition, Interspeech, Lyon, France, 2013, page(s): 34-39 [pdf] [codes] [Corrigendum] [GitHub] S. Iqbal, A. Verma, P. K. Ghosh, K. Church, J. Marcus "Intent Focused Summarization of Caller-Agent Conversations" accepted in in Proc. ICASSP, Vancouver, Canada, 2013, page(s): 8352-8356 [pdf] [codes] [Corrigendum] [GitHub] J. Kim, A. Lammert, P. K. Ghosh, S. Narayanan "Spatial and temporal alignment of multimodal human speech production data: real time imaging, flesh point tracking and audio" accepted in iin Proc. ICASSP, Vancouver, Canada, 2013, page(s): 3637-3641 [pdf] [codes] [Corrigendum] [GitHub]
P. K. Ghosh, S. Narayanan "Analysis of inter-articulator correlation in acoustic-to-articulatory inversion using generalized smoothness criterion" accepted in in Proc. Interspeech, Florence, Italy, 2011, page(s): 2685-2688 [pdf] [codes] [Corrigendum] [GitHub] S. Narayanan, E. Bresch, P. K. Ghosh, L. Goldstein, A. Katsamanis, Y. Kim, A. Lammert, M. Proctor, V. Ramanarayanan, Y. Zhu "A Multi-modal Real-Time MRI Articulatory Corpus for Speech Research" accepted in in Proc. Interspeech, Florence, Italy, 2011, page(s): 837-840 [pdf] [codes] [Corrigendum] [GitHub] B. Xiao, P. K. Ghosh, Panayiotis G. Georgiou, S. Narayanan "Overlapped speech detection using long-term spectro-temporal similarity in stereo recording" accepted in in Proc. ICASSP, Prague, Czech Republic, 2011, page(s): 5216-5219 [pdf] [codes] [Corrigendum] [GitHub] A. Tsiartas, P. K. Ghosh, Panayiotis G. Georgiou, S. Narayanan "Bilingual audio-subtitle extraction using automatic segmentation of movie audio" accepted in in Proc. ICASSP, Prague, Czech Republic, 2011, page(s): 5624-5627 [pdf] [codes] [Corrigendum] [GitHub] P. K. Ghosh, S. Narayanan "A subject-independent acoustic-to-articulatory inversion" accepted in in Proc. ICASSP, Prague, Czech Republic, 2011, page(s): 4624-4627 [pdf] [codes] [Corrigendum] [GitHub]
P. K. Ghosh, S. Narayanan, Pierre Divenyi, Louis Goldstein, Elliot Saltzman "Estimation of articulatory gesture patterns from speech acoustics" accepted in Proc. InterSpeech, 6-10 Sep, 2009, Brighton, UK, pp 2803-2806 [pdf] [codes] [Corrigendum] [GitHub] A. Tsiartas, P. K. Ghosh, S. Narayanan "Context-driven bilingual movie subtitle alignment" accepted in Proc. InterSpeech, 6-10 Sep, 2009, Brighton, UK, pp 444-447 [pdf] [codes] [Corrigendum] [GitHub] A. Tsiartas, P. K. Ghosh, P. Georgiou, S. Narayanan "Robust word boundary detection in spontaneous speech using acoustic and lexical cues" accepted in In Proc. ICASSP, Taipei, Taiwan, Apr 2009, page(s): 4785-4788 [pdf] [codes] [Corrigendum] [GitHub]
P. K. Ghosh, T.V. Sreenivas "Dynamic Programming Based Optimum Non-Uniform Samples For Speech Reconstruction and Coding" accepted in ICASSP 2006, Volume 1, Page(s): I-I [pdf] [codes] [Corrigendum] [GitHub] P. K. Ghosh, T.V. Sreenivas "Extrema based Unwarping for Time-varying Pitch Estimation" accepted in in 12th National Conference on Communication (NCC) 2006 [pdf] [codes] [Corrigendum] [GitHub] A. Das, M. Balwani, R. Thota, P. K. Ghosh "Face Recognition from Images with High Pose Variations by Transform Vector Quantization" accepted in ICVGIP 2006, Pages: 674-685 [pdf] [codes] [Corrigendum] [GitHub] A. Das, P. K. Ghosh "Audio-Visual Biometric Recognition by Vector Quantization" accepted in IEEE Spoken Language Technology(SLT) Workshop, Dec 2006, Page(s): 166-169 [pdf] [codes] [Corrigendum] [GitHub]