Dr. S. Umesh
Research Interests
- Automatic Speech Recognition.
- Speaker Normalization & Adaptation.
- Self Supervised Learning.
- Deep Learning and Machine Learning.
- Speaker Recognition & Diarisation.
Awards and Honours
- AICTE Career Award for Young Teachers 1997.
- Alexander von Humboldt Research Fellowship 2004.
Personal Website
For more information, visit my personal website.
Recent Publications
Journals
- S V Bharath Kumar and S Umesh [2008]: Non-Uniform Speaker Normalization Using Affine Transformation,To Appear in Journal of the Acoustical Society of America, Vol 124, No 3, Sep 2008.
- R Sinha and S Umesh [2008]: A Shift based Approach to Speaker Normalization using Non-Linear Frequency-Scaling Model, ISCA Transactions on Speech Communication, Vol 50,No 3, pp 191-202, Mar 2008.
- S Umesh and R Sinha [2007]: A Study of Filter-Bank Smoothing in MFCC Features for Recognition of Children Speech, IEEE Transactions on Audio, Speech and Language Processing, Volume 15, Issue 8, Nov 2007 Page(s): 2418 2430.
- S Umesh, L Cohen and D Nelson [2007]: Fluctuations in Speech, Fluctuations and Noise Letters, 1 Vol 7, No 3, Sep 2007, pp 215224.
- S Umesh, L Cohen and D Nelson [2002]: The Speech Scale, Acoustics Research Letters Online of the Journal of Acoustical Society of America, Vol 3, Issue 3, pp 83-88, July 2002.
- S Umesh, L Cohen and D Nelson [2002]: Frequency Warping and the Mel-scale, IEEE Signal Processing Letters, vol 9, no 3, pp 104-107, March 2002.
- S Umesh, L Cohen, N Marinovic, and D J Nelson [1999]: Scale-Transform in Speech Analysis, IEEE Transactions on Speech and Audio Processing, vol 7, no 1, pp 40-45, Jan 1999.
- S Umesh and D W Tufts [1996]:Estimation of Parameters of Multiple Exponentially Damped Sinusoids using Fast Maximum Likelihood Estimation with Application to NMR Spectroscopy Data,IEEE Trans Signal Processing, vol 44, no 9, pp 2245-2259, Sept 1996.
- D W Tufts, H Ge, and S Umesh [1993]: Fast Maximum Likelihood Estimation of Signal Parameters using the Shape of the Compressed Likelihood Function, IEEE Journal of Oceanic Engg, Vol 18, no 4, pp 388-400, Oct 1993.
Conference Proceedings
- D R Sanand and S Umesh [2008]: Study of Jacobian Compensation Using Linear Transformation of Conventional MFCC for VTLN, To Appear in Interspeech-2008, Brisbane, Sep 2008.
- D R Sanand, V Balaji, R Sandhya Rani and S Umesh [2008]: Use of Spectral Center of Gravity for Generating Speaker Invariant Features for Automatic Speech Recognition, To Appear in Interspeech-2008, Brisbane, Sep 2008.
- P T Akhil, S P Rath, S Umesh and D R Sanand [2008]: A Computationally Efficient Approach to Warp Factor Estimation in VTLN Using EM Algoirthm and Sufficient Statistics’’, To Appear in Interspeech-2008, Brisbane, Sep 2008.
- D R Sanand, D Dinesh Kumar and S Umesh [2007]: Linear Transformation Approach to VTLN Using Dynamic Frequency Warping,’’ Proc of International Conference on Spoken Language Processing (Interspeech 2007), Antwerp, Belgium, August 27-31, 2007.
- S Umesh, L Cohen and D Nelson [2007]: Fluctuations in speech,’’ Proc of Conference on Noise and Fluctuations in Biological, Biophysical, and Biomedical Systems, Florence, Italy, May 2007.
- S Umesh, D Rama Sanand, G Praveen [2007]: Speaker-Invariant Features for Automatic Speech Recognition,’’ Proc. of International Joint Conferences on Artificial Intelligence, (IJCAI-07), pp 1738-1743, Jan 2007.
- S V Bharath, S Umesh and R Sinha [2006]: Study of Non-Linear Frequency Warping Functions for Speaker Normalization,’’ To Appear in Proc of IEEE International Conf on Acoustic, Speech and Signal Processing, (ICASSP Toulouse), April 2006.
- J Loof and H Ney and S Umesh [2006]: VTLN Warping Factor Estimation Using Accumulation of Sufficient Statistics,’’ To Appear in Proc of IEEE International Conf. on Acoustic, Speech and Signal Processing, (ICASSP Toulouse), April 2006.
- S Umesh, A Zolnay and H Ney [2005]: Implementing Frequency-Warping and VTLN Through Linear Transformation of Conventional MFCC,’’ Proc of InterSpeech 2005, (Lisbon, Portugal), Sep’2005.
- S Umesh, L Cohen and D Nelson [2005]: The Speech Scale and Spectral Transformation,’’ Proc of SPIE Conference on Wavelet Applications in Signal & Image Proc, July’2005.
- S V Bharath and S Umesh [2004]: Non-uniform speaker normalization using frequency-dependent scaling function,’’ Proc IEEE International Conference on Signal Processing and Communications, (Bangalore), December 2004.
Books Authored
- Ajit K Chaturvedi, Srinivasan Umesh, Adrish Banerjee, Kameswari Chebrolu, Joseph John, Ayyangar R Harish (Editors): Proceedings of the Thirteenth National Conference on Communications, I I T Kanpur, 26-28 January 2007 ;ISBN Number: 978-81-904444-0-8.
Professional Experience
- Professor, Dept. of Electrical Engineering, IIT Madras, At Present
- Faculty, Dept. of Electrical Engineering, IIT Kanpur, Jun 1996 - Jul 2009
- Visiting Researcher, Computer Science VI, RWTH-Aachen, Germany, May 2004 - Jun 2005
- Visiting Researcher, Machine Intelligence Laboratory-Cambridge University, Jun 2003 - Apr 2004
- Visiting Researcher, AT&T Laboratories-Research, USA, May - Dec 1999
- Visiting Research Faculty, City University of New York, USA, Summer 1997, 1998, 2002
- Post-Doctoral Fellow, City University of New York, USA, 1994 - 1996
- Post-Doctoral Fellow, University of Rhode Island, USA, 1993 - 1994
Other Information
- Faculty, Dept of Electrical Engineering, IIT-Kanpur (June 1996 - July 2009) (First as Assistant Professor and finally as Professor).
- Visiting Researcher, Computer Science VI, RWTH-Aachen, Germany (May 2004 - June 2005) (on Alexander von Humboldt Research Fellowship).
- Visiting Researcher, Machine Intelligence Laboratory-Cambridge University Engg Dept, UK (June 2003 - April 2004).
- Visiting Researcher, AT&T Laboratories-Research, USA (May-Dec 1999).
- Visiting Research Faculty, City Univ of New York, USA (Summer 1997, Summer 1998, Summer 2002).
- Post-Doctoral Fellow, City Univ of New York, USA (1994-1996).
- Post-Doctoral Fellow, Univ of Rhode Island, USA (1993-1994).