03 Apr
|
Sony Research India
|
Tirupati
03 Apr
Sony Research India
Tirupati
Apply on Kit Job: kitjob.in/job/45ndmq
Sony Research India is driving cutting-edge research and development in various locations around the globe, including laboratories in Japan, the United States, Europe, and Asia. We endeavor to create new technology, products, and services while sustaining Sony Group’s diverse businesses in electronics, entertainment, and financial fields. For our research centre to blaze a trail in the latest technologies, we seek to foster the growth of a diverse pool of research and engineering talent and create a technology talent bank to drive research excellence worldwide. Sony Research India is offering outstanding career opportunities around frontline technologies such as AI and data analytics.
Sony Research India is seeking a dynamic and motivated Speech Recognition Consultant to join our innovative research team. As a Consultant, you will work on real-world problems in automatic speech recognition (ASR), focusing on improving noise robustness and minimizing code-switching errors in transcription outputs. You'll gain hands-on experience with state-of-the-art tools and datasets, and contribute to impactful projects alongside experienced researchers and engineers.
Key Responsibility:
- Explore and develop techniques to improve ASR robustness under noisy, low-resource, and domain-shifted conditions.
- Investigate code-switching errors in end-to-end ASR models (e.g., Whisper, Wav2Vec2, Conformer) and propose mitigation strategies.
- Conduct experiments using large-scale speech datasets and evaluate ASR performance across varying noise levels and linguistic diversity.
- Understand and implement open-source libraries and repositories.
- Contribute to research publications, technical reports, or open-source tools resulting from the work.
- Support business-related tasks on a day-to-day basis as required.
Work Location:
- Remote within India,
Duration of the paid contractual role:
- The annual paid direct contractual tenure is extendable.
- Ideally this position will start from first week of May 2026.
- The working hours are from 9:00 to 18:00 (Monday to Friday) full time.
Essential Education:
- Master’s degree (Research) with some industry experience in deep learning or machine learning, or a PhD candidate in the final stage of their program.
- Hands-on experience with speech AI or speech processing applications.
Must Have Skills & Abilities:
- Excellent coding skills, especially in Python and PyTorch
- Experience with speech processing libraries (e.g., Torchaudio, Transformers, etc.).
- Experience with ASR models like Wav2Vec2, Whisper, Conformer, RNN-T is a plus.
- Experience with Nemo, ESPNET toolkits.
- Ability to read and implement academic papers.
- Strong foundation in machine learning and signal processing
Good to Have Skills:
- Familiarity with prompt tuning, contrastive learning, or multi-modal architectures.
- Experience with multilingual ASR.
- Papers in top-tier conferences like ICASSP, Interspeech, NeurIPS, AAAI, ACL, etc.
Apply on Kit Job: kitjob.in/job/45ndmq
📌 Speech Recognition - Research Consultant (Tirupati)
🏢 Sony Research India
📍 Tirupati