Senior Data Scientist- Audio And Speech (Multimodal AI)
- Elbit Systems
- Sharon, Massachusetts
- Full Time
?Are you ready to push the boundaries of Audio Intelligence
We're looking for a Senior Data Scientist with deep expertise in Audio AI, Speech Processing, and Generative Modeling to design and develop advanced onprem multimodal systems capable of understanding, generating, and analyzing complex audio streams in noisy, realworld environments
You'll join a worldclass Defense Tech AI team building speechdriven solutions that enable intelligent communication, operational insight, and nextgeneration humanmachine interaction
:What You'll Do
Fine tune, and evaluate SpeechtoText (STT) models optimized for noisy, lowlatency, and missioncritical environments
Develop speaker identification and diarization ,sentiment and emotional analysis to detect tone, stress levels, and affective patterns
Design and optimize multimodal pipelines combining audio, text, and visual inputs for enhanced semantic understanding and crossmodal reasoning
Contribute to Generative AI innovations - noise reduction, voice conversion, speech enhancement, and conversation insights
Collaborate closely with ML engineers and research peers to deploy, scale, and optimize Audio AI models onprem and edge hardware
Work with domain experts to adapt models for realtime speech understanding, decision support, and behavioral insights
:Your Expertise
Solid background in Machine Learning, Deep Learning, and Audio Signal Processing
5+ years handson experience developing and deploying speech or audiobased AI models
3+ years focused on STT / ASR, TTS, speaker recognition, or sentiment analysis
Deep familiarity with architectures such as Conformer, Whisper, RNNTransducer, FastSpeech / Tacotron, speaker embedding networks, and selfsupervised speech representations
Experience handling noisy, realtime audio, latency optimization, and edgedevice constraints
Understanding of semantic embeddings, multimodal search, and RAG architectures
Strong datadriven mindset and ability to conduct research on novel Audio AI approaches
Comfortable working with Agile workflows, MLOps, and DevOps principles
Publication record, Kaggle or challenge participation, or equivalent - Advantage
:Why Join Us
Work with leading researchers and engineers on nextgeneration Speech and Audio Intelligence
Make a direct impact on speech understanding, generation, and sentiment analytics in realworld applications
Collaborate on cuttingedge multimodal AI systems integrating vision, audio, and language
Be part of a forwardthinking team that values creativity, research excellence, and continuous learning
Shape the future of Audio and Speech AI - from concept to deployment
Only suitable applications will be considered
#Netanya