Senior Data Scientist- Audio And Speech (Multimodal AI)

  • Elbit Systems
  • Sharon, Massachusetts
  • Full Time

?Are you ready to push the boundaries of Audio Intelligence

We're looking for a Senior Data Scientist with deep expertise in Audio AI, Speech Processing, and Generative Modeling to design and develop advanced onprem multimodal systems capable of understanding, generating, and analyzing complex audio streams in noisy, realworld environments

You'll join a worldclass Defense Tech AI team building speechdriven solutions that enable intelligent communication, operational insight, and nextgeneration humanmachine interaction

:What You'll Do

Fine tune, and evaluate SpeechtoText (STT) models optimized for noisy, lowlatency, and missioncritical environments

Develop speaker identification and diarization ,sentiment and emotional analysis to detect tone, stress levels, and affective patterns

Design and optimize multimodal pipelines combining audio, text, and visual inputs for enhanced semantic understanding and crossmodal reasoning

Contribute to Generative AI innovations - noise reduction, voice conversion, speech enhancement, and conversation insights

Collaborate closely with ML engineers and research peers to deploy, scale, and optimize Audio AI models onprem and edge hardware

Work with domain experts to adapt models for realtime speech understanding, decision support, and behavioral insights

:Your Expertise

Solid background in Machine Learning, Deep Learning, and Audio Signal Processing

5+ years handson experience developing and deploying speech or audiobased AI models

3+ years focused on STT / ASR, TTS, speaker recognition, or sentiment analysis

Deep familiarity with architectures such as Conformer, Whisper, RNNTransducer, FastSpeech / Tacotron, speaker embedding networks, and selfsupervised speech representations

Experience handling noisy, realtime audio, latency optimization, and edgedevice constraints

Understanding of semantic embeddings, multimodal search, and RAG architectures

Strong datadriven mindset and ability to conduct research on novel Audio AI approaches

Comfortable working with Agile workflows, MLOps, and DevOps principles

Publication record, Kaggle or challenge participation, or equivalent - Advantage

:Why Join Us

Work with leading researchers and engineers on nextgeneration Speech and Audio Intelligence

Make a direct impact on speech understanding, generation, and sentiment analytics in realworld applications

Collaborate on cuttingedge multimodal AI systems integrating vision, audio, and language

Be part of a forwardthinking team that values creativity, research excellence, and continuous learning

Shape the future of Audio and Speech AI - from concept to deployment

Only suitable applications will be considered

#Netanya

Job ID: 522700039
Originally Posted on: 5/28/2026

Want to find more Technology opportunities?

Check out the 165,053 verified Technology jobs on iHireTechnology