Data & ML Infrastructure Architect Posted 07/24/25
- Protingent
- Tukwila, Washington
- Full Time
Data & ML Infrastructure Architect
Tukwila, WA 98188
Posted: 07/24/2025
Employment Type: Direct Hire
Department: Core Technologies - FTE
Job Number: 30732
Industry: AUTO - Automotive
JOB DESCRIPTION
Position Title: Data & ML Infrastructure Architect
Position Description: Protingent Staffing has an exciting Remote Direct Hire opportunity.
Job Responsibilities:
Own the architecture of ML data infrastructure, enabling scalable ingestion, storage, curation, and access for 100+ engineers and researchers across autonomy teams.
Design and evolve infrastructure to support petabyte-scale machine learning workflows, including multimodal perception data, synthetic data, simulation output, and continuous training pipelines.
Architect high-throughput systems for distributed training on large GPU clusters, driving significant improvements in utilization, throughput, and job efficiency.
Establish robust data governance, observability, and retention strategies to ensurepliance, reproducibility, and long-term data utility.
Collaborate cross-functionally with ML engineers, autonomy researchers, data engineers, and DevOps to ensure tight integration between infrastructure and user workflows.
Lead technical strategy and roadmap development for the ML & Data Platform team, incorporating cutting-edge tools and best practices from industry and open source.
Mentor and influence engineers across teams, promoting engineering excellence in distributed systems, ML platforms, and autonomy-scale data management.
Job Qualifications:
15+ years of meaningful software engineering experience, including significant architecture-level ownership in ML, data infrastructure, or high-scale systems.
Proven experience leading the design of ML platforms that serve large-scale training and inference workloads.
Deep technical fluency in distributed storage, high-volume data pipelines, and datapression strategies for ML use cases.
Strong knowledge of Linux systems, Python, and C++ or similar performance-oriented languages.
Experience operating in hybrid environments: bare metal, HPC, and public cloud (AWS/GCP/Azure).
Comfortable owning cross-org initiatives and influencing system-level design across autonomy, simulation, and platform teams.
Prior work in robotics, autonomous vehicles, or safety-critical domains strongly preferred.
Preferred Job Qualifications:
Experience building or leading infrastructure at a top-tier ML/AIpany or AV program.
Background contributing to open-source ML or data infrastructure projects.
Familiarity with ML experiment tracking, model evaluation pipelines, and versioned data systems.
Job Details:
Job Type: Direct Hire
Location: Remote
Salary Range: $205 - 282k/year.
Tukwila, WA 98188
Posted: 07/24/2025
Employment Type: Direct Hire
Department: Core Technologies - FTE
Job Number: 30732
Industry: AUTO - Automotive
JOB DESCRIPTION
Position Title: Data & ML Infrastructure Architect
Position Description: Protingent Staffing has an exciting Remote Direct Hire opportunity.
Job Responsibilities:
Own the architecture of ML data infrastructure, enabling scalable ingestion, storage, curation, and access for 100+ engineers and researchers across autonomy teams.
Design and evolve infrastructure to support petabyte-scale machine learning workflows, including multimodal perception data, synthetic data, simulation output, and continuous training pipelines.
Architect high-throughput systems for distributed training on large GPU clusters, driving significant improvements in utilization, throughput, and job efficiency.
Establish robust data governance, observability, and retention strategies to ensurepliance, reproducibility, and long-term data utility.
Collaborate cross-functionally with ML engineers, autonomy researchers, data engineers, and DevOps to ensure tight integration between infrastructure and user workflows.
Lead technical strategy and roadmap development for the ML & Data Platform team, incorporating cutting-edge tools and best practices from industry and open source.
Mentor and influence engineers across teams, promoting engineering excellence in distributed systems, ML platforms, and autonomy-scale data management.
Job Qualifications:
15+ years of meaningful software engineering experience, including significant architecture-level ownership in ML, data infrastructure, or high-scale systems.
Proven experience leading the design of ML platforms that serve large-scale training and inference workloads.
Deep technical fluency in distributed storage, high-volume data pipelines, and datapression strategies for ML use cases.
Strong knowledge of Linux systems, Python, and C++ or similar performance-oriented languages.
Experience operating in hybrid environments: bare metal, HPC, and public cloud (AWS/GCP/Azure).
Comfortable owning cross-org initiatives and influencing system-level design across autonomy, simulation, and platform teams.
Prior work in robotics, autonomous vehicles, or safety-critical domains strongly preferred.
Preferred Job Qualifications:
Experience building or leading infrastructure at a top-tier ML/AIpany or AV program.
Background contributing to open-source ML or data infrastructure projects.
Familiarity with ML experiment tracking, model evaluation pipelines, and versioned data systems.
Job Details:
Job Type: Direct Hire
Location: Remote
Salary Range: $205 - 282k/year.
Job ID: 487126083
Originally Posted on: 7/28/2025
Want to find more Technology opportunities?
Check out the 147,816 verified Technology jobs on iHireTechnology
Similar Jobs