Founding Senior Data/ML Infrastructure Engineer

  • CoCo
  • San Francisco, California
  • Full Time
At Coco , our mission is to revolutionize urban logistics by empowering cities, boosting local economies, and delivering delightful customer experiences. We connect people with local restaurants through our fleet of on-demand delivery robots, helping merchants reach their customers faster and more efficiently. By building innovative robotic systems that seamlessly navigate city sidewalks, Coco plays a key role in reshaping the future of last-mile delivery and enhancing local businesses. To deliver on our mission, we are building an autonomy team to develop the AI technology that will enable our robot pilots to scale efficiently, sustainably, and safely. The involves building an autonomy stack ground-up based on our millions of miles of last-mile delivery routes, proprietary video streams, and LiDAR data. What is the scope of this role? As a Founding Data & ML Infrastructure Engineer , you will be responsible to stand up Coco's autonomy stack alongside the CTO and fellow team members in the autonomy team. You will be responsible for developing and maintaining the infrastructure that supports the collection, processing, management, and training of large-scale datasets for our autonomous robots. The impact of this will be massive improvements to our robot-to-pilot ratio thereby allowing every person living in an urban area to benefit from last-mile delivery. In this role, you must accomplish the following: Design and implement a high-performance data engine to mine and identify valuable data samples that enhance model training. Build tools and pipelines for automatically extracting, cleaning, and curating data from various sources (sensors, logs, real-world interactions). Enable seamless interaction with large-scale datasets, ensuring that the team can quickly retrieve and analyze data to drive insights. Collaborate with the autonomy and AI engineers to develop the query layer and workflows for training and testing models Build and maintain tools for dataset management , including data exploration, versioning, and interaction tools. Architect and manage the infrastructure for model training and experimentation. This includes continuously optimizing data pipelines and infra for cost, scalability, and speed. Create and maintain systems for dataset tracking and governance to ensure consistent and reproducible experiments. Must have competencies: 5+ years of experience in software engineering, data engineering, or infrastructure engineering, with a focus on machine learning or AI systems. Extremely well versed in building and managing cloud infrastructure for large-scale data processing and model training (AWS, GCP, Azure). Excellent programming skills. Familiarity with ML frameworks ie TensorFlow, PyTorch. Strong understanding of data pipelines, versioning, and data management best practices. Experience working with containerization and orchestration tools (Docker, Kubernetes). Strong experience with cloud platforms and infrastructure as code (Terraform, CloudFormation). Familiarity with distributed systems, high-performance computing, and optimization for training large models. Hands-on experience with tools for data management and interaction (eg, DVC, Delta Lake, or similar tools). Strong leadership and communication skills. Web Reference AJF/862516727-430 Posted Date Mon, 30 Jun 2025 To apply for this position you will complete an application form on another website provided by or on behalf of CoCo . Please note JobShark - California Jobs is not responsible for the application process on any external website.
Job ID: 483909264
Originally Posted on: 7/3/2025

Want to find more Technology opportunities?

Check out the 156,628 verified Technology jobs on iHireTechnology