We are seeking a senior technical contributor to help support, modernize, and scale our on premise high performance computing platform. This role will work across Linux systems administration, HPC operations, Kubernetes-based services, automation, observability, software tooling, and user-facing platform delivery. The ideal candidate has deep experience administering RHEL based systems in complex compute environments and is comfortable troubleshooting issues across operating systems, schedulers, storage, networking, containers, applications, and user workloads.
This person will play a key role in improving the reliability, usability, and operational maturity of the platform. They will help develop and maintain core HPC services, support users running demanding engineering and AI/ML workloads, and create tooling, scripts, APIs, and integrations. Strong software engineering fundamentals are important, including experience with Python, Go, or similar languages, Git-based development workflows, code reviews, testing practices, CI/CD pipelines, documentation, and maintainable code design. Experience with Slurm or other workload managers is highly valued.
We are looking for someone who can balance strong technical depth with a user-focused delivery mindset. This role requires the ability to work collaboratively with platform engineers, application teams, and technical users to identify pain points, resolve production issues, document repeatable processes, and build durable improvements. The right candidate will be pragmatic, a team player, comfortable in a fast-moving environment, and motivated by making complex, massive on-prem infrastructure easier to operate, automate, observe, and continuously improve.
RESPONSIBILITIES- Administer, troubleshoot, and improve RHEL based high performance computing environments supporting CPU and GPU workloads.
- Create and maintain HPC services across compute, storage, networking, scheduling, Kubernetes, and observability.
- Develop tools, scripts, APIs, integrations, and automation using Python, Go, Bash, or similar languages.
- Apply software engineering best practices, including Git workflows, code reviews, testing, modular design, and CI/CD.
Support and help update HPC scheduling environments, with Slurm experience preferred.
Improve monitoring, alerting, dashboards, and operational visibility using Grafana, Prometheus, Dynatrace, and related tools.
- Partner with users, customers, and internal engineering teams to understand requirements, resolve issues, and improve platform usability.
- Create and maintain documentation, architecture notes, user guides, and operational procedures.
Drive platform modernization focused on reliability, scalability, automation, security, and maintainability.
- Bachelors degree in Computer Science, Engineering, or related field, or equivalent experience
- 10+ years of experience in systems engineering, infrastructure engineering, platform engineering, or a related technical role.
- Strong Linux systems administration experience, preferably with RHEL.
- Experience with Slurm, PBS, or another HPC workload manager.
- Experience creating APIs, applications, and services that support platform operations and user workflows.
- Experience supporting production compute, infrastructure, and large-scale technical environments.
Hands-on experience with scripting and software development using Python, Go, Bash, or similar languages.
- Familiarity with CI/CD concepts, GitHub, and modern software delivery practices.
- Strong troubleshooting skills across operating systems, services, networking, storage, and application layers.
- Ability to write clear documentation and communicate effectively with both technical and non-technical stakeholders.
Strong ownership mindset with the ability to drive issues to resolution.
Ability to use independent judgement to make sound technical decisions.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builderor all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Years Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary
This position is a salary grade 8 .
This position is a salary grade 8 and ranges from $113,580-192,900 .
*Visa Sponsorship is not provided for this role *
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call ....
#LI-Remote
#LI-GH2