OpenStack Infrastructure Engineer

  • GTN Technical Staffing
  • McKinney, Texas
  • Full Time

Job Title: OpenStack Infrastructure Engineer Location: Onsite in McKinney, TX (with flexibility) Compensation: $150,000 $190,000/year (based on experience) Type: Full-Time, Direct Hire

Overview:

We are seeking an experienced OpenStack Infrastructure Engineer to lead the design, deployment, and optimization of our enterprise-scale infrastructure supporting hybrid cloud and colocation environments. This is a critical, hands-on role focused on delivering resilient, secure, and high-performing OpenStack platforms that power our virtualized services across multiple data centers.

You ll work onsite in McKinney, TX , with flexibility, and play a key role in bridging physical infrastructure, automation, and scalable cloud-native operations. If you're passionate about open infrastructure, bare-metal performance, and driving 99.999% uptime systems this role is for you.

Key Responsibilities: Infrastructure Design & Deployment
  • Architect and deploy OpenStack clusters across distributed, geo-redundant data centers.

  • Ensure availability and fault tolerance for hybrid workloads with a target of 99.999% uptime.

  • Evaluate, procure, and manage compute, storage, and networking hardware aligned with OpenStack/Ceph requirements.

Data Center Operations & Capacity Planning
  • Collaborate with colocation providers to ensure appropriate power, cooling, and rack space.

  • Utilize DCIM tools to plan and manage rack-level resources and capacity.

  • Conduct quarterly audits to monitor resource utilization and anticipate upgrade cycles.

Networking & Security
  • Configure SD-WAN, AWS Direct Connect, and hybrid connectivity strategies.

  • Enforce OpenStack and Linux-based security controls (Keystone, Neutron, encrypted Ceph).

  • Conduct regular security audits and lead remediation efforts.

Automation & Observability
  • Automate deployments using Terraform, Ansible, MaaS, and other IaC tooling.

  • Build predictive monitoring pipelines using OpenTelemetry, Grafana, Prometheus, and Loki.

  • Create self-healing infrastructure patterns to minimize MTTR.

Key Performance Indicators (KPIs):
  • Task Timeliness: 80% on-time completion.

  • MTTR (Mean Time to Recovery): Root Cause Analysis: Preliminary RCA within 24 hours; final RCA within 3 days of incident closure.

Required Competencies:
  • Expert in OpenStack (Nova, Neutron, Keystone), Ceph, and virtualization technologies (KVM/QEMU).

  • Strong proficiency in IaC: Terraform, OpenTofu, Pulumi, Ansible.

  • Deep knowledge of Linux internals and performance analysis (eBPF).

  • Experience with DCIM platforms and hardware lifecycle management.

  • Proficient in observability tools: APM, Grafana, Prometheus, Loki, OpenTelemetry.

  • Strong background in networking, protocols, and secure design practices.

  • Proven success managing container environments (Kubernetes, Docker, Helm).

  • Familiarity with Git, CI/CD (GitHub Actions, Argo), and ticketing systems (Jira, Azure DevOps).

Ideal Attributes:
  • High ownership mentality and ability to independently lead initiatives.

  • Curiosity-driven with a passion for continuous learning.

  • Strong communicator with cross-functional team experience.

  • Ability to prioritize and manage competing demands with clarity and confidence.

  • Empathetic leader who thrives in collaborative engineering environments.

Job ID: 478131387
Originally Posted on: 5/22/2025

Want to find more Technology opportunities?

Check out the 150,852 verified Technology jobs on iHireTechnology