Site Reliability Engineer
- Eye Care Partners
- Tampa, Florida
- 14 hours ago
- Full Time
Job Summary
Job Description
EyeCare Partners is the nations leading provider of clinically integrated eye care. Our national network of over 300 ophthalmologists and 700 optometrists provides a lifetime of care to our patients with a mission to enhance vision, advance eye care and improve lives. Based in St. Louis, Missouri, over 650 ECP-affiliated practice locations provide care in 18 states and 80 markets, providing services that span the eye care continuum. For more information, visit click to view .
SUMMARY
We are seeking a highly skilledSite Reliability Engineer (SRE)with deep expertise inAmazon Web Services (AWS)andDatadog monitoringto join our growing I nfrastructure team. In this role, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems. You will work closely with development, operations, and security teams to build and maintain robust monitoring, alerting, and automation solutions. The ideal candidate will have a solid understanding of infrastructure and cloud technologies, as well as experience working in a high growth, large enterprise , multi -location environment. The role reports to the Director of Cloud IT and will interface with technology and business stakeholders alike.
ESSENTIAL DUTIES AND RESPONSIBILITIES
- Design, implement, and maintain scalable and reliable infrastructure on AWS and Azure.
- Conduct capacity planning and performance testing.
- Define and track SLIs, SLOs, RTO and RPOs.
- Define and test HA & DR Strategies for in scope cloud applications.
- Develop and manage Datadog dashboards, monitors, and alerts to ensure system health and performance.
- Automate operational tasks using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.
- Collaborate with development teams to improve system reliability and performance through observability best practices.
- Conduct root cause analysis and post-mortems for incidents, and drive continuous improvement.
- Participate in on-call rotations and respond to production incidents with a focus on minimizing downtime.
- Ensure compliance with security and operational standards.
- Partner with in-house and contract team members to implement projects and operations related to AWS and Azure.
- Respond to and support operational duties related to the platforms, including requests and performance monitoring
- Assist in the selection, configuration, and deployment of services across cloud platforms
- Promote the use of tooling and architectures to drive best practices and awareness of cloud capabilities across the organization
- Stay up to date on the latest trends, features, and security capabilities for the Cloud IT team
- Contribute, and lead where needed, to the improvement of technological support processes
- In short, you will be responsible for ensuring our cloud environments are reliable, scalable and performing adequately to meet the needs of the business.
- Adheres to all safety policies and procedures in performing job duties and responsibilities while supporting a culture of high quality and great customer service.
- Performs other duties that may be necessary or in the best interest of the organization.
QUALIFICATIONS
- AWS certifications preferred (e.g., AWS Certified DevOps Engineer, Solutions Architect).
- Experience with other observability tools (e.g., Prometheus, Grafana, ELK).
- Knowledge of service-level objectives (SLOs), service-level indicators (SLIs), and error budgets.
- Experience in high-availability, high-traffic environments.
- Strong understanding of cloud computing principles and infrastructure experience
- Application support experience with Microsoft Portfolio of Products, including M365, Windows Server, etc. Linux experience is a plus.
- Professional in appearance and actions, detail oriented and reliable.
- Logical and Critical thinking skills
- Customer-focused with excellent written, listening and verbal communication skills
- Exhibits a positive attitude and is flexible in accepting work assignments and priorities
- Management and organizational skills to support the leadership of this function
- Ability to follow or provide verbal & written instructions with sufficient grammar and spelling skills to avoid mistakes or misinterpretations
- Providing system documentation is a key output of this position
- Interpersonal skills to support customer service, functional, and teammate support as needed
- Able to communicate effectively in English, both verbally and in writing
- Ability for basic to intermediate problem solving, including mathematics
- Intermediate to Advanced computer operation
- Proficiency with Microsoft Excel, Word, and Outlook
- Travel to other site locations may be necessary. Thus, those needing to travel for work must have access to dependable transportation, and driving record must meet company liability carrier standards
- Specialty knowledge of systems relating to job function
- Knowledge of state and federal regulations for this position; general understanding of HIPAA guidelines
EDUCATION AND/OR EXPERIENCE
- Minimum Required: B.S. or B.A. in Computer Science, Information Technology or related field
- Minimum Required: 3+ years of related experience
- 3+ years of experience in a Site Reliability Engineering, DevOps, or Cloud Infrastructure role.
- Strong hands-on experience with AWS services (EC2, ECS/EKS, RDS, S3, Lambda, CloudWatch, etc.).
- Proficiency in Datadog for monitoring, alerting, and performance tuning.
- Experience with scripting and automation (e.g., Python, Bash, or Go).
- Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
- Solid understanding of networking, security, and system administration.
- Experience with CI/CD tools (e.g., Jenkins, GitHub Actions, GitLab CI).
- 5+ years of experience hands-on deployment of enterprise-level platforms
- 5+ years' experience working on cross-functional technology projects
LICENSES AND CREDENTIALS
- M365, Azure or AWS preferred
SYSTEMS AND TECHNOLOGY
- Proficient in Microsoft Excel, Word, PowerPoint, Outlook
LOCATION
- This position is located in St Louis, Missouri. Candidates living in Alabama, Arizona, Florida, Georgia, Illinois, Indiana, Kansas, Kentucky, Michigan, Minnesota, Missouri, New Jersey, N. Carolina, Ohio, Oklahoma, Pennsylvania, Texas and Virginia may also be considered for remote work.
PHYSICAL REQUIREMENTS
- The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
- While performing the duties of this job, the employee is frequently required to stand, walk, sit, reach with arms and hands, talk and hear. The individual must occasionally lift and/or move up to 50+ pounds. Specific vision abilities required for this job include close vision, distance vision and ability to adjust focus.
If you need assistance with this application, please contact .... Please do not contact the office directly only resumes submitted through this website will be considered.
EyeCare Partners is an equal opportunity/affirmative action employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
Job Summary
Eye Care Partners
Job ID: 485330472
Originally Posted on: 7/15/2025