Site Reliability Developer 3 - US Citizen Required
Oracle Cloud Infrastructure
Site Reliability Developer 3 - US Citizen Required
- Job Identification 305585
- Job Category Product Development
- Posting Date 08/20/2025, 08:56 PM
- Role Individual Contributor
- Job Type Regular Employee
- Does this position require a security clearance? No
- Years 6 to 10+ years
- Applicants Less than 10 applicants
- Additional Info Visa / work permit sponsorship is not available for this position
- Applicants are required to read, write, and speak the following languages English
Job Description
Within the Oracle Health (OHAI) organization, the new EHR and Clinical AI Agent (CAA) cloud services are at the forefront of new generative AI services for healthcare organizations. Building on the success of the established Oracle Digital Assistant (ODA) product, EHR and CAA enable healthcare providers to leverage advanced AI technologies, together with voice commands, to reduce manual work and enable providers to focus on patient care.
Oracle Health EHR and CAA are expanding their Oracle Cloud Infrastructure (OCI) Operations teams and looking to bring in new Site Reliability Engineers. As an SRE engineer, you will be engaged in solving technical challenges on an advanced OCI cloud service platform, focusing on areas such as reliability, scalability, resilience, security, and performance.
You will define how to use latest technologies to optimize the operational efficiency of the service. You will gain a deep understanding of ChatBots, cognitive services, machine learning and analytics. You will work with a team pushing the boundaries of a scalable, self-healing, autonomous platform built on Kubernetes, Docker, Prometheus, and Grafana. You will be exposed to a wide range of OCI cloud services and understand how we interact with many dependent services across the organization.
Areas of responsibility
- Service Ownership
As part of the EHR/CAA team, you will:
- Be responsible for all operational aspects of the OCI services included in our portfolio.
- Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of the EHR and CAA products.
- Own end-to-end availability, reliability, and performance of a Cloud Service
- Participate in LiveSite operations, working rapidly to mitigate issues that may arises
- Service Design
- Designing and implement solutions for rolling out software and security updates with zero downtime
- Partner with development and product management to build and maintain platform and automation frameworks to ensure maximum up-time and predictability, preventing outages and service interruptions or degradation
- Analyze system failures and develop rapid response processes
- Operations engineering
- Evaluate the operation of cloud service deployments across commercial and government datacenters
- Monitor the degradation of the service and dependencies under load, and implement solutions to ensure high availability to our customers
- Analyse resource utilization and scaling requirements in a high-end production system
- Resolve security vulnerabilities to conform to corporate and government security standards.
- Automation
- Building on your understanding of automation and orchestration principles, you will be identifying opportunities to automate SRE procedures in production environments
- The solution implemented will be designed to minimize the possibility of errors being introduced into the system
- Technical expertise
- Handle complex, critical issues encountered in production environments, drawing on your accumulated technical knowledge to rapidly identify the issues and apply steps to mitigate.
- Develop an understanding of the underlying AI technologies used to implement the EHR and CAA services
- Minimum 5 years of hands-on Platform Engineering, DevOps or SRE experience
- BS or MS in Computer Science, Computer Engineering, or equivalent
- Excellent team skills, can-do attitude, focus on quality.
- Technical role with a history of embracing automated processes, cloud native application design principles and a CI/CD DevOps model.
- Strong trouble shooting capabilities targeting complicated problems in remote systems
- Experience with production operations and best practices for deploying quality code in production and troubleshooting issues when they arise.
- Experience with public cloud (OCI, AWS, GCP, Azure).
- Knowledge of Infrastructure as Code (IaC), Configuration as Code (CaC), GitOps and tools such as Terraform, Argo CD, Flux, etc.
- Experience and working knowledge in languages like Python or Java.
- Experience deploying, configuring, managing and debugging cloud infrastructure and platform software such as OpenStack, Kubernetes, etc.
- Experience with public cloud managed Kubernetes (such as OCI/OKE, AWS/EKS, GCP/GKE, Azure/AKS).
- Experience with cloud-native administration and monitoring/alerting technologies such as Docker, Helm, Prometheus, Grafana, EFK/ELK, Jaeger, or similar technologies.
- Experience designing and implementing CI/CD pipelines, platforms and components such as Jenkins, Argo CD.
- Knowledge of version control using Git.
- Experience in Linux/Unix environment
- Experience with application frameworks such as Spring, Helidon, Micronaut, etc. is a plus.
- Experience developing or designing healthcare software is a plus.
- Experience working in Agile/Scrum development process is a plus.
- Experience working with MLOps/AIOps tooling is a plus.
- Must be eligible to obtain & maintain a US government security clearance appropriate for this role, which requires you to be a US Citizen.
Responsibilities
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.
Qualifications
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $79,100 to $158,200 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle’s differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC3
About Us
As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Request a referral from an Oracle employee.
Similar Jobs