
Site Reliability Engineer
/10
Job Description
We help the world run better
At SAP, we keep it simple: you bring your best to us, and we'll bring out the best in you. We're builders touching over 20 industries and 80% of global commerce, and we need your unique talents to help shape what's next. The work is challenging – but it matters. You'll find a place where you can be yourself, prioritize your wellbeing, and truly belong. What's in it for you? Constant learning, skill growth, great benefits, and a team that wants you to grow and succeed.
What you’ll do:
We are looking for a seasoned, motivated Site Reliability Engineer to join our expanding team. This key role is integral in optimizing and fine-tuning our operational efficiency, guiding our technological infrastructure to manage, plan and coordinate the software development process. The ideal candidate will work collaboratively to develop automated deployment tools, monitor system health, maintain secure and efficient IT architecture, and ensure swift troubleshooting of issues when they arise.
As a Site Reliability Engineer, you will play a critical role in systems operations for our company. As part of an SRE team, you will be responsible for service deployment, developing and implementing software continuous integration and delivery pipelines, automating operations and development processes, troubleshooting issues, and streamlining application deployment. In addition, you will maintain configuration management solutions and work with various teams to improve DevOps practices throughout the organization. You will also play a vital role in designing infrastructure strategies to ensure reliable, efficient, and secure IT systems – this would also include the development of comprehensive monitoring solutions to provide full visibility to the different platform components using tools and services that integrate with the chosen cloud provider.
Following SRE principles, you will also collaborate with experienced software engineers from around the globe and jointly investigate problems and help improving our products. You will be also expected to share your broad knowledge and experience to educate junior colleagues.
We will count on your strong support with our UPCOMING PROJECTS:
- Deploy and support the Digital Workspace product on K8S runtime
- Extend and Automate Security requirements to key services
- Adopt Multi-AZ requirements – organize, continuously execute and automate chaos testing to ensure service resilience and availability
- Develop, maintain, and extend monitoring, alerting and remediation tooling for SAP Build process automation services (SBPA)
- Deploy and support key services on GCP landscapes
Part of this role includes being available for service support out-of-office hours.
What you bring:
With at least 2 years’ relevant work experience, you should have a good knowledge of modern cloud architectures, debugging and profiling tools. You will also have a passion for automation and experience with different tools. We are looking for a team player who can work efficiently in emergency situations and quickly analyse and solve problems in a worldwide team setup. Excellent communication skills are required in these scenarios to ensure information distributed is precise and factual.
You should also have practical experience in at least one of the following areas and good knowledge of the rest:
- Bachelor’s degree in computer science, Information Technology, or a related field
- Significant previous experience in a DevOps, SRE, System Admin, or similar role
- Strong experience with cloud services (AWS, GCP, Azure) and architecture
- Solid experience with DevOps toolchain (Jenkins, Travis CI, Puppet, Chef, GitHub)
- Solid experience with coding and scripting languages (Python, Go, Bash, Selenium, Groovy)
- Solid experience with monitoring tools such as Grafana and Dynatrace
- Experience with performance tuning, monitoring, and system-level debugging
- Knowledge and experience in automating response to system-related incidents, warnings, and key performance indicators (KPIs) would be a plus
- Knowledge of containerization (Cloud Foundry, Docker, Kubernetes, ECS, or OpenShift)
- Understanding of Infrastructure as Code (IaC) using tools like Terraform, Cloudformation or Ansible
- Excellent problem-solving skills and attention to detail
- Ability to support 24/7 on-call rota schedule within the team
- Knowledge of Agile methodology and processes
We would also look for some key soft skills such as:
- Communication
- Positive “can do” attitude
- Teamwork
- Excellent work ethic – stay focused and complete tasks in a timely manner especially when under pressure
- Willingness to learn
- Leadership skills – ability to mentor/ share knowledge with junior team members
Meet your team
The SAP Build Site Reliability Engineering team plays a fundamental role in managing and maintaining the organization's cloud computing strategy. They are responsible for overseeing the daily operations and maintenance of cloud applications, ensuring their availability, performance, reliability, and security. This involves monitoring system health, handling software upgrades and deployments, identifying and troubleshooting issues, and ensuring optimal resource allocation. They also work closely with other teams to troubleshoot complex system issues, implement necessary updates, and ensure compliance with industry's best practices and regulations. Furthermore, the SRE team is crucial in disaster recovery planning and execution, as well as creating guidelines and procedures for cloud operations.
The Cloud Ops team makes the SAP Services run better by providing 24x7 deep technical coverage for Incident Management applying SRE principles. We share a Live Site First culture and care for the business continuity of our customers running mission critical applications on top of the Cloud Platform.
Bring out your best
SAP innovations help more than four hundred thousand customers worldwide work together more efficiently and use business insight more effectively. Originally known for leadership in enterprise resource planning (ERP) software, SAP has evolved to become a market leader in end-to-end business application software and related services for database, analytics, intelligent technologies, and experience management. As a cloud company with two hundred million users and more than one hundred thousand employees worldwide, we are purpose-driven and future-focused, with a highly collaborative team ethic and commitment to personal development. Whether connecting global industries, people, or platforms, we help ensure every challenge gets the solution it deserves. At SAP, you can bring out your best.
We win with inclusion
SAP’s culture of inclusion, focus on health and well-being, and flexible working models help ensure that everyone – regardless of background – feels included and can run at their best. At SAP, we believe we are made stronger by the unique capabilities and qualities that each person brings to our company, and we invest in our employees to inspire confidence and help everyone realize their full potential. We ultimately believe in unleashing all talent and creating a better world.
SAP is committed to the values of Equal Employment Opportunity and provides accessibility accommodations to applicants with physical and/or mental disabilities. If you are interested in applying for employment with SAP and are in need of accommodation or special assistance to navigate our website or to complete your application, please send an e-mail with your request to Recruiting Operations Team: Careers@sap.com.
For SAP employees: Only permanent roles are eligible for the SAP Employee Referral Program, according to the eligibility rules set in the SAP Referral Policy. Specific conditions may apply for roles in Vocational Training.
Qualified applicants will receive consideration for employment without regard to their age, race, religion, national origin, ethnicity, gender (including pregnancy, childbirth, et al), sexual orientation, gender identity or expression, protected veteran status, or disability, in compliance with applicable federal, state, and local legal requirements.
Successful candidates might be required to undergo a background verification with an external vendor.
AI Usage in the Recruitment Process
For information on the responsible use of AI in our recruitment process, please refer to our Guidelines for Ethical Usage of AI in the Recruiting Process.
Please note that any violation of these guidelines may result in disqualification from the hiring process.
Requisition ID: 442795 | Work Area: Software-Development Operations | Expected Travel: 0 - 10% | Career Status: Professional | Employment Type: Regular Full Time | Additional Locations: #LI-Hybrid
Company benefits
Working at SAP
Company employees:
Gender diversity (m:f):
Hiring in countries
Argentina
Australia
Austria
Bahrain
Belgium
Brazil
Bulgaria
Canada
Chile
China
Colombia
Costa Rica
Czechia
Denmark
Egypt
Finland
France
Germany
Greece
Hong Kong
Hungary
India
Indonesia
Iraq
Ireland
Israel
Italy
Japan
Kuwait
Malaysia
Mexico
Morocco
Netherlands
New Zealand
Norway
Oman
Pakistan
Panama
Philippines
Poland
Portugal
Qatar
Romania
Saudi Arabia
Serbia
Singapore
Slovakia
Slovenia
South Africa
South Korea
Spain
Sweden
Switzerland
Taiwan
Thailand
Türkiye
Ukraine
United Arab Emirates
United Kingdom
United States
Vietnam
Office Locations
Other jobs you might like
Site Reliability Engineer
Bucuresti, Bucuresti, Romania
9 Dec
Transparency8.8/10
RankingNSL – Site Reliability Engineer
Leeds, United Kingdom
8 Dec
Transparency9/10
RankingSite Reliability Engineer
Sofia, BG
5 Dec
Transparency8.4/10
RankingSenior Site Reliability Engineer
Sofia, BG
18 Nov
Transparency8.4/10
RankingTechnical Lead - Site Reliability Engineer
Sofia, BG
18 Nov
Transparency8.4/10
Ranking


