Website Adobe

Our Company

Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.

We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!

The opportunity

From the moment you wake up in the morning until you go to bed at night consider the media you consume, the adverts you see, the apps you use, the websites you browse and almost all of the shopping you do online throughout the day. Chances are that every single one of those interactions, every single one of those experiences, was touched by an Adobe product.

We have a fantastic opportunity for a Site Reliability Engineer to join our Ethos API Platform team.

The Ethos Platform provides industry leading API hosting capabilities. Our solutions support high traffic, highly visible applications with immense amounts of data, numerous third-party integrations, and exciting scalability and performance problems.

The Site Reliability Engineer on the Ethos API Platform NET SRE team has the responsibility to ensure optimal performance and uptime of Adobe’s API infrastructure: specifically the API Gateway and our Kubernetes powered infrastructure. We’re focused on: containerization, clusterization, performance, continuous integration / continuous deployment (CI/CD), and pipeline automation. This team is uniquely positioned to make a measurable difference to Adobe’s development culture, reputation and its bottom line!

The successful candidate should have a strong interest in learning new technologies, working independently, and the ability to drive complex and ambitious projects to conclusion. Strong collaboration with other teams is key to succeed in this role. This individual should be self-motivated and have a passion for quality.

The Ethos Platform NET SRE team is geographically distributed and as such we rely heavily on tools like Slack and video conferencing. Our team is in San Franciso, Bucharest and Ottawa. International travel may be requested.

What You’ll Do

Operation of clusters of servers in AWS and Azure running applications that handle billions of transactions. Define and track metrics to monitor and improve reliability of these systems. Respond and resolve outages that could impact tens of thousands of customers.
Ensure the highest level of uptime and Quality of Service (QoS) for our customers through operational excellence.
Interact with Kubernetes clusters.
Build, challenge, and secure our automated, multi-cloud, multi-tenant environments: in software, process, and infrastructure.
Engage in service capacity analysis and demand forecasting, software performance analysis and system tuning.
Improve our tools for continuous integration, continuous deployment, automated testing and release management.
Work closely with internal users to debug and fix REST-based APIs and network connectivity.

What You Need To Succeed:

B.Sc. or higher in related field, or equivalent experience.
Two or more years of proven experience in software engineering, site reliability engineering, release engineering, and/or configuration management.
Passion for automating repetitive work using scripting languages (Python, Ruby) and automation platforms like Chef and Ansible.
Experience with cloud service providers. For example: Microsoft Azure, Amazon AWS, Google Cloud, Oracle Cloud Infrastructure.
In depth experience with Linux and familiarity with containerization (e.g. Docker).
An understanding of IP networking, firewalls, IP addressing, network segmentation, content distribution networks, web application firewalls and the know-how to analyze traffic as it traverses layers of infrastructure.

Nice To Have

Experience deploying, managing and maintaining Kubernetes.
Cloud provider automation, ex. AWS Cloudformation, Azure ARM Templates, Troposphere, Terraform, Heat Templates, etc.
Experience with build management tools, preferably Jenkins and/or Argo.
Experience with log aggregation tools such as Splunk, Fluentbit.
Experience with monitoring solutions: Newrelic, Datadog, Runscope, Prometheus, Grafana.
Exposure to Kafka, AWS Kinesis or messaging platforms.
Fluency in scrum terminology and processes.
Previous experience automating build and release processes.
Previous experience consulting and working with customers.

As our many awards will tell you, at Adobe you’ll be immersed in an exceptional work environment that is recognized around the world. You’ll be surrounded by colleagues who are committed to helping each other grow through our unique Check-In approach where ongoing feedback flows freely. If you’re looking to make an impact, Adobe’s the place for you. Discover what our employees are saying about their career experiences on the Adobe Life blog,  and explore the fantastic benefits we offer at

Diversity & Inclusivity:

Adobe is proud to be an Equal Employment Opportunity and affirmative action employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other applicable characteristics protected by law. Learn more.

Adobe aims to make accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email [email protected] or call (408) 536-3015.

Adobe values a free and open marketplace for all employees and has policies in place to ensure that we do not enter into illegal agreements with other companies to not recruit or hire each other’s employees.

To apply for this job please visit