Site Reliability Engineer - Bengaluru, India - Adobe

    Adobe
    Adobe background
    Full time
    Description

    Our Company

    Changing the world through digital experiences is what Adobe's all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.

    We're on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours

    Role Responsibilities:

  • You will develop software/tools and provide hands on technical expertise to design, deploy, and optimize Cloud services
  • You will build automation using industry standard tools such as Chef, Jenkins, Terraform, Spinnaker etc to deploy services
  • Participate in release cycles of our services, deploying code to staging, and production environments, integrating with continuous integration (CI) and continuous delivery (CD) tools, monitoring, and change management
  • Will come up with plans to improve security, availability of the services
  • Identify single points of failure and other high-risk architecture issues; propose and implement more resilient resolutions
  • Identify system bottlenecks and recommend solutions to solve the availability issue
  • Participate in On-Call and drive any issues found to resolution and also contribute to post-mortems
  • Proactively work on the efficiency and capacity planning to set clear requirements and reduce the system resources usage
  • Evangelize SRE principles and guide development team to build reliable services
  • You will build automation and tools that will increase the productivity of teams
  • Qualification:

    MUST:

  • Have at least 6 years of experience as SRE in Cloud engineering
  • Minimum 5 years of experience with containerized environment: Kubernetes, Docker
  • Experience with Argo will be a plus.
  • have experience in automation and tool development
  • have at least 4 years plus of experience building Cloud services and distributed systems – deployment, monitoring, scaling, debugging
  • You are proficient in multi cloud environments: AWS, Azure
  • Have experience writing applications using Go, Python, or JavaScript
  • Knowledge of well-known open-source tools for monitoring, trending, and configuration management.
  • Familiarity with Observability tools like Prometheus, Cortex, Grafana, NewRelic, DataDog, and Splunk. Experience with CI/CD tools like Jenkins/Groovy DSL
  • Have experience in scaling to the limit with high throughput services
  • You enjoy working with a large variety of services and technologies