Jobs
>
Hyderabad

    Senior Site Reliability Engineer - Hyderabad, India - Microsoft

    Microsoft
    Microsoft background
    Technology / Internet
    Description

    Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into fans.

    We are customer obsessed problem-solvers. We orchestrate deep engagements in areas like incident management, support and enablement. We analyze and amplify those customer voices, both within our own team, and across the Cloud + AI team, bringing the customer connection to the Quality vision for Azure. We innovate ways to scale what we learn across our customer base. Diversity and inclusion are central to who we are, how we work, and what we enable our customers to achieve. We know that empowering our customers starts with empowering our team to show up authentically, work in ways that are best for them, and achieve their career goals.

    Would you like to join one of the fastest-growing teams within Microsoft Azure Engineering? Are you constantly customer-obsessed, and focused on enhancing customer experience? Are you passionate about cloud computing and love the challenge of solving the most complex technical problems? Are you interested in a start-up like environment, passionate about building automations, observability, proactive & SLO monitoring experiences?

    Our organization is looking for you, a customer obsessed Principal Site Reliability Engineer with extensive experience in implementing Service Level Objectives (SLOs) monitoring solutions to top Azure customers. As a key member of our Observability team, you will play a critical role in ensuring the reliability, availability, and performance of customer applications hosted in Microsoft Azure. You will be responsible for designing, implementing, and maintaining robust SLO monitoring systems to track and meet the service level objectives defined in our offerings, customer engagement agreements. This position is critical to the success of our team's charter and embodies our inclusive culture, growth & learning mindsets, and unwavering dedication to diversity.

    Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

    "Customer obsession", "measure what matters", "no dead-ends", "get it done", "collaboration" "teamwork", "whatever it takes" are few characteristics we look for in this role. We are growing fast but remain agile.

    Additional Locations:

    India, Karnataka, Bangalore

    Bangalore, Karnataka

    India

    India, Uttar Pradesh, Noida

    Noida, Uttar Pradesh

    India

    Qualifications:

    • 10+ years of experience with designing, implementing, debugging and launching commercial software products or web services. 3+ years of SRE experience in cloud - Azure (or AWS/GCP)
    • Degree: bachelor's or master's degree in computer engineering (or equivalent)
    • Customer Obsession: Passion for customers and focus on delivering the right customer experience.
    • Growth Mindset: Openness and ability to learn new skills and technologies in a fast-paced environment.
    • Excellent Communication: Must have the ability to empathize with customers and convey confidence. Able to explain highly technical issues to varied audiences. Able to prioritize and advocate customer's needs to the proper channels. Take ownership and work towards a resolution.

    Technical Skills:

    • Proven expertise in implementing and managing Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for cloud customers.
    • Extensive experience with SLO monitoring tools and platforms.
    • Advanced certifications in SRE or related fields.
    • Experience in observability, SRE Open Telemetry, Prometheus, Grafana, Dynatrace, Datadog, AzureMonitor, AI, ML

    #AZCXP #AZCXPACE #ACES500 #AZCXPSUPPORT, #AzureCXP

    Responsibilities:

    • Collaborate with customers to jointly define and establish SLOs and SLIs that align with their business goals and expectations.
    • Instrument code to measure SLOs, develop solutions to detect SLO breaches
    • Develop automated solutions and troubleshooting guides to remediate or mitigate SLO breaches.
    • Collaborate closely with service engineering teams to develop solutions for corelating customer-defined SLOs with relevant platform SLOs, signals to effectively pinpoint, address, and resolve customer-impacting issues.
    • Ensure customer-centric SLOs are consistently exceeded through cross-functional collaboration.
    • Analyze SLO data for trends, improvements, and reliability risks, proposing remediation plans.
    • Proactively engage customers on SLO performance, addressing concerns and offering insights.
    • Lead optimization efforts for system performance, scalability, and efficiency to exceed SLOs.
    • Develop and maintain documentation related to customer specific SLOs, SLIs, and monitoring processes.
    • Exemplify Microsoft culture and foster a diverse, inclusive work environment.

    Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

    • Industry leading healthcare
    • Educational resources
    • Discounts on products and services
    • Savings and investments
    • Maternity and paternity leave
    • Generous time away
    • Giving programs.
    • Opportunities to network and connect.


  • Insight Global Hyderabad, India

    Required Skills and Experience * · Bachelor's or master's degree in computer science, Software Engineering, or a related field. · Proven experience (7+ years) in SRE, automation testing · Strong skills in developing and implementing automation testing strategies and frameworks. · ...


  • Quiktrak, LLC Hyderabad, India

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps Engineer · Job Description: · Summary: · As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the ...


  • Wall Street Consulting Services LLC Hyderabad, India

    Role: SRE · Exp: 6+ years · Location: Pune, Bengaluru, Chennai · JOB DESCRIPTION: · The Role: · As a Site Reliability Engineer, you will be critical in ensuring our software products' reliability, scalability, and performance. You will be responsible for designing and implementin ...


  • SID Global Solutions Hyderabad, India

    Job Title: Site Reliability Engineer · Location: Hyderabad - Onsite · Work Mode: 5 Days Working from Office · JOB DESCRIPTION · • Experience in Cloud administration and troubleshooting(GCP is recommended) or AWS or AZURE · • Experience with Kubernetes or comparable technolog ...


  • ValueLabs Hyderabad, India

    Experienced in SRE or Site Reliability Engineer · Design, implement, and maintain automated processes for deploying, monitoring, and managing applications on Azure DevOps. · Collaborate with cross-functional teams to optimize system performance, reliability, and scalability. · D ...

  • TEKsystems Global Services in India

    Site Reliability Engineer

    56 minutes ago


    TEKsystems Global Services in India Hyderabad, India

    Experience – 5+ years · Location – Bengaluru / Hyderabad · Notice Period – Immediate to 30 days · Role Overview: · SRE Engineer will play a critical role in ensuring our trading services are always available, scalable, and engineered to withstand unparalleled demand. · You wil ...


  • DATAMTX LLC Hyderabad, India

    The Company · Datamtx / formerly Datamatics) established in 1993 and globally HQ'd in Atlanta has a stellar history supporting both Tier 1 and 2 ERP rollouts ranging from implementations, data cleanse, migrations, customization, hypercare and Day 1 support. We are also nationall ...


  • Banyan Cloud Hyderabad, India

    About US · Honest Data technologies Pvt Ltd, is a wholly owned subsidiary of Banyan Cloud, USA, the Cyber Security Product Company, headquartered in San Jose, California, USA, owning the SaaS product "Banyan Cloud", first of its kind Cyber Security CNAP Platform that simplifies t ...


  • Coforge Hyderabad, India

    Role: Site Reliability Engineer · Location: Hyderabad · Work Mode: WFO · Experience: 6-10 yrs · Job Description: · Deep knowledge of version control. · Sound knowledge of operating Systems (like LINUX). · Should be aware of DevOps concepts and best practices. · CI/CD implementati ...


  • Zyoin group Hyderabad, India

    We are seeking a highly skilled and experienced Database Reliability Engineer (DBRE) to join our team and play a crucial role in ensuring the performance, scalability, and high availability of our customer database services on the Tessell Platform. · Minimum Requirements : · year ...


  • Splunk Inc Hyderabad, India

    Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to ou ...


  • PURVIEW Hyderabad, India

    Role: Site Reliability EngineerLocation: HyderabadJob Type: Contract (Permanent to Purview Services) · NOTE: Client is looking for immediate joiners or 1 Month Notice Period candidates only. · Job Description: · Primary job responsibilitiesAbility to operate and maintain various ...


  • Electronic Arts Hyderabad, India

    Pogo has been the leader in online casual games since 1998. Featuring a growing library of 60+ titles · spanning popular genres like Solitaire, Mahjong, Match 3, and more, Pogo exists to be the best · destination for online casual games. We strive to produce high-quality HTML5-po ...


  • Oriontek INC Hyderabad, India Full time

    The Role · You will be responsible for : · Gathering and evaluating user feedback. · Providing code documentation and other inputs to technical documents. · Supporting continuous improvement by investigating alternatives and new technologies and presenting these for architectur ...


  • Wipro Hyderabad, India

    Requirement- SRE · Experience: 6+Years · Location : Pan India · Key responsibilities · Review Monitoring & alerts to provide recommendations for enhancement towards 360° coverage · Create dashboards, setup synthetic and real user monitoring, visualize large data sets with inter ...


  • Alter Domus Hyderabad, India

    ABOUT US · We are Alter Domus. Meaning "The Other House" in Latin, Alter Domus is proud to be home to 85% of the top 30 asset managers in the alternatives industry, and more than 5,000 professionals across 23 countries. · With a deep understanding of what it takes to succeed in ...


  • IdeaHelix, Inc Hyderabad, India

    Requirements: · Proficiency in system management scripting languages (like Python , shell), · Strong experience with Kubernetes, Docker containers and Terraform. · Understanding of Linux administration. · Experience with automation/ configuration management tools (like Ansible, S ...


  • Oracle Hyderabad, India

    As a SRE for the State & Local GBU you will play a critical role in solving complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. As an SRE, you not only follow best practices, standards, and processes employed by the team, ...


  • SID Global Solutions Hyderabad, India

    Experience: 3-5 years · Experience: 8-11 years · • Experience in Cloud administration and troubleshooting (GCP is recommended) or AWS or AZURE · • Experience with Kubernetes or comparable technology. · • APIGEE OR API Management experience is Mandatory. · • Experience with CI/CD ...


  • Arcesium Hyderabad, India

    We are looking for an experienced Principal Engineer to implement a new monitoring tool for the firm. The ideal candidate will have a strong background in SRE principles and practices, and strong knowledge and experience in maintaining monitoring frameworks for large scale organi ...