Site Reliability Engineering - Hyderabad / Secunderabad, Telangana - confidential

    confidential
    confidential Hyderabad / Secunderabad, Telangana

    21 hours ago

    Full time Healthcare
    Description

    Summary:

    The SRE Manager at Tech Blocks India will lead the reliability engineering function, ensuring infrastructure resiliency and optimal operational performance. This hybrid role blends technical leadership with team mentorship and cross-functional coordination.

    Experience Required:

    • 10+ years total experience, with 3+ years in a leadership role in SRE or Cloud Operations.

    Technical Knowledge and Skills:

    Mandatory:

    • Deep understanding of Kubernetes, GKE, Prometheus, Terraform
    • Cloud: Advanced GCP administration
    • CI/CD: Jenkins, Argo CD, GitHub Actions
    • Incident Management: Full lifecycle, tools like OpsGenie

    Nice to Have :

    • Knowledge of service mesh and observability stacks
    • Strong scripting skills (Python, Bash)
    • Big Query /Dataflow exposure for telemetry

    Scope:

    • Build and lead a team of SREs
    • Standardize practices for reliability, alerting, and response
    • Engage with Engineering and Product leaders

    Roles and Responsibilities:

    • Establish and lead the implementation of organizational reliability strategies, aligning SLAs, SLOs, and Error Budgets with business goals and customer expectations.
    • Develop and institutionalize incident response frameworks, including escalation policies, on-call scheduling, service ownership mapping, and RCA process governance.
    • Lead technical reviews for infrastructure reliability design, high-availability architectures, and resiliency patterns across distributed cloud services. Champion observability and monitoring culture by standardizing tooling, alert definitions, dashboard templates, and telemetry data schemas across all product teams.
    • Drive continuous improvement through operational maturity assessments, toil elimination initiatives, and SRE OKRs aligned with product objectives. Collaborate with cloud engineering and platform teams to introduce self-healing systems, capacity-aware autoscaling, and latency-optimized service mesh patterns.
    • Act as the principal escalation point for reliability-related concerns and ensure incident retrospectives lead to measurable improvements in uptime and MTTR.
    • Own runbook standardization, capacity planning, failure mode analysis, and production readiness reviews for new feature launches. Mentor and develop a high-performing SRE team, fostering a proactive ownership culture, encouraging cross-functional knowledge sharing, and establishing technical career pathways.
    • Collaborate with leadership, delivery, and customer stakeholders to define reliability goals, track performance, and demonstrate ROI on SRE investments

  • confidential Hyderabad / Secunderabad, Telangana Full time

    +Job summary · Design, build, and maintain scalable infrastructure. Develop automation tools. · +ResponsibilitiesDesign, build, and maintain scalable infrastructure. · Develop automation tools. · ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    The role involves designing and maintaining distributed tracing metrics logging using OpenTelemetry Prometheus Loki Tempo ensuring complete instrumentation of .NET Core applications implementing telemetry pipelines for application logs performance metrics traces monitoring alerti ...

  • Only for registered members Hyderabad/ Secunderabad

    Join us as we pursue our purpose to make the world work better for everyone. Drive immediate relief and provide a sustainable resolution to issues within the ServiceNow platform. · ...

  • Only for registered members Hyderabad/ Secunderabad

    The Site Reliability Engineer supports the reliability, performance, and operability of customer environments by contributing to routine change, incident and problem management, and continuous improvement of observability and automation across non-production and production. · Lea ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    +Job summary · Site Reliability Engineer +Supporting Java-based multi-tier applications with complex upstream downstream interactions. · Analysing application logs for investigating and troubleshooting issues. · Developing automated CI/CD capability for our application. · +Mimimu ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    We are looking for an intelligent, resourceful and highly skilled Senior Site Reliability Engineer (SRE) to join our Platform Site Reliability Engineering (PSRE) team . This team plays a critical role in ensuring the stability reliability and availability of mission-critical prod ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    We are looking for a Senior Site Reliability Engineer to lead incident management processes. · ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    A Senior Site Reliability Engineer will develop infrastructure automation solutions using tools like Terraform and Ansible as well as leverage expertise in AWS services such as EC2 S3 VPC RDS EKS ECS CloudFormation and more containerization technologies Docker orchestration platf ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    We are looking for Senior Software Engineers who are eager to build in a fast-paced, startup environment inside a stable, profitable company. · Rapidly build new applications on an existing, robust enterprise platform. · Build new cloud infrastructure from scratch following the b ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    ++The Lead Site Reliability Engineer will collaborate with development teams to define reliability standards. · Design highly available architectures · Troubleshoot complex issues throughout software stack.+++- Collaborate with development, operations, and product teams- Design i ...

  • Only for registered members Hyderabad/ Secunderabad

    Principal Service Reliability Engineer: design for telemetry, security, resiliency, scalability, and performance; lead sizing/architecture; drive service health reviews and process simplification. · End-to-end service ownership: design for telemetry, security, resiliency, scalabi ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    Design implement and manage security controls and practices in cloud environments. · ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    Zenoti is a cloud-based software solution for the beauty and wellness industry that helps businesses streamline their systems and reduce costs while improving customer retention and spending. · We are looking for a Lead Site Reliability Engineer to contribute to the adoption of D ...

  • Only for registered members Hyderabad/ Secunderabad

    Own and scale mission-critical ERP/SaaS services while building intelligent, cloud-native capabilities. This role requires a SRE mindset combined with AI/ML expertise and strong application engineering skills across public and private cloud environments. · End-to-end service owne ...

  • Only for registered members Hyderabad/ Secunderabad

    Own and scale mission-critical ERP/SaaS services while building intelligent, cloud-native capabilities. This role requires a SRE mindset combined with AI/ML expertise and strong application engineering skills across public and private cloud environments. · ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    We are looking for a Senior Site Reliability Engineer to join our team of Phenom. In this position, you will work on our core product environment upgradations, production issues fixing and incident response. · ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    We are looking for a Senior Site Reliability Engineer to join our team of Phenom. In this position, you'll work on our core product environment upgradations and production issues fixing. · ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    Contribute in the adoption of DevOps as we'll as DevOps architecture and design for various services in the organization. · ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    Zenoti provides an all-in-one, cloud-based software solution for the beauty and wellness industry. · The Lead Database Administrator will work in the product Engineering team of Zenoti. · ...

  • confidential Hyderabad / Secunderabad, Telangana Full time

    SRE new headcount to assist with day-to-day activities supporting ST Application services related to deployment and incident management. · ...

Jobs
>
Site reliability engineer