- Define and measure the reliability of the service using SLI, SLOs and consider the risk minimization of service degradation.
- Enable the development team to bring new software or new features (Digital Offering) to production as quickly as possible, while also ensuring an agreedupon acceptable level of IT operations performance and error risk in line with the service level agreements (SLAs) agreed.
- Closely cooperate with different Product Owners, Site Reliability Engineers, and the Cloud Platform Teams to define processes to migrate between different Cloud Platforms while ensuring reliability for business offerings.
- Work with multiple Site Reliability Engineers for operations and system administration tasks analyzing logs, performance tuning, applying patches, testing production environments, identify opportunities and drive the design and implementation of endtoend observability, alerting, selfhealing and automation capabilities to improve service health, manageability, and reliability.
- Work with different stake holders (POs, SREs and Platform Team) to define Incident Management Process as required for responding to incidents, drive postmortems reviews for improving the service quality.
- Closely work with Dev and SRE team to select appropriate metrics related to observability and reliability as well as defining SLIs and SLOs
- Define and drive observability for selfdeveloped software and the managed cloud components by collecting appropriate observability data for insights and alerting including setting up proper alerting for critical components.
- Ensure availability and responsiveness of application by setting up and maintaining the required documentation method and tools. Building Playbooks for troubleshooting techniques to effectively identify and investigate issues that can be used by SREs.
- Handle resolution of blockers, escalation to stakeholders, and provisioning of resources.
- Own availability, performance, and supportability targets for the service.
- Author functional and technical documentation and remain current on relevant technologies and procedures.
- 812 years of relevant industry experience.
- Minimum of 3 years as a Site Reliability Engineering Lead.
- Minimum of 5 years' experience as a Site Reliability Engineer
- Minimum of 8 years' experience with cloud computing platforms like Azure and related services.
- Indepth knowledge of system architecture, networking, and microservice based distributed systems.
- Expertise in designing and implementing reliable, scalable, and faulttolerant systems using container Orchestration Technologies like Docker and Kubernetes.
- Proficiency in setting up and managing monitoring, alerting, and logging systems for early detection and resolution of issues for container orchestrators like Kubernetes using Tools like Prometheus, Grafana, Open Telemetry Collector or similar tools.
- Handson experience in incident management, including incident response, troubleshooting, and postmortem analysis.
- Proficiency in coding/scripting languages commonly used in infrastructure automation and monitoring (such as Terraform).
- Knowledge of best practices in disaster recovery planning and execution for cloudbased Systems.
- Ability to lead and mentor a team of SREs, providing guidance, support, and coaching.
- Capability to advocate for SRE best practices and principles within the organization and drive cultural changes as needed.
- Willingness to stay updated with the latest trends, tools, and technologies in the field of site reliability engineering.
- Strong communication skills to effectively collaborate with crossfunctional teams, including Software Developers, Product Owners, and Cloud Platform Engineers.
-
Lead Site Reliability Engineer
Found in: Talent IN 2A C2 - 2 days ago
HCLSoftware Bengaluru, IndiaPosition – SRE Architect/ Lead Site Reliability Engineer · Location – Pune/Bangalore/Chennai/Noida · Exp – 14+ · We are busy, growing quickly and have an incredible workforce who are committed to becoming the #1 Software company in the world. Come join HCL s fast-growing, $2B sof ...
-
Site Reliability Engineering Lead
Found in: Talent IN 2A C2 - 2 days ago
ZEISS Group Bengaluru, IndiaCARL ZEISS · Carl Zeiss AG branded as ZEISS, is a German manufacturer of optical systems and optoelectronics, founded in Jena, Germany in 1846 by optician Carl Zeiss. · ZEISS is headquartered in Oberkochen, Germany and enjoys a global presence and rich heritage of being in busine ...
-
Site Reliability Engineering Lead
Found in: Appcast Linkedin IN C2 - 2 days ago
ZEISS Group Bengaluru, IndiaCARL ZEISS · Carl Zeiss AG branded as ZEISS, is a German manufacturer of optical systems and optoelectronics, founded in Jena, Germany in 1846 by optician Carl Zeiss. · ZEISS is headquartered in Oberkochen, Germany and enjoys a global presence and rich heritage of being in busine ...
-
Lead Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
Yo HR Consultancy Bangalore, India permanentRole : Lead Site Reliability Engineer · Location : Bangalore, Karnataka, India · Experience : 8-12 Years · Must Have : Site Reliability Engineering · Skills : · - Troubleshooting · - On call support · - Linux · - Monitoring tools · - AWS services · - Scripting(Python/Shell/Bash) ...
-
Lead Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
The HRBPs Bangalore, India permanentLead Site Reliability Engineer - Bangalore · Experience - 8 to 12 years · Responsibilities : · - Collaborating with customer success managers and solutions engineers to bring deep technical expertise in implementing intelligent automation solutions for customers. · - Providing cu ...
-
Observability/Site Reliability Lead
Found in: Talent IN 2A C2 - 4 days ago
Connectio IT Pvt Ltd Bangalore, India permanentJob Description : · The Observability/SRE Lead will be a key member of the Engineering team. · They will have to focus on driving reliability, scalability, performance, and observability of production systems, and ensure a consistent end-user experience across all products. · The ...
-
Lead Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
The HRBPs Bangalore, India permanentLead Site Reliability Engineer - Bangalore · Exp - 8 to 12 years · Responsibilities : · - Collaborating with customer success managers and solutions engineers to bring deep technical expertise in implementing intelligent automation solutions for customers. · - Providing customers ...
-
Site Reliability Lead Engineer
Found in: Talent IN 2A C2 - 3 days ago
Voyager Partners Bangalore, India permanentRole : SRE Lead. · Client : KPMG. · Experience : 9 Years 12 Years. · Location : Bangalore. · Notice Period : Immediate to 15 Days Joiners. · C2H Role ( On Voyager Partners Payroll). · Job Description : · - Bachelor's degree in computer science, Information Technology, or related ...
-
Lead Site Reliability Engineer, CloudOps
Found in: Talent IN C2 - 2 days ago
ThousandEyes Bengaluru, IndiaLead Site Reliability Engineer, Cloudops · at Cisco ThousandEyes · Who We Are · The name ThousandEyes was born from two big ideas: the power to see things not ordinarily possible and the ability to collect insights from a multitude of vantage points. As the world continues it ...
-
Lead Site Reliability Engineer SRE
Found in: Talent IN C2 - 2 days ago
ConnectIO Bengaluru, IndiaOverview · The Lead SiteReliability Engineer (SRE) Observability KPI at Newrelic plays acrucial role in ensuring the reliability availability andperformance of Newrelics observability platform. This role isessential in maintaining and improving the observability keyperformance in ...
-
Litmus7 - Site Reliability Engineer Lead - ITSM/Monitoring Tools
Found in: Talent IN 2A C2 - 2 days ago
litmus7 Bengaluru, IndiaJob Description : · Job role : SRE Lead/Manager. · Prior experience in supporting JAVA based e-commerce application is MANDATORY. · Align and implement SRE principles, best practices based on ongoing Issues. · Closely work with Client Partners, Account Managers, and SRE Architec ...
-
Field Application Engineer-Reliability
Found in: Talent IN 2A C2 - 2 days ago
Trident Infosol Pvt Ltd Bengaluru, IndiaLooking for Reliability Engineer for our "Field Application Engineer or Team Lead" position for Bengaluru location. · Required skills: · Experience in one or more of the following standards will be an added advantage: MIL-HDBK-217F, IEC Standards, Telcordia, FIDES, NPRD, MIL -162 ...
-
Analog Circuit Design Lead
Found in: Talent IN 2A C2 - 2 days ago
Wipro Bengaluru, IndiaAnalog Circuit Design Engineers/Leads with 8+ years of experience to join our team · As an Analog Circuit Design Lead, you will be responsible for critical block designs such as Temperature sensor, PLL, ADC, DAC, LDO, Bandgap ckts, Ref Generators, Charge Pump, Current Mirrors, Co ...
-
Senior Site Reliability Engineer
Found in: Talent IN 2A C2 - 2 days ago
Zscaler Bengaluru, IndiaCompany Description · For over 10 years, Zscaler has been disrupting and transforming the security industry. Our 100% purpose built cloud platform delivers the entire gateway security stack as a service through 150 global data centers to securely connect users to their applicatio ...
-
Principal Site Reliability Engineering Manager
Found in: Talent IN 2A C2 - 2 days ago
Wipro Bangalore Urban, IndiaPrincipal Site Reliability Engineer · We are seeking a highly skilled and experienced Principal Site Reliability Engineer (SRE) to join Lab45 team in Wipro. As a Principal SRE, you will play a critical role in ensuring the reliability, availability, and performance of our systems ...
-
Engineering Manager, Platform Reliability Engineering
Found in: Talent IN 2A C2 - 2 days ago
Arcesium Bengaluru, IndiaWe are looking for an experienced Engineering Manager to lead our Site Reliability Engineering (SRE) team. The ideal candidate will have a strong background in SRE principles and practices, as well as experience managing and mentoring engineers. The SRE Manager will be responsibl ...
-
Staff Software Engineer
Found in: Talent IN 2A C2 - 2 days ago
Protoporos Staffing Services Private Limited Bengaluru, IndiaOpportunity with a leadingB2B SaaS product client specializing in cutting-edge data integration solutions · Position Overview: We are seeking a highly skilled and experienced Staff Engineer to join the Engineering team. As a Staff Engineer, you will play a crucial role in desig ...
-
Staff Site Reliability Engineer
Found in: Talent IN 2A C2 - 2 days ago
Protoporos Staffing Services Private Limited Bengaluru, IndiaOpportunity with a leadingB2B SaaS product client specializing in cutting-edge data integration solutions · Position Overview: We are seeking a highly skilled and experienced Staff Site Reliability Engineer to join our team. As a Staff SRE, you will play a critical role in ensur ...
-
Data Engineering Lead
Found in: Talent IN 2A C2 - 2 days ago
UNO Digital Bank Bengaluru, IndiaWe are seeking a highly motivated and experienced Data Engineering Lead to join our young team. The ideal candidate will possess a deep understanding of data engineering principles, a proven track record of leading data engineering initiatives, the ability to drive the developmen ...
-
Senior Backend Engineer
Found in: Talent IN 2A C2 - 2 days ago
Kredivo Group Bengaluru, IndiaResponsibilities · Successfully and independently deliver large-size projects, including scoping, planning, design, development, testing, rollout and maintenance. · Write clean, concise, modular and well-tested code. Review code from junior engineers and provide constant and con ...
Site Reliability Lead - Bengaluru, India - Domnic Lewis International
Description
Purpose: As a Site Reliability Engineering Lead, you will bridge the gap between Development, Cloud Platform Engineering Teams and Product Owners of different Digital Offerings. Defining and implementing the SRE-concepts with our teams, and aligning the service quality with the business objectives and user expectations will be at the core of your