- Lead and manage the SRE team in the design, implementation, and operation of our SRE practices and processes.
- Lead and manage a team of engineers, providing coaching, technical guidance, mentorship, goal (OKR) and performance management, and career management for their reports.
- Mentor and develop SRE engineers to ensure that they have the skills and knowledge necessary to be successful in their roles.
- Work with other engineering teams to ensure that our systems are designed and implemented in a way that is reliable, scalable, and secure.
- Represent the SRE team to other stakeholders within the company.
- Operations management
- Manage on-call rotations to provide 24 hours coverage
- Day to day support of dashboard, including responding to outages and triaging cases escalated by clients/internal teams
- Review various processes from time to time and drive continual improvement.
- Should have a flair for automation and seek opportunities to automate manual processes and service catalog items.
- Own operational success by continuously monitoring the stability and tech KPIs of the team and remediating any issues.
- Own the Incident management process
- Own end to end availability and performance of mission critical services and build automation to prevent problem recurrence
- 9+ years of experience in SRE or a related field
- Strong understanding of SRE principles and practices
- Experience with observability tools
- Experience with incident response and management
- Reliability: An exposure to Chaos Engineering and various reliability practices including disaster recovery will be good to have
- Experience with Cloud Computing like AWS
- Experience with Kubernetes
- Experience in Agile practices (Scrum)
- Excellent analytical, problem-solving and troubleshooting skills
- Excellent communication and presentation skills
- Experience managing and mentoring engineers
- Ability to work independently and as part of a team
- Ability to delegate, monitor and make progress
-
Reliability Engineer
Found in: Talent IN C2 - 3 hours ago
ATDXT Pvt. Ltd. Bengaluru, India Full timeRespond to the P1 incident quickly and assess its impact on the environment. · Investigation & leading technical troubleshooting of Infra/App environment · Strong knowledge on improving the reliability, availability, scalability and performance of the environment. · Proactively ...
-
Site Reliability Engineer
Found in: Appcast Linkedin IN C2 - 4 days ago
ViewSonic Bengaluru, IndiaJob Requirements: · Bachelor's degree in Computer Science, Engineering, or a related field. · 1+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. · Basic understanding of AWS solutions including ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 3 days ago
Ensono Bengaluru, IndiaAbout Role · Ensono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
Integra Connect Bangalore Urban, IndiaAbout IntegraConnect · Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud plat ...
-
Database Reliability Engineer
Found in: Appcast Linkedin IN C2 - 4 days ago
MethodHub Bengaluru, IndiaDatabase Reliability Engineer (DBRE) · Location: Bengaluru, Noida · Looking for strong DB Reliability Engineering candidates (4-10 yrs band) · Must have strong skillset on MySQL DBA + Linux OS + Automation tools (Chef/Ansible/Shell scripting) · Hands on experience on High Availab ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
TERRAGIG LLP Bengaluru, IndiaRole : Site Reliability Engineer · Experience : 5+ Years · Work Model : Remote / Contract 3 years · Skills : · - Develop and provide operational support for full-stack software applications. · - Relevant industry certifications, such as through the Site Reliability Engineering ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
ViewSonic Bengaluru, IndiaJob Requirements: · Bachelor's degree in computer science, Engineering, or a related field. · 3+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role. · Proficient in AWS solutions including but not limited to EC2, S3, CloudWatch, Lambda, and RDS. ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
Larsen & Toubro Bengaluru, IndiaEXP:- 5to 8 Years · Location- Pune,Bangalore,Hyderabad,Chennai · Primary Skills: · Site Reliability Engineering (SRE) · Application Support on Middleware tools like Apache, WebSphere, Tibco, JMS, RabbitMQ, etc. · Automation using tools like Ansible, Chef, etc.; familiarity with ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 5 days ago
Cyitechsearch Bengaluru, IndiaWe are hiring for Site Reliability Engineer · Skills : · - Develop and provide operational support for full-stack software applications. · - Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation. · - Five years' experience as a site ...
-
Site Reliability Engineer
Found in: Appcast Linkedin IN C2 - 4 days ago
PhonePe Bangalore Urban, IndiaJOB DESCRIPTION: We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production services. · Systems internals/security, Linux, Network, and Monitorin ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
Talent500 Bengaluru, IndiaJob Description : · Cloud Engineer - Site Reliability Engineering for Ford Credit Tech · Were passionate about building software that solves problems. We count on our Site Reliability Engineers (SREs) to empower our users with a rich feature set, high availability, and stellar pe ...
-
Database Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
Zyoin group Bangalore/Hyderabad, India permanentWe are seeking a highly skilled and experienced Database Reliability Engineer (DBRE) to join our team and play a crucial role in ensuring the performance, scalability, and high availability of our customer database services on the Tessell Platform. · Minimum Requirements : · year ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 9 hours ago
PhonePe Bangalore Urban, IndiaSRE- Azure · Job Description: · We are looking for engineers who are passionate about reliability, performance, and efficiency, · and with experience in building tools, services, and automation to manage and improve · production services. · Role and Responsibilities: · ● Systems ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 2 days ago
Prudential Manpower Pvt Bangalore, India permanentNotice Period : Immediate to 30 Days · Minimum Requirements : · - 4 years of experience as a Site Reliability Engineer. · Experience with one or more of the following : · C++, Java, Python, Go, Perl and/or Ruby etc. · - Experience with Unix/Linux operating systems internals an ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 5 days ago
RENUZA TECHNOLOGIES PRIVATE LIMITED Bangalore, India permanentJob Description : · Site Reliability Engineers (SREs) are responsible for ensuring the reliability and performance of production systems at Renuza Technologies. · They wear many hats, encompassing troubleshooting, software development, system administration, infrastructure manag ...
-
Site Reliability Engineer
Found in: Adzuna IN C2 - 2 days ago
Prudential Manpower Pvt Bangalore, India permanentPosition : Site Reliability Engineer · Location : Bangalore · Notice Period : Immediate to 30 Days · Minimum Requirements : · - 4 years of experience as a Site Reliability Engineer. · - Experience with one or more of the following : C++, Java, Python, Go, Perl and/or Ruby etc. ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 5 days ago
Cyitechsearch Bangalore, India permanentAbout the job : · We are hiring for Site Reliability Engineer · Experience : 5+ Years · Work Model : Remote / Contract 3 years · Skills : · - Develop and provide operational support for full-stack software applications. · - Relevant industry certifications, such as through the ...
-
Quality Reliability Engineer
Found in: Talent IN C2 - 4 days ago
Ceragon Bengaluru, IndiaJob Description · Develop and maintain product quality and reliability processes · Provide reliability analysis (MTBF, Return Rate, Failure Rate and others) · Develop and maintain the company's quality production and repair procedures, suppliers KPIs · Manage product reliability ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 2 days ago
Solugenix Bengaluru, IndiaJob Title:SRE Cloud Engineer · Location : Hyderabad / Bangalore · Shifts : 24/5 · Exp : 5+ Years · Job Summary : Cloud Engineer is primarily responsible for working hands on various AWS services like EC2, RDS, EKS, ECS, S3, VPC, Route53, Lambda, Code pipeline etc., and should ha ...
-
Site Reliability Engineer
Found in: Talent IN C2 - 4 days ago
Signify Bengaluru, India Full timeJob Title · Site Reliability Engineer - AWS and AzureJob Description · Site Reliability Engineer · Signify, the new company name of Philips Lighting, is the global leader in lighting building on 125+ years of innovations. Our purpose is to unlock the extraordinary potential of li ...
Engineering Manager, Platform Reliability Engineering - Bengaluru, India - Arcesium
Description
We are looking for an experienced Engineering Manager to lead our Site Reliability Engineering (SRE) team. The ideal candidate will have a strong background in SRE principles and practices, as well as experience managing and mentoring engineers. The SRE Manager will be responsible for the overall success of the SRE team, including ensuring that our systems are reliable, scalable, and secure. The team is responsible for monitoring the stability and availability of mission critical production systems, managing incidents for quicker resolution, and establishing BAU. Team is also building tools/infra which to be used by all development teams to assist in monitoring and troubleshooting.
What you'll do (Responsibilities):
What you'll need (Qualifications):
The Company offers excellent benefits, an informal and collegial working environment, and an attractive compensation package.
Members of the Arcesium Company Group do not discriminate in employment matters on the basis of sex, race, color, caste, creed, religion, pregnancy, national origin, age, military service eligibility, veteran status, sexual orientation, marital status, disability, or any other protected class.