- Design, develop, and implement software (including infrastructure as code) to enhance the stability, scalability, availability, and performance of our development environments.
- Support our customers from 2nd line on a daily basis to ensure higher satisfaction.
- Solve problems and incidents that impact customer experience, building solutions and automation to prevent recurrence (root cause analysis).
- Build, enhance, and maintain tooling and scripts to automate repetitive tasks.
- Proactively manage issues, monitor user experience, and identify opportunities to automate issue remediation.
- Design and create structural solutions instead of workarounds.
- Take ownership of one or more services, ensuring they remain operational.
- Develop effective monitoring to observe systems health and behavior, intervening during outages.
- (Co-)design and develop business-critical systems to meet user needs and anticipate technological advancements.
- Share the on-call rotation and act as an escalation contact for incidents.
- Bachelor's or master's degree in Software Engineering, Computer Science, or equivalent.
- 8-10 years of relevant experience.
- Experience with Site Reliability Engineering.
- Strong proficiency in languages like Python, Ruby, etc.
- Demonstrable proficiency in logging infrastructure and tools such as Splunk and Zabbix.
- Experience with Infrastructure as Code (Ansible, Puppet & Terraform).
- Experience maintaining scalable systems using Kubernetes.
- Networking, Security, and Storage expertise.
- Linux administration and troubleshooting skills.
- Experience working in a true Agile DevOps environment.
- Proven experience with building/maintaining CI/CD pipelines.
- Creativity and a willingness to step outside of your comfort zone.
- Highly motivated to become acquainted with new topics and technologies.
- Driven to improve yourself and keep learning.
- Fluent in English, both in speaking and writing.
- Design and implement software solutions.
- Support customers and solve problems effectively.
- Develop and maintain automation scripts.
- Manage issues proactively and monitor systems health.
- Design structural solutions and avoid workarounds.
- Take ownership of services and ensure their operational status.
- Develop effective monitoring and intervene during outages.
- (Co-)design and develop business-critical systems.
- Share on-call responsibilities and handle incident escalations.
- A collaborative and innovative workplace that encourages creativity and supports professional growth.
- Opportunities to lead and implement cutting-edge technology solutions in a robust environment.
- Competitive compensation and benefits, with opportunities for career advancement and continuous learning.
- A dynamic team environment where your skills and expertise will be valued and where you can make a significant impact on our technological direction.
-
Reliability Engineer
3 weeks ago
Cargill india, IndiaJob Purpose and Impact · The Reliability Engineer, will perform routine activities to deliver continuous improvement in process and asset reliability through the detection and elimination of defects. In this role, you will use your knowledge to fulfill reliability engineering s ...
-
Site Reliability Engineer
3 weeks ago
Exoscale india, IndiaJob Description · Exoscale is the leading Swiss/European cloud service provider. · With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order t ...
-
Site Reliability Engineer
3 days ago
IKAI Technology Solutions IndiaCompany Description · IKAI Technology Solutions is a leading provider of IT services, supporting businesses across various industries to harness the full potential of information technology. With extensive experience in managing the intricate systems and operations of global ente ...
-
Senior reliability engineer
3 weeks ago
QuEST Global Services Pte. Ltd india, IndiaQuest Global is an organization at the forefront of innovation and one of the world's fastest growing engineering services firms with deep domain knowledge and recognized expertise in the top OEMs across seven industries. We are a twenty-five-year-old company on a journey to beco ...
-
Site Reliability Engineer
3 weeks ago
System Soft Technologies IndiaTitle: Site Reliability Engineer · 100% REMOTE · The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of r ...
-
Site Reliability Engineer
3 days ago
World Wide Technology IndiaWorld Wide Technology (WWT), a global technology integrator and supply chain solutions provider. WWT employs more than 7000 people worldwide and operates in more than 2 million square feet of state-of-the-art warehousing, distribution, and integration space strategically located ...
-
Senior Reliability Engineer
3 weeks ago
Elfonze Technologies Pvt Ltd india, IndiaJob Description Perform reliability evaluation of IC products, packages, and process technology with focus on suitability to end applications and conformance to industry standards. · Perform device level failure analysis for an in-depth understanding of IC device failures. · An ...
-
Site Reliability Engineer
2 weeks ago
Circles Life india, IndiaJob Description · Role: Site Reliability Engineer (SRE) · Title: Software Engineer II, SRE · Location: Bangalore · About Circles · Founded in 2014, Circles is a global technology company reimagining the telco industry with its SaaS platform - Circles X, helping telco op ...
-
Site Reliability Engineer
2 weeks ago
Serendipity Recruiting india, IndiaJob Description · As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government. · Our client firmly believes that exceptiona ...
-
Site Reliability Engineer
3 weeks ago
Mobile Programming LLC india, IndiaLocation : Pune · NP : Immediate / Serving Notice Period · Years of Experience : 12+ · Role : Site Reliability Engineer · Mandatory Skill : Java, GCP, AWS, CICD · Job Description : · Requirements : · Minimum 12+ years experience as a Site Reliability engineer supporting diff ...
-
Senior reliability engineer
3 weeks ago
QuEST Global Services Pte. Ltd india, IndiaQuest Global is an organization at the forefront of innovation and one of the world's fastest growing engineering services firms with deep domain knowledge and recognized expertise in the top OEMs across seven industries. We are a twenty-five-year-old company on a journey to beco ...
-
Site Reliability Engineer
3 days ago
HuntingCube Recruitment Solutions IndiaPosition: SRE · Experience: 4-6 years · Qualification: B.tech/BE/MCA · Location: Remote · Notice period: Immediate/Serving/30 days · Key skills : Terraform, Jenkins, Kubernetes, Any cloud but AWS preferred, · Any Programming Language Like Python/Scala etc, Observability, SLI · Re ...
-
Site Reliability Engineer
3 weeks ago
Ideope Media india, IndiaWe are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production services. · About Inc42 Media · Inc42 is India's #1 startup media & intelligence platfo ...
-
Site Reliability Engineer
3 weeks ago
LivePerson, Inc india, IndiaOverview: · LivePerson is looking for a Site Reliability/DevOps Engineer for the GPT (Global Product & Technology) Division. You will be part of the LivePerson SRE team building and managing highly available, distributed systems. You will have the opportunity to be part of a st ...
-
Site Reliability Engineer
1 week ago
World Wide Technology IndiaResponsibilities · This role is part of a dedicated team of SREs that operate mission-critical IT Infrastructure and Cloud Management platforms. The role requires communications skills and patience to work with people as well as technology. We encourage our engineers to work wit ...
-
Site Reliability Engineer
3 weeks ago
Unilog india, IndiaJob Title : Site Reliability Engineer · Job Summary : · As a Site Reliability Engineer (SRE) specializing in Google Cloud Platform (GCP), you will be responsible for designing, implementing, and maintaining highly scalable and reliable systems. You will collaborate with developm ...
-
Site Reliability Engineer
3 weeks ago
Exoscale india, IndiaJob Description · Exoscale is the leading Swiss/European cloud service provider. · With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order t ...
-
Site Reliability Engineer
3 days ago
Aventurine Technologies Inc india, IndiaJob Description · SRE (Site Reliability Engineer) · Dallas, TX – Hybrid (F2F interview will be requested) · 6+ Mon Contract · Note: Look for candidates with over 9+ Years' experience. · Job Description (SRE) · • Collaborating closely with engineering teams on building and en ...
-
Site Reliability Engineer
2 weeks ago
Travash Software Solutions/Risk Resources Anywhere in India/Multiple Locations Full timeJob Description: · 10+ years of experience in SRE or a related field. · Proven experience in designing, developing, and implementing monitoring solutions. · - Deep understanding of monitoring technologies and tools, including Prometheus, Grafana, Loki, and Tempo · - Experience ...
-
Site Reliability Engineer
3 weeks ago
HCLSoftware india, IndiaThe Role: · HCL BigFix is looking for a Site Reliability Engineer to work on infrastructure for a new product that will help keep our customers' end points secure. You will be a part of a team that leverages modern technological solutions to drive growth and efficiency. Your ...
Site Reliability Engineer - india, India - STAFIDE
![Default job background](https://contents.bebee.com/public/img/bg-user-ex-1.jpg)
Description
Job DescriptionAbout us:
Stafide is the premier destination for tech talent consulting, providing comprehensive employment services throughout Europe. Our mission is straightforward: to effortlessly connect job seekers with employers, focusing on the rapidly changing technology sector. Boasting unparalleled expertise and a steadfast commitment, we specialize in aligning elite tech talent with companies to meet their IT consulting requirements precisely. Be part of our journey as we redefine the landscape of tech recruitment.
As a Site Reliability Engineer (SRE), you will:
About us: Stafide is the premier destination for tech talent consulting, providing comprehensive employment services throughout Europe. Our mission is straightforward: to effortlessly connect job seekers with employers, focusing on the rapidly changing technology sector. Boasting unparalleled expertise and a steadfast commitment, we specialize in aligning elite tech talent with companies to meet their IT consulting requirements precisely. Be part of our journey as we redefine the landscape of tech recruitment. As a Site Reliability Engineer (SRE), you will: Design, develop, and implement software (including infrastructure as code) to enhance the stability, scalability, availability, and performance of our development environments. Support our customers from 2nd line on a daily basis to ensure higher satisfaction. Solve problems and incidents that impact customer experience, building solutions and automation to prevent recurrence (root cause analysis). Build, enhance, and maintain tooling and scripts to automate repetitive tasks. Proactively manage issues, monitor user experience, and identify opportunities to automate issue remediation. Design and create structural solutions instead of workarounds. Take ownership of one or more services, ensuring they remain operational. Develop effective monitoring to observe systems health and behavior, intervening during outages. (Co-)design and develop business-critical systems to meet user needs and anticipate technological advancements. Share the on-call rotation and act as an escalation contact for incidents. What You Bring to the Table: Bachelor's or master's degree in Software Engineering, Computer Science, or equivalent. 8-10 years of relevant experience. Experience with Site Reliability Engineering. Strong proficiency in languages like Python, Ruby, etc. Demonstrable proficiency in logging infrastructure and tools such as Splunk and Zabbix. Experience with Infrastructure as Code (Ansible, Puppet & Terraform). Experience maintaining scalable systems using Kubernetes. Networking, Security, and Storage expertise. Linux administration and troubleshooting skills. Experience working in a true Agile DevOps environment. Proven experience with building/maintaining CI/CD pipelines. Creativity and a willingness to step outside of your comfort zone. Highly motivated to become acquainted with new topics and technologies. Driven to improve yourself and keep learning. Fluent in English, both in speaking and writing. You should possess the ability to: Design and implement software solutions. Support customers and solve problems effectively. Develop and maintain automation scripts. Manage issues proactively and monitor systems health. Design structural solutions and avoid workarounds. Take ownership of services and ensure their operational status. Develop effective monitoring and intervene during outages. (Co-)design and develop business-critical systems. Share on-call responsibilities and handle incident escalations. What We Bring to the Table: A collaborative and innovative workplace that encourages creativity and supports professional growth. Opportunities to lead and implement cutting-edge technology solutions in a robust environment. Competitive compensation and benefits, with opportunities for career advancement and continuous learning. A dynamic team environment where your skills and expertise will be valued and where you can make a significant impact on our technological direction.