- Continually improve the lifecycle of microservices and architectural components from inception and design, through deployment, operation, and refinement.
- Participate in defining, evolving, and managing SLOs
- Write code and automation to reduce operational workload, increase efficiency, improve security posture, eliminate toil, and enable our developers to deliver features more rapidly.
- Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
- Facilitate blame-free root cause analysis meetings for incidents to learn and drive improvement
- Participate in and continually improve our global IRC (incident response coordination) for all products.
- Drive root cause identification and issue resolution with the various teams.
- Work inside of a fast-paced iterative environment.
- 5+ years of industry experience.
- Cloud native application development experience leveraging best practices and design patterns
- Strong debugging and trouble-shooting skills across the entire technology stack
- Deep understanding of AWS Networking, Compute, Storage, and managed services.
- Competency with modern CI/CD tooling like Kubernetes, Terraform, Ansible & Jenkins
- Experience with full life cycle support of services, from creation to production support
- Versed in Infrastructure as Code practices using technologies like Terraform or Cloud Formation
- Ability to author production ready code in at least one the following: Java, Scala or Go.
- Experience with Linux systems and at home on the command line
- Understand and apply modern approaches to cloud-native software security
- Experienced with agile frameworks, such as Scrum and Kanban, and how to operate within these frameworks to continually deliver value.
- Flexible and willing to step into new roles and responsibilities
- Willingness to learn and use ourLogic products for solving reliability and security issues
- Experienced with planet scale product development
- Running and operating SaaS products on AWS Cloud with expert level proficiency
- Experience with streaming technologies like Kafka, Kafka Streams, or KSQL
- Experience in one or more of: Java, Go, Scala, or Python
- Experience in one or more of: Terraform, Jenkins, Kubernetes
- Experience running and tuning JVM workloads at scale
-
Reliability Engineer
2 weeks ago
Cargill india, IndiaJob Purpose and Impact · The Reliability Engineer, will perform routine activities to deliver continuous improvement in process and asset reliability through the detection and elimination of defects. In this role, you will use your knowledge to fulfill reliability engineering s ...
-
Senior reliability engineer
2 weeks ago
QuEST Global Services Pte. Ltd india, IndiaQuest Global is an organization at the forefront of innovation and one of the world's fastest growing engineering services firms with deep domain knowledge and recognized expertise in the top OEMs across seven industries. We are a twenty-five-year-old company on a journey to beco ...
-
Site Reliability Engineer
2 weeks ago
Exoscale india, IndiaJob Description · Exoscale is the leading Swiss/European cloud service provider. · With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order t ...
-
Site Reliability Engineer
2 weeks ago
System Soft Technologies IndiaTitle: Site Reliability Engineer · 100% REMOTE · The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of r ...
-
Site Reliability Engineer
2 weeks ago
Circles Life india, IndiaJob Description · Role: Site Reliability Engineer (SRE) · Title: Software Engineer II, SRE · Location: Bangalore · About Circles · Founded in 2014, Circles is a global technology company reimagining the telco industry with its SaaS platform - Circles X, helping telco op ...
-
Site Reliability Engineer
2 weeks ago
Serendipity Recruiting india, IndiaJob Description · As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government. · Our client firmly believes that exceptiona ...
-
Site Reliability Engineer
2 weeks ago
Unilog india, IndiaJob Title : Site Reliability Engineer · Job Summary : · As a Site Reliability Engineer (SRE) specializing in Google Cloud Platform (GCP), you will be responsible for designing, implementing, and maintaining highly scalable and reliable systems. You will collaborate with developm ...
-
Site Reliability Engineer
2 weeks ago
Exoscale india, IndiaJob Description · Exoscale is the leading Swiss/European cloud service provider. · With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order t ...
-
Site Reliability Engineer
2 weeks ago
STAFIDE india, IndiaJob Description · About us: · Stafide is the premier destination for tech talent consulting, providing comprehensive employment services throughout Europe. Our mission is straightforward: to effortlessly connect job seekers with employers, focusing on the rapidly changing techn ...
-
Site Reliability Engineer
2 weeks ago
Mobile Programming LLC india, IndiaLocation : Pune · NP : Immediate / Serving Notice Period · Years of Experience : 12+ · Role : Site Reliability Engineer · Mandatory Skill : Java, GCP, AWS, CICD · Job Description : · Requirements : · Minimum 12+ years experience as a Site Reliability engineer supporting diff ...
-
Senior Reliability Engineer
2 weeks ago
Elfonze Technologies Pvt Ltd india, IndiaJob Description Perform reliability evaluation of IC products, packages, and process technology with focus on suitability to end applications and conformance to industry standards. · Perform device level failure analysis for an in-depth understanding of IC device failures. · An ...
-
Senior reliability engineer
2 weeks ago
QuEST Global Services Pte. Ltd india, IndiaQuest Global is an organization at the forefront of innovation and one of the world's fastest growing engineering services firms with deep domain knowledge and recognized expertise in the top OEMs across seven industries. We are a twenty-five-year-old company on a journey to beco ...
-
Site Reliability Engineer
2 weeks ago
Ideope Media india, IndiaWe are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production services. · About Inc42 Media · Inc42 is India's #1 startup media & intelligence platfo ...
-
Site Reliability Engineer
2 weeks ago
LivePerson, Inc india, IndiaOverview: · LivePerson is looking for a Site Reliability/DevOps Engineer for the GPT (Global Product & Technology) Division. You will be part of the LivePerson SRE team building and managing highly available, distributed systems. You will have the opportunity to be part of a st ...
-
Site Reliability Engineer
2 weeks ago
People Success india, India permanentJob Description : · Key Responsibilities : · - Knowledge on AWS resources such as EC2 instances, S3 buckets, RDS databases, etc., as per project requirements · - Knowledge on AWS monitoring tools like CloudWatch monitor the health, performance, and availability of AWS resources. ...
-
Site Reliability Engineer
4 days ago
World Wide Technology IndiaResponsibilities · This role is part of a dedicated team of SREs that operate mission-critical IT Infrastructure and Cloud Management platforms. The role requires communications skills and patience to work with people as well as technology. We encourage our engineers to work wit ...
-
Senior Reliability Engineer
3 weeks ago
Duck Creek Technologies india, IndiaWHO WE ARE · Duck Creek Technologies is the intelligent solutions provider defining the future of the property and casualty (P&C) and general insurance industry. We are the platform upon which modern insurance systems are built, enabling the industry to capitalize on the power ...
-
Site Reliability Engineer
1 week ago
Travash Software Solutions/Risk Resources Anywhere in India/Multiple Locations Full timeJob Description: · 10+ years of experience in SRE or a related field. · Proven experience in designing, developing, and implementing monitoring solutions. · - Deep understanding of monitoring technologies and tools, including Prometheus, Grafana, Loki, and Tempo · - Experience ...
-
Site Reliability Engineer
2 weeks ago
HCLSoftware india, IndiaThe Role: · HCL BigFix is looking for a Site Reliability Engineer to work on infrastructure for a new product that will help keep our customers' end points secure. You will be a part of a team that leverages modern technological solutions to drive growth and efficiency. Your ...
-
Site Reliability Engineer
2 weeks ago
UBS india, IndiaYour role · We're looking for a Site Reliability Engineer to: · • work as a part of an agile pod (team) · • determine the reliability of our digital products, technology services, and the infrastructure that underpins them · • minimize the risk and impact of failures by engineer ...
Site Reliability Engineer - India - Andela
4 weeks ago
Description
About Andela
Andela exists to connect brilliance and opportunity. Since 2014, we have been dedicated to breaking down global barriers and accelerating the future of work for both technologists and organizations around the world. For technologists, Andela offers competitive long-term career opportunities with leading organizations, access to a global community of professionals, and educational opportunities with leading technology providers. At Andela, we're deeply passionate about creating long-lasting and transformative growth opportunities for all - and doing it in an E.P.I.C. way We're excited to continue building our remote-first team with incredible people like you. After applying for this role, you will join our Andela Community of brilliant technologists by passing a technical screening and live interview. As a community member, you'll have access to a multitude of exclusive technologist roles. Join Andela today to access this opportunity and more in our global marketplace Our roles are typically filled at lightning speed, so if you're considering applying, get your application in quickly
Andela ́s Benefits:
-- 100% full-time
-- 100% long-term
-- 100% payment in USD
--100% Remote
Responsibilities
MUST HAVES
NICE TO HAVES
Important: This is a fully Remote opportunity for one of our esteemed clients.
● Working days: Monday to Friday
● Working hours: 9 AM to 6PM
● Contract Estimated Duration: 12 months (renewable)
At Andela, we know our strengths lie in our diverse community whose talents, perspectives, backgrounds, and orientations we take pride in. Andela is committed to nurturing a work environment where all individuals are treated with respect and dignity. Everyone has the right to work in a professional atmosphere that promotes equal employment opportunities and prohibits discriminatory practices. Andela provides equal employment opportunities to all employees and applicants without regard to factors including but not limited to race, color, religion, gender, sexual orientation, gender identity, national origin, age, disability, pregnancy (including breastfeeding), genetic information, HIV/AIDS or any other medical status, family or parental status, marital status, amnesty or status as a covered veteran in accordance with applicable federal, state and local laws. This commitment applies to all terms and conditions of employment, including but not limited to hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. Our policies expressly prohibit any form of harassment and/or discrimination, as stated above. Andela is home for all. Come as you are.