- Provide L2 support to production systems like application, database, middleware components, infrastructure and network components
- Manage productions incidents end-to-end within defined SLAs with focus on resolution rather than who caused it.
- Interact with various stake holders such as Release managers, program leads, service managers, development and test leads
- Review operational readiness requirements such as monitoring and alerting, log rotation and resilience of the components and report the gaps
- Provide pre-implementation support with activities such as release notes review and implementation dry runs.
- Protect production components by running health checks, monitoring latency and memory utilization.
- Automate day-to-day activities and propose changes that improve reliability
- Participate in CAB and provide feedback on change requests
- Support the DevOps team in testing the promote pipelines and suggest automation of configuration items.
- Practice incident management best practices and perform RCA.
- Participate in disaster recovery tests and operational acceptance tests
- Analyze the technology stack that makes up the product and optimize recovery time objective.
- Work with team members spread across and time zones
- Share knowledge, document improvements and mentor junior resources
- Use Jenkins to orchestrate builds as well as link to Sonar, Maven, etc. to build out the CI/CD pipeline.
- Support deployments of code into multiple lower environments. Supporting current processes needed with an emphasis on automating everything as soon as possible.
- Design, Implement, and enhance our deployment automation based on Chef. We need proven experience designing and implementing an overall release and deployment process.
- Design and implement a Git based code management strategy that will support multiple environment deployments in parallel. Experience with automation for Branch management, code promotions, and version management.
- Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement.
- Deployments MTF/Prod, Maintenance items (including stop/start, Disaster Recovery-related activities, etc.), CR for changes in MTF/Prod
- Tools -
- Log Monitoring Tool - Splunk
- Application Monitoring tool - DynaTrace
- Ticketing incident/problem management tool - Remedy
- Dev-ops Basics - CI-CD Basics, Overview of git, Bit-bucket, SonarQube, Ansible/ Chef, Artifactory
- Skills -
- Linux& Shell Scripting
- ITIL / ITSM
- PL/SQL
- Troubleshooting
- Jenkins- CI/CD, Groovy Scripting/Yaml
-
Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
UST Pune, IndiaMandatory Skills – Reliability Test Planning and Reporting, Ultrasound (Plus), Medical Device Domain, · Experienced in R&D environment. Preferable in a Regulated environment (Medical, Automotive or Aerospace/Defense). · Strong in Reliability Engineering Fundamentals, proficient ...
-
Reliability Engineer
Found in: beBee S2 IN - 6 days ago
Philips Pune, India Full timeJob Title · Reliability EngineerJob Description · Your challenge · Do you want to be a transformation leader, teaming up with over 20 Quality and Reliability professionals to support all Philips businesses globally with End-to-end (E2E) product Quality and Reliability? · Do you ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
SLB Pune, IndiaAbout us · We are a global technology company, driving energy innovation for a balanced planet. · Together, we create amazing technology that unlocks access to energy for the benefit of all. · Our inclusive culture is the key to our success. We collaborate with our internal com ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
HCLSoftware Pune, IndiaThe Role: · HCL BigFix is looking for a Site Reliability Engineer to work on infrastructure for a new · product that will help keep our customers' end points secure. You will be a part of a team · that leverages modern technological solutions to drive growth and efficiency. Your ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 3 hours ago
NCS Group Pune, IndiaAre you looking for value-adding and impactful work? · Do you want to make a difference with your expertise? · With us, you'll be able to make it happen. · NCS is a leading technology services firm, operating across Asia Pacific in over 20 countries, providing services and soluti ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 1 day ago
PhonePe Pune, IndiaSRE SYSTEMS · JOB DESCRIPTION: · We are looking for engineers who are passionate about reliability, performance, and efficiency, · and with experience in building tools, services, and automation to manage and improve · production services. · Systems internals/security, Linux, Net ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
LTIMindtree Pune, IndiaAbout the Job: · Position: SRE Devops · Location: Chennai/Bangalore/Hyderabad/Pune/Mumbai · Experience: 5 to 8 Years only · Primary Skill- SRE, Dynatrace, Prometheus, Grafana, Kubernetes, AWS Native components, CloudWatch, (Puppet/ Chef/Ansible), CDK · Responsibilities · • Engage ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
PubMatic Pune, IndiaAs an SRE Engineer, you will be responsible for the Activate and Production Infrastructure. Your essential duties encompass ensuring the seamless operation and optimal performance of large-scale distributed software applications. Your role revolves around maintaining a robust and ...
-
Site Reliability Engineer
Found in: Appcast Linkedin IN C2 - 2 days ago
Arista Networks Pune, IndiaSite Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/C ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 1 day ago
SLB Pune, IndiaAbout us · We are a global technology company, driving energy innovation for a balanced planet.Together, we create amazing technology that unlocks access to energy for the benefit of all. · Our inclusive culture is the key to our success. We collaborate with our internal commun ...
-
Site Reliability Engineer
Found in: Talent IN C2 - 6 days ago
TSYS Pune, India Full timeEvery day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services. Our worldwide team helps over 3 million companies, more than 1,300 financial institutions an ...
-
Site reliability engineer
Found in: Talent IN C2 - 4 days ago
Roche Pune, India Full timeThe Position · KEY ROLES & RESPONSIBILITIES (required): · Responsibilities: · Design, implement, and maintain site reliability engineering (SRE) practices that ensure the reliability and performance of our production systems. · Design and implement SRE practices that align wit ...
-
Site Reliability Engineer
Found in: Talent IN C2 - 6 days ago
Ensono Pune, IndiaAbout Us (Ensono) · Ensono is an expert technology adviser and managed service provider. As a relentless ally, we accelerate clients' digital transformation to achieve business outcomes that stand to last. Our dedicated team helps organizations optimize today's systems across an ...
-
Site Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
Mobile Programming LLC Pune/Maharashtra, India permanentLocation : Pune · NP : Immediate / Serving Notice Period · Years of Experience : 12+ · Role : Site Reliability Engineer · Mandatory Skill : Java, GCP, AWS, CICD · Job Description : · Requirements : · Minimum 12+ years experience as a Site Reliability engineer supporting diff ...
-
Application Reliability Engineer
Found in: Talent IN 2A C2 - 4 days ago
TripleLift Pune, India permanentTripleLift is seeking an Application Reliability Engineer to contribute to our technical escalations operations. This candidate will focus on ensuring that external clients are incredibly satisfied with TL platform and internal stakeholders are properly leveraged to execute. This ...
-
Site Reliability Engineer
Found in: Talent IN C2 - 4 days ago
FIS Pune, India Experienced (relevant combo of work and education)Position Type : · Full time Type Of Hire : · Experienced (relevant combo of work and education) Education Desired : · Associate's Degree Travel Percentage : · 0% Site Reliability Engineer (SRE) · Are you curious, motivated, and forward-thinking? At FIS you'll have ...
-
Site Reliability Engineer
Found in: Talent IN C2 - 5 days ago
GfK Pune, India Full timeDescription · About You · You are a DevOps or Site Reliability Engineer with a passion for cloud infrastructure and automation. You're a self-starter and you love keeping up to date with the latest developments in cloud, configuration management and container technologies. You u ...
-
Site Reliability Engineer
Found in: Talent IN C2 - 4 days ago
Etraveli Group Pune, India Full timeEtraveli is one of the leading global flight centric Online Travel Agencies (OTAs) with €4bn+in annual gross sales. We also operate , the #1 meta searcher in Sweden and Tripstack, the independent B2B arm of the group offering a variety of complex technology solutions. · Our diver ...
-
Site Reliability Engineer
Found in: Talent IN C2 - 4 days ago
Jobs for Humanity Pune, India Full timeJob Description · Position Type : · Full time Type Of Hire : · Experienced (relevant combo of work and education) Education Desired : · Associate's Degree Travel Percentage : · 0%Site Reliability Engineer (SRE) · Are you curious, motivated, and forward-thinking? At FIS you'll ...
-
Application Reliability Engineer
Found in: Talent IN C2 - 4 days ago
TripleLift Pune, IndiaThe Role · TripleLift is seeking an Application Reliability Engineer to contribute to our technical escalations operations. This candidate will focus on ensuring that external clients are incredibly satisfied with TL platform and internal stakeholders are properly leveraged to e ...
Lead System Reliability Engineer - Pune, India - Fulcrum Digital
Description
Job Description
Who are we FulcrumDigital is an agile and next-generation digital accelerating company providingdigital transformation and technology services right from ideation toimplementation. These services have applicability across a variety ofindustries, including banking & financial services, insurance, retail,higher education, food, healthcare, and manufacturing.
The Role
Requirements