Jobs
>
Site Reliability Engineer
>
Hyderabad

    Site Reliability Engineer - Hyderabad, India - NCR Corporation

    NCR Corporation
    NCR Corporation Hyderabad, India

    Found in: Talent IN C2 - 2 days ago

    Default job background
    Full time
    Description

    About NCR VOYIX

    NCR VOYIX Corporation (NYSE: VYX) is a leading global provider of digital commerce solutions for the retail, restaurant and banking industries. NCR VOYIX is headquartered in Atlanta, Georgia, with approximately 16,000 employees in 35 countries across the globe. For nearly 140 years, we have been the global leader in consumer transaction technologies, turning everyday consumer interactions into meaningful moments. Today, NCR VOYIX transforms the stores, restaurants and digital banking experiences with cloud-based, platform-led SaaS and services capabilities.

    Not only are we the leader in the market segments we serve and the technology we deliver, but we create exceptional consumer experiences in partnership with the world's leading retailers, restaurants and financial institutions. We leverage our expertise, R&D capabilities and unique platform to help navigate, simplify and run our customers' technology systems.

    Our customers are at the center of everything we do. Our mission is to enable stores, restaurants and financial institutions to exceed their goals – from customer satisfaction to revenue growth, to operational excellence, to reduced costs and profit growth. Our solutions empower our customers to succeed in today's competitive landscape.

    Our unique perspective brings innovative, industry-leading tech to all the moving parts of business across industries. NCR VOYIX has earned the trust of businesses large and small — from the best-known brands around the world to your local favorite around the corner.

    TITLE : Site Reliability Engineer

    Job Role:

    We are looking for a Site Reliability Engineer (SRE) who will be part of our SRE team and help build scalable systems, using best practices around automation, that improve reliability, velocity and enable monitoring of the operational health of stacks throughout their life-cycle including metrics collection, aggregation, and visualization.

    As a member of the SRE team you will support NCR's Financial Services business unit, product and technology teams to improve the design and operation of systems, focusing on making them scalable, reliable, and efficient while ensuring performance and high availability of products/services primarily residing in the cloud. You will influence the development and implementation of reliable production systems and services to address emerging business needs (such as Cloud-based SaaS). SRE's pride themselves on the resiliency and stability of production systems, yet at the same time are committed to innovation and operational improvement through the application of software engineering practices to operations.

    The SRE will facilitate innovation and operational improvement through the application of software engineering practices to operations. You will make our products easier to adopt and use by making improvements to the product, tools, processes and documentation. You are someone who strives for six 9's or better for service availability

    Job Description:

  • You will be responsible for maintaining and scaling production services and servers for complex and high throughput cloud services.
  • You will bridge and own the union between development, quality, security and operations.
  • You will improve scalability, service reliability, capacity, and performance.
  • You will write automation code for provisioning and operating infrastructure at massive scale.
  • You are not an operator, you're an experienced software engineer focused on operations.
  • You will initiate and contribute to continuous improvement of our software delivery processes and practices in a multi-location, multidisciplinary team to empower and accelerate product development.
  • You will use automation extensively to design, configure, manage, and monitor systems in support of our product development teams.
  • You will participate in disaster recovery planning and execution.
  • You will be responsible for maintaining / patching servers supporting SaaS products. This includes Windows Servers, Linux Servers running in in-house Datacenters and/or using cloud PaaS providers (Primarily GCP & Azure).
  • You'll work hand-in-hand with all teams to ship our code to production using Continuous Integration / Continuous Deployment (CI/CD) and AppSec tooling.
  • You will collaborate with development teams and use intuition, experience and understanding to create SLIs, SLOs, and SLAs.
  • You will provide timely assistance and remediation solutions during critical situations and production incidents to help resolve service problems. (You will be on-call for periods of time.)
  • You will develop monitoring architecture, implementing monitoring agents, build dashboards, manage escalations and alerts
  • You will participate in incident management and driving root cause analysis (RCA) and risk management processes.
  • YOU HAVE:

  • BS degree in Computer Science or related technical field or 5 years prior relevant experience
  • Experience in a DevOps / SRE role with demonstrable experience in deploying and managing large scale production environments in GCP, AWS or Azure and Multi Datacenter environment.
  • Experience developing and debugging code (i.e. one or more of the following: Java, C, C++, .NET, Python, Ruby, Go, Shell, Perl, JavaScript)
  • 2+ years deploying and supporting high traffic, scalable web applications/services
  • Experience with Linux, Shell Scripting, PKI TLS/SSL, Network, firewalls, load balancers and backup
  • Experience with one or more CI tools GitHub, Jenkins, Artifactory
  • Experience with orchestration, automation, and configuration management tools like Terraform
  • Ansible and Helm (or related technology)
  • Experience with log management, including aggregation, alerting, and graphing (i.e Sensu/StackDriver/Prometheus/ELK/TICK stacks)
  • Excellent analysis, debugging, root-cause identification, and troubleshooting skills
  • YOU MIGHT ALSO HAVE:

  • 2+ years with cloud virtualization and PaaS
  • 2+ years with Docker, Kubernetes and early versions of OpenShift
  • Experience with Kubernetes, system virtualization, on-prem and/or hybrid cloud computing, cloud Identity and security system, cloud monitoring and logging, and/or local/cloud storage
  • Experience with application disaster recovery, migration, roll-back plans, expansion, routine deployments, and system upgrades
  • Experience hosting and solving problems with public-facing services securely in GCP, Azure or AWS
  • Experience in designing, analyzing and running large-scale distributed systems
  • Experience with Cassandra, Elasticsearch or Kafka
  • Experience with CI/AppSec tools – Sonar, Coverity, WhiteSource, Seeker, Aqua
  • Cloud certifications
  • Offers of employment are conditional upon passage of screening criteria applicable to the job


  • Wall Street Consulting Services LLC

    Site Reliability Engineer

    Found in: Talent IN 2A C2 - 1 day ago


    Wall Street Consulting Services LLC Hyderabad, India

    Role: SRE · Exp: 6+ years · Location: Pune, Bengaluru, Chennai · JOB DESCRIPTION: · The Role: · As a Site Reliability Engineer, you will be critical in ensuring our software products' reliability, scalability, and performance. You will be responsible for designing and implementin ...

  • Quiktrak, LLC

    Site Reliability Engineer

    Found in: Talent IN 2A C2 - 2 days ago


    Quiktrak, LLC Hyderabad, India

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps Engineer · Job Description: · Summary: · As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the ...

  • WaferWire Cloud Technologies

    Site Reliability Engineer

    Found in: Talent IN 2A C2 - 1 day ago


    WaferWire Cloud Technologies Hyderabad, India

    Hi, · This is Sundeep from Waferwire Technologies and we are hiring Site Reliability Engineer (SRE). · Role: Site Reliability Engineer (SRE) · Location: Hyderabad Office · Experience: 5 to 8 Years · Role Description: · Responsibilities: · Implement and maintain robust DevOps prac ...

  • DATAMTX LLC

    Site Reliability Engineer

    Found in: Talent IN 2A C2 - 2 days ago


    DATAMTX LLC Greater Hyderabad Area, India

    The Company · Datamtx / formerly Datamatics) established in 1993 and globally HQ'd in Atlanta has a stellar history supporting both Tier 1 and 2 ERP rollouts ranging from implementations, data cleanse, migrations, customization, hypercare and Day 1 support. We are also nationall ...

  • Coforge

    Site Reliability Engineer

    Found in: Talent IN 2A C2 - 2 days ago


    Coforge Hyderabad, India

    Role: Site Reliability Engineer · Location: Hyderabad · Work Mode: WFO · Experience: 6-10 yrs · Job Description: · Deep knowledge of version control. · Sound knowledge of operating Systems (like LINUX). · Should be aware of DevOps concepts and best practices. · CI/CD implementati ...

  • ValueLabs

    Site Reliability Engineer

    Found in: Talent IN 2A C2 - 3 days ago


    ValueLabs Hyderabad, India

    Experienced in SRE or Site Reliability Engineer · Design, implement, and maintain automated processes for deploying, monitoring, and managing applications on Azure DevOps. · Collaborate with cross-functional teams to optimize system performance, reliability, and scalability. · D ...

  • Banyan Cloud

    Service Reliability Engineer

    Found in: Talent IN 2A C2 - 1 day ago


    Banyan Cloud Hyderabad, India

    About US · Honest Data technologies Pvt Ltd, is a wholly owned subsidiary of Banyan Cloud, USA, the Cyber Security Product Company, headquartered in San Jose, California, USA, owning the SaaS product "Banyan Cloud", first of its kind Cyber Security CNAP Platform that simplifies t ...

  • Snaphunt

    Site Reliability Engineer

    Found in: Talent IN C2 - 2 days ago


    Snaphunt Hyderabad, India Full time

    The Offer · Work within a company with a solid track record of success · Great work environment · Attractive salary & benefits · The Job · You will be responsible for : · Gathering and evaluating user feedback. · Providing code documentation and other inputs to technical docume ...

  • Splunk Inc

    Database Reliability Engineer

    Found in: Talent IN C2 - 2 days ago


    Splunk Inc Hyderabad, India

    Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to ou ...

  • Shining Sheroes

    Site Reliability Engineer

    Found in: Talent IN 2A C2 - 3 days ago


    Shining Sheroes Hyderabad, India

    Primary job responsibilities : · - Ability to operate and maintain various OS platforms, with a focus on debugging, automation, availability, performance, and scale. · - Diagnose and troubleshoot complex distributed systems. · - Work and collaborate across teams, such as OS, Appl ...

  • Experian

    Site Reliability Engineer

    Found in: Talent IN C2 - 2 days ago


    Experian Hyderabad, India Full time

    Job Description · Experian is looking for a talented senior engineer to join our Site Reliability Engineering team. This team is focused on system performance, optimization, and keeping our AWS platform running reliably at scale. The ideal candidate should have an extensive back ...

  • Alter Domus

    Site Reliability Engineer

    Found in: beBee S2 IN - 2 days ago


    Alter Domus Hyderabad, India

    ABOUT US · We are Alter Domus. Meaning "The Other House" in Latin, Alter Domus is proud to be home to 85% of the top 30 asset managers in the alternatives industry, and more than 5,000 professionals across 23 countries. · With a deep understanding of what it takes to succeed in ...

  • Microsoft

    Site Reliability Engineer

    Found in: Talent IN C2 - 2 days ago


    Microsoft Hyderabad, India Full time

    Overview · Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing t ...

  • Maneva Consulting Pvt. Ltd

    Site Reliability Engineer

    Found in: Talent IN C2 - 3 days ago


    Maneva Consulting Pvt. Ltd Hyderabad, India

    GreetingsFromManeva · JobDescription · JobTitle Site ReliabilityEngineer · LocationBangalore/Hyderabad · Experience4 10Years · JobRequirement: · 1.Incident Management: · Lead incident management efforts coordinating with crossfunctionalteams to resolve service disruptions and mi ...

  • FedEx

    Site Reliability Engineer

    Found in: Talent IN C2 - 2 days ago


    FedEx Hyderabad, India Full time

    General SummaryNeed to play a key role in maintaining the reliability and performance of the systems and services in support operations. Also act as a player to bridge the gap between development/ implementation and support operations by addressing tasks typically handled by oper ...

  • IdeaHelix, Inc

    Site Reliability Engineer

    Found in: Talent IN 2A C2 - 1 day ago


    IdeaHelix, Inc Hyderabad, India

    Requirements: · Proficiency in system management scripting languages (like Python , shell), · Strong experience with Kubernetes, Docker containers and Terraform. · Understanding of Linux administration. · Experience with automation/ configuration management tools (like Ansible, S ...

  • Unison Consulting Pte Ltd

    Site Reliability Engineer

    Found in: Talent IN C2 - 2 days ago


    Unison Consulting Pte Ltd Hyderabad, India Full time

    Experience with supporting Java (J2EE/Spring Boot) based multi-tier applications with complex upstream downstream interactions having expertise in understanding the application request flow and analysing application logs for investigating and troubleshooting issues and applicatio ...

  • Medtronic

    Site Reliability Engineer

    Found in: Talent IN C2 - 1 day ago


    Medtronic Hyderabad, India

    Careers that Change Lives At Medtronic, we contribute to human welfare and wellbeing through biomedical engineering.Everyday we're involved in meaningful work to change people's lives and health for the better.Now is your chance to join a talented team of engineers focused on pr ...

  • Electronic Arts

    Site Reliability Engineer

    Found in: beBee S2 IN - 2 days ago


    Electronic Arts Hyderabad, India

    Pogo has been the leader in online casual games since 1998. Featuring a growing library of 60+ titles · spanning popular genres like Solitaire, Mahjong, Match 3, and more, Pogo exists to be the best · destination for online casual games. We strive to produce high-quality HTML5-po ...

  • Oriontek INC

    Site Reliability Engineer

    Found in: Talent IN C2 - 2 days ago


    Oriontek INC Hyderabad, India Full time

    The Role · You will be responsible for : · Gathering and evaluating user feedback. · Providing code documentation and other inputs to technical documents. · Supporting continuous improvement by investigating alternatives and new technologies and presenting these for architectur ...