Jobs

    Lead Site Reliability Engineer - Bangalore, India - GetHyr

    GetHyr
    GetHyr Bangalore, India

    4 days ago

    Default job background
    permanent Technology / Internet
    Description

    Job Description :

    Maintain services once they are live by measuring and monitoring availability, latency, and overall system reliability.

    • Work closely with team members to ensure best practices and strategic goals are incorporated into development work.
    • Collaborate with other engineering teams to identify and anticipate changing requirements and opportunities to improve the development environment.
    • Monitoring at scale with VictoriaMetrics and the like.
    • Orchestrating and managing with K8S and the like.
    • Implementing best practices, challenging the status quo, and tab on industry and technical trends, changes, and developments to ensure the team is always striving for bestinclass work.
    • Manage capacity, build security into every layer, and reduce cost.
    • Implement secure networking, key management, user management, access management, process management, and image management.
    • Effectively lead and manage team deliverable (short/long term) project planning and coaching, quarterly reviews, participation in the selection process for new hires, and technical and nontechnical guidance to the team.
    Requirements :


    • Proven experience in handling large infrastructure and distributed systems like Yarn, Kubernetes, Elasticsearch, Kafka, etc.
    • Familiarity with Pythonrelated technologies and frameworks like Falcon, Django, or Pyramid.
    • Experience with Unix/Linux operating systems internals and administration (e. g. filesystems, inodes, system calls, etc. ) or networking (e. g. TCP/IP, routing, network topologies, and hardware, SDN, etc. ).
    • Familiarity with the cloud computing infrastructure, preferably Azure.
    • Familiarity with task queue frameworks like Celery or Pika is a plus.
    • Source code management and Implementation of security best practices.
    • Deep understanding of modern software architectures, including loadbalancing, queueing, caching, distributed systems failure modes generally, microservices, and big data technologies.
    • Knowhow in gathering metrics across distributed systems (instances/container) and generating automated notifications, and reports.
    • Prowess in analyzing App bottlenecks, and performance degradation, and implementing automated processes/tools to detect such anomalies.
    Good understanding and implementation experience using 12-factor App principles.

    Mandatory Skills :

    years of Experience on the AWS/Azure platform.

    • Excellent programming (Python, Go, Ruby, or preferred scripting languages) and automation skills.
    • Deep understanding of container orchestration technologies
    • Kubernetes.
    • Should have had prior experience in migrating high throughput services to Kubernetes.
    • Expertise in any CI/CD tools build, artifact, packaging, and service discovery management tools. Gitops preferred.
    • Expertise in skillsets for centralized logging systems, metrics, and tooling frameworks such as ELK, Prometheus/VictoriaMetrics, and Grafana.
    • Great communication, interpersonal, and teamwork skills.
    • Experience with AWS/Azure cost explorer, billing analysis, and various cost optimization techniques.
    • Awareness of Cloud Security concepts.
    • Awareness of Information Security Concepts and Best Practices.
    Good to have :


    • AWS/Azure cloud certification preferred.
    • Certification in Kubernetes Administrator (CKA).
    • Certification in Kubernetes Application Developer (CKAD).
    • Experience with configuration management tools and strong code analysis skills in Python.
    • Experience in working with APMbased tools like New Relic.
    )


  • Waytogo Consultants Bangalore, India permanent

    Job Description : · As an SRE Lead (Site Reliability Engineering Lead), you will play a crucial role in ensuring the reliability, scalability, and performance of our systems and services. · He/ She will lead a team of SREs (Site Reliability Engineers) and collaborate closely wit ...


  • Integra Connect Bangalore Urban, India

    About IntegraConnect · Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud plat ...


  • Jobeefie pvt ltd Bangalore, India permanent

    Responsibilities : · - Establish instrumentation to measure SLI (Service Level Indicators), define SLO (Service Level Objectives), Alerting mechanisms, review with Stakeholders · - Ensure the reliability, scalability and performance of our cloud-based systems and On-Prem Systems. ...


  • Zyoin group Bangalore, India permanent

    We are seeking a highly skilled and experienced Senior Reliability Engineer to join our team in Bangalore and play a crucial role in ensuring the reliability, scalability, and performance of our critical software systems. · Location : Bangalore (Hybrid) · Responsibilities : · - D ...


  • Zyoin group Bangalore, India permanent

    Responsibilities : · - Work with the Kubernetes, and Service Mesh team to manage our growing fleet of clusters globally, across multiple Cloud providers. · - Work with the Development Tools and Service Mesh teams to implement and measure SLAs, SLOs, and MTTD/R for our services fa ...


  • Signify Netherlands B.V. Bangalore, India Full time

    Site Reliability Engineer · Signify, the new company name of Philips Lighting, is the global leader in lighting building on 125+ years of innovations. Our purpose is to unlock the extraordinary potential of light for brighter lives and a better world. · We are proud to be ahead o ...


  • Qualcomm Bangalore, India Paid Work

    Company: · Qualcomm India Private Limited · Job Area: · Information Technology Group, Information Technology Group > IT Software Engineer · Qualcomm Overview: · Qualcomm is a company of inventors that unlocked 5G ushering in an age of rapid acceleration in connectivity and new po ...


  • Edge In Asia Recruitment Private Limited Bangalore, India permanent

    Our client is a global investment banking company having its headquarters in the US with employees globally and is looking for a SRE to join their Bangalore regional team. · Note : Looking for candidates who can join within 30days · Work Mode : Hybrid · Experience : 7 to 10 year ...


  • Magna International Inc. Bangalore, India

    Job Number: 65448 · Group: Magna Corporate · Division: Magna Corporate R&D India · Job Type: Permanent/Regular · Location: BANGALORE · Work Style: · About us · We see a future where everyone can live and move without limitations. That's why we are developing technologies, s ...


  • The HRBPs Bangalore, India permanent

    Lead Site Reliability Engineer - Bangalore · Exp - 8 to 12 years · Responsibilities : · - Collaborating with customer success managers and solutions engineers to bring deep technical expertise in implementing intelligent automation solutions for customers. · - Providing customers ...


  • Dashhire Bangalore, India permanent

    Actively seeking a Senior Site Reliability Engineer (Senior SRE) to elevate the reliability, scalability, and performance of our cloud management platform. · You will be at the forefront of developing and maintaining sophisticated tools and systems to automate and optimize the ma ...


  • The HRBPs Bangalore, India permanent

    Lead Site Reliability Engineer - Bangalore · Experience - 8 to 12 years · Responsibilities : · - Collaborating with customer success managers and solutions engineers to bring deep technical expertise in implementing intelligent automation solutions for customers. · - Providing cu ...


  • Orange Shark Bangalore, India permanent

    Position : Senior Site Reliability Engineer (SRE) · Experience : 5+ Years · Location : Bangalore (Hybrid) · Notice Period : Immediate to 30 Days Notice · Job Description : · Key Responsibilities : · - Design and implement sophisticated automation solutions for the management and ...


  • Meesho Bangalore, India permanent

    Site Reliability Engineer II · Bangalore, Karnataka Tech Infrastructure /Full Time Employee /On-Site · About the Team : · When 5% of Indian households shop with us, its important to build resilient systems to manage millions of orders every day. Weve done this with zero downtime ...


  • Smarsh Bangalore Urban, India

    Smarsh is the leader in communications compliance, archiving, and analytics. We provide compliance across the broadest set of communications channels with insights on what's being captured. Smarsh customers manage over 500 million daily conversations across 80 channels and growin ...


  • Winfort Services Pvt ltd Bangalore, India permanent

    Job Description : · Building and maintaining network monitoring, orchestration and automation solutions, including automated inventory reconciliation and remediation, · - Monitor the performance of our network infrastructure and develop automated solutions to address any issues. ...


  • Novopay Bangalore, India permanent

    About Trustt : · Trustt (formerly Novopay) was founded by Srikanth Nadhamuni (Founder CTO Aadhaar) and Gautam Bandyopadhyay (a FinTech industry veteran and former Head of Finacle Innovation Hub at Infosys). Vinod Khosla, the legendary silicon-valley venture capitalist, is our chi ...


  • Grizmo Labs Bangalore, India permanent

    Responsibilities : · - Architect and deploy scalable infrastructure and platform services (Monitoring, logging, etc) with a focus on simplicity and automation. · - Own the performance and reliability of backend services, data pipelines, platform services, etc, and work with devel ...


  • TalentXo Bangalore, India permanent

    Role & Responsibilities : · - As a member of the cloud engineering team you get to build the cloud infrastructure in which our Jobvite applications run · - You will get to build the tools that monitor, deploy and manage our web applications and backend systems · - Participate in ...


  • Wipro Bangalore Urban, India

    Principal Site Reliability Engineer · We are seeking a highly skilled and experienced Principal Site Reliability Engineer (SRE) to join Lab45 team in Wipro. As a Principal SRE, you will play a critical role in ensuring the reliability, availability, and performance of our systems ...