Jobs

    Site Reliability Engineer - india, India - HCLSoftware

    HCLSoftware
    HCLSoftware india, India

    3 weeks ago

    Default job background
    Description

    The Role:

    HCL BigFix is looking for a Site Reliability Engineer to work on infrastructure for a new product that will help keep our customers' end points secure. You will be a part of a team that leverages modern technological solutions to drive growth and efficiency. Your daily responsibilities will be centered on HCL BigFix's cloud infrastructure, with daily tasks related to improving scalability, reliability, and observability.

    The ideal candidate will have a strong background in software engineering and system administration, with a proficiency in modern infrastructure tools (e.g., Kubernetes, Docker, AWS/GCP/Azure), with a passion for designing, implementing, and maintaining reliable and

    scalable systems. On-call duties are involved in this role.

    What You Do ?

    • Collaborate with development and operations teams to design, implement, and maintain scalable and reliable infrastructure solutions.
    • Implement and manage monitoring, alerting, and logging systems to ensure proactive identification and resolution of issues.
    • Work on the automation of infrastructure provisioning.
    • Perform regular system and application performance analysis, tuning, and capacity planning.
    • Ensure cost efficiency and efficacy of complex, multi-cloud product and tackle ongoing cost minimization efforts.
    • Ensure the availability of new and existing developer tools.
    • Drive the migration of large-scale, distributed diagnostics applications towards cloud-native microservices.
    • Analyze and plan for capacity management and lead infrastructure change management for cloud-based services.
    • Work with SWE counterparts to identify and mitigate production issues.
    • Document and implement failover/disaster recovery plans.
    • Participate in code reviews and contribute to technical architecture documents.
    • Participate in team on-call rotations.

    What You Bring:

    • BS in Computer Science or related technical field or proof of exceptional skills in related fields with practical software engineering experience.
    • Expert knowledge of cloud operating system internals, filesystems, disk/storage technologies, and storage protocols, and networking stack.
    • Experience leading troubleshooting and full-cycle incident response, including mitigation, correction, and prevention.
    • 3+ years of managing services in distributed systems.
    • 3+ years of experience with common containerization tools, such as Kubernetes or Docker.
    • Expert knowledge of at least one higher-level language such as Python or Go.
    • Expert knowledge of CI/CD tools, Jenkins or GitHub Actions.

    Candidate Data Privacy Notice


  • Cargill

    Reliability Engineer

    3 weeks ago


    Cargill india, India

    Job Purpose and Impact · The Reliability Engineer, will perform routine activities to deliver continuous improvement in process and asset reliability through the detection and elimination of defects. In this role, you will use your knowledge to fulfill reliability engineering s ...


  • IKAI Technology Solutions India

    Company Description · IKAI Technology Solutions is a leading provider of IT services, supporting businesses across various industries to harness the full potential of information technology. With extensive experience in managing the intricate systems and operations of global ente ...


  • QuEST Global Services Pte. Ltd india, India

    Quest Global is an organization at the forefront of innovation and one of the world's fastest growing engineering services firms with deep domain knowledge and recognized expertise in the top OEMs across seven industries. We are a twenty-five-year-old company on a journey to beco ...


  • System Soft Technologies India

    Title: Site Reliability Engineer · 100% REMOTE · The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of r ...


  • Circles Life india, India

    Job Description · Role: Site Reliability Engineer (SRE) · Title: Software Engineer II, SRE · Location: Bangalore · About Circles · Founded in 2014, Circles is a global technology company reimagining the telco industry with its SaaS platform - Circles X, helping telco op ...


  • STAFIDE india, India

    Job Description · About us: · Stafide is the premier destination for tech talent consulting, providing comprehensive employment services throughout Europe. Our mission is straightforward: to effortlessly connect job seekers with employers, focusing on the rapidly changing techn ...


  • Exoscale india, India

    Job Description · Exoscale is the leading Swiss/European cloud service provider. · With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order t ...


  • Serendipity Recruiting india, India

    Job Description · As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government. · Our client firmly believes that exceptiona ...


  • QuEST Global Services Pte. Ltd india, India

    Quest Global is an organization at the forefront of innovation and one of the world's fastest growing engineering services firms with deep domain knowledge and recognized expertise in the top OEMs across seven industries. We are a twenty-five-year-old company on a journey to beco ...


  • Mobile Programming LLC india, India

    Location : Pune · NP : Immediate / Serving Notice Period · Years of Experience : 12+ · Role : Site Reliability Engineer · Mandatory Skill : Java, GCP, AWS, CICD · Job Description : · Requirements : · Minimum 12+ years experience as a Site Reliability engineer supporting diff ...


  • Elfonze Technologies Pvt Ltd india, India

    Job Description Perform reliability evaluation of IC products, packages, and process technology with focus on suitability to end applications and conformance to industry standards. · Perform device level failure analysis for an in-depth understanding of IC device failures. · An ...


  • HuntingCube Recruitment Solutions India

    Position: SRE · Experience: 4-6 years · Qualification: B.tech/BE/MCA · Location: Remote · Notice period: Immediate/Serving/30 days · Key skills : Terraform, Jenkins, Kubernetes, Any cloud but AWS preferred, · Any Programming Language Like Python/Scala etc, Observability, SLI · Re ...


  • Ideope Media india, India

    We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production services. · About Inc42 Media · Inc42 is India's #1 startup media & intelligence platfo ...


  • World Wide Technology India

    World Wide Technology (WWT), a global technology integrator and supply chain solutions provider. WWT employs more than 7000 people worldwide and operates in more than 2 million square feet of state-of-the-art warehousing, distribution, and integration space strategically located ...


  • LivePerson, Inc india, India

    Overview: · LivePerson is looking for a Site Reliability/DevOps Engineer for the GPT (Global Product & Technology) Division. You will be part of the LivePerson SRE team building and managing highly available, distributed systems. You will have the opportunity to be part of a st ...


  • World Wide Technology India

    Responsibilities · This role is part of a dedicated team of SREs that operate mission-critical IT Infrastructure and Cloud Management platforms. The role requires communications skills and patience to work with people as well as technology. We encourage our engineers to work wit ...


  • Unilog india, India

    Job Title : Site Reliability Engineer · Job Summary : · As a Site Reliability Engineer (SRE) specializing in Google Cloud Platform (GCP), you will be responsible for designing, implementing, and maintaining highly scalable and reliable systems. You will collaborate with developm ...


  • Exoscale india, India

    Job Description · Exoscale is the leading Swiss/European cloud service provider. · With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order t ...


  • Aventurine Technologies Inc india, India

    Job Description · SRE (Site Reliability Engineer) · Dallas, TX – Hybrid (F2F interview will be requested) · 6+ Mon Contract · Note: Look for candidates with over 9+ Years' experience. · Job Description (SRE) · • Collaborating closely with engineering teams on building and en ...


  • Travash Software Solutions/Risk Resources Anywhere in India/Multiple Locations Full time

    Job Description: · 10+ years of experience in SRE or a related field. · Proven experience in designing, developing, and implementing monitoring solutions. · - Deep understanding of monitoring technologies and tools, including Prometheus, Grafana, Loki, and Tempo · - Experience ...