Current jobs related to MLOps Site Reliability Engineer - Chennai, India - KLA

  • Only for registered members Chennai Full time $100,000 - $150,000 per year

    Explore your next opportunity at a Fortune Global 500 organization. Envision innovative possibilities, experience our rewarding culture, and work with talented teams that help you become better every day. · Ensure the reliability and uptime of critical services and infrastructure ...

  • Only for registered members Chennai Full time $100,000 - $150,000 per year

    Explore your next opportunity at a Fortune Global 500 organization. Envision innovative possibilities, experience our rewarding culture, and work with talented teams that help you become better every day. · Ensure the reliability and uptime of critical services and infrastructure ...

  • confidential Chennai Full time

    Join us as a Site Reliability Engineer · You ll be managing the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ) · We ll look to you to identify and automate manual tasks and implement observab ...

  • confidential Chennai Full time

    Description · We are seeking a Site Reliability Engineer (SRE) to join our team in India. The ideal candidate will be responsible for maintaining and improving the reliability and performance of our systems, ensuring high availability and scalability. This role requires a mix of ...

  • Only for registered members Chennai Full time $80,000 - $120,000 per year

    We are seeking a motivated Site Reliability Engineer (SRE) Level 1 to enhance the infrastructure and operational reliability of our ERP product, specifically within Azure and Windows environments. · The ideal candidate will utilize SRE principles to ensure high system availabilit ...

  • Poshmark Chennai ₹900,000 - ₹1,200,000 per year

    We're looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our ...

  • Only for registered members Chennai Full time ₹800,000 - ₹1,200,000 per year

    About NIQ Activate is the leading provider of AI-powered customer analytics, personalization, and brand collaboration platform. Serving dozens of retailers and brands across the world using cutting edge big-data, real-time analytics, and data-science automation. Disrupting existi ...

  • confidential Chennai Full time

    3 to 6 years of experience · Skills: Adobe Authoring, SQL, HTML, CSS, scripting · Role & Responsibilities · Ensure the customer experience is seamless throughout the funnel by creating and maintaining product pages, category pages cross-sells, and more. · Leverage hands-on experi ...

  • confidential Chennai Full time

    In your day to day role you will · You would manage and maintain a comprehensive list of open incidents and follow up with relevant teams to facilitate driving the incident to closure. Create comprehensive corrective actions for recurring problems. You would be driving all curren ...

  • confidential Chennai Full time

    Job Overview: · We are seeking two highly skilled Cloud Operations Engineers to support the projected growth of B2W in the Trimble Cloud for 2025. · These engineers will play a critical role in onboarding new customers, migrating existing ones, and providing 24x7 infrastructure s ...

  • Alignity Chennai

    Alignity: Site Reliability Engineer for Datadog & Dynatrace · We are seeking a skilled Site Reliability Engineer (SRE) to maintain high system reliability, optimize monitoring strategies, and ensure observability across applications and infrastructure. · Key Responsibilities: · D ...

  • consulant Chennai

    Key Responsibilities : · Elevate the performance and dependability of GDI&A platforms and applications by participating in 24x7 on-call production support rotations, handling incident response to minimize disruptions, and continuously monitoring system health. · Regularly review ...

  • noon Chennai

    Site Reliability Engineer at Noon · At noon, we're on a mission to accelerate the digital economy of the Middle East by empowering regional talent and businesses to meet the full range of consumers' online needs. · We're aggressively ambitious, with a technology leader that's rev ...

  • Concentrix Catalyst Chennai ₹1,800,000 - ₹2,500,000 per year

    Transforming Business Outcomes through Digital Excellence · About the Role · We are seeking an accomplished Senior Site Reliability Engineer to drive our digital transformation agenda. As a key member of our technology team, you will be responsible for delivering SRE strategies a ...

  • Only for registered members Chennai Full time ₹3,500,000 - ₹4,500,000

    This role is for one of Weekday's clientsSalary range: Rs Rs ie INR 35-45 LPA) · Min Experience: 10 years · Location: Chennai · JobType: full-time · Monitoring and Alerting: Setting up and maintaining monitoring systems to track performance metrics, detect anomalies, and trigger ...

  • Only for registered members Chennai Full time $90,000 - $120,000 per year

    Join us as a Site Reliability Engineer to manage stable, resilient, reliable applications and minimize disruption to Customer & Colleague Journeys. You'll collaborate with feature teams, participate in delivery activities, and address production issues. This is a great chance to ...

  • ADP Chennai

    ADP is hiring Senior Site Reliability Engineer. · We're building the next generation of technologies. Our mission is to create powerful solutions that are efficient, intuitive, beautiful, and responsive. As a Site Reliability Engineer, you will be responsible for ensuring availab ...

  • confidential Chennai Full time

    Collaborate with partners to ensure technology solutions are supported by the right architectures and operating models. · Lead projects in Network Services, focusing on operations, optimization, automation, and security. · Advocate for Next Gen Wireless Network, LAN/WAN, Cloud, S ...

  • KLA Chennai

    MLOps Site Reliability Engineer · KLA Overview: · At KLA, we're a global leader in diversified electronics for the semiconductor manufacturing ecosystem. We're responsible for producing technologies used in virtually every electronic device worldwide. · We invest heavily in innov ...

  • Only for registered members Chennai Full time ₹3,500,000 - ₹4,500,000 per year

    This role is for one of Weekday's clientsSalary range: Rs Rs ie INR 35-45 LPA) · Min Experience: 10 years · Location: Chennai · JobType: full-time · Monitoring and Alerting: Setting up and maintaining monitoring systems to track performance metrics, detect anomalies, and trigger ...

  • Only for registered members Chennai Full time $120,000 - $150,000 per year

    Join Zuora's high-impact Operations team, where you'll be instrumental in maintaining the reliability, scalability, and performance of our SaaS platform. This role involves proactive service monitoring, incident response, infrastructure service management, and ownership of intern ...

  • MLOps Site Reliability Engineer - Chennai, India - KLA

    KLA
    KLA Chennai, India

    1 month ago

    Full time
    Description

    Company Overview

    KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world's leading technology providers to accelerate the delivery of tomorrow's electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us

    Group/Division

    With over 40 years of semiconductor process control experience, chipmakers around the globe rely on KLA to ensure that their fabs ramp next-generation devices to volume production quickly and cost-effectively. Enabling the movement towards advanced chip design, KLA's Global Products Group (GPG), which is responsible for creating all of KLA's metrology and inspection products, is looking for the best and the brightest research scientist, software engineers, application development engineers, and senior product technology process engineers. Central Engineering is KLA's largest engineering organization comprised of 9 Centers-of-Excellence (CoE) in various disciplines applied across all product groups in the company. These CoE include Handling & Automation, Precision Motion Control, Sensors & Image Acquisition, Platform Design, and Packaging Engineering, among others. Talent includes over 500 engineers across global centers in Israel, China, India, and the US. Each CoE contributes not just talent and deliverables per discipline toward product programs, but also subject matter expertise, best practices, roadmaps, specialized facilities, apparatus, models, and analytics. These differentiate KLA not only in WHAT we do, but also in HOW we do it.

    Job Description/Preferred Qualifications

    We are seeking a highly skilled and motivated MLOps Site Reliability Engineer (SRE) to join our team. In this role, you will be responsible for ensuring the reliability, scalability, and performance of our machine learning infrastructure. You will work closely with data scientists, machine learning engineers, and software developers to build and maintain robust and efficient systems that support our machine learning workflows. This position offers an exciting opportunity to work on cutting-edge technologies and make a significant impact on our organization's success.

    Responsibilities:

    • Design, implement, and maintain scalable and reliable machine learning infrastructure.
    • Collaborate with data scientists and machine learning engineers to deploy and manage machine learning models in production.
    • Develop and maintain CI/CD pipelines for machine learning workflows.
    • Monitor and optimize the performance of machine learning systems and infrastructure.
    • Implement and manage automated testing and validation processes for machine learning models.
    • Ensure the security and compliance of machine learning systems and data.
    • Troubleshoot and resolve issues related to machine learning infrastructure and workflows.
    • Document processes, procedures, and best practices for machine learning operations.
    • Stay up-to-date with the latest developments in MLOps and related technologies.

    Required Qualifications:

    • Bachelor's degree in Computer Science, Engineering, or a related field.
    • Proven experience as a Site Reliability Engineer (SRE) or in a similar role.
    • Strong knowledge of machine learning concepts and workflows.
    • Proficiency in programming languages such as Python, Java, or Go.
    • Experience with cloud platforms such as AWS, Azure, or Google Cloud.
    • Familiarity with containerization technologies like Docker and Kubernetes.
    • Experience with CI/CD tools such as Jenkins, GitLab CI, or CircleCI.
    • Strong problem-solving skills and the ability to troubleshoot complex issues.
    • Excellent communication and collaboration skills.

    Preferred Qualifications:

    • Master's degree in Computer Science, Engineering, or a related field.
    • Experience with machine learning frameworks such as TensorFlow, PyTorch, or Scikit-learn.
    • Knowledge of data engineering and data pipeline tools such as Apache Spark, Apache Kafka, or Airflow.
    • Experience with monitoring and logging tools such as Prometheus, Grafana, or ELK stack.
    • Familiarity with infrastructure as code (IaC) tools like Terraform or Ansible.
    • Experience with automated testing frameworks for machine learning models.
    • Knowledge of security best practices for machine learning systems and data.

    Minimum Qualifications

    Master's Level Degree or Bachelor's Level Degree and related work experience of 2 years

    We offer a competitive, family friendly total rewards package. We design our programs to reflect our commitment to an inclusive environment, while ensuring we provide benefits that meet the diverse needs of our employees.

    KLA is proud to be an equal opportunity employer

    Be aware of potentially fraudulent job postings or suspicious recruiting activity by persons that are currently posing as KLA employees.  KLA never asks for any financial compensation to be considered for an interview, to become an employee, or for equipment. Further, KLA does not work with any recruiters or third parties who charge such fees either directly or on behalf of KLA. Please ensure that you have searched KLA's Careers website for legitimate job postings.  KLA follows a recruiting process that involves multiple interviews in person or on video conferencing with our hiring managers.  If you are concerned that a communication, an interview, an offer of employment, or that an employee is not legitimate, please send an email to to confirm the person you are communicating with is an employee. We take your privacy very seriously and confidentially handle your information.


Jobs
>
Chennai