Linux System Administrator - Chennai - SISL Global

    SISL Global
    SISL Global Chennai

    2 days ago

    Technology / Internet
    Description

    Job Description – HPC Engineer (HPC with SLURM, CPU & GPU Clusters)

    Position Overview

    We are seeking a skilled HPC Engineer to design, deploy, manage, and optimize our on premises High Performance Computing (HPC) environment, consisting of SLURM-managed CPU and GPU clusters. The ideal candidate will have a strong understanding of HPC architecture, Linux systems, job scheduling, and cluster operations. Experience with parallel file systems and enterprise storage solutions such as WekaFS or Scality is preferred but optional.

    Key Responsibilities

    1. HPC Infrastructure & Operations


    • Manage day to day operations of on prem HPC clusters including CPU and GPU compute nodes.


    • Monitor cluster health, performance, and utilization, ensuring high availability and efficiency.


    • Implement and maintain best practices for HPC operations, user management, and resource administration.


    • Troubleshoot cluster related issues including networking, node failures, job failures, and performance bottlenecks.


    • Support users in job submissions, resource usage, and HPC workflows.

    2. SLURM Workload Manager (Mandatory)


    • Configure, install, and manage SLURM workload manager across multiple clusters.


    • Handle queue creation, partition configuration, node allocation, fair share policies, and job prioritization.


    • Perform SLURM upgrades, migrations, and service maintenance with hands on expertise.


    • Work with SLURM APIs and integrations to support automation and custom workflows.


    • Optimize scheduling policies for mixed CPU/GPU workloads.

    3. Linux System Administration


    • Manage Linux-based compute nodes, head nodes, and administration servers.


    • Perform OS updates, package installations, security patching, and system tuning.


    • Knowledge of shell scripting (Bash/Python) for automation and HPC tooling workflows.

    4. Parallel Computing & Cluster Architecture


    • Understanding of parallel computing concepts: MPI, OpenMP, distributed execution.


    • Familiarity with HPC building blocks: interconnect networks (InfiniBand/100G), storage tiers, resource managers, monitoring tools.


    • Ability to analyze and troubleshoot performance issues in parallel workloads.

    5. Storage (Optional but Preferred)

    A. WEKA (WekaFS) – Optional


    • Knowledge of parallel file systems and performance tuning.


    • Diagnose and resolve issues related to WekaFS with minimal downtime.


    • Provide guidance to internal teams on WekaFS usage and best practices.


    • Stay updated with Weka ecosystem advancements and propose improvements.

    B. Scality – Optional


    • Troubleshoot and maintain Scality RING and ARTESCA environments.


    • Monitor, tune, and optimize Scality-based storage for high availability and reliability.


    • Create and maintain documentation for Scality configuration and SOPs.


    • Recommend performance improvements based on new Scality enhancements.

    Qualifications & Skills

    Mandatory Skills


    • Experience managing HPC clusters with SLURM in production environments.


    • Good understanding of Linux (RHEL) administration.


    • Knowledge of parallel computing concepts and HPC architecture.


    • Strong troubleshooting and diagnostic skills.


    • Ability to work in complex, multi-node distributed environments.

    Preferred/Optional Skills


    • Experience with WekaFS, Scality RING, or other parallel/distributed file systems.


    • Exposure to GPU computing (CUDA, NVIDIA drivers, GPU scheduling).


    • Familiarity with monitoring tools (Grafana, Prometheus).


  • Work in company

    System Administrator

    Only for registered members

    Position: System Administrator, Chennai · Department: Information Technology | Role: Full-time | Experience: 4 to 6 Years | Number of Positions: 1 | Location: Chennai · Skillset: · System Administrator, Scripting Language, AWS Services, Linux, Kubernetes, Docker, CI/CD, Good Engl ...

    Chennai, Tamil Nadu ₹400,000 - ₹1,200,000 (INR) per year

    6 days ago

  • Work in company

    System Administrator

    Only for registered members

    Job Description · We are looking for a passionate and detail-oriented System Administrator Trainee to join our IT team. The ideal candidate should possess strong technical knowledge in system and network administration, along with a proactive approach to troubleshooting and secur ...

    Chennai ₹400,000 - ₹1,200,000 (INR) per year

    5 days ago

  • Work in company

    System Administrator

    Only for registered members

    We are looking for a skilled · System Administrator to manage and maintain IT infrastructure, · servers networks and security systems. ...

    Chennai

    1 month ago

  • Work in company

    System Administrator

    Only for registered members

    We are looking for a skilled System Administrator with strong hands-on experience in system hardening endpoint security and network controls to manage and secure our company laptops desktops and internal IT infrastructureAntivirus · Perform OS & System Hardening (Windows/Linux/MA ...

    Chennai, Tamil Nadu

    1 month ago

  • Work in company

    System Administrator

    Only for registered members

    The job requires a system administrator to manage and maintain server hardware and software ensuring optimal performance. · ...

    Chennai

    1 week ago

  • Work in company

    System Administrator

    Only for registered members

    End-to-end management of HCI solution including VM creation, migration and optimization with minimal downtime. · ...

    Chennai

    4 weeks ago

  • Work in company

    System Administrator

    Only for registered members

    Job summary: Provide technical expertise in system administration tasks. · ...

    Chennai

    1 month ago

  • Work in company

    System Administrator

    Only for registered members

    JiBe is a cloud based fully integrated ERP system for the shipping industry. · Strong troubleshooting skills on Windows platform & Network administration. · Address user tickets regarding hardware, software and networking · Installing and maintaining hardware and computer periphe ...

    Chennai, Tamil Nadu

    1 month ago

  • Work in company

    System Administrator

    Only for registered members

    MVH requires System Administrator For maintaining IT Infrastructure including hardware Networking servers security Backup and smooth operations through troubleshooting candidates Send your CV call Qual B.E IT M. · ...

    Chennai

    3 weeks ago

  • Work in company

    System Administrator

    Only for registered members

    We are looking for an experienced IT Infrastructure professional with strong expertise in Desktop Support · L3/L4), Networking, IIS Administration, · and Basic MS SQL administration. · ...

    Chennai

    2 weeks ago

  • Work in company

    System Administrator

    Only for registered members

    Alpha Group of Institutions is looking for a proactive and technically sound System Administrator / IT Administrator to manage campus-wide IT infrastructure, networking, ERP systems, and user support. · ...

    Chennai

    1 month ago

  • Work in company

    System Administrator

    Only for registered members

    The immediate joiner for System Administrator role will configure provision partners in IBM Sterling Integrator and File Gateway. · ...

    Chennai, Tamil Nadu

    4 weeks ago

  • Work in company

    System Administrator

    Only for registered members

    This is a System Administrator role responsible for maintaining and upgrading Bentley ProjectWise and Autodesk Vault environments. · ...

    Chennai, India

    2 weeks ago

  • Work in company

    System Administrator

    Only for registered members

    We are seeking a skilled System Administrator with experience in hardware, networking, and system support. The ideal candidate will maintain IT infrastructure, ensure network security and provide technical support across the organization. · Install configure and maintain servers ...

    Chennai

    1 week ago

  • Work in company

    System Administrator

    Only for registered members

    This is a System Administrator role responsible for managing and maintaining SAP Business One system landscape. · Manage and maintain SAP Business One (SAP B1) system landscape. · Provide end-user support for SAP B1. · ...

    Chennai

    2 months ago

  • Work in company

    System Administrator

    Only for registered members

    Support Single Sign-On using Active Directory Federation Services with Multi Factor Authentication. Perform scripting administration tasks and reporting using PowerShell. · ...

    Chennai

    1 month ago

  • Work in company

    System Administrator

    Only for registered members

    We are looking for an experienced System Administrator & Infrastructure Specialist with 4+ years of relevant experience to take full ownership of our on-premises and hybrid IT environments. · Take full ownership of our on-premises and hybrid IT environments. · ...

    Chennai

    3 weeks ago

  • Work in company

    System Administrator

    Only for registered members

    Job Description: · We are seeking an experienced · System Administrator · with 8–10 years of expertise in managing and optimizing Microsoft SQL Server environments. The role involves ensuring high availability, performance, security, and reliability of database systems while supp ...

    Chennai, Tamil Nadu ₹400,000 - ₹1,200,000 (INR) per year

    4 days ago

  • Work in company

    It System Administrator

    Only for registered members

    Improve productivity, efficiency and safety levels, while reducing costs with JiBe ERP. · ...

    Chennai, Navi Mumbai

    2 weeks ago

  • Work in company

    System Administrator

    Only for registered members

    We are seeking a skilled · and proactive System Administrator with 4-5 years of hands-on experience in managing Linux, · Unix, · and Windows environments.Maintains responsibility accountability · and credibility in all work assignments. · IDentifies isolates resolves production ...

    Chennai

    1 month ago

  • Work in company

    System Administrator

    Only for registered members

    Install, configure and maintain desktop, server and network systems. Monitor system performance and troubleshoot issues related to hardware software and networking. · ...

    Chennai

    1 month ago

Jobs
>
Linux system administrator
>
Jobs for Linux system administrator in Chennai