HPC Engineer - Gujarat, India - Selby Jennings

    Selby Jennings
    Selby Jennings Gujarat, India

    Found in: Appcast Linkedin IN C2 - 1 week ago

    Default job background
    Description

    Job Description: HPC Reliability Engineer - High-Frequency Trading Firm

    Company Overview:

    Our client is a leading top-tiered high-frequency trading (HFT) firm headquartered in the United States. Their cutting-edge technology and advanced trading strategies allow them to capitalize on market opportunities with exceptional speed and precision. As part of their global team, you will have the opportunity to work on low latency applications and contribute to the development and maintenance of our high-performance computing (HPC) infrastructure.

    Position Overview:

    They are seeking a highly skilled and motivated HPC Reliability Engineer to join our global team. As an HPC Reliability Engineer, you will play a critical role in ensuring the stability, performance, and availability of their low latency applications and HPC infrastructure. You will collaborate with cross-functional teams, including traders, software developers, and system administrators, to design and implement reliable and efficient solutions to support our high-frequency trading operations.

    Responsibilities:

    • Implement and maintain a high-performance, low latency infrastructure that supports the firm's trading strategies.
    • Optimize and fine-tune the HPC infrastructure to achieve maximum performance and reliability.
    • Monitor, analyze, and troubleshoot HPC systems and applications to identify performance bottlenecks, latency issues, and potential failures.
    • Develop and implement robust monitoring and alerting mechanisms to proactively detect and mitigate system issues.
    • Collaborate with software developers to ensure the efficient utilization of hardware resources and the optimization of low latency applications.
    • Work closely with network engineers to minimize network latency and optimize data flow across the infrastructure.
    • Collaborate with system administrators to design and implement backup, recovery, and disaster recovery solutions.
    • Conduct performance analysis and capacity planning to anticipate future infrastructure needs and ensure scalability.
    • Stay up-to-date with industry trends, best practices, and emerging technologies related to HPC and low latency systems.
    • Document system configurations, procedures, and troubleshooting steps to facilitate knowledge sharing and ensure operational continuity.
    • Participate in on-call rotations and provide timely response and resolution to critical incidents.

    Qualifications:

    • Bachelor's or Master's degree in computer science, engineering, or a related field.
    • Strong experience in designing, implementing, and managing high-performance computing (HPC) infrastructure in a low latency, high-frequency trading environment.
    • In-depth knowledge of low latency architecture, including network optimization, kernel tuning, CPU affinity, and memory management.
    • Proficient in programming languages commonly used in HPC environments, such as C++, Python, and Java.
    • Experience with performance monitoring and analysis tools, such as Nagios, Ganglia, or similar.
    • Solid understanding of Linux/Unix systems and administration, including shell scripting and system-level troubleshooting.
    • Familiarity with storage technologies, such as NAS, SAN, and distributed file systems.
    • Strong problem-solving and analytical skills, with the ability to diagnose and resolve complex system issues under time pressure.
    • Excellent communication and collaboration skills, with the ability to work effectively in a global team environment.
    • Experience in the financial industry or high-frequency trading is a plus.

    Join their dynamic team and contribute to the success of a leading high-frequency trading firm. As an HPC Reliability Engineer, you will have the opportunity to work on cutting-edge technology, collaborate with talented professionals, and make a significant impact on our trading operations. They also offer a competitive compensation package and a stimulating work environment that encourages innovation and continuous learning.