beBee background
Professionals
>
Rewāri
Rajat Saini

Rajat Saini

MLOPs and AI Infrastructure Engineering

Engineering / Architecture

Rewāri, Rewari

Social


About Rajat Saini:

Expertise in architecting, developing and operating scalable, GPU-accelerated AI systems (NLP + CV), proficient in Python and Java. Experienced in cloud/on-premises infrastructure, container orchestration (Kubernetes), MLOps (Kubeflow), distributed inference and optimization (vLLM, Ray, TensorRT-LLM, Triton, ONNX, DeepSpeed, NVIDIA Model Optimizer), efficient fine-tuning (PEFT LoRA/QLoRA) and system observability.

Experience

AI Platform & Infrastructure Engineer | Senior Software Engineer — Designing and managing large-scale AI infrastructure with expertise in GPU scheduling and optimization, distributed model serving (vLLM + Ray), GPU-accelerated analytics, and production-grade deployment pipelines. I deliver full-lifecycle AI solutions—from model training and convergence to drift monitoring and high-availability production deployments—using Kubeflow, SageMaker, and custom MLOps pipelines. Skilled in fine-tuning large language models with LoRA and optimizing training using frameworks like DeepSpeed across both resource-constrained and distributed multi-node clusters.

Specialized in containerization and orchestration with Docker and Kubernetes, architecting scalable, fault-tolerant systems. Experienced in building CI/CD pipelines with Jenkins, and automating infrastructure provisioning via Terraform and Ansible.

Strong software engineering foundation across FastAPI (Python), Spring Boot (Java), and Laravel, with expertise in microservices (gRPC/REST) and high-throughput asynchronous processing (Kafka, RabbitMQ, Redis Streams). Versatile in deploying solutions across AWS, GCP, OpenStack, and on-premises environments with Proxmox virtualization.

Education

BTech MTech IIT Roorkee

Professionals in the same Engineering / Architecture sector as Rajat Saini

Professionals from different sectors near Rewāri, Rewari

Other users who are called Rajat

Jobs near Rewāri, Rewari

  • Work in company

    4604092-Senior Manager

    EXL

    Description · MLOpsWe are looking for a highly skilled Analytics & Data Engineering professional with a strong background in Machine Learning, MLOps, and DevOps. The ideal candidate will have experience designing and implementing scalable data and analytics pipelines, enabling pr ...

    Gurugram, Haryana, India

    6 days ago

  • Work in company

    Machine Learning Ops Engineer

    Inovalon

    About Us: · Inovalon is a leading healthcare technology company dedicated to revolutionizing the healthcare industry through innovative AI and machine learning solutions. Our mission is to leverage cutting-edge technology to improve health outcomes and streamline healthcare proce ...

    Gurugram, India

    6 days ago

  • Work in company

    ML Ops Engineer

    Sirion

    This role sits at the intersection of machine learning, cloud infrastructure and platform engineering. · ...

    Gurgaon, Haryana

    2 weeks ago