Principal Member of Technical Staff/Solutions Architect - Hyderabad, India - Oracle

    Oracle
    oracle background
    Description

    Oracle Cloud Infrastructure (OCI) is a pioneering force in cloud technology, merging the agility of startups with the robustness of an enterprise software leader.

    Within OCI, the Oracle Generative AI Service team spearheads innovative solutions at the convergence of artificial intelligence and cloud infrastructure.

    As part of this team, you'll contribute to large-scale cloud solutions utilizing cutting-edge machine learning technologies, aimed at addressing complex global challenges.

    Join us to create innovative solutions using top-notch machine learning technologies to solve global challenges.

    We're looking for an experienced Software Development Engineer (IC4) to join our OCI Gen-AI Solutions team for strategic customers team.

    In this role, you'll collaborate with applied scientists and product managers to design, develop, and deploy tailored Gen-AI solutions with an emphasis on Large Language Models (LLMs) and Retrieval Augmented Generation (RAG).

    Career Level - IC4As part of the

    OCI Gen AI

    S

    olutions

    for strategic customersteam, you will be responsible for developing innovative data science services for our customers.

    As a Principal member of the technical staff, you'll lead the development of advanced Gen AI solutions using the latest ML technologies combined with Oracle's cloud expertise.

    Your work will significantly impact sectors like financial services, telecom, healthcare, and code generation by creating distributed, scalable, high-performance solutions for strategic customers.

    Work directly with key customers and accompany them on their Gen AI journey – understanding their requirements, help them envision and design and build the right solutions and work together with their ML engineering to remove blockers.

    You will dive deep into model structure to optimize model performance and scalability.
    You will build state of art solutions with brand new technologies in this fast-evolving area.
    You will diagnose, troubleshoot, and resolve issues in AI model training and serving. You may also perform other duties as assigned.
    Build re-usable solution patterns and reference solutions / showcases that can apply across multiple customers.
    Bean enthusiastic, self-motivated, and a great collaborator.

    Be our product evangelist - engage directly with customers and partners, participate and present in external events and conferences, etc.

    Qualifications and experienceBachelors or master's in computer science or equivalent technical field with 10+ years of experienceAble to optimally communicate technical ideas verbally and in writing (technical proposals, design specs, architecture diagrams and presentations).Demonstrated experience in designing and implementing scalable AI models and solutions for production, relevant professional experience as end-to-end solutions engineer or architect (data engineering, data science and ML engineering is a plus), with evidence of close collaborations with PM and Dev teams.

    Experience with OpenSearch, Vector databases, PostgreSQL and Kafka Streaming.

    Practical experience with the latest technologies in LLM and generative AI, such as parameter-efficient fine-tuning, instruction fine-tuning, and advanced prompt engineering techniques like Tree-of-Thoughts.

    Hands-on experience with emerging LLM frameworks and plugins, such as LangChain, LlamaIndex, VectorStores and Retrievers, LLM Cache, LLMOps (MLFlow), LMQL, Guidance, etc.

    Proven experience in designing data collection/annotation solutions and systematic evaluation necessary for developing and maintaining production systems.
    Strong publication record, including as a lead author or reviewer, in top-tier journals or conferences.
    Ability and passion to mentor and developjunior machine learning engineers.
    Proficient in Python and shell scripting tools.
    Preferred Qualifications
    :
    Masters or Bachelor's in related field with 7+ years relevant experienceExperience with RAG based solutions architecture.

    Familiarity in OpenSearch and Vector stores as a knowledge storeKnowledge of LLM and experience delivering, Generative AI And Agent models are a significant plus.

    Familiarity and experience with the latest advancements in computer vision and multimodal modeling is a plus.
    Experience with Large Language Models (LLMs) serving technologies like DeepSpeed, FasterTransformer etc.
    Experience in working on a public cloud environment, and in-depth knowledge of IaaS/PaaS industry and competitive capabilities. Experience with popular model training and serving frameworks like KServe, KubeFlow, Triton etc.
    Experience with LLM fine-tuning, especially the latest parameter efficient fine-tuning technologies and multi-task serving technologies.
    Deep technical understanding of Machine Learning, Deep Learning architectures like Transformers, training methods, and optimizers.

    Experience with deep learning frameworks (such as PyTorch, JAX, or TensorFlow) and deep learning architectures (especially Transformers).Experience in diagnosing, fixing, and resolving issues in AI model training and serving.