Senior DevOps Engineer - Greater Bengaluru Area, India - Groww

    Groww
    Groww Greater Bengaluru Area, India

    2 weeks ago

    Default job background
    Accounting / Finance
    Description

    About Groww

    We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey.

    Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the customers' needs and convenience in mind.

    Our people are our greatest strength. Everyone at Groww is driven by ownership, customer-centricity, integrity and the passion to constantly challenge the status quo.

    Are you as passionate about defying conventions and creating something extraordinary as we are? Let's chat.

    Our Vision

    Every individual deserves the knowledge, tools, and confidence to make informed financial decisions. At Groww, we are making sure every Indian feels empowered to do so through a cutting-edge multi-product platform offering a variety of financial services.

    Our long-term vision is to become the trusted financial partner for millions of Indians.

    Our Values

    Our culture enables us to be what we are — India's fastest-growing financial services company. It fosters an environment where collaboration, transparency, and open communication take center-stage and hierarchies fade away. There is space for every individual to be themselves and feel motivated to bring their best to the table, as well as craft a promising career for themselves.

    The values that form our foundation are:

    • Radical customer centricity
    • Ownership-driven culture
    • Keeping everything simple
    • Long-term thinking
    • Complete transparency

    We are looking for a DevOpsEngineer to support application development, infrastructure and security from the start by automating workflows to keep the DevOps workflow from slowing down. The ideal individual should have the mindset of DevOps with built-in security, not security that functions as a perimeter around apps and data. The candidate will work closely with Architect, quality assurance and development teams throughout the application lifecycle to achieve application availability, scalability and operational effectiveness in the most secure way aligned with automation. The position also offers to achieve & maintain the uptime, performance and scalability of very low latency and high traffic apps hosted on microservice platforms. The candidate will work with the cutting edge technology leveraging Containers, Kubernetes, Jenkins, Saltstack, Kafka, Redis, RabbitMQ, Elastic Search, GitLab, MySQL, Scylla, Service Mesh, Prometheus, Loki etc

    Roles and Responsibilities

    • Bridging the gaps b/w core infra, security, QA and development team.
    • Owning the end-to-end Availability, Performance, Capacity of applications and their infrastructure and creating/maintaining the respective observability with Prometheus/New Relic/ELK/Loki.
    • Providing 24X7 infra & app support, building processes and documenting "tribal" knowledge around the same time.
    • Mentor and train L1 engineers and continually improve app and infra support processes.
    • Managing application deployment & GKE platforms - automate and improve development and release processes.
    • Creating, managing and maintaining datastores & data platform infra using IaC.
    • Owning and onboarding new applications with the production readiness review process.
    • Managing the SLO/Error Budgets/Alerts and performing root cause analysis for production errors.
    • Working with Core Infra, Dev and Product teams to define SLO/Error Budgets/Alerts.
    • Working with the Dev team to have an in-depth understanding of the application architecture and its bottlenecks.
    • Identifying observability gaps in application & infrastructure and working with stakeholders to fix them.
    • Managing outages and doing detailed RCA with developers and identifying ways to avoid that situation.
    • Automate toil and repetitive work.

    Experience & Skills

    • 6 to 8 Years of experience in managing high traffic, large scale microservices and infrastructure with excellent troubleshooting skills.
    • Experience in troubleshooting, managing and deploying containerized environments using Docker/containerd, Kubernetes is a must.
    • Must be proficient with the helm with experience in service mesh like Istio, Linkerd.
    • Must be very hands-on in managing and troubleshooting the Kubernetes environment.
    • Extensive experience with Linux administration and a good understanding of the various Linux kernel subsystems (memory, storage, network etc).
    • Extensive experience in DNS, TCP/IP, UDP, GRPC, Routing and Load Balancing.
    • Expertise in GitOps, Infrastructure as a Code tool such as Terraform etc.. and Configuration Management Tools such as Chef, Puppet, Saltstack, Ansible.
    • Expertise in Google Cloud (GCP) and/or other relevant Cloud Infrastructure solutions like AWS or Azure.
    • Experience in building the CI/CD pipelines with tools such as Jenkins, GitLab, Spinnaker, Argo etc.
    • Experience with multiple datastores is a plus (Kafka/RabbitMQ, Redis, Elasticsearch).
    • Must be good in any of the DevOps scripting languages - python or go.
    • A collaborative spirit with the ability to work across disciplines to influence, learn and deliver.
    • A deep understanding of computer science, software development, and networking principles