SRE lead - Coimbatore, India - AppViewX

    AppViewX
    AppViewX Coimbatore, India

    Found in: Appcast Linkedin IN C2 - 1 week ago

    Default job background
    Description

    Site Reliability Engineer:

    Experience: 7+ years

    Location: Coimbatore

    Who we are and What we do?

    AppViewX is trusted by the world's leading global organizations to reduce risk, ensure compliance, and increase visibility through machine identity management and application infrastructure security and orchestration. At AppViewX, you will get to work with our bleeding edge Automation Platform that facilitates digital transformation through streamlined workflows to prevent outages, reduce security incidents and protect both an enterprise's reputation and bottom line. AppViewX has also been certified as a Great Place to Work in India, cementing us as an employer of choice.

    Glimpse of our Team

    Are you passionate about solving problems and being a technical expert for enterprise level customers? Well, join our team of expendables. We help customers with infra design study for product deployments, scalability/performance recommendations, product upgrades, patching hotfixes, troubleshooting issues and triaging with internal stakeholders and more importantly, wear the customer hat to guide engineering and product teams to enhance product quality and growth direction. Automation is a key growth area and that's icing on the cake.

    What will you be responsible for?

    • Provide leadership and guidance, act as subject matter expert, and foster the use of best practices.
    • Oversee the involvement of the SRE team in the SDLC to ensure the performance, scalability, and reliability of our services.
    • Work with engineering, QA, and other teams in the architecture and implementation of Internet-scale services.
    • Create and review requirements specifications; evaluate solutions and designs, assess implementations,
    • Lead the SRE projects and tasks including OAM tasks with onus on optimization.
    • Coach and mentor the SRE team to improve their knowledge and expertise, and the quality engineering deliverables, and provide recommendations to ensure the dependability of our services.
    • Build the automation for large scale OAM of our systems and services.
    • Prevent incidents that could impair the operational readiness of our systems and services.
    • Set up logging and monitoring systems that alert on symptoms instead of outages.
    • Improve the performance, scalability, reliability, quality, and time-to-market of our suite of software solutions.
    • Improve our operational processes (deployment, onboarding, upgrades, decommission, etc.) with automation reducing to a minimum while supporting (rare) human intervention.
    • Assist with the architecture and implementation of Internet-scale services.
    • Support the production SaaS environment by monitoring key performance indicators and taking a holistic view of system health.
    • Debug production issues across all tiers, layers, and components of our applications and services.
    • Support of the production SaaS environment by monitoring key performance indicators and taking a holistic view of system health.
    • Be responsible for documenting cases to reflect the actions taken, informing customers of problem status and providing updates and solution(s) in professional and timely fashion, over the lifetime of the support request.
    • Take ownership of customer issues when escalated by customer management. Drive to resolve issues effectively, escalating cases to development teams where necessary.

    What do we require?

    • Bachelor's degree in Computer Science, Engineering, or relevant field.
    • 7+ years of experienced covering DevOps, Software Development, SRE.
    • Experienced in Linux System Administration and Networking.
    • Strong programming skills: Linux Shell and Python.
    • Experienced with configurations management systems/tools (Ansible), infrastructure as code (CloudFormation, Helm, Terraform), version control (Git), and CI/CD (Jenkins).
    • Solid experience in Cloud Computing (AWS preferably, GCP, and Azure)
    • Solid experience in container technology (Docker, containerd), container orchestration (Kubernetes), service meshes (Istio), observability (Prometheus, Jaeger), visualization (Grafana), event logging and alert management (Splunk, Elasticsearch)
    • Strong understanding of IT and Security best practices, controls, regulations, standards, tools, etc.
    • Data mining experience
    • Strong understanding of the WWW architecture and technologies,
    • Strong understanding of DBs (SQL and NO SQL).
    • Experienced in the delivery of IaaS, and/or PaaS, and/or SaaS.
    • Project management background and experienced leading large technology implementations.
    • Intense dedication to the performance, scalability, and reliability of applications and systems.
    • Significant experience in DevOps and SRE adoption and evolving practices.
    • Quick, effective, efficient.
    • Strong leadership skills.
    • Strong communicator, able to lead and facilitate discussions across many tiers including business, architecture/design, engineering, DevOps, operations, etc.
    • Experienced in 24x7 Operations.

    What do we desire?

    ● Exposure to Azure and other cloud platforms

    ● Experience in technical engineering / design of SaaS environments is a Plus.

    ● Experience in CI-CD technologies such as Jenkins is a Plus.

    ● Hands on coding abilities in Python and Terraform is a Plus.

    What brings us together?

    What makes us stand out from the rest is the people. The people who push hard every day to be more smart, passionate, resilient, creative, inclusive and curious. Surrounding ourselves with a bunch of diverse and talented minds has made us what we are, and we can't wait to see the positive impact it would have on you too

    What's more in store?

    AppViewX is on par with leading global companies when it comes to the benefits it offers its employees, ranging from competitive incentives, health & wellness policies, saving & investment schemes, time off/sabbatical eligibility and dedicated L&D.

    What we consider equally important is the flexibility we offer our employees to – work remotely, define their own hours, and more importantly harmonize both work and life. The more trust and accountability we place on our employees, the more they surpass our goals and expectations.

    Why AppViewX?

    AppViewX caters to a wide range of customers from Fortune 1000 companies, including six of the top ten global commercial banks, five of the top ten global media companies, and five of the top ten managed healthcare providers. Over the years, we grew our diverse team, perfected our automation platform, and expanded our Global footprint to India, North America, United Kingdom and Australia. Today, we are headquartered in New York City and have come a long way by optimizing opportunities to create lasting relationships with enterprises, gaining unshakable customer trust along the way.

    AppViewX is proud to be an Equal Employment Opportunity Employer. It is AppViewX's policy to afford equal employment opportunities to all employees regardless of race, color, national origin, ancestry, religion, citizenship status, , gender, gender expression or identity, sexual orientation, age, marital status, military or veteran status, pregnancy, disability, genetic information, arrest record, or other protected class under state, federal, or local law.