company logo

DevOps Engineer (Apps)

Mumbai
Gurgaon
Full-Time
Senior: 7 to 12 years
Posted on Oct 21 2024

About the Job

Skills

Devops
Kubernetes
Cloud
Ansible
Terraform
Jenkins

DevOps Engineer (Apps)

   

Role overview

 

We are looking for a DevOps Engineer (Apps) to join our team with 7-10 years of experience, focusing on managing hyperscale AI Cloud-based applications. These applications are designed to support millions of users and handle hexa-bytes of data across distributed systems. The engineer will play a key role in ensuring availability, scalability, and high performance of one of the largest cloud-hosted infrastructures in the industry. This requires expertise in Kubernetes, automation, and infrastructure management to maintain seamless operations and uptime.

Our product allows users to store, sync, share, and stream content, including photos, videos, music, and documents across a wide array of devices, such as smartphones, tablets, TVs, gaming consoles, and even cars.

 

Key Responsibilities:


·      Hyperscale Application Support: Oversee and ensure scalability of our application infrastructure to manage millions of users and massive data loads. Expertise in Kubernetes and AI tools is critical to maintain microservice orchestration, high availability, and disaster recovery across multi-node environments.

Provides technical expertise and takes responsibility to enable the design, planning, implementation, integration, configuration and production support of infrastructure services across the workspace

·      Automation & Infrastructure Management: Develop and implement automation strategies using tools such as Terraform, Ansible, Jenkins, and scripting languages like Python, Go, Bash. These processes will streamline deployment pipelines and ensure swift recovery during incidents.

·      Kubernetes & Containerization: Manage containerized workloads and microservices in Kubernetes to handle the demands of a hyperscale infrastructure. Knowledge of tools such as Helm charts, Linkerd/Istio, and Docker is essential to facilitate continuous integration and delivery.

·      High Availability & Performance Optimization: Establish monitoring routines and automated failover mechanisms to ensure our infrastructure remains operational 24x7 with minimal downtime, even under high traffic. Experience in managing and scaling clusters in Kubernetes for high availability is a must.

·      Automation of Monitoring & Alerts: Utilize monitoring tools such as Prometheus, Grafana, and EFK Stack (ElasticSearch, Fluentd, Kibana) to create automated alerts and observability. Ensure seamless monitoring and troubleshooting of system issues in real time. Uses tools to determine infrastructure component’s utilization and performance statistics with regards to agreed SLA

·      Cloud & Virtualization: Deep experience in cloud infrastructure, particularly with OpenStack, Azure, or GCP, and working knowledge of setting up and managing virtualized environments at scale.


Skills & Requirements:

  • Strong experience in Kubernetes orchestration and management for large-scale, distributed applications.
  • Expertise in automation tools like Ansible, Terraform, Jenkins, and proficiency in scripting with Python, Go, Bash.
  • Solid understanding of cloud-native applications, microservices architecture, and infrastructure automation.
  • Experience in hyperscale system design and high availability architectures with 24x7 operational support.
  • Proficiency in middleware technologies such as Haproxy, Redis, RabbitMQ, Tomcat, and APIs for seamless integration between components.
  •  Ability to manage high-traffic, performance-sensitive applications and collaborate across teams to ensure consistent support and uptime.
  • Develop and maintain technical documentation.
  • Develop System Architecture and system design documentation.

Profile:

·      Education: B.Tech or MS degree in Computer Science or a related technical field.

·      Experience: Previous experience in managing hyperscale systems, Multi Cluster

Kubernetes, and cloud-based infrastructure at scale is critical for success in this role.


 

About the company

We are the force behind the meteoric rise of Indias leading telecom operator Jio with 400 Million+ customers. In Addition to this we have also powered an exhaustive list of digital apps & services that have delivered functionality, usability, engagement, scale and loyalty. We provide solutions for customers (B2C) and enterprise (B2B). We have an end to end 5G solution consisting of 5G Radio, a com ...Show More

Industry

Media & Telecommunication...

Company Size

51-200 Employees

Headquarter

Navi Mumbai, Maharashtra

Other open jobs from Jio