Sr. SRE Application Engg

Bengaluru
Hybrid
Senior: 7 to 10 years
22L - 35L (Per Year)
Posted on Jul 02 2024

About the Job

Skills

Scripting (Python, Bash)
Cloud Services (AWS, GCP, Azure)
Observability Tools
Linux Systems Administration
Monitoring and Alerting
Containerization (Docker, Kubernetes)
CI/CD Pipelines
Networking Fundamentals

Responsibilities:

  • Design and implement automation solutions for infrastructure provisioning, configuration management, promoting consistency and reliability across environments.
  • Maintenance of CI/CD pipelines using Jenkins, ensuring efficient deployment processes.
  • Manage the applications using Docker and Kubernetes, focusing on scalability, efficiency, integrating quality checks, and security.
  • Solutioning and Maintaining the secure, scalable, and resilient cloud infrastructure on AWS, including performance tuning and cost optimization.
  • Conduct comprehensive Linux system administration, including performance tuning, security hardening, and troubleshooting.
  • Develop and maintain Java, Python to automate tasks and integrate systems, enhancing operational efficiency.
  • Monitor system performance, identify bottlenecks, and implement solutions to ensure high availability and optimal user experience.
  • Lead incident response efforts, minimizing impact and conducting post-mortem analyses to prevent future occurrences.
  • Mentor junior team members and contribute to the development of best practices and standards within the SRE team.

Qualifications:

  • Bachelor's degree in Computer Science, Information Technology, or related field preferred.

Skills:

  • Must Have:
  • Minimum 8+ years of hands-on experience in Site Reliability Engineering (SRE).
  • Extensive experience with AWS services: EC2, EKS, RDS, S3, Lambda, load balance, IAM, VPC.
  • Configuration Tool and IAC tool: Ansible/Terraform.
  • Scripting: Java/Python/Shell.
  • Hands-on experience with CI/CD pipeline.
  • Incident management and Debug the issue.
  • Good to Have:
  • AIOps Knowledge: Familiarity with AIOps concepts and tools such as machine learning, anomaly detection, and predictive analytics applied to infrastructure monitoring and management.
  • Telecom Domain Experience: Exposure to the telecommunications industry, including knowledge of networking protocols, telecommunications infrastructure, and service delivery platforms.
  • OTT (Over-the-Top) Domain Experience: Understanding of Over-the-Top services and platforms, including streaming media, content delivery networks (CDNs), and video-on-demand (VOD) services.

About the Company:

The Senior SRE will be responsible for leading initiatives to improve system reliability, automate operational processes, and ensure the scalability and security of our systems. The ideal candidate will have a strong background in Linux systems, cloud technologies, containerization, and automation, along with a proactive approach to problem-solving and a commitment to continuous improvement.


About the company

Taggd is a digital recruitment platform, we serve as a recruitment process outsourcer for different clients (hands on in their recruitment procedure- end to end recruiting). We offer faster route to the candidate in terms of update , interview schedules and offer release.

Industry

Recruitment

Company Size

501-1,000 Employees

Headquarter

Bangalore

Other open jobs from Taggd