Site Reliability Engineer (SRE)
Posted on February 13, 2025
Job Description
- Job Title: Site Reliability Engineer (SRE)
- Experience Required: 5+ years
- Location: Noida (Remote)
- We are looking for an experienced Infrastructure Site Reliability Engineer (SRE) to join our team. This role involves managing and optimizing infrastructure with a primary focus on Kafka, OpenSearch, and multi-cloud environments.
- Key Responsibilities:
- Oversee and scale Kafka clusters, ensuring high performance and reliability.
- Configure and optimize OpenSearch clusters to maintain availability and fault tolerance.
- Deploy and manage Kubernetes clusters on AWS EKS, ensuring security and efficiency.
- Design and maintain infrastructure across multi-cloud environments with a focus on security and cost-effectiveness.
- Automate CI/CD pipelines while adhering to industry best practices.
- Required Skills & Experience:
- Strong expertise as an SRE or DevOps Engineer, with a deep understanding of CI/CD principles.
- Hands-on experience with Jenkins for building and managing CI/CD pipelines, including blue-green deployment strategies.
- Solid experience with AWS and Azure cloud services such as AKS, EC2, and S3.
- Proficiency in containerization technologies, including Docker and Kubernetes.
- Experience working with microservices architecture, along with knowledge of associated challenges and best practices.
- Strong grasp of monitoring and observability tools and techniques.
- Excellent problem-solving and troubleshooting abilities.
- Proficiency in scripting languages like Bash and Python for automation and tooling.
Required Skills
ci/cd
aws
docker and kubernetes.