Senior Site Reliability Engineer
Posted on March 10, 2025
Job Description
- The Role:
- Night shift-6:30 - 3:30
- Looking for a Senior Site Reliability Engineer in Dublin to collaborate with Software Engineering teams located primarily in our Dublin office. An ideal candidate would have extensive experience building cloud infrastructure on Google Cloud with Terraform, and have strong experience running and securing workloads at scale on Google�s Kubernetes Engine.
- The Team:
- Join our Site Reliability Engineering team and make an impact helping all engineering teams become as efficient and secure as possible.
- You will be part of a high performing and versatile SRE team that handles many aspects of modern cloud software deployment. We cooperate closely with Software Engineers to provide 3 main functions: improving the reliability of any service on our cloud, continually monitoring and advancing our security posture for these services, and increasing the efficiency of our deployment pipelines and infrastructure builds.
- you will work within our SRE team on team led initiatives, but also be a part of engineering pods that are working on revenue producing features, helping them get their code to production, and help it to perform at its best.
- Responsibilities:
- ? Join the efforts of our Site Reliability Engineering team, establish best practices, and help evolve the SRE culture
- ? Help our teams build and improve our Google Cloud infrastructure
- ? Collaborate closely with Software Engineers to get applications deployed, scaled properly, and to use the right tool for the job whether the solution should be serverless or containerized in a Kubernetes service
- ? Secure and instrument the Kubernetes cluster, the container, and the cloud resources utilized
- ? Enjoy a fast paced environment that challenges you to adapt to change quickly
- ? Focus on solving the problem with simple, concise, maintainable, transparent techniques
- ? Implement revenue producing and cost conserving features plus discover and contribute your own
- ? Ability to work independently, but also learn from and mentor other team members
- What you bring to the table:
- ? 5+ years of experience as a Site Reliability Engineer/Cloud Engineer/Software Engineer
- ? Extensive experience using cloud resources and building infrastructure on Google Cloud Platform using Terraform
- ? Experience configuring and deploying containerized workloads on Kubernetes, securing and monitoring them, and troubleshooting the issues that may arise
- ? Experience building and troubleshooting containers
- ? Fluent in Python, Shell, or Go
- ? Excellent grasp of CS fundamentals, the Linux operating system, and common GNU/Linux tools
- ? BS/MS in CS/EE or equivalent experience.
- Bonus Points:
- ? Experience working closely with software engineering teams in an effort to accelerate output
- ? Experience contributing to or writing and enforcing SOC2 policies
- ? Experience with distributed systems and highly parallel processes
- ? Experience building CI/CD pipelines for and with containers
- ? Experience deploying gRPC services and making them available securely for public consumption
- ? Experience with Datadog, Codefresh, or Jenkins
- Perks:
- ? Culture: Strong core business values, focus on teamwork, vibrant, social and fun environment
- ? Opportunities: Be part of a growing team with training and support to help you grow
- ? Half-Day, No Meeting Fridays: Get work done without interruption, then get a head start on your weekend
- ? Work-Life Balance: Unlimited vacation days and flexible working hours in a fairly decentralized workforce
- ? Give Back: We give 40 hours a year to volunteer and organize office volunteer programs with local organizations
- ? Competitive Localized Benefits
- ? Ownership: Lead creative and challenging projects
- ? IATA travel discount allowing discounts in hotel, resort, and airline reservations
Required Skills
experience with datadog
codefresh
or jenkins
ci/cd pipelines