
Senior Site Reliability Engineer, Environment Automation at GitLab
Job Description
Senior Site Reliability Engineer, Environment Automation
Location: Remote, Australia
Overview
GitLab is seeking a Senior Site Reliability Engineer (Environment Automation) to keep user-facing services and production systems reliable, scalable, and efficient. This role focuses on automating the lifecycle of many tenant environments, ensuring they remain secure, consistent, and reliable at scale. You will combine software engineering practices with a pragmatic operations mindset to drive automation, reduce toil, and improve resilience across the platform.
What you will do
- Design and implement automation that provisions and manages multi-tenant GitLab environments using IaC patterns and tools such as Terraform, Ansible, and Kubernetes.
- Create and maintain deployment packages for GitLab, for example Helm charts and omnibus-gitlab bundles.
- Build and operate Dedicated GitLab instances integrated with cloud-native services on providers like GCP and AWS.
- Develop tools to orchestrate infrastructure-as-code workflows across multiple tenants and automate version upgrades, configuration rollouts, and provisioning pipelines.
- Deploy and manage microservices on Kubernetes clusters at scale and enhance observability using tools like Prometheus and ELK.
- Troubleshoot production issues across Kubernetes clusters and cloud services, lead incident response, and drive postmortem and remediation efforts.
- Architect automation and operational patterns that scale, and collaborate with engineering teams to improve platform resilience and production readiness.
What you'll bring
- Proven experience operating and troubleshooting production workloads across many tenants or environments, with deep knowledge of distributed system failure modes and resilience strategies.
- Hands-on mastery of Terraform or other infrastructure-as-code tooling, including state management and workspace strategies.
- Production Kubernetes experience, including debugging deployments, pod failures, scheduling issues, and rollout strategies.
- Ability to read and debug code in Go and/or Ruby, and contribute to infrastructure tooling and automation codebases.
- Experience working with cloud provider ecosystems, including IAM, networking, and storage services on AWS or GCP.
- Operational background supporting large scale infrastructure and observability systems, with on-call and incident leadership experience.
- Strong collaboration skills to engage with internal and external customers, and to drive solutions across teams.
- Familiarity with GitLab as a platform for automation and collaboration is advantageous.
Preferred and complementary skills
- Experience with configuration management and templating tools, such as Ansible or Jsonnet.
- Background improving observability stacks and capacity planning using Prometheus, ELK, Grafana, or similar tools.
- Strong Linux skills and comfort operating in cloud native environments.
Company and hiring notes
GitLab is an open-core company building an AI powered DevSecOps platform used by thousands of organizations. The company hires globally and many roles are remote, though some positions carry location-based eligibility. Review the Recruitment Privacy Policy for personal data handling details. GitLab is an equal opportunity employer and provides accommodations during recruitment on request.
How to apply
Apply via the job listing URL provided in the posting. The application form will accept standard candidate details and attachments. Recruitment and privacy notices apply.
Categories
Required Skills
Ready to Apply?
Take the next step in your career journey.
Apply NowYou will be redirected to the company's application page
💜 Please mention that you found the job on Remote World Jobs, this helps us grow. Thanks!
About GitLab
GitLab is a fully remote company that provides a comprehensive DevSecOps platform, enabling organizations to deliver software faster and more securely. Founded in 2011, GitLab serves over 50 million registered users globally, including more than 50% of the Fortune 100 companies.
View Company Profile