Lead impactful DevOps projects in a remote role with equity benefits. Collaborate on cutting-edge cloud infrastructure solutions across multi-cloud environments. Enhance your expertise in Kubernetes, GitOps, and observability systems.
Senior Devops Engineer
in Information Technology PermanentJob Detail
Job Description
Overview
- Design, deploy, and maintain Kubernetes clusters, ensuring lifecycle management, performance optimization, and capacity planning.
- Build and manage cloud infrastructure across GCP and AWS using Terraform and Terragrunt, adhering to infrastructure-as-code principles.
- Develop and optimize CI/CD pipelines using GitHub Actions and Flux CD for reliable GitOps-driven deployments.
- Create Python-based tools and automation for infrastructure and platform operations support.
- Troubleshoot and resolve operational, networking, pipeline, and infrastructure issues in multi-cloud environments.
- Implement and maintain monitoring, alerting, and observability solutions using Prometheus and Grafana.
- Ensure compliance with security, governance, and regulatory requirements, including classified environments.
- Collaborate with teams to gather requirements and translate them into reliable infrastructure solutions.
- Promote cloud-native best practices, infrastructure-as-code principles, and GitOps workflows across the organization.
Key Responsibilities & Duties
- Own cluster lifecycle management, including upgrades, scaling, and performance tuning.
- Develop and maintain CI/CD pipelines to enable reliable deployments of containerized applications.
- Automate repetitive workflows to reduce operational burden on the engineering team.
- Implement observability systems and ensure comprehensive monitoring and alerting.
- Collaborate with engineering teams to architect scalable and cost-efficient solutions.
- Troubleshoot and resolve issues in multi-cloud environments.
- Ensure adherence to security and governance requirements.
- Document workflows and processes to enhance organizational knowledge.
- Support 24/7/365 production services and ensure operational reliability.
Job Requirements
- Bachelor of Science degree in a relevant field.
- Active U.S. security clearance is mandatory.
- 6+ years of professional experience in DevOps, SRE, or Platform Engineering.
- Proficiency in Kubernetes, GCP, AWS, Terraform, and Terragrunt.
- Hands-on experience with CI/CD pipelines using GitHub Actions and Flux CD.
- Strong scripting skills in Bash and Python.
- Experience with Prometheus and Grafana for monitoring and observability.
- Familiarity with service mesh technologies like Istio or Linkerd is preferred.
- Excellent organizational and documentation skills.
- ShareAustin: