[Remote] Software Development Engineer II, Cloud Platform
Note: The job is a remote job and is open to candidates in USA. Mapbox is the leading real-time location platform for a new generation of location-aware businesses. They are seeking a Software Development Engineer II to deliver cloud-native containerized infrastructure and deployment platforms, focusing on AWS resources and CI/CD pipelines.
Responsibilities
- Actively onboard AWS resources to the declarative gitops-based framework utilizing Terraform and Terragrunt
- Maintain and troubleshoot legacy cloud infrastructure in AWS that is deployed with Cloudformation/CDK and utilizes ECS, Lambda, EMR, etc
- Architect and promote Kubernetes deployments for new services
- Lead migration of deployment pipelines from ECS and Cloudformation to EKS and ArgoCD
- Architect a centralized CI pipelines framework utilizing GitHub Actions and Runs-on
- Broadly influence and lead the Mapbox Cloud Platform strategy around AWS architecture, open-source tools and frameworks
- Configure and maintain a comprehensive observability platform, such as Datadog or Observe, to enable real-time monitoring, alerting, and analytics
- Promote a culture of operational excellence by testing and monitoring our systems and code, and providing on-call support for the platform services
- Document your work and decision-making processes, and lead presentations and discussions in a way that is easy for others to understand
- Uphold a culture of collaboration, transparency, creativity, inclusion, and data-driven decisions
Skills
- 5+ years experience leveraging infrastructure-as-code frameworks to manage AWS infrastructure using Terraform, Terragrunt, Atlantis, CDK
- 4+ years experience orchestrating containerized workloads at scale using EKS, ECS
- 4+ years experience managing scalable CI/CD frameworks in a distributed engineering organization using Github Actions
- Strong expertise with Kubernetes, ArgoCD, Istio
- Proven ability to design and develop cost efficient, secure, and durable solutions on AWS using EKS, ECS, EC2, Lambda, Fargate, CloudFront, IAM, Route53, DynamoDB
- Proficient in at least one programming language, such as Python, Nodejs, GoLang
- Experience configuring and managing observability systems in a distributed large-scale environment using Datadog, CloudWatch, or similar
- Experience with incident response practices including blameless post-mortems and resilience engineering concepts
- A desire to share your expertise through documentation, mentorship, and both written and vocal discussion
- Ability to work asynchronously and independently with minimal supervision, lead by example, and make decisions based on priorities and business goals
Benefits
- Supportive health care
- Parental leave
- Flexibility for the things that come up in life
- Innovating on how we think about supporting our people
- An environment of teaching and learning to equip employees with the tools needed to be successful in their function and the company
Company Overview
Apply To This Job