Sr Cloud Infrastructure Engineer
Company | The Walt Disney Company |
---|---|
Location | Seattle, WA, USA, Orlando, FL, USA, Glendale, CA, USA, Anaheim, CA, USA |
Salary | $138800 – $195000 |
Type | Full-Time |
Degrees | |
Experience Level | Senior |
Requirements
- Strong proficiency with cloud-based Kubernetes architectures on at least two of AWS EKS, Azure AKS, and GCP GKE
- Strong proficiency with networking and cloud network architecture including, HTTP, TCP/IP, DNS (AWS Route 53, Azure Resolver), subnetting, VPC, gateways, firewalls
- Demonstrated ability to create cloud architectures from greenfield through production, going through proof of concept, documentation, and building the entire lifecycle of an architecture
- 5+ years progressive experience building enterprise solutions to support modern, cloud-based products and services
- Strong interpersonal, organizational, and communication skills
- Able to resolve matters/issues in a positive manner
- Ability to create concise and accurate documentation for Level 1 and two staff for the resolution of simple to complex incidents/issues
- Demonstrated inclusive leadership that embraces diversity
- Ability to successfully operate in a highly matrixed organizational system where partnership and influence are key drivers of success
- Ability to initiate change and act with integrity when tough decisions must be made
- Demonstrated experience with software development lifecycle methodologies such as Agile/Scrum
- Proven experience with system analysis and design, development, and testing
- Demonstrated strong analytical and problem-solving skills to achieve business results
- Ability to manage and prioritize multiple projects simultaneously
- Excellent organizational, communication and time management skills
Responsibilities
- Lead the design, testing, and implementation of container orchestration platforms such as Kubernetes and AWS ECS
- Develop and implement infrastructure-as-code (IaC) using Terraform
- Design, testing, and implementation of load balancer solutions including setting up pools, VIPs, layer 7 routing, debugging
- Lead the design, testing, and implementation of OS images including performance monitoring, setup, configuration, tuning, and troubleshooting
- Use scripts and tools built by others, including the ability to troubleshoot or debug issues with these tools
- Interpret error messages from scripts, tools, and applications to identify root cause
- Author and update moderately complex scripts to automate repeatable production tasks (using scripting languages like Bash, PowerShell) and have basic skills in at least one or more programming language (e.g. Python, Go-lang, Java, JavaScript/TypeScript/Node.js)
- Independently troubleshoot complex issues and pass this knowledge on to operational teams
- Present issues to management as well as peers, both written and verbally, in a concise fashion
- Receive feedback in a constructive manner and consistently apply it to tasks
- Create system and production documentation, adhering to organization standards
- Lead the evaluation of technology solutions through research and lab work
- Engage with our customers to hear their needs, collect feedback, and feed that back into tangible solutions
- Drive adoption of solutions through relationship building, technical sharing, and collaboration
- Ensure deliverables across engineering teams are of high quality and clearly documented
- Challenge the status quo through intellectual curiosity and natural inquisitiveness to look beyond the obvious for continuous improvement opportunities backed with factual arguments
- Work collaboratively with local and remote team members
- Have ownership over your projects and provide appropriate status to leadership on progress and key decisions
- Provide thought leadership, problem solving and analytical skills to solve hard-to-solve production issues impeding the availability & performance of applications
- Provide level four escalation support to operations partners
Preferred Qualifications
- Demonstrated experience with IDP (Internal Developer Portal) deployment
- Demonstrated experience with service mesh deployed multi-cloud and multi-region, ideally Istio
- Project or team leadership