Fednow Senior Site Reliability Engineer
Company | The Federal Reserve System |
---|---|
Location | Boston, MA, USA |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | |
Experience Level | Senior |
Requirements
- Strong communication and collaboration skills
- Extensive knowledge and understanding of working in AWS environments & services (EC2, EBS, EKS, RDS, Aurora, S3, Route 53, ELB, IAM, etc.)
- Hashicorp Terraform, Consul, Vault, and Ansible
- Automation experience preferably using GitLab
- Experience with scripting languages preferably Python for automated processes
- Experience working in Linux environment and shell scripting
- Experience supporting infrastructure for large multi-services applications
- Experience working with continuous deployment in micro-services architectures
- Experience working with Docker, Containers, ECR and EKS
- Observability – CloudWatch, OpenSearch, Dynatrace, Grafana, Prometheus
- Familiarity with Fault Injection tooling (i.e. AWS Fault Injection Simulator, Gremlin, ChaosToolkit, Chaos Monkey)
- Automation mindset to enable consistency and dependability in common actions
Responsibilities
- Operate the production environment for the FedNow program
- Architect, implement, and leverage solution monitoring and tooling for capacity planning, utilization reporting, and scaling
- Support Engineering, DevOps, and DevSecOps tools, services, and solutions
- Design and develop CI/CD and IaC Pipeline automation
- Manage resiliency, DR and BCP (including testing)
- Interface with internal stakeholders and customers for planning, delivery, and service management
- Own ongoing ITIL processes and implement continuous improvement initiatives
- Work closely with Engineers and Architects of the FedNow program to maintain seamless automation across the entire platform
- Proactively identify suspected gaps in system architecture and design experiments to expose them
Preferred Qualifications
-
No preferred qualifications provided.