Ingénieur fiabilité de site – SRE – / Site Reliability Engineer – SRE
Company | RTX |
---|---|
Location | Longueuil, QC, Canada |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | Bachelor’s |
Experience Level | Mid Level, Senior |
Requirements
- Bachelor’s degree in computer science, Information Technology, or a related field.
- Several years of hands-on experience in a DevOps or related role.
- Proficiency with cloud platforms like AWS, Azure, or Google Cloud.
- Strong understanding of CI/CD pipelines and tools like Jenkins, GitLab, or CircleCI.
- Experience with containerization and orchestration technologies like Docker and Kubernetes.
- Strong scripting skills with languages such as Python, Bash, or PowerShell.
- Knowledge of infrastructure as code (IaC) tools like Terraform, Ansible, or CloudFormation.
- Experience with monitoring and alerting tools like Prometheus, Grafana, or Datadog.
Responsibilities
- Design, implement, and manage CI/CD pipelines to automate the deployment process.
- Collaborate with development teams to create streamlined release management processes.
- Monitor, troubleshoot, and optimize infrastructure performance and reliability.
- Implement and maintain infrastructure as code (IaC) using tools like Terraform or Ansible.
- Manage cloud infrastructure (AWS, Azure, GCP) and ensure high availability and scalability.
- Set up monitoring, logging, and alerting for production systems.
- Work closely with the security team to ensure that infrastructure is secure and compliant.
- Support disaster recovery and backup strategies for critical systems.
- Stay up to date with the latest DevOps trends, tools, and best practices.
Preferred Qualifications
- Excellent problem-solving abilities and a detail-oriented mindset.
- Strong collaboration and communication skills.
- Experience with microservices architecture.
- Familiarity with security best practices and tools.
- Certifications such as AWS Certified DevOps Engineer, Azure DevOps Engineer Expert, or equivalent.