Skip to content

IT Ops Specialist – Google Cloud Platform Support
Company | TD Bank |
---|
Location | Toronto, ON, Canada |
---|
Salary | $91200 – $136800 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s, Master’s |
---|
Experience Level | Senior, Expert or higher |
---|
Requirements
- Current or prior experience supporting GKE (Google Kubernetes Engine) – MUST HAVE
- Current or prior experience supporting GCP (Google Cloud Platform) – MUST HAVE
- Strong to expert knowledge of supporting GCP including GKE workloads
- Familiar with supporting GCP services such as BigQuery, Cloud SQL (SQL/PostgreSQL), REDIS, Cassandra, BigTable, Cloud Filestore, Persistent Storage, Apigee, Kafka, Dataflow, GCS.
- Thorough problem determination skills to troubleshoot and resolve business application issues.
- Knowledge with OS technologies (Windows, RedHat Linux).
- Familiar with CI/CD tools such as Github, Jenkins, etc.
- DevOps and Agile understanding.
- Working knowledge of Local Area Networks (LAN) and Wide Area Networks (WAN).
- Knowledge of scripting languages and tools such as Python, JavaScript, Powershell, Bash.
- The successful applicant must have a solid understanding of incident, change, and problem management methodologies as well as solid experience in a large, high-performance production environment.
- Undergraduate degree or Technical Certificate
- Graduate degree, preferred
- 7+ years relevant experience
Responsibilities
- Provide 24/7 support of our Google Cloud Platform (GCP) environment, focusing on operational support and optimizing its health and performance.
- Oversee the management and security of our containerized applications.
- Ensure efficient resource allocation and adherence to best practices for container deployments.
- Monitor the performance and availability of GCP at a platform level, proactively identifying and resolving any potential issues.
- Responsible for DEV to PROD GCP Cloud PaaS/IaaS support and processes to ensure quality, performance, and availability of Public Cloud services (GCP).
- Provide planning, communication, and reporting of day-to-day ticket metrics and longer-term tactical objectives.
- Level 2 support of TD business line GCP Cloud infrastructure including PaaS/IaaS/Containers across all production and test environments.
- Manage non-standard/complex P1, P2 (major incidents), and P3 and P4 incidents and service requests.
- Ensure customer service satisfaction and enable continuous improvements.
- Oversee higher complexity operational and preventive maintenance tasks.
- Manage complex remedial and unscheduled urgent changes.
- Drive root cause analysis on repeatable incidents to help prevent issues in the future.
- Creation of support documentation and scripts.
- Oversee vendor’s service delivery and escalation.
- Provide operational consultancy for future-state technologies.
- Prioritize activities to align with compliance, regulatory requirements, and business objectives.
Preferred Qualifications
- Experience and knowledge supporting an Azure Public Cloud environment (while not necessary) would be valuable.