Posted in

Senior Manager Application Integration – SRE

Senior Manager Application Integration – SRE

CompanyDiscover
LocationHouston, TX, USA, Wheeling, IL, USA
Salary$129000 – $217400
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior

Requirements

  • Bachelors – Information Technology or related
  • 6+ Years – Payments and/or equivalent technology industry, or related experience
  • In lieu of education, 8+ Years – Payments and/or equivalent technology industry, or related experience

Responsibilities

  • Responsible for managing a team that oversees and executes the construction, integration, overall resiliency, release management, and ongoing operations for applications.
  • Manages application connectivity and data flows from customer entry point and surrounding technology components.
  • Develops and coach teams to be able to identify, manage, and escalate risk, and effectively manages risk within the teams you oversee.
  • Manages a technology team. Leads team and peers to ensure capacity and performance management is compliant.
  • Oversees application release management.
  • Manages change control for applications and platforms.
  • Manages disaster recovery and compliance plans.
  • Leads business and budget planning to align with product roadmaps.

Preferred Qualifications

  • 8+ Years – Payments and/or equivalent technology industry, or related experience
  • Strong understanding of distributed systems, cloud architecture and modern application development with experience in troubleshooting production issues.
  • Proven ability to lead major incident bridges, drive root cause analysis and coordinate cross functional teams to quickly restore service.
  • Hands on experience with monitoring, observability tools used in high availability production environments.
  • Expertise in 24×7 production support and ability to own uptime & performance SLA’s for large scale distributed systems
  • Ability to assess complex technical issues, identify risks, and implement effective solutions under pressure.
  • Ability to identify issues, drive post incident learnings and lead initiatives that enhance system reliability and streamline operations.
  • Expertise in one or more general purpose programming languages: Python, Go, shell scripting (Unix/Linux), Java
  • Expertise in container technology (OpenShift, Kubernetes), hybrid cloud and AWS.
  • Experience in monitoring tools and log analysis tools to manage operations