Posted in

Lead – Platform Health

Lead – Platform Health

CompanyKargo
LocationNew York, NY, USA
Salary$100000 – $160000
TypeFull-Time
Degrees
Experience LevelSenior, Expert or higher

Requirements

  • 8+ years of experience in technical support, operations, or customer success management
  • Proven track record of building and leading high-performing, globally distributed teams
  • Expertise in service-level management, incident triage, and root cause analysis
  • Strong understanding of cost optimization strategies and resource allocation models
  • Excellent problem-solving, analytical, and decision-making skills
  • Passion for talent development and building sustainable career paths
  • Experience navigating and influencing cross-functional stakeholders
  • Familiarity with the ad-tech industry and Kargo’s products and services is a plus

Responsibilities

  • Design and implement robust monitoring frameworks across critical platform metrics
  • Develop comprehensive dashboards and alerts for key performance indicators (KPIs) including: Response rates, Win rates, Margin performance, Revenue fluctuations, Latency and performance metrics
  • Create early warning systems to detect anomalies and potential performance degradations
  • Establish standardized protocols for investigating and responding to platform health incidents
  • Conduct deep-dive root cause analyses for performance drops or significant metric deviations
  • Develop and maintain incident response playbooks
  • Coordinate cross-functional teams during critical platform health events
  • Track, document, and report on incident patterns and resolution strategies
  • Define and document platform health standards and performance benchmarks
  • Create comprehensive guidelines for metric thresholds and acceptable performance ranges
  • Design automated and manual mechanisms for ongoing platform health assessment
  • Develop predictive models to anticipate potential performance issues before they escalate
  • Generate periodic platform health reports with actionable insights
  • Recommend architectural improvements, optimization strategies, and risk mitigation approaches
  • Collaborate with engineering, product, and business teams to align platform health initiatives
  • Develop simulation and stress testing methodologies to proactively assess platform resilience
  • Select, implement, and optimize monitoring and observability tools
  • Build custom scripts and integrations to enhance platform health tracking
  • Maintain and evolve the platform’s health monitoring infrastructure
  • Evaluate and introduce new technologies for improved platform visibility

Preferred Qualifications

  • Familiarity with the ad-tech industry and Kargo’s products and services is a plus