Skip to content

Lead – Platform Health
Company | Kargo |
---|
Location | New York, NY, USA |
---|
Salary | $100000 – $160000 |
---|
Type | Full-Time |
---|
Degrees | |
---|
Experience Level | Senior, Expert or higher |
---|
Requirements
- 8+ years of experience in technical support, operations, or customer success management
- Proven track record of building and leading high-performing, globally distributed teams
- Expertise in service-level management, incident triage, and root cause analysis
- Strong understanding of cost optimization strategies and resource allocation models
- Excellent problem-solving, analytical, and decision-making skills
- Passion for talent development and building sustainable career paths
- Experience navigating and influencing cross-functional stakeholders
- Familiarity with the ad-tech industry and Kargo’s products and services is a plus
Responsibilities
- Design and implement robust monitoring frameworks across critical platform metrics
- Develop comprehensive dashboards and alerts for key performance indicators (KPIs) including: Response rates, Win rates, Margin performance, Revenue fluctuations, Latency and performance metrics
- Create early warning systems to detect anomalies and potential performance degradations
- Establish standardized protocols for investigating and responding to platform health incidents
- Conduct deep-dive root cause analyses for performance drops or significant metric deviations
- Develop and maintain incident response playbooks
- Coordinate cross-functional teams during critical platform health events
- Track, document, and report on incident patterns and resolution strategies
- Define and document platform health standards and performance benchmarks
- Create comprehensive guidelines for metric thresholds and acceptable performance ranges
- Design automated and manual mechanisms for ongoing platform health assessment
- Develop predictive models to anticipate potential performance issues before they escalate
- Generate periodic platform health reports with actionable insights
- Recommend architectural improvements, optimization strategies, and risk mitigation approaches
- Collaborate with engineering, product, and business teams to align platform health initiatives
- Develop simulation and stress testing methodologies to proactively assess platform resilience
- Select, implement, and optimize monitoring and observability tools
- Build custom scripts and integrations to enhance platform health tracking
- Maintain and evolve the platform’s health monitoring infrastructure
- Evaluate and introduce new technologies for improved platform visibility
Preferred Qualifications
- Familiarity with the ad-tech industry and Kargo’s products and services is a plus