Posted in

Safeguards Policy Analyst

Safeguards Policy Analyst

CompanyAnthropic
LocationSan Francisco, CA, USA, New York, NY, USA
Salary$170000 – $200000
TypeFull-Time
Degrees
Experience LevelMid Level, Senior

Requirements

  • Experience establishing and scaling policy enforcement, and review workflows
  • Written and improved policies for tech products and platforms
  • Excellent written and verbal communication skills, with the ability to explain complex policy topics to various audiences
  • Used SQL and/or other data analysis tools to draw insights from large datasets
  • Identified emerging risks and threat actors, and provided feedback to a diverse sets of stakeholders, such as Product, Policy, Engineering, and Legal teams
  • Worked with generative AI products, including writing effective prompts for content review and enforcement
  • An understanding of the challenges that exist in implementing product policies at scale, including in the content moderation space
  • Experience as a trust & safety professional or subject matter expert working in one or more of the following focus areas: elections, influence operations, or fraud and abuse

Responsibilities

  • Design and architect automated enforcement systems and review workflows that scale effectively while maintaining high accuracy
  • Partner with Product, Engineering, and Data Science teams to optimize detection models for policy violations and automated enforcement systems
  • Review flagged content to drive enforcement and policy improvements
  • Work with external experts to gather feedback on policy, product interventions, and harm mitigations
  • Enforce usage policies with a focus on detecting and mitigating potential harmful use of AI systems
  • Support the Safeguards policy design team by providing detailed feedback on policy gaps based on real enforcement scenarios
  • Keep up to date with emerging AI policy enforcement best practices, and use these to inform our decision-making and workflows

Preferred Qualifications

    No preferred qualifications provided.