Safeguards Policy Analyst
Company | Anthropic |
---|---|
Location | San Francisco, CA, USA, New York, NY, USA |
Salary | $170000 – $200000 |
Type | Full-Time |
Degrees | |
Experience Level | Mid Level, Senior |
Requirements
- Experience establishing and scaling policy enforcement, and review workflows
- Written and improved policies for tech products and platforms
- Excellent written and verbal communication skills, with the ability to explain complex policy topics to various audiences
- Used SQL and/or other data analysis tools to draw insights from large datasets
- Identified emerging risks and threat actors, and provided feedback to a diverse sets of stakeholders, such as Product, Policy, Engineering, and Legal teams
- Worked with generative AI products, including writing effective prompts for content review and enforcement
- An understanding of the challenges that exist in implementing product policies at scale, including in the content moderation space
- Experience as a trust & safety professional or subject matter expert working in one or more of the following focus areas: elections, influence operations, or fraud and abuse
Responsibilities
- Design and architect automated enforcement systems and review workflows that scale effectively while maintaining high accuracy
- Partner with Product, Engineering, and Data Science teams to optimize detection models for policy violations and automated enforcement systems
- Review flagged content to drive enforcement and policy improvements
- Work with external experts to gather feedback on policy, product interventions, and harm mitigations
- Enforce usage policies with a focus on detecting and mitigating potential harmful use of AI systems
- Support the Safeguards policy design team by providing detailed feedback on policy gaps based on real enforcement scenarios
- Keep up to date with emerging AI policy enforcement best practices, and use these to inform our decision-making and workflows
Preferred Qualifications
-
No preferred qualifications provided.