Anthropic
Anthropic

Safeguards Policy Analyst, Fraud & Scams

contentfull-timeRemote-Friendly (Travel-Required) | San Francisco, CA | New York City, NY
SALARY
Not specified
WORK TYPE
remote
JOB TYPE
full-time
INDUSTRY
ai
Apply for this position →
✦ AutoApply — Let us apply to roles like this on your behalf.
Learn more →

About the role

About Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. The team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the Role

As a Safeguards Policy Analyst focused on Fraud & Scams, you will be responsible for designing, building, and executing enforcement workflows that detect and mitigate fraud and scam-related harms on Anthropic's products. You will serve as the subject matter expert on fraud typologies, scam ecosystems, and the threat actors who perpetrate them — translating that expertise into durable and scalable policies.

This role sits within the Integrity & Authenticity (I&A) team. You will function both as a policy owner and work closely with threat investigative and enforcement teams. You will develop the guidelines that power classifiers and serve as a point of contact for cross-functional workstreams.

Important context: In this position you may be exposed to and engage with explicit content spanning a range of topics, including those of a financial, psychological, or otherwise disturbing nature, including detailed fraud schemes and scam content.

Policy Design & Ownership

  • Draft, maintain, and iterate on Fraud & Scams policies governing Anthropic's products and APIs, with clarity for both model enforcement and human reviewers
  • Conduct regular structured policy reviews to identify gaps, ambiguities, and coverage failures, and lead the process to close them
  • Develop detailed threat models for fraud and scam vectors — including social engineering, financial fraud, impersonation scams, phishing, and AI-enabled fraud — and translate these into enforceable policy language
  • Stay current on the fraud and scam landscape, including emerging typologies, regulatory shifts, and threat actor tactics, techniques, and procedures (TTPs)

Enforcement Strategy & Operations

  • Design and architect automated enforcement systems and human review workflows that scale effectively while maintaining high precision and recall
  • Review flagged content to drive enforcement decisions and surface policy improvements grounded in real-world cases
  • Define and manage precision/recall tradeoffs in enforcement, working with data science teams to continuously tune classifiers and detection signals
  • Build and maintain an effective feedback loop between threat intelligence, policy, and enforcement operations to ensure timely response to novel and evolving fraud threats
✦ Let us apply for you
We find roles like this and apply on your behalf. Cover letter written for each one. $14.44/mo.
Start AutoApply →
Apply now →