Anthropic

Safeguards Policy Analyst, Fraud & Scams

contentfull-timeRemote-Friendly (Travel-Required) | San Francisco, CA | New York City, NY

SALARY

Not listed

WORK TYPE

remote

JOB TYPE

full-time

INDUSTRY

Apply for this position

✦ AutoApply Let us apply to roles like this on your behalf.

Learn more

About the role

About Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. The team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the Role

As a Safeguards Policy Analyst focused on Fraud & Scams, you will be responsible for designing, building, and executing enforcement workflows that detect and mitigate fraud and scam-related harms on Anthropic's products. You will serve as the subject matter expert on fraud typologies, scam ecosystems, and the threat actors who perpetrate them — translating that expertise into durable and scalable policies.

This role sits within the Integrity & Authenticity (I&A) team. You will function both as a policy owner and work closely with threat investigative and enforcement teams. You will develop the guidelines that power classifiers and serve as a point of contact for cross-functional workstreams.

Important context: In this position you may be exposed to and engage with explicit content spanning a range of topics, including those of a financial, psychological, or otherwise disturbing nature, including detailed fraud schemes and scam content.

Policy Design & Ownership

Draft, maintain, and iterate on Fraud & Scams policies governing Anthropic's products and APIs, with clarity for both model enforcement and human reviewers
Conduct regular structured policy reviews to identify gaps, ambiguities, and coverage failures, and lead the process to close them
Develop detailed threat models for fraud and scam vectors — including social engineering, financial fraud, impersonation scams, phishing, and AI-enabled fraud — and translate these into enforceable policy language
Stay current on the fraud and scam landscape, including emerging typologies, regulatory shifts, and threat actor tactics, techniques, and procedures (TTPs)

Enforcement Strategy & Operations

Design and architect automated enforcement systems and human review workflows that scale effectively while maintaining high precision and recall
Review flagged content to drive enforcement decisions and surface policy improvements grounded in real-world cases
Define and manage precision/recall tradeoffs in enforcement, working with data science teams to continuously tune classifiers and detection signals
Build and maintain an effective feedback loop between threat intelligence, policy, and enforcement operations to ensure timely response to novel and evolving fraud threats

✦ Let us apply for you

We find roles like this and apply on your behalf. Cover letter written for each one. Plans from $15/mo. Cancel anytime.

Get AutoApply

Apply now