Nebius

Staff / Principal Applied AI Researcher (Agentic Search)

engineeringfull-timeAmsterdam, Netherlands; London, United Kingdom; Remote - Europe; Remote - United States

SALARY

Not listed

WORK TYPE

remote

JOB TYPE

full-time

INDUSTRY

Apply for this position

✦ AutoApply Let us apply to roles like this on your behalf.

Learn more

About the role

About Nebius

Nebius is leading a new era in cloud infrastructure for the global AI economy. We are building a full-stack AI cloud platform that supports developers and enterprises from data and model training through to production deployment, without the cost and complexity of building large in-house AI/ML infrastructure.

Built by engineers, for engineers. From large-scale GPU orchestration to inference optimization, we own the hard problems across compute, storage, networking and applied AI.

Role Overview

We are seeking a Staff or Principal Applied AI Researcher to join a fast growing team building an agent native search platform - the web access layer for AI systems.

You can think of this as Google for AI agents: a system designed for machines, not humans. We are building agentic search, where AI systems actively plan, retrieve, evaluate, and refine information rather than simply returning results. As AI becomes the primary interface to the web, this layer will replace the role of traditional search engines.

We are designing how AI agents - not humans - retrieve, evaluate, and reason over web data in real time, under strict latency and reliability constraints. This means solving retrieval and ranking under entirely new access patterns and at significant scale, with systems operating over constantly changing, unstructured data and serving tens of thousands of production workloads 24 by 7.

This role comes with ownership over key parts of our applied AI research direction and system design, with a strong expectation of defining new approaches and shipping measurable impact in production.

What you'll work on

Designing agent native retrieval systems optimised for machine consumption rather than human search UX
Building systems where LLMs iteratively plan, query, refine, and reason over results
Developing ranking and retrieval approaches for multi step, agent driven workflows under real world constraints

Your responsibilities

Drive applied research and technical direction across retrieval and ranking systems
Design and evolve multi stage retrieval architectures (query understanding, rewriting, reranking, iterative retrieval)
Develop methods for grounding LLMs in real time web data at scale
Define and implement new evaluation paradigms and metrics for agentic systems, where correctness is not reducible to clicks
Lead experimentation on modern retrieval approaches (embeddings, hybrid search, reranking) and bring them into production
Analyse trade-offs across relevance, latency, and cost at scale
Work closely with engineering to deploy systems in high throughput, low latency environments

✦ Let us apply for you

We find roles like this and apply on your behalf. Cover letter written for each one. Plans from $15/mo. Cancel anytime.

Get AutoApply

Apply now