← Back to jobs
Nebius
Nebius

Senior ML Engineer (Token Factory)

engineeringfull-timeGermany; Israel; Netherlands; Prague, Czech Republic; Remote - Europe; United Kingdom
SALARY
Not listed
WORK TYPE
remote
JOB TYPE
full-time
INDUSTRY
ai
Apply for this position
✦ AutoApply Let us apply to roles like this on your behalf.
Learn more

About the role

The role

Token Factory is a part of Nebius Cloud, one of the world’s largest GPU clouds, running tens of thousands of GPUs. We are building an inference & fine-tuning platform that makes every kind of foundation model — text, vision, audio, and emerging multimodal architectures — fast, reliable, and effortless to train & deploy at massive scale.

Some directions we currently working on and which you can be a part of:

  • Advanced Fine-Tuning: Enhancing fine-tuning methodologies - both LoRA-based and full-parameter - for cutting-edge LLMs (e.g., GPT-OSS, Kimi K2.5, DeepSeek V3.1/V3.2, GLM-4.7), focusing on both model quality and training efficiency.

  • Inference Optimization: Identifying LLM inference bottlenecks to drive production speedups. This involves building model training and evaluation pipelines in JAX for speculative decoding, experimenting with architectures (dense/MoE, auto-regressive/parallel), and deriving scaling laws to guide resource allocation.
  • Low Precision Training & Inference: Investigating low-precision (FP8, NVFP4/MXFP4) methodologies for supervised fine-tuning and reinforcement learning - spanning both inference and training - optimized for modern hardware

We expect you to have:

  • A profound understanding of theoretical foundations of machine learning and reinforcement learning.

  • Deep expertise in modern deep learning for language processing and generation

  • Experience with training large models on multiple computational nodes

  • Reasonable understanding of performance aspects of large neural network training (sharding strategies, custom kernels, hardware features etc.)

  • Strong software engineering skills (we mostly use Python)

  • Deep experience with modern deep learning frameworks (we use JAX)

  • Proficiency in contemporary software engineering approaches, including CI/CD, version control and unit testing

  • Strong communication and leadership abilities

Nice to have:

✦ Let us apply for you
We find roles like this and apply on your behalf. Cover letter written for each one. Plans from $14.99/mo. Cancel anytime.
Join waitlist
Apply now
Senior ML Engineer (Token Factory) at Nebius — Remote