Fireworksai

AI Field Engineer - Enterprise

engineeringfull-timeNew York, NY; Remote, USA; San Mateo, CA

SALARY

Not listed

WORK TYPE

remote

JOB TYPE

full-time

INDUSTRY

Apply for this position

✦ AutoApply Let us apply to roles like this on your behalf.

Learn more

About the role

About Us

At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function calling and multimodal models. Fireworks is a Series C company valued at $4 billion and backed by top investors including Benchmark, Sequoia, Lightspeed, Index, and Evantic. We’re an ambitious, collaborative team of builders, founded by veterans of Meta PyTorch and Google Vertex AI.

In the last few months alone we launched Fireworks Training, partnered with Microsoft Azure Foundry, and published research straight from our production systems.

The Role

AI Field Engineers at Fireworks are the technical tip of the spear. You embed with our most ambitious customers and technology partners to turn complex AI problems into production systems, fast. The role sits at the intersection of engineering, product, and customer delivery. You are hands-on-keyboard building POCs, MVPs, and production integrations, while also holding your own in executive-level conversations about architecture, strategy, and business outcomes.

You spend most of your time building. You ship code, run benchmarks, debug production issues, and architect deployments. But you also lead discovery conversations, align stakeholders, and translate customer pain points into product improvements that compress the feedback loop from field to roadmap. This is a role for engineers who are comfortable on-site with customers, building the relationships and trust that happen in person, not just over a call.

The Segment

As a Field Engineer in the Enterprise track you will work with large organizations and digital-native companies adopting GenAI across the business. These engagements span more stakeholders and longer cycles, so you will manage executive relationships and align teams while staying hands-on in the code. The emphasis is on pairing strong technical delivery with the executive presence to earn trust across an org: discovery, solution design, POC execution, and the path to production at enterprise scale.

What You'll Work On

Technical Delivery and Deployment

Build end-to-end POCs and MVPs alongside customer engineering teams, working inside their codebases, infrastructure, and constraints.
For customers whose core product is built on GenAI, architect the inference foundations that capability depends on, and size deployments so they can scale in their market without infrastructure becoming the bottleneck.
Run load tests and establish latency, throughput, and cost baselines against realistic customer traffic profiles, and tune deployments to hit those targets.
Deploy and validate new model families on inference frameworks (vLLM, SGLang), determining optimal shapes, quantization configs, and serving patterns across workloads.

Model Strategy and Fine-Tuning

Guide customers on model selection, fine-tuning strategy (SFT, DPO, RFT), and evaluation methodology.
Build and run fine-tuned models for customer use cases.

✦ Let us apply for you

We find roles like this and apply on your behalf. Cover letter written for each one. Plans from $15/mo. Cancel anytime.

Get AutoApply

Apply now