AI Field Engineer - Enterprise
About the role
About Us
At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function calling and multimodal models. Fireworks is a Series C company valued at $4 billion and backed by top investors including Benchmark, Sequoia, Lightspeed, Index, and Evantic. We’re an ambitious, collaborative team of builders, founded by veterans of Meta PyTorch and Google Vertex AI.
In the last few months alone we launched Fireworks Training, partnered with Microsoft Azure Foundry, and published research straight from our production systems.
The Role
AI Field Engineers at Fireworks are the technical tip of the spear. You embed with our most ambitious customers and technology partners to turn complex AI problems into production systems, fast. The role sits at the intersection of engineering, product, and customer delivery. You are hands-on-keyboard building POCs, MVPs, and production integrations, while also holding your own in executive-level conversations about architecture, strategy, and business outcomes.
You spend most of your time building. You ship code, run benchmarks, debug production issues, and architect deployments. But you also lead discovery conversations, align stakeholders, and translate customer pain points into product improvements that compress the feedback loop from field to roadmap. This is a role for engineers who are comfortable on-site with customers, building the relationships and trust that happen in person, not just over a call.
The Segment
As a Field Engineer in the Enterprise track you will work with large organizations and digital-native companies adopting GenAI across the business. These engagements span more stakeholders and longer cycles, so you will manage executive relationships and align teams while staying hands-on in the code. The emphasis is on pairing strong technical delivery with the executive presence to earn trust across an org: discovery, solution design, POC execution, and the path to production at enterprise scale.
What You'll Work On
Technical Delivery and Deployment
- Build end-to-end POCs and MVPs alongside customer engineering teams, working inside their codebases, infrastructure, and constraints.
- For customers whose core product is built on GenAI, architect the inference foundations that capability depends on, and size deployments so they can scale in their market without infrastructure becoming the bottleneck.
- Run load tests and establish latency, throughput, and cost baselines against realistic customer traffic profiles, and tune deployments to hit those targets.
- Deploy and validate new model families on inference frameworks (vLLM, SGLang), determining optimal shapes, quantization configs, and serving patterns across workloads.
Model Strategy and Fine-Tuning
- Guide customers on model selection, fine-tuning strategy (SFT, DPO, RFT), and evaluation methodology.
- Build and run fine-tuned models for customer use cases.