Applied AI Engineer
Pareto.AI · Remote
📍 US Remote💰 $225K – $275K • Offers Equity • Offers Bonusvia ashbyPosted 2026-04-08
Apply on company site ↗
CareerRiver pulls this listing straight from the employer's hiring system — no recruiter middleman, no reposts. Applying takes you directly to Pareto.AI.
ABOUT PARETO
Humanity is in a virtuous cycle: human insight improves AI, and better AI expands what people can do. Sustaining it depends on the one input that can't be automated: expert human judgment https://pareto.ai/blog/debating-persuasive-llms-truthful-answers.
At Pareto, we build the platform that turns that judgment into the data https://pareto.ai/blog/community-driven-knowledge-resource-ai, evals https://pareto.ai/blog/introducing-attunebench, and RL environments frontier models https://pareto.ai/blog/llm-metacognition-shared-and-shallow learn from. We work with leading frontier labs like Anthropic and GDM, and we give skilled people everywhere a way to shape the future of AI and share in what it creates.
This RL environment and human-data infrastructure is already in production. Our job now is to scale it.
Responsibilities
- Design and build the pipelines that generate synthetic tasks and evaluation environments for AI model training — this is the factory floor of AI development, producing training fuel for next-generation models, not the models themselves
- Architect the workflows where AI and humans work together in the loop — deciding what gets automated, what requires human intervention, how state is preserved across handoffs, and how the whole system stays reliable at scale
- Own and lead the most complex system design discussions — produce one-page technical scoping documents that surface hidden risks before development begins, define technology stacks, and establish engineering guidelines that let the team move fast without breaking things
- Rapidly assess whether a technical idea is worth building — get early signal, align stakeholders, and kill or accelerate accordingly
- Partner closely with research, operations, and data teams — juggle multiple workstreams, make smart tradeoff decisions as priorities shift, and translate ambiguous business needs into concrete technical architecture
- Build reusable frameworks and engineering guidelines that raise the team's collective execution muscle
You may be a good fit if you have
- 8+ years of software engineering experience with a track record of owning complex systems end-to-end
- A software engineering foundation first — you think in systems, architecture, and engineering tradeoffs, not in models and experiments
- Production experience building and shipping agentic workflows, multi-agent orchestration, HITL pipelines, and LLM-powered applications with measurable business outcomes — RAG, vector stores, semantic search, and multi-model LLM stacks in production, not just demos
- Battle-tested context engineering practices — you reason clearly about the limits of AI and architect around them
- Experience with distributed systems architecture applied to AI or data platforms — reliable, observable, and scalable systems built in service of a product
- Daily proficiency with agentic coding tools (Claude Code, Cursor, or equivalent) — you use these to multiply your output, not pad it
- A track record of operating in ambiguity — shipping fast, pivoting when wrong, and moving on without ego
- Exceptional written and verbal English communication skills — you can lead a design discussion, push back on stakeholders, and document architecture clearly. Communication cannot be a bottleneck
Nice to Have
- Experience at an AI data company (Scale AI, Surge, Snorkel, Labelbox, or similar) — particularly building synthetic data pipelines, eval environments, or task generation systems. This is the dream background.
- Experience building human data labeling interfaces, annotation workflows, or data collection pipelines
- Familiarity with preference data and reward models used in AI model training (RLHF, RLVR, or similar)
- Proficiency with our stack: Python, TypeScript, AWS, GCP, Terraform, Temporal Cloud, containerization, LLM gateways, RAG frameworks, and data pipeline tooling
- Ability to employ data structures and algorithms when forming AI/LLM solutions
- Ability to reason about requirements with a bias for Essentialism
More Remote jobs
Remote jobs · Browse all locations