Talent.com
Senior Backend Engineer, Inference Platform
Senior Backend Engineer, Inference PlatformTogether • San Francisco, CA, United States
Senior Backend Engineer, Inference Platform

Senior Backend Engineer, Inference Platform

Together • San Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Senior Backend Engineer, Inference Platform

About the Team

Together AI is building the Inference Platform that brings the most advanced generative AI models to the world. Our platform powers multi‑tenant serverless workloads and dedicated endpoints, enabling developers, enterprises, and researchers to harness the latest LLMs, multimodal models, image, audio, video, and speech models at scale.

If you get a thrill from optimizing latency down to the last millisecond, this is your playground. You’ll work hands‑on with tens of thousands of GPUs (H100s, H200s, GB200s, and beyond), figuring out how to fully utilize every FLOP and every gigabyte of memory.

You’ll collaborate directly with research teams to bring frontier models into production, making breakthroughs usable in the real world. Our team also works closely with the open‑source community, contributing to and leveraging projects like SGLang, vLLM, and NVIDIA Dynamo to push the boundaries of inference performance and efficiency.

Some of What You’ll Work On

  • Build and optimize global and local request routing, ensuring low‑latency load balancing across data centers and model engine pods.
  • Develop auto‑scaling systems to dynamically allocate resources and meet strict SLOs across dozens of data centers.
  • Design systems for multi‑tenant traffic shaping, tuning both resource allocation and request handling — including smart rate limiting and regulation — to ensure fairness and consistent experience across all users.
  • Engineer trade‑offs between latency and throughput to serve diverse workloads efficiently.
  • Optimize prefix caching to reduce model compute and speed up responses.
  • Collaborate with ML researchers to bring new model architectures into production at scale.
  • Continuously profile and analyze system‑level performance to identify bottlenecks and implement optimizations.

What We’re Looking For

  • 5+ years of demonstrated experience building large‑scale, fault‑tolerant, distributed systems and API microservices.
  • Strong background in designing, analysing, and improving efficiency, scalability, and stability of complex systems.
  • Excellent understanding of low‑level OS concepts : multi‑threading, memory management, networking, and storage performance.
  • Expert‑level programming in one or more of : Rust, Go, Python, or TypeScript.
  • Knowledge of modern LLMs and generative models and how they are served in production is a plus.
  • Experience working with the open‑source ecosystem around inference is highly valuable; familiarity with SGLang, vLLM, or NVIDIA Dynamo will be especially handy.
  • Experience with Kubernetes or container orchestration is a strong plus.
  • Familiarity with GPU software stacks (CUDA, Triton, NCCL) and HPC technologies (InfiniBand, NVLink, MPI) is a plus.
  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field, or equivalent practical experience.
  • Why Join Us?

  • Shape the core inference backbone that powers Together AI’s frontier models.
  • Solve performance‑critical challenges in global request routing, load balancing, and large‑scale resource allocation.
  • Work with state‑of‑the‑art accelerators (H100s, H200s, GB200s) at global scale.
  • Partner with world‑class researchers to bring new model architectures into production.
  • Collaborate with and contribute to the open‑source community, shaping the tools that advance the industry.
  • Enjoy a culture of deep technical ownership and high impact — where your work makes models faster, cheaper, and more accessible.
  • Competitive compensation, equity, and benefits.
  • About Together AI

    Together AI is a research‑driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co‑designing software, hardware, algorithms, and models. We have contributed to leading open‑source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama.

    Compensation

    We offer competitive compensation, startup equity, health insurance, and other benefits. The US base salary range for this full‑time position is : $160,000 - $250,000 + equity + benefits. Our salary ranges are determined by location, level, and role. Individual compensation will be determined by experience, skills, and job‑related knowledge.

    Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Senior Engineer Platform • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    Senior Platform Backend Engineer - AI-Driven Platform

    Senior Platform Backend Engineer - AI-Driven Platform

    Sprig • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A tech company focused on AI-driven research is seeking a Senior Backend Engineer.You will design and maintain features that ensure platform reliability and performance, directly impacting customer...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    MSI Data • San Francisco, California, USA
    serp_jobs.job_card.full_time
    MSI Data is launching a new dedicated AI team with a singular mission : to disrupt the Field Service software category through AI-native solutions. We are moving beyond simple automation to create in...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    LlamaIndex • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Join us and help shape the future of AI by redefining document workflows with AI agents.LlamaIndex's backend application platform team is hiring for an experienced (4+ years) backend engineer excit...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    Speak • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Our mission is to reinvent the way people learn, starting with language.We begin by teaching the next billion people English, Spanish, and French. English is the global language of business, culture...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    Getfinvest • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    We are looking for a senior backend engineer to join our team in San Francisco.As one of the founding members of our team, you will play a critical role in shaping our product and engineering cultu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    David AI • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    This range is provided by David AI.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. At David AI, our engineers build the pipelines, platforms, an...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Backend Engineer in San Francisco - Skyfire

    Senior Backend Engineer in San Francisco - Skyfire

    WorksHub • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Skyfire is a payment platform designed for the emerging machine economy, where AI and machines need to transact seamlessly and autonomously. It addresses the limitations of current financial systems...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    Skyfire • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    About Skyfire Skyfire is building the payment backbone for the emerging machine economy — a future where AI and machines transact seamlessly, instantly, and autonomously. Current financial systems a...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    TestBox • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    TestBox was founded with a bold mission : to fundamentally transform how software is bought and sold.We’re building an innovative platform that empowers buyers with transparent, interactive software...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    Fal • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    This role is ideal for engineers who thrive on complex distributed systems and have deep experience with backend APIs, relational databases, and event-driven architectures.You’ll build high-perform...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Backend Engineer — AI-Driven Healthcare Platform

    Senior Backend Engineer — AI-Driven Healthcare Platform

    Recruiting From Scratch • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    An innovative firm is seeking a Senior Backend Engineer to drive AI integration in healthcare.You'll design scalable backend systems, ensuring compliance with security standards while optimizing op...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior Backend Engineer

    Senior Backend Engineer

    Cerebras • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Genies is an avatar technology company powering the next era of interactive digital identity through AI companions.With the Avatar Framework and intuitive creation tools, Genies enables developers,...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    Macroscope Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Macroscope aims to be the source of truth of what's happening for any company that builds software.Our mission is to give leaders clarity and engineers time. We help leaders understand how their pro...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    Chime • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Chime Engineering is growing rapidly as we scale to support the financial needs of our members.We are looking for driven engineers to join our team, where you’ll work on APIs that power our member ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    Trust In SODA • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Soda is working with one of the most exciting companies in the Agentic AI space, a team that’s transforming how enterprises automate customer support with cutting-edge AI agents.They’re on track to...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Backend Engineer, AI Data Platform

    Senior Backend Engineer, AI Data Platform

    Labelbox • San Francisco, California, USA
    serp_jobs.job_card.full_time
    At Labelbox were building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018 weve been pioneering data-centric approaches that are fu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Backend Platform Engineer, Semgrep Analysis Foundations

    Senior Backend Platform Engineer, Semgrep Analysis Foundations

    Menlo Ventures • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Our mission is to make world-class software security available to everyone.This means building program analysis tools that are open source, easy to use, powerful, and fast.It also means building a ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Backend Engineer

    Senior Backend Engineer

    Doss Workflows • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    We’re building the “Terraform for operators”—an AI‑generated schema layer driving a white‑labeled data warehouse that lets physical‑goods companies evolve their systems of record as fast as their b...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted