Talent.com
ML Model Serving Engineer
ML Model Serving EngineerSESAME • San Francisco, CA, United States
serp_jobs.error_messages.no_longer_accepting
ML Model Serving Engineer

ML Model Serving Engineer

SESAME • San Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Sesame Job Opportunity

Sesame believes in a future where computers are lifelike - with the ability to see, hear, and collaborate with us in ways that feel natural and human. With this vision, we're designing a new kind of computer, focused on making voice companions part of our daily lives. Our team brings together founders from Oculus and Ubiquity6, alongside proven leaders from Meta, Google, and Apple, with deep expertise spanning hardware and software. Join us in shaping a future where computers truly come alive.

Responsibilities :

  • Turbocharge our serving layer, consisting of a variety of LLM, speech, and vision models.
  • Partner with ML infrastructure and training engineers to build a fast, cost-effective, accurate, and reliable serving layer to power a new consumer product category.
  • Modify and extend LLM serving frameworks like VLLM and SGLang to take advantage of the latest techniques in high-performance model serving.
  • Experiment with new compilers to support running models on a variety of hardware compute platforms.
  • Work with the training team to identify opportunities to produce faster models without sacrificing quality.
  • Use techniques like in-flight batching, caching, and custom kernels to speed up inference.
  • Find ways to reduce model initialization times without sacrificing quality.

Required Qualifications :

  • Expert in some differentiable array computing framework, preferably PyTorch.
  • Expert in optimizing machine learning models for serving reliably at high throughput, with low latency.
  • Significant systems programming experience; ex. Experience working on high-performance server systemsyou'd be just as comfortable with the internals of VLLM as you would with a complex PyTorch codebase.
  • Significant performance engineering experience; ex. Bottleneck analysis in high-scale server systems or profiling low-level systems code.
  • Always up to date on the latest techniques for model serving optimization.
  • Preferred Qualifications :

  • Familiarity with high-performance LLM serving; ex. experience with VLLM, SGlang deployment, and internals.
  • Experience with a public cloud platform such as GCP, AWS, or Azure.
  • Experience deploying and scaling inference workloads in the cloud using Kubernetes, Ray, etc.
  • You like to ship and have a track record of leading complex multi-month projects without assistance.
  • You're excited to learn new things and work in a multitude of roles.
  • Sesame is committed to a workplace where everyone feels valued, respected, and empowered. We welcome all qualified applicants, embracing diversity in race, gender, identity, orientation, ability, and more. We provide reasonable accommodations for applicants with disabilitiescontact careers@ for assistance.

    Full-time Employee Benefits :

  • 401k matching
  • 100% employer-paid health, vision, and dental benefits
  • Unlimited PTO and sick time
  • Flexible spending account matching (medical FSA)
  • Benefits do not apply to contingent / contract workers

    serp_jobs.job_alerts.create_a_job

    Ml Engineer • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    ML Engineer – Personalization & Recommendations

    ML Engineer – Personalization & Recommendations

    Amazon • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading technology company seeks a Machine Learning Engineer specializing in personalization.This role involves developing ML models and collaborating on product insights.Ideal candidates have a ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Generative ML Engineer — Equity + Visa Sponsorship

    Generative ML Engineer — Equity + Visa Sponsorship

    Fal • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A technology company in San Francisco is seeking a Machine Learning Engineer to develop and operationalize models that enhance user experiences. The ideal candidate will be proficient in Python and ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineer — Forecasting & Scheduling (Hybrid)

    Senior ML Engineer — Forecasting & Scheduling (Hybrid)

    Assembled • San Francisco, California, United States
    serp_jobs.job_card.full_time
    A software development company is seeking a Machine Learning Engineer - Forecasting & Scheduling.In this mid-senior level role, you will lead the development of ML features, drive technical roadmap...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineer - Generative AI, Siri Agent Modeling

    Senior ML Engineer - Generative AI, Siri Agent Modeling

    Apple Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    San Francisco Bay Area, California, United States Machine Learning and AI.The Siri team is looking for passionate Machine Learning Engineers to join us in developing and shipping state-of-the-art g...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior ML Engineer - Monetization & Personalization

    Senior ML Engineer - Monetization & Personalization

    Pinterest • San Francisco, CA, US
    serp_jobs.job_card.full_time
    A leading social media platform in San Francisco seeks a Machine Learning Engineer to develop personalized experiences using innovative ML techniques. The role demands expertise in recommendation sy...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineer – Personalization & Recommendations

    Senior ML Engineer – Personalization & Recommendations

    Quizlet • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    An innovative learning platform is seeking a Senior Machine Learning Engineer to design, implement, and optimize systems that personalize learning experiences. Located in San Francisco, this role re...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Staff ML Engineer — Marketplace Signals Innovator

    Staff ML Engineer — Marketplace Signals Innovator

    Uber • San Francisco, California, United States
    serp_jobs.job_card.full_time
    A leading tech company is seeking a Staff Machine Learning Engineer to design and optimize ML models that enhance marketplace signals. You will directly impact pricing and customer experience in a d...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Platform Engineer : Scale Production Models

    Senior ML Platform Engineer : Scale Production Models

    Turo Inc • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading car-sharing platform is seeking a Senior Software Engineer to work with the Machine Learning Engineering team.You'll build scalable systems and integrate machine learning models into the ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Modeling Engineer — Remote AI for Support

    Senior ML Modeling Engineer — Remote AI for Support

    Block • San Francisco, California, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    A technology company located in San Francisco is seeking an experienced leader in machine learning to drive innovative initiatives and improve customer support via AI. This role requires over 10 yea...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Foundation Model ML Engineer — Remote-Friendly

    Foundation Model ML Engineer — Remote-Friendly

    Stripe • San Francisco, California, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    A leading financial technology company is seeking a Machine Learning Engineer for their Foundation Model team.The candidate will develop and optimize machine learning models that enhance payments a...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineer – Personalization & Recommendations

    Senior ML Engineer – Personalization & Recommendations

    Icon Ventures • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading technology firm in San Francisco is seeking a Senior Machine Learning Engineer.The role involves designing and optimizing large-scale retrieval and recommendation systems to enhance user ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior ML Platform Engineer

    Senior ML Platform Engineer

    42dot • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    AI company committed to solving mobility challenges with software and AI.As the Global Software Center of Hyundai Motor Group, 42dot pioneers the future of mobility by advancing the development of ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior ML Engineer, Global Recommendations & LLMs (Hybrid)

    Senior ML Engineer, Global Recommendations & LLMs (Hybrid)

    Grindr LLC • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading LGBTQ+ connection platform is seeking an experienced ML Engineer to enhance their machine learning capabilities. This role involves architecting scalable recommendation systems and leverag...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineer

    Senior ML Engineer

    Veryfi, Inc. • San Mateo, California, United States, 94401
    serp_jobs.job_card.full_time
    Veryfi AI document capture (Veryfi Lens) and AI-powered data extraction (Veryfi OCR API) software delivers Day 1 Accuracy™ and immediate go-to-market prowess. Veryfi enables fintech products, retent...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30
    Staff ML Engineer — Personalization & Recommendations

    Staff ML Engineer — Personalization & Recommendations

    Quizlet, Inc. • San Francisco, California, United States
    serp_jobs.job_card.full_time
    An educational technology company in San Francisco is seeking an experienced Senior or Staff Machine Learning Engineer to design and build large-scale recommendation systems.The role requires exper...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Customer-Facing Diffusion ML Engineer

    Customer-Facing Diffusion ML Engineer

    Black Forest Labs • San Francisco, California, United States
    serp_jobs.job_card.full_time
    A generative AI startup is seeking a Forward Deployed Machine Learning Engineer to assist customers with implementing and optimizing FLUX models. The ideal candidate should have a solid background i...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ML Engineer : Shape Supply, Incentives & Balance

    ML Engineer : Shape Supply, Incentives & Balance

    DoorDash • San Francisco, California, United States
    serp_jobs.job_card.full_time
    A food delivery technology company in San Francisco seeks a Machine Learning Engineer to design and deploy ML systems that drive decision-making across areas like Dasher acquisition and marketplace...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted