Talent.com
Engineering Manager - Model Performance
Engineering Manager - Model PerformanceBaseten • San Francisco, CA, US
Engineering Manager - Model Performance

Engineering Manager - Model Performance

Baseten • San Francisco, CA, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Join Our Dynamic Team at Baseten

Join our dynamic team at Baseten, where we're revolutionizing AI deployment with cutting-edge inference infrastructure. Backed by premier investors such as IVP, Spark Capital, Greylock, and Conviction, we're trusted by leading enterprises and AI-driven innovatorsincluding Descript, Bland.ai, Patreon, Writer, and Robust Intelligenceto deliver top-tier performance, security, and reliability for their production workloads. With our recent $75 million Series C funding, we're poised to accelerate our mission to make AI accessible across all products. If you're passionate about tackling impactful challenges and building transformative solutions from the ground up, we invite you to join us on this exciting journey!

The Role

Are you passionate about advancing the frontiers of artificial intelligence while leading a team of exceptional engineers? We are looking for a Tech Lead Manager focused on ML performance and inference. This role is ideal for someone with a strong engineering background who is eager to lead and mentor a team while remaining hands-on with technology. If you thrive in a fast-paced startup environment and are excited about both leadership and technical challenges, we want to hear from you.

Responsibilities

  • Lead, mentor, and manage a team of engineers focused on developing and optimizing ML model inference and performance.
  • Oversee technical strategy and architecture decisions, driving improvements across our engineering organization.
  • Collaborate with cross-functional teams to ensure seamless integration and scalability of ML models in production environments.
  • Dive into the codebase of frameworks like TensorRT, PyTorch, CUDA, and others to identify and solve complex performance bottlenecks.
  • Drive the development and deployment of large-scale optimization techniques for various ML models, especially large language models (LLMs).
  • Own the full lifecycle of projects from inception through delivery, including planning, execution, and resource management.
  • Foster a collaborative, inclusive team environment that encourages continuous learning and growth.

Requirements

  • Bachelor's, Master's, or Ph.D. in Computer Science, Engineering, or a related field.
  • 5+ years of professional experience in software engineering, with at least 2 years in a technical leadership role.
  • Proven experience managing and mentoring teams of engineers.
  • Expertise in one or more programming languages, such as Python, C++, or Go.
  • In-depth understanding of ML model performance optimization, especially using libraries such as PyTorch, TensorRT, and CUDA.
  • Strong knowledge of containerization (Docker) and orchestration systems (Kubernetes).
  • Experience with production-level AI / ML solutions, including scaling and deploying large models.
  • Ability to balance hands-on technical work with team leadership and project management.
  • Bonus Points

  • Experience enhancing the performance of large language models (LLMs) or similar AI systems.
  • Familiarity with LLM optimization techniques such as quantization, speculative decoding, or continuous batching.
  • Deep knowledge of GPU architecture and performance tuning.
  • Previous experience in a high-growth startup environment.
  • Benefits

  • Competitive compensation package (Unlimited PTO, 401k, covered healthcare premiums).
  • An opportunity to lead a talented engineering team at a rapidly growing startup in the machine learning space.
  • Inclusive and supportive work culture with ample opportunities for professional development.
  • Exposure to a wide range of ML use cases, offering unmatched learning and networking potential.
  • Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

    At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

    serp_jobs.job_alerts.create_a_job

    Manager Performance • San Francisco, CA, US

    Job_description.internal_linking.related_jobs
    Engineering Manager, Canvas

    Engineering Manager, Canvas

    Zapier • San Francisco, California, USA
    serp_jobs.job_card.full_time
    So if youre using AI tools while applying here - thats great! We just ask that you use them.How to Collaborate with AI During Zapiers Hiring Process. AI tools like ChatGPT Claude Gemini or others du...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Product Manager - AI Models

    Engineering Product Manager - AI Models

    Cisco Systems, Inc. • San Francisco, California, United States
    serp_jobs.job_card.full_time
    The application window is expected to close on : December 25th, 2025.NOTE : Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received.The Cis...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager - Autonomy

    Engineering Manager - Autonomy

    Booster • San Mateo, CA, United States
    serp_jobs.job_card.full_time
    Skydio is the leading US drone company and the world leader in autonomous flight, the key technology for the future of drones and aerial transportation. The Skydio team combines deep expertise in ar...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager, AI-Driven GTM Platform (Onsite SF / NYC)

    Engineering Manager, AI-Driven GTM Platform (Onsite SF / NYC)

    Unify • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A high-growth technology company in San Francisco is seeking an experienced Engineering Manager to lead a talented team.The ideal candidate will have over 6 years of engineering experience and a st...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager - MLOps & Analytics

    Engineering Manager - MLOps & Analytics

    Canonical • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Engineering Manager - MLOps & Analytics.Be among the first 25 applicants.Engineering Manager - MLOps & Analytics.Get AI-powered advice on this job and more exclusive features.The role of an Enginee...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Remote AI-Driven Engineering Manager

    Remote AI-Driven Engineering Manager

    Y-Axis • San Francisco, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    A leading technology company based in San Francisco is seeking an Engineering Manager to lead a talented team.This role involves designing engineering services for AI features, optimizing performan...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager, Machine Learning Infrastructure, Ads

    Engineering Manager, Machine Learning Infrastructure, Ads

    Roblox • San Mateo, California, USA
    serp_jobs.job_card.full_time
    With Roblox Ads business growing at a rapid rate we are building large scale ads machine learning infrastructure to deliver effective performance ads to our users and more business values to our ad...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager, ML Acceleration

    Engineering Manager, ML Acceleration

    Anthropic • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Engineering Manager, ML

    Engineering Manager, ML

    TwelveLabs • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    At TwelveLabs, we are pioneering the development of frontier multimodal foundation models that can see, hear and understand the world as humans do. Our models have redefined the standards in video-l...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Manager, Software Engineering Personalization and ML Enablement

    Senior Manager, Software Engineering Personalization and ML Enablement

    Upstart • San Mateo, California, USA
    serp_jobs.job_card.full_time
    Upstart is the leading AI lending marketplace partnering with banks and credit unions to expand access to affordable credit. By leveraging Upstarts AI marketplace Upstart-powered banks and credit un...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager

    Engineering Manager

    Rillet • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Our customers are the financial brains of their companies.Our job is to help them run the numbers with impossible speed, accuracy, and insight. Rillet is an AI-native ERP that can drive a zero‑day c...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager - Machine Learning Infrastructure

    Engineering Manager - Machine Learning Infrastructure

    Plaid Inc • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Plaid is evolving into an AI-first company, where data and machine learning are the key enablers of smarter, more secure insight products built on top of Plaid’s vast financial data network.The Mac...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager, Desktop

    Engineering Manager, Desktop

    anthropic • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager, Host

    Engineering Manager, Host

    Turo • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    As an engineering manager for the Host product team, you’ll lead a cross‑functional team of Software Engineers that build features to support the supply side of Turo’s global marketplace.This team ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager, Merchant Data & ML Solutions

    Engineering Manager, Merchant Data & ML Solutions

    Grubhub • San Francisco, CA, US
    serp_jobs.job_card.full_time
    A leading food delivery platform headquartered in San Francisco is looking for an Engineering Manager to lead the Merchant engineering team. You will be responsible for driving the development of da...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Engineering Manager, AI Document Generation — Hybrid + Equity

    Engineering Manager, AI Document Generation — Hybrid + Equity

    EvenUp • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A technology-driven legal firm in San Francisco seeks an Engineering Manager for Document Generation.This hybrid role involves leading a team in developing AI workflows, ensuring high-quality softw...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineering Manager, Pricing & Revenue

    Senior ML Engineering Manager, Pricing & Revenue

    Opendoor • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading real estate platform in Seattle is seeking a Senior Manager, Machine Learning Engineering to lead a team of engineers in driving the machine learning ecosystem. Focused on optimizing ML sy...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Engineering Manager, Portal

    Engineering Manager, Portal

    Hayden AI • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Engineering Manager, Portal (Hayden AI).Join to apply for the Engineering Manager, Portal role at Hayden AI.At Hayden AI, we are on a mission to harness the power of computer vision to transform th...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted