Talent.com
Engineering Manager - Model Performance
Engineering Manager - Model PerformanceBaseten • San Francisco, CA, US
serp_jobs.error_messages.no_longer_accepting
Engineering Manager - Model Performance

Engineering Manager - Model Performance

Baseten • San Francisco, CA, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Join Our Dynamic Team at Baseten

Join our dynamic team at Baseten, where we're revolutionizing AI deployment with cutting-edge inference infrastructure. Backed by premier investors such as IVP, Spark Capital, Greylock, and Conviction, we're trusted by leading enterprises and AI-driven innovatorsincluding Descript, Bland.ai, Patreon, Writer, and Robust Intelligenceto deliver top-tier performance, security, and reliability for their production workloads. With our recent $75 million Series C funding, we're poised to accelerate our mission to make AI accessible across all products. If you're passionate about tackling impactful challenges and building transformative solutions from the ground up, we invite you to join us on this exciting journey!

The Role

Are you passionate about advancing the frontiers of artificial intelligence while leading a team of exceptional engineers? We are looking for a Tech Lead Manager focused on ML performance and inference. This role is ideal for someone with a strong engineering background who is eager to lead and mentor a team while remaining hands-on with technology. If you thrive in a fast-paced startup environment and are excited about both leadership and technical challenges, we want to hear from you.

Responsibilities

  • Lead, mentor, and manage a team of engineers focused on developing and optimizing ML model inference and performance.
  • Oversee technical strategy and architecture decisions, driving improvements across our engineering organization.
  • Collaborate with cross-functional teams to ensure seamless integration and scalability of ML models in production environments.
  • Dive into the codebase of frameworks like TensorRT, PyTorch, CUDA, and others to identify and solve complex performance bottlenecks.
  • Drive the development and deployment of large-scale optimization techniques for various ML models, especially large language models (LLMs).
  • Own the full lifecycle of projects from inception through delivery, including planning, execution, and resource management.
  • Foster a collaborative, inclusive team environment that encourages continuous learning and growth.

Requirements

  • Bachelor's, Master's, or Ph.D. in Computer Science, Engineering, or a related field.
  • 5+ years of professional experience in software engineering, with at least 2 years in a technical leadership role.
  • Proven experience managing and mentoring teams of engineers.
  • Expertise in one or more programming languages, such as Python, C++, or Go.
  • In-depth understanding of ML model performance optimization, especially using libraries such as PyTorch, TensorRT, and CUDA.
  • Strong knowledge of containerization (Docker) and orchestration systems (Kubernetes).
  • Experience with production-level AI / ML solutions, including scaling and deploying large models.
  • Ability to balance hands-on technical work with team leadership and project management.
  • Bonus Points

  • Experience enhancing the performance of large language models (LLMs) or similar AI systems.
  • Familiarity with LLM optimization techniques such as quantization, speculative decoding, or continuous batching.
  • Deep knowledge of GPU architecture and performance tuning.
  • Previous experience in a high-growth startup environment.
  • Benefits

  • Competitive compensation package (Unlimited PTO, 401k, covered healthcare premiums).
  • An opportunity to lead a talented engineering team at a rapidly growing startup in the machine learning space.
  • Inclusive and supportive work culture with ample opportunities for professional development.
  • Exposure to a wide range of ML use cases, offering unmatched learning and networking potential.
  • Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

    At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

    serp_jobs.job_alerts.create_a_job

    Manager Performance • San Francisco, CA, US

    Job_description.internal_linking.related_jobs
    Engineering Manager - AI Model Platform

    Engineering Manager - AI Model Platform

    Replicate, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A startup in AI deployment is seeking an Engineering Manager to lead a team focused on developing production-ready generative models. You will coordinate team efforts, set clear goals, and foster co...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Rider Personalization ML Engineering Manager

    Rider Personalization ML Engineering Manager

    Uber • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading transportation network company in San Francisco is seeking a Manager for the Rider Intelligence team.This role will focus on leveraging machine learning to enhance the rider experience th...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Hands-on Engineering Manager — Hybrid Lead & Mentor

    Hands-on Engineering Manager — Hybrid Lead & Mentor

    Quindar • San Francisco, California, United States
    serp_jobs.job_card.full_time
    A leading tech company is looking for a Software Engineering Manager to lead their Software Development team in a hybrid role. This position balances hands-on development, team management, and strat...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager - MLOps & Analytics

    Engineering Manager - MLOps & Analytics

    Canonical • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Engineering Manager - MLOps & Analytics.Be among the first 25 applicants.Engineering Manager - MLOps & Analytics.Get AI-powered advice on this job and more exclusive features.The role of an Enginee...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Remote AI-Driven Engineering Manager

    Remote AI-Driven Engineering Manager

    Y-Axis • San Francisco, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    A leading technology company based in San Francisco is seeking an Engineering Manager to lead a talented team.This role involves designing engineering services for AI features, optimizing performan...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager, ML Acceleration

    Engineering Manager, ML Acceleration

    Anthropic • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Engineering Manager, Machine Learning Operations

    Engineering Manager, Machine Learning Operations

    Pitchbook • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    At PitchBook, we are always looking forward.We continue to innovate, evolve, and invest in ourselves to bring out the best in everyone. We’re deeply collaborative and thrive on the excitement, energ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. Manager, Engineering - Model Serving

    Sr. Manager, Engineering - Model Serving

    Databricks • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Be among the first 25 applicants.At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to acceler...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Engineering Manager, ML

    Engineering Manager, ML

    TwelveLabs • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    At TwelveLabs, we are pioneering the development of frontier multimodal foundation models that can see, hear and understand the world as humans do. Our models have redefined the standards in video-l...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager - Machine Learning Infrastructure

    Engineering Manager - Machine Learning Infrastructure

    Plaid Inc • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Plaid is evolving into an AI-first company, where data and machine learning are the key enablers of smarter, more secure insight products built on top of Plaid’s vast financial data network.The Mac...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Engineering Manager, 3D Modeling

    Senior Engineering Manager, 3D Modeling

    HOVER • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Hover helps people design, improve, and protect the properties they love.With proprietary AI built on over a decade of real property data, Hover answers age-old questions like “What will it look li...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager, Desktop

    Engineering Manager, Desktop

    anthropic • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Performance Modelling Engineer

    Performance Modelling Engineer

    PageBolt WordPress • San Francisco, CA, United States
    serp_jobs.job_card.permanent
    We’re searching for a Staff Performance Modelling Engineer to create and own the analytical and simulation models that steer OTPU architecture and software evolution. You will build functional simul...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Lead Performance Modelling Engineer - Systems & Simulators

    Lead Performance Modelling Engineer - Systems & Simulators

    Flux • San Francisco, CA, US
    serp_jobs.job_card.full_time
    A leading technology company in San Francisco is seeking a Staff Performance Modelling Engineer to develop analytical and simulation models that drive architecture evolution.The ideal candidate wil...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Manager, Host

    Engineering Manager, Host

    Turo • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    As an engineering manager for the Host product team, you’ll lead a cross‑functional team of Software Engineers that build features to support the supply side of Turo’s global marketplace.This team ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Engineering Product Manager - AI Models

    Engineering Product Manager - AI Models

    Cisco Systems, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    The application window is expected to close on : December 25th, 2025.NOTE : Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received.The Cis...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineering Manager, Pricing & Revenue

    Senior ML Engineering Manager, Pricing & Revenue

    Opendoor • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading real estate platform in Seattle is seeking a Senior Manager, Machine Learning Engineering to lead a team of engineers in driving the machine learning ecosystem. Focused on optimizing ML sy...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Engineering Program Manager - Fleet Engineering

    Engineering Program Manager - Fleet Engineering

    Lambda • San Francisco, CA, US
    serp_jobs.job_card.full_time
    Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference.Lambda's mission is to make compute as ubiquitous as electricity and give every person access to a...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted