Talent.com
Software Engineer - Model Performance
Software Engineer - Model PerformanceAI Fund • San Francisco, CA, United States
serp_jobs.error_messages.no_longer_accepting
Software Engineer - Model Performance

Software Engineer - Model Performance

AI Fund • San Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Overview

Join to apply for the Software Engineer - Model Performance role at AI Fund .

Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role is ideal for someone who thrives in a fast-paced startup environment and is eager to make significant contributions to the exciting field of LLM inference. If you are a backend engineer who thrives on making things faster and is excited about open-source ML models, we look forward to your application.

About Baseten

Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market - fast. Backed by top investors including

Responsibilities

  • Implement, refine, and productionize cutting-edge techniques (quantization, speculative decoding, kv cache reuse, chunked prefill and LoRA) for ML model inference and infrastructure.
  • Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to debug ML performance issues.
  • Apply and scale optimization techniques across a wide range of ML models, particularly large language models.
  • Collaborate with a diverse team to design and implement innovative solutions.
  • Own projects from idea to production.

Qualifications

  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.
  • Experience with one or more general-purpose programming languages, such as Python or C++.
  • Familiarity with LLM optimization techniques (e.g., quantization, speculative decoding, continuous batching).
  • Strong familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM.
  • Demonstrated interest and experience in LLMs.
  • Deep understanding of GPU architecture.
  • Bonus :
  • Proficiency in enhancing the performance of software systems, particularly in the context of large language models (LLMs).

  • Experience with CUDA or similar technologies.
  • Deep understanding of software engineering principles and a proven track record of developing and deploying AI / ML inference solutions.
  • Experience with Docker and Kubernetes.
  • Benefits

  • Competitive compensation package.
  • This is a unique opportunity to be part of a rapidly growing startup in one of the most exciting engineering fields of our era.
  • An inclusive and supportive work culture that fosters learning and growth.
  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
  • Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

    At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

    Job Details

  • Seniority level : Entry level
  • Employment type : Full-time
  • Job function : Engineering and Information Technology
  • Industries : Venture Capital and Private Equity Principals
  • Location : San Francisco, CA

    Salary : $160,000.00-$180,000.00 per year

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Engineer Performance • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    Senior Software Engineer - Infrastructure, Machine Learning

    Senior Software Engineer - Infrastructure, Machine Learning

    Baton • San Francisco, California, United States
    serp_jobs.job_card.full_time
    With $10B in freight under management, our technology reaches every part of the U.We design and ship category-defining software that enables Ryder and its 50,000+ customers—including some of the wo...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Software Engineer, Machine Learning

    Senior Software Engineer, Machine Learning

    Planet Labs PBC • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer, Performance Optimization

    Software Engineer, Performance Optimization

    Fireworks Ai • Redwood City, California, United States
    serp_jobs.job_card.full_time
    Here at Fireworks, we’re building the future of generative AI infrastructure.Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference.We’...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Fullstack

    Software Engineer, Fullstack

    Ambient.ai • Redwood City, California, United States
    serp_jobs.job_card.full_time
    Build a safer world with us, one incident at a time.AI-powered physical security platform helping the world’s leading enterprises reduce risk, improve operational efficiency, and gain critical insi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Fullstack Software Engineer, Pricing Lifecycle

    Fullstack Software Engineer, Pricing Lifecycle

    Orb • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Orb is transforming how modern AI and software companies monetize at scale.We've built the next-generation billing infrastructure that turns complex usage-based pricing into competitive advantage.O...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer

    Software Engineer

    Confidential • Redwood City, California, United States
    serp_jobs.job_card.full_time
    C3 Energy is looking for data engineers to develop and implement the next generation of analytics for the smart grid.We are building a platform able to handle the extremely large amount of data gen...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Infrastructure Engineer, Model Serving Platform

    AI Infrastructure Engineer, Model Serving Platform

    Scale AI, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Software Engineer - Machine Learning Platform

    Senior Software Engineer - Machine Learning Platform

    Snowflake • Menlo Park, California, United States
    serp_jobs.job_card.full_time
    The Snowflake Machine Learning Platform team’s mission is to enable customers to bring their machine learning and deep learning workloads to Snowflake. Our customers want to build powerful models wi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Software Engineer - Machine Learning- Publica by IAS

    Senior Software Engineer - Machine Learning- Publica by IAS

    Publica • San Francisco, California, United States
    serp_jobs.job_card.full_time
    At Publica, engineers have a unique opportunity to work on a platform that handles billions of requests per hour in one of the fastest growing areas in Ad Tech : Connected Television.Engineers at Pu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer (Fullstack)

    Software Engineer (Fullstack)

    Onecrew • San Francisco, California, United States
    serp_jobs.job_card.full_time
    OneCrew is the leading unified platform helping paving contractors estimate accurately, manage crews effectively, and track profitability in real-time. We eliminate the costly mistakes and wasted ti...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer (AI Performance)

    Software Engineer (AI Performance)

    Gimlet Labs • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Gimlet Labs is building the foundation for the next generation of AI applications.As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck.Gimlet is redefi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Machine Learning Infrastructure

    Software Engineer, Machine Learning Infrastructure

    Datologyai • Redwood City, California, United States
    serp_jobs.job_card.full_time
    Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Model Inference

    Software Engineer, Model Inference

    Openai • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Our team brings OpenAI’s most capable research and technology to the world through our products.We empower consumers, enterprise and developers alike to use and access our start-of-the-art AI model...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff Software Engineer, Machine Learning

    Staff Software Engineer, Machine Learning

    Mlabs • Burlingame, California, United States
    serp_jobs.job_card.full_time
    Staff Software Engineer, Machine Learning.Burlingame, CA (On-site, 4 days a week).We are a rapidly growing AI company applying. We're looking for a highly skilled and experienced.Staff Software Engi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Software Engineer - Machine Learning

    Senior Software Engineer - Machine Learning

    Celonis • Redwood City, California, United States
    serp_jobs.job_card.full_time
    We're Celonis, the global leader in Process Intelligence technology and one of the world's fastest-growing SaaS firms.We believe there is a massive opportunity to unlock productivity by placing AI,...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer (Modeling), Support

    Senior Machine Learning Engineer (Modeling), Support

    Block • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Block is one company built from many blocks, all united by the same purpose of economic empowerment.The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Sec...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer (ML Platform)

    Software Engineer (ML Platform)

    Anyscale • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Ray in their tech stacks to accelerate the progress of AI applications out into the real world.With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can s...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Backend Engineer, Models

    Backend Engineer, Models

    Meter • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Networking is one of the most fundamental industries in all of technology.For the first time, Meter has unified the full networking stack. and now we are making it autonomous.Meter Zero is our neur...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted