Talent.com
Software Engineer (AI Performance)
Software Engineer (AI Performance)Gimlet Labs • San Francisco, California, United States
Software Engineer (AI Performance)

Software Engineer (AI Performance)

Gimlet Labs • San Francisco, California, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Gimlet Labs is building the foundation for the next generation of AI applications. As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck. Gimlet is redefining AI inference from the ground up, combining cutting-edge research with an integrated hardware-software stack that delivers breakthrough performance, efficiency, and model quality. Gimlet pairs its inference stack with a seamless developer experience, allowing users to deploy, manage, and monitor AI workloads from frameworks like PyTorch and LangChain at production scale in seconds.

Gimlet is spun out of a Stanford research project under Professors Zain Asgar and Sachin Katti. The founding team has deep experience across AI, distributed systems, and hardware with previous successful exits.

Gimlet Labs is seeking a Software Engineer focused on AI Performance. You will be researching and implementing techniques to drive performance and quality optimizations across the latest AI models. You will implement techniques such as quantization, KV caching, and FlashAttention to enable inference efficiency. You will design parallelism strategies to distribute data and workloads across compute nodes at production scale. You will dive deep into GPU code and kernel optimizations to accelerate AI workloads.

Responsibilities :

Evaluating and implementing cutting-edge AI research for model performance and efficiency

Architecting infrastructure for distributed AI workloads across both the software stack and GPU kernel layers

Profiling, benchmarking, and analyzing system performance, identifying bottlenecks and optimization opportunities in execution runtimes targeting various hardware systems

Qualifications :

Bachelor’s degree in computer science, engineering, applied mathematics or comparable area of study

Experience with performance optimization

Preferred Qualifications :

Graduate degree in computer science, engineering, applied mathematics or comparable area of study

Familiarity with compilers and compiler frameworks such as MLIR

Experience with PyTorch, TensorFlow, vLLM, ONNX and other AI frameworks

Software development experience with Python, C++, and CUDA

serp_jobs.job_alerts.create_a_job

Software Engineer Ai • San Francisco, California, United States

Job_description.internal_linking.related_jobs
AI Software Engineer

AI Software Engineer

Unitq • San Francisco, California, United States
serp_jobs.job_card.full_time
Q is a game-changing AI SaaS platform that empowers companies to build the world’s best products by leveraging real-time customer feedback to improve product quality and drive growth.Q’s leading AI...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer (AI Performance)

Software Engineer (AI Performance)

Gimlet Labs, Inc • San Francisco, CA, United States
serp_jobs.job_card.full_time
Gimlet Labs is building the foundation for the next generation of AI applications.As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck.Gimlet is redefi...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Software Engineer, AI Model serving - San Francisco, USA

Senior Software Engineer, AI Model serving - San Francisco, USA

Speechify • San Francisco, California, United States
serp_jobs.job_card.full_time
PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
AI Software Engineer, Search

AI Software Engineer, Search

Nexus • San Francisco, California, United States
serp_jobs.job_card.full_time
Nexus is innovating at the intersection of artificial intelligence, blockchain, and zero-knowledge cryptography to build a Layer 1 for the AI era. Our team of world-leading experts is developing the...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Software Engineer, AI

Senior Software Engineer, AI

Peregrine Technologies • San Francisco, California, United States
serp_jobs.job_card.full_time
Backed by leading investors from Silicon Valley, Peregrine supports public safety agencies across the country — from Los Angeles to Louisville to Atlanta — empowering public servants to improve ope...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Software Engineer, AI

Senior Software Engineer, AI

Magical • San Francisco, California, United States
serp_jobs.job_card.full_time
At Magical, we empower organizations to automate the complex, manual workflows that are essential to their operations.We’re building a brand new product, and this is your chance to join as a foundi...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer, Enterprise AI

Software Engineer, Enterprise AI

Scale AI, Inc. • San Francisco, CA, United States
serp_jobs.job_card.full_time
Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer (AI Agents)

Software Engineer (AI Agents)

Pylon • San Francisco, California, United States
serp_jobs.job_card.full_time
At Pylon, we're building the future of B2B Post Sales.We’re building the all-in-one B2B post-sales support platform powered by conversational data and layered with intelligence to help our customer...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer, Analytics Platform

Software Engineer, Analytics Platform

Openai • San Francisco, California, United States
serp_jobs.job_card.full_time
The Research Platform Analytics team designs, builds, and operates the critical foundational data and analytics infrastructure that enables research at OpenAI. Our goal is one, and one only : acceler...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
AI Software Engineer

AI Software Engineer

unitQ • San Francisco, CA, United States
serp_jobs.job_card.full_time
At unitQ, we leverage AI and advanced analytics to enable businesses to proactively monitor and improve product quality based on real‑time user feedback from both public and private channels.Backed...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer, AI

Software Engineer, AI

Mlabs • San Francisco, California, United States
serp_jobs.filters.remote
serp_jobs.job_card.full_time
Software Engineer, AI (0→1 Product Focus).Location : San Francisco Bay Area.We are a high-growth, well-funded technology company pioneering the Composable Customer Data Platform (CDP) and AI Decisio...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer - AI

Software Engineer - AI

Unify • San Francisco, California, United States
serp_jobs.job_card.full_time
Unify was founded January 17th, 2023 by Austin Hughes and Connor Heggie.Connor was a machine learning research engineer at. The rest of our team comes from companies like.Our mission is to build the...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
HPC / AI Data Performance Engineer

HPC / AI Data Performance Engineer

Lawrence Berkeley National Laboratory • Berkeley, CA, United States
serp_jobs.job_card.full_time +1
In this exciting role, you will serve as a Data Performance Engineer in NERSC's Application Performance Group, architecting HPC and AI data services that advance fundamental science.You'll optimize...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Software Engineer, AI Products

Senior Software Engineer, AI Products

CloudTrucks • San Francisco, CA, United States
serp_jobs.job_card.full_time
The trucking industry is the backbone of the global economy.Roughly 70 percent of what we consume in the U.Those trucks are powered by over 3. Trucking is a massive industry, but it is a traditional...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer (Applied AI)

Software Engineer (Applied AI)

Column Tax • San Francisco, California, United States
serp_jobs.job_card.full_time
Software Engineer (Applied AI).At Column Tax, we’re building the next generation of tax software.Our mission is to make it possible for every taxpayer to file confidently in just one click.As the f...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Software Engineer, AI Agentic Experience (Auth0)

Senior Software Engineer, AI Agentic Experience (Auth0)

Okta • San Francisco, CA, United States
serp_jobs.job_card.full_time
Senior Software Engineer, AI Agentic Experience (Auth0) — join to apply for the Senior Software Engineer, AI Agentic Experience (Auth0) role at Okta. Design and Build Developer Tooling that helps de...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer- AI / ML, AWS Neuron

Software Engineer- AI / ML, AWS Neuron

Amazon • San Francisco, CA, United States
serp_jobs.job_card.full_time
AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud‑scale machine learning accelerators and the Trn1 and Inf1 servers that use them. This role is for a software engin...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
AI Software Engineer

AI Software Engineer

AI Fund • San Francisco, CA, United States
serp_jobs.job_card.full_time
Want to use your engineering skills to make an impact? AI is the new electricity.Founded by AI visionary Andrew Ng, DeepLearning. AI is on a mission to empower everyone to build with AI.As a FullSta...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted