Talent.com
CUDA Kernel Optimizer - ML Engineer
CUDA Kernel Optimizer - ML EngineerMercor • San Francisco, California, United States
CUDA Kernel Optimizer - ML Engineer

CUDA Kernel Optimizer - ML Engineer

Mercor • San Francisco, California, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.filters.remote
  • serp_jobs.filters_job_card.quick_apply
job_description.job_card.job_description

1) Role Overview

Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while maintaining correctness and reproducibility,

2) Key Responsibilities

Develop, tune, and benchmark CUDA kernels for tensor and operator workloads.

Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling.

Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools.

Report performance metrics, analyze speedups, and propose architectural improvements.

Collaborate asynchronously with PyTorch Operator Specialists to integrate kernels into production frameworks.

Produce well-documented, reproducible benchmarks and performance write-ups.

3) Ideal Qualifications

Deep expertise in CUDA programming, GPU architecture, and memory optimization.

Proven ability to achieve quantifiable performance improvements across hardware generations.

Proficiency with mixed precision, Tensor Core usage, and low-level numerical stability considerations.

Familiarity with frameworks like PyTorch, TensorFlow, or Triton (not required but beneficial).

Strong communication skills and independent problem-solving ability.

Demonstrated open-source, research, or performance benchmarking contributions.

4) More About the Opportunity

Ideal for independent contractors who thrive in performance-critical, systems-level work.

Engagements focus on measurable, high-impact kernel optimizations and scalability studies.

Work is fully remote and asynchronous; deliverables are outcome-driven.

Access to shared benchmarking infrastructure and reproducibility tooling via Mercor support resources.

5) Compensation & Contract Terms

Typical range : $120–$250 / hour , depending on scope, specialization, and results achieved. Payments will be based on accepted task output over flat hourly.

Structured as a contract-based engagement , not an employment relationship.

Compensation tied to measurable deliverables or agreed milestones.

Confidentiality, IP, and NDA terms as defined per engagement.

6) Application Process

Submit a brief overview of prior CUDA optimization experience, profiling results, or performance reports.

Include links to relevant GitHub repos, papers, or benchmarks if available.

Indicate your hourly rate, time availability, and preferred engagement length.

Selected experts may complete a small, paid pilot kernel optimization project

7) About Mercor

Mercor connects domain experts with top AI research and technology organizations through project-based contracts.

Contractors operate independently, with full flexibility over methods, timelines, and tools.

Our mission is to help top engineers and researchers access frontier technical work without rigid employment structures.

serp_jobs.job_alerts.create_a_job

Ml Engineer • San Francisco, California, United States

Job_description.internal_linking.related_jobs
Performance ML Engineer : CUDA, GPU Systems

Performance ML Engineer : CUDA, GPU Systems

Relace • San Francisco, CA, United States
serp_jobs.job_card.full_time
A tech company specializing in ML infrastructure is seeking a Machine Learning Engineer who excels at making models faster and more efficient through performance tuning and optimization.The ideal c...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Dev Ops Engineer

Dev Ops Engineer

Lawrence Berkeley National Laboratory • Berkeley, CA, United States
serp_jobs.job_card.full_time +1
Lawrence Berkeley National Lab's (.NERSC Division has an opening for a Dev Ops Engineer to join the team.In this exciting role, you will serve as a DevOps-oriented System Administrator / Software Eng...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Imaging CT Tech

Imaging CT Tech

The US Oncology Network • Emeryville, CA, United States
serp_jobs.job_card.full_time
ANNUAL SALARY (DEPENDING ON SKILLS / EXPERIENCE) : $60.Open Positions in these Clinic Locations : Antioch, Dublin, Hayward, Emeryville, & Pleasant Hill. Perform computed tomography scan duties in compli...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer - Observability

Software Engineer - Observability

Snowflake • Menlo Park, California, United States
serp_jobs.job_card.full_time
The Observability team at Snowflake is in charge of building an extensible, self-service Observability platform that reliably collects and serves telemetry data such as metrics, logs, traces to bot...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer, Performance Optimization

Software Engineer, Performance Optimization

Fireworks Ai • Redwood City, California, United States
serp_jobs.job_card.full_time
Here at Fireworks, we’re building the future of generative AI infrastructure.Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference.We’...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Product Development Engineer, Reagents

Product Development Engineer, Reagents

Bruker • Emeryville, CA, United States
serp_jobs.job_card.full_time +1
Product Development Engineer, Reagents.Bruker is enabling scientists to make breakthrough discoveries and develop new applications that improve the quality of human life. Bruker's high-performance s...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Firmware EngineerSoftware Engineering • Berkeley, CA • Full time • On-site

Senior Firmware EngineerSoftware Engineering • Berkeley, CA • Full time • On-site

Form Energy • Berkeley, CA, United States
serp_jobs.job_card.full_time
Are you ready to build America's energy future? Form Energy is an American manufacturing and energy technology company.We're revolutionizing energy storage with cost-effective, multi-day technology...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Mechanical Engineer

Senior Mechanical Engineer

Bio-Rad Laboratories • Hercules, CA, United States
serp_jobs.job_card.full_time
Working within Bio-Rad's Clinical Diagnostics Group R&D organization as an Instrument Engineer with Mechanical Engineering background, you'll take engineering concepts and requirements and transfor...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
ML Research Engineer, ML Systems

ML Research Engineer, ML Systems

Scale AI, Inc. • San Francisco, CA, United States
serp_jobs.job_card.full_time
Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer - ML Pricing

Software Engineer - ML Pricing

Opendoor • San Francisco, California, United States
serp_jobs.job_card.full_time
At Opendoor, pricing is at the core of our product — our models directly influence high-stakes decisions around real estate transactions across the country. We are looking for a mid-level.This is a ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer, GPU Infrastructure

Software Engineer, GPU Infrastructure

Openai • San Francisco, California, United States
serp_jobs.job_card.full_time
This role will support the fleet infrastructure team at OpenAI.The fleet team focuses on running the world’s largest, most reliable, and frictionless GPU fleet to support OpenAI’s general purpose m...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Curator (0421U), The Magnes - #81098

Senior Curator (0421U), The Magnes - #81098

University of California-Berkeley • Berkeley, CA, United States
serp_jobs.job_card.full_time +2
At the University of California, Berkeley, we are dedicated to fostering a community where everyone feels welcome and can thrive. Our culture of openness, freedom and belonging make it a special pla...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Surgical Technologist I - General Specialty PMs - El Cerrito

Surgical Technologist I - General Specialty PMs - El Cerrito

Advocate Aurora • El Cerrito, CA, United States
serp_jobs.job_card.full_time +1
Surgical Technologist I - General Specialty PMs.Aurora St Lukes Medical Center - 2900 W Oklahoma Ave.Advocate Health offers a comprehensive suite of Total Rewards : benefits and well-being programs,...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Machine Learning / Backend Engineer

Senior Machine Learning / Backend Engineer

Plenful • San Francisco, California, United States
serp_jobs.job_card.full_time
We are hiring an exceptional Senior Machine Learning Backend Engineer to lead the integration of advanced ML solutions into our platform. At Plenful, we believe the Engineering-Product-Design (EPD) ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Principal Software Engineer

Principal Software Engineer

Informatica LLC • Redwood City, CA, United States
serp_jobs.job_card.full_time
Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous, work-from-anywhere minds eager to so...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer (ML Platform)

Software Engineer (ML Platform)

Anyscale • San Francisco, California, United States
serp_jobs.job_card.full_time
Ray in their tech stacks to accelerate the progress of AI applications out into the real world.With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can s...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Backend Engineer, Models

Backend Engineer, Models

Meter • San Francisco, California, United States
serp_jobs.job_card.full_time
Networking is one of the most fundamental industries in all of technology.For the first time, Meter has unified the full networking stack. and now we are making it autonomous.Meter Zero is our neur...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer (Credit / ML)

Software Engineer (Credit / ML)

Slope • San Francisco, California, United States
serp_jobs.job_card.full_time
At Slope, we empower growing businesses by providing seamless access to capital.We are building a unified platform that serves two main purposes : . Direct Business Financing ("Financing by Slope").We...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted