Talent.com
CUDA Kernel Optimizer ML Engineer
CUDA Kernel Optimizer ML EngineerMercor • San Francisco, California, USA
CUDA Kernel Optimizer ML Engineer

CUDA Kernel Optimizer ML Engineer

Mercor • San Francisco, California, USA
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

1) Role Overview

Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization performance profiling and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while maintaining correctness and reproducibility

2) Key Responsibilities

Develop tune and benchmark CUDA kernels for tensor and operator workloads.

Optimize for occupancy memory coalescing instruction-level parallelism and warp scheduling.

Profile and diagnose performance bottlenecks using Nsight Systems Nsight Compute and comparable tools.

Report performance metrics analyze speedups and propose architectural improvements.

Collaborate asynchronously with PyTorch Operator Specialists to integrate kernels into production frameworks.

Produce well-documented reproducible benchmarks and performance write-ups.

3) Ideal Qualifications

Deep expertise in CUDA programming GPU architecture and memory optimization.

Proven ability to achieve quantifiable performance improvements across hardware generations.

Proficiency with mixed precision Tensor Core usage and low-level numerical stability considerations.

Familiarity with frameworks like PyTorch TensorFlow or Triton (not required but beneficial).

Strong communication skills and independent problem-solving ability.

Demonstrated open-source research or performance benchmarking contributions.

4) More About the Opportunity

Ideal for independent contractors who thrive in performance-critical systems-level work.

Engagements focus on measurable high-impact kernel optimizations and scalability studies.

Work is fully remote and asynchronous; deliverables are outcome-driven.

Access to shared benchmarking infrastructure and reproducibility tooling via Mercor support resources.

5) Compensation & Contract Terms

Typical range : $120$250 / hour depending on scope specialization and results achieved. Payments will be based on accepted task output over flat hourly.

Structured as a contract-based engagement not an employment relationship.

Compensation tied to measurable deliverables or agreed milestones.

Confidentiality IP and NDA terms as defined per engagement.

6) Application Process

Submit a brief overview of prior CUDA optimization experience profiling results or performance reports.

Include links to relevant GitHub repos papers or benchmarks if available.

Indicate your hourly rate time availability and preferred engagement length.

Selected experts may complete a small paid pilot kernel optimization project

7) About Mercor

Mercor connects domain experts with top AI research and technology organizations through project-based contracts.

Contractors operate independently with full flexibility over methods timelines and tools.

Our mission is to help top engineers and researchers access frontier technical work without rigid employment structures.

Key Skills

HVAC Engineering,Client Servicing,Access Control,Jpa,Gym,Air Compressors

Employment Type : Full Time

Experience : years

Vacancy : 1

serp_jobs.job_alerts.create_a_job

Ml Engineer • San Francisco, California, USA

Job_description.internal_linking.related_jobs
Performance ML Engineer : CUDA, GPU Systems

Performance ML Engineer : CUDA, GPU Systems

Relace • San Francisco, CA, United States
serp_jobs.job_card.full_time
A tech company specializing in ML infrastructure is seeking a Machine Learning Engineer who excels at making models faster and more efficient through performance tuning and optimization.The ideal c...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Dev Ops Engineer

Dev Ops Engineer

Lawrence Berkeley National Laboratory • Berkeley, CA, United States
serp_jobs.job_card.full_time +1
Lawrence Berkeley National Lab's (.NERSC Division has an opening for a Dev Ops Engineer to join the team.In this exciting role, you will serve as a DevOps-oriented System Administrator / Software Eng...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Hardware Support Engineer

Hardware Support Engineer

Cognizant • Emerald Hills, CA, US
serp_jobs.job_card.full_time
Cognizant is a leading provider IT and BPO services, providing critical initiatives to a variety of global clients.The Hardware Operations team is a part of a high profile client project that provi...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_1_hour • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Imaging CT Tech

Imaging CT Tech

The US Oncology Network • Emeryville, CA, United States
serp_jobs.job_card.full_time
ANNUAL SALARY (DEPENDING ON SKILLS / EXPERIENCE) : $60.Open Positions in these Clinic Locations : Antioch, Dublin, Hayward, Emeryville, & Pleasant Hill. Perform computed tomography scan duties in compli...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer - Observability

Software Engineer - Observability

Snowflake • Menlo Park, California, United States
serp_jobs.job_card.full_time
The Observability team at Snowflake is in charge of building an extensible, self-service Observability platform that reliably collects and serves telemetry data such as metrics, logs, traces to bot...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer, Performance Optimization

Software Engineer, Performance Optimization

Fireworks Ai • Redwood City, California, United States
serp_jobs.job_card.full_time
Here at Fireworks, we’re building the future of generative AI infrastructure.Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference.We’...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer

Software Engineer

Confidential • Redwood City, California, United States
serp_jobs.job_card.full_time
C3 Energy is looking for data engineers to develop and implement the next generation of analytics for the smart grid.We are building a platform able to handle the extremely large amount of data gen...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Product Development Engineer, Reagents

Product Development Engineer, Reagents

Bruker • Emeryville, CA, United States
serp_jobs.job_card.full_time +1
Product Development Engineer, Reagents.Bruker is enabling scientists to make breakthrough discoveries and develop new applications that improve the quality of human life. Bruker's high-performance s...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
CT Technologist

CT Technologist

Providence • Vallejo, CA, US
serp_jobs.job_card.full_time
Under the direction of a Radiologist, or Physician designee, and Imaging Leadership, and with latitude for independent judgment, performs all the professional duties involved in applying ionizing r...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_1_hour • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Senior Mechanical Engineer

Senior Mechanical Engineer

Bio-Rad Laboratories • Hercules, CA, United States
serp_jobs.job_card.full_time
Working within Bio-Rad's Clinical Diagnostics Group R&D organization as an Instrument Engineer with Mechanical Engineering background, you'll take engineering concepts and requirements and transfor...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
CUDA Kernel Optimizer - ML Engineer

CUDA Kernel Optimizer - ML Engineer

Mercor • San Francisco, California, United States
serp_jobs.filters.remote
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
ML Research Engineer, ML Systems

ML Research Engineer, ML Systems

Scale AI, Inc. • San Francisco, CA, United States
serp_jobs.job_card.full_time
Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer - ML Pricing

Software Engineer - ML Pricing

Opendoor • San Francisco, California, United States
serp_jobs.job_card.full_time
At Opendoor, pricing is at the core of our product — our models directly influence high-stakes decisions around real estate transactions across the country. We are looking for a mid-level.This is a ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer, Supercomputing

Software Engineer, Supercomputing

Thinking Machines Lab • San Francisco, California, United States
serp_jobs.job_card.full_time
Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence.We're building a future where everyone has access to the knowledge and tools to make AI w...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer, GPU Infrastructure

Software Engineer, GPU Infrastructure

Openai • San Francisco, California, United States
serp_jobs.job_card.full_time
This role will support the fleet infrastructure team at OpenAI.The fleet team focuses on running the world’s largest, most reliable, and frictionless GPU fleet to support OpenAI’s general purpose m...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Surgical Technologist I - General Specialty PMs - El Cerrito

Surgical Technologist I - General Specialty PMs - El Cerrito

Advocate Aurora • El Cerrito, CA, United States
serp_jobs.job_card.full_time +1
Surgical Technologist I - General Specialty PMs.Aurora St Lukes Medical Center - 2900 W Oklahoma Ave.Advocate Health offers a comprehensive suite of Total Rewards : benefits and well-being programs,...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Principal Software Engineer

Principal Software Engineer

Informatica LLC • Redwood City, CA, United States
serp_jobs.job_card.full_time
Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous, work-from-anywhere minds eager to so...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer (ML Platform)

Software Engineer (ML Platform)

Anyscale • San Francisco, California, United States
serp_jobs.job_card.full_time
Ray in their tech stacks to accelerate the progress of AI applications out into the real world.With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can s...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted