Talent.com
Senior Software Engineer, AI Inference Platform
Senior Software Engineer, AI Inference PlatformCEREBRAS SYSTEMS INC. • Sunnyvale, CA, United States
serp_jobs.error_messages.no_longer_accepting
Senior Software Engineer, AI Inference Platform

Senior Software Engineer, AI Inference Platform

CEREBRAS SYSTEMS INC. • Sunnyvale, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About The Role

We are seeking a talented Platform Software Engineer to join the team building the Cerebras Inference Platform. You will be instrumental in designing, developing, and operating the core backend services and APIs that power the Inference platform. You'll build the software that allows customers to seamlessly deploy, manage, and serve inference workloads on dedicated Cerebras hardware.

Responsibilities :

  • Design, build, and maintain the core APIs for the Inference Platform, handling model catalog management, deployment of ML workloads, scaling, and status monitoring.
  • Focus on building platform capabilities that optimize for ease-of-use, robustness, and self-service access to inference models and serving.
  • Collaborate with infrastructure and ML engineering teams to ensure high reliability, uptime, and smooth user interactions with the inference service
  • Design and implement features like multi-tenant support, deployment automation, priority queuing, and caching strategies for user requests.
  • Build robust observability features by integrating with monitoring and telemetry tools (e.g., Prometheus, Grafana) to track system health, performance metrics, and request analytics.

Skills & Qualifications :

  • Bachelor's or Master's degree in computer science or related field, or equivalent practical experience.
  • 5+ years of experience in backend software development, with a focus on service APIs, orchestration platforms, or user-facing infrastructure.
  • Strong proficiency in Python (C++ is good to have).
  • Experience designing, building, and integrating with RESTful APIs and gRPC services
  • Solid understanding of distributed systems concepts such as concurrency, scalability, and fault tolerance
  • Hands-on experience with containerization (Docker) and orchestration frameworks (Kubernetes)
  • Experience with databases and caching systems (e.g., Postgres, Redis).
  • Experience with observability, telemetry pipelines, and system monitoring best practices.
  • Strong problem-solving and debugging abilities
  • Excellent communication and cross-functional collaboration skills
  • Why Join Cerebras

    People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we've reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras :

  • Build a breakthrough AI platform beyond the constraints of the GPU.
  • Publish and open source their cutting-edge AI research.
  • Work on one of the fastest AI supercomputers in the world.
  • Enjoy job stability with startup vitality.
  • Our simple, non-corporate work culture that respects individual beliefs.
  • Read our blog : Five Reasons to Join Cerebras in 2025.

    Apply today and become part of the forefront of groundbreaking advancements in AI!

    Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

    This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

    serp_jobs.job_alerts.create_a_job

    Senior Software Engineer Platform • Sunnyvale, CA, United States

    Job_description.internal_linking.related_jobs
    Senior Platform & AI Engineer

    Senior Platform & AI Engineer

    Adobe • San Jose, California, USA
    serp_jobs.job_card.full_time
    Changing the world through digital experiences is what Adobes all about.We give everyonefrom emerging artists to global brandseverything they need to design and deliver exceptional digital experien...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. Software Engineer - AI / LLM Applications (26456)

    Sr. Software Engineer - AI / LLM Applications (26456)

    Supermicro • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer — Data Platform for Fintech AI

    Senior Software Engineer — Data Platform for Fintech AI

    Intuit Inc. • Mountain View, CA, United States
    serp_jobs.job_card.full_time
    A leading fintech company is looking for a Senior Software Engineer to join the Data Exchange team in Mountain View, California. In this role, you will develop scalable systems and improve developer...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer, AI Inference Platform

    Senior Software Engineer, AI Inference Platform

    Cerebras Systems • Sunnyvale, CA, United States
    serp_jobs.job_card.full_time
    Senior Software Engineer, AI Inference Platform.Sunnyvale, CA or Toronto, Canada.Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture de...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer, AI for Quantum

    Senior Software Engineer, AI for Quantum

    PsiQuantum • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    PsiQuantum'smission is to build the first useful quantum computers-machines capable of delivering the breakthroughs the field has long promised. Since our founding in 2016, our singular focus has be...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Contract Senior AI Software Engineer

    Contract Senior AI Software Engineer

    Turing • San Jose, CA, United States
    serp_jobs.job_card.full_time
    A leading AI research accelerator is seeking a contractor with over 5 years of software engineering experience to evaluate AI-generated code and enhance coding solutions. Responsibilities include co...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Senior Generative AI Software Engineer

    Senior Generative AI Software Engineer

    NVIDIA Corporation • Santa Clara, CA, United States
    serp_jobs.job_card.full_time
    Senior Generative AI Software Engineer page is loaded## Senior Generative AI Software Engineerlocations : US, CA, Santa Clara : US, CA, Remotetime type : Full timeposted on : Posted Todayjob re...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer – Holoscan Sensor AI Platform

    Senior Software Engineer – Holoscan Sensor AI Platform

    NVIDIA Corporation • Santa Clara, CA, United States
    serp_jobs.job_card.full_time
    Senior Software Engineer – Holoscan Sensor AI Platform page is loaded## Senior Software Engineer – Holoscan Sensor AI Platformlocations : US, CA, Santa Claratime type : Full timeposted on : Post...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • Hayward, CA, US
    serp_jobs.job_card.full_time
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patients data, processing orders and prescr...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer, AI / ML, AIR

    Senior Software Engineer, AI / ML, AIR

    Google Inc. • Mountain View, CA, United States
    serp_jobs.job_card.full_time
    Google place Mountain View, CA, USA.Bachelor’s degree or equivalent practical experience.Natural Language Processing or Large Language Models. Master's degree or PhD in Computer Science or related t...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior Backend Engineer, AI Inference Platform

    Senior Backend Engineer, AI Inference Platform

    Cerebras Systems • Sunnyvale, CA, United States
    serp_jobs.job_card.full_time
    A leading AI technology firm based in Sunnyvale is looking for a Senior Software Engineer to work on their AI Inference Platform. The role involves designing and developing backend services and APIs...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer – AI Platform

    Senior Software Engineer – AI Platform

    General Motors of Canada • Mountain View, CA, United States
    serp_jobs.job_card.full_time
    Join the team driving the future of AI innovation at General Motors! Our AI Core Infrastructure team, which sits directly under the Chief AI Officer, is a high-impact team powering GM’s enterprise-...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior AI Platform Engineer — API Leadership & Cloud

    Senior AI Platform Engineer — API Leadership & Cloud

    Cisco Systems, Inc. • San Jose, CA, United States
    serp_jobs.job_card.full_time
    A technological leader in San Jose is searching for a Senior Software Engineer to lead API development for their AI platform. The role requires deep knowledge of Kubernetes, cloud infrastructure, an...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior AI Software Engineer — Enterprise Workbench

    Senior AI Software Engineer — Enterprise Workbench

    KE Technology • Palo Alto, California, United States
    serp_jobs.job_card.full_time
    An early-stage AI startup in Palo Alto seeks a Senior Software Engineer (AI) to develop an innovative AI workbench that enhances enterprise workflows. This role involves building AI features, applyi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior AI Engineer

    Senior AI Engineer

    SAP • Palo Alto, California, USA
    serp_jobs.job_card.full_time +1
    At SAP we keep it simple : you bring your best to us and well bring out the best in you.Were builders touching over 20 industries and 80% of global commerce and we need your unique talents to help s...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer (Palo Alto)

    Senior Software Engineer (Palo Alto)

    Signify Technology • Palo Alto, CA, US
    serp_jobs.job_card.full_time +1
    Onsite, Palo Alto, CA (5 days per week).A fast-growing startup at the crossroads of.Their mission centers on responsible innovation, developing AI products that are not only powerful but trustworth...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer, AI Infra Compute

    Senior Software Engineer, AI Infra Compute

    TikTok • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Senior Software Engineer, AI Infra Compute.Be among the first 25 applicants.Design and implement prototypes of key technologies or products. Design and implement core feature improvements.Research a...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior AI Engineer

    Senior AI Engineer

    LinkedIn • Sunnyvale, California, USA
    serp_jobs.job_card.full_time
    This role will be based in Sunnyvale San Francisco Bellevue or New York City.At LinkedIn our approach to flexible work is centered on trust and optimized for culture connection clarity and the evol...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted