Talent.com
Sr. System Engineer/Rack Solution (27694)
Sr. System Engineer/Rack Solution (27694)Supermicro • San Jose, CA, United States
Sr. System Engineer / Rack Solution (27694)

Sr. System Engineer / Rack Solution (27694)

Supermicro • San Jose, CA, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Req ID : 27694

About Supermicro :

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.

Job Summary :

As a Sr. System Engineer, you'll be the go-to person to roll out and maintain business critical applications and services for Supermicro. You are also responsible for resolving escalated service issues, coaching other engineers to resolutions, engineering and implementing complex projects. You will be a person who is independent with leadership to drive the technical development and with excellent communication skills.

Essential Duties and Responsibilities :

Includes the following essential duties and responsibilities (other duties may also be assigned) :

  • Execute comprehensive system-level rack tests on latest NVidia and AMD GPUs, ARM-based, Intel Xeon, and AMD EPYC processors, encompassing functionality, compatibility, performance, stress, and reliability testing, leveraging proprietary in-house tools.
  • Establish expertise in HPC / AI applications and benchmarks, delivering impactful training sessions to customers and partners, while addressing complex customer support issues, demonstrating innovative problem-solving skills and building robust processes and procedures for HPC / AI solutions.
  • Conduct proof of concept design and testing, providing optimized benchmarks for HPC / AI applications in a timely manner. Fine-tune BIOS settings, optimize OS / network configurations, and develop diverse simulation configurations to enhance efficiency across various workloads.
  • Deliver on-site deployment services, ensuring customer acceptance verification and providing post-level 1&2 support. Create and maintain technical documentation, including technical notes, blogs, and diagrams, to facilitate knowledge dissemination.
  • Identify and document hardware and software quality issues and collaborate with Product Management and other Engineering teams to integrate customer feedback into future product enhancements.
  • Proactively engage in HPC roadmap development, planning software and hardware upgrades to sustain exceptional HPC infrastructure performance.
  • Document and analyze test plans, reports, logs, and actively contribute to the development of test utilities and automation scripts to streamline testing processes.

Qualifications :

  • BS / MS in Electrical Engineering, Computer Engineering or Computer Science
  • 8+ years of work-related experience in Deep Learning and Machine Learning
  • 8+ years of Linux / networking debugging / testing or relevant experience preferred
  • Experience with leading AI / ML frameworks such as PyTorch, TensorFlow, ONNX, etc.
  • Experience with DevOps or in cloud environments, including but not limited to Docker / Containers and Kubernetes
  • Hands-on experience with workload / scheduler Managers (Slurm) for rack / cluster
  • Familiar with MLPerf Training / Inference benchmark, LLM, HPL-AI or RCCL / NCCL
  • Programming experience with windows and Linux shell scripting
  • Strong sense of teamwork and good team player, strong communication skills
  • Familiar with Intel / AMD / NVIDIA development tool kits such as CUDA, oneAPI, ROCm is a plus
  • Experience with server / network hardware debugging and troubleshooting is a plus
  • CCNA, OpenStack, OpenShift, Azure or AWS is a plus
  • Please note that this position requires regular in-office attendance. The successful candidate is expected to be present in the office during standard working hours as determined by the company. In-office collaboration and participation in team meetings, training sessions, and other on-site activities are essential aspects of this role. Candidates should consider the commuting distance and be prepared to fulfill their responsibilities in the designated office location.

    Salary Range

    $137,000 - $156,000

    The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.

    EEO Statement

    Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

    serp_jobs.job_alerts.create_a_job

    Solution • San Jose, CA, United States

    Job_description.internal_linking.related_jobs
    Sr. Systems Engineer

    Sr. Systems Engineer

    Archer • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Archer is an aerospace company based in San Jose, California building an all-electric vertical takeoff and landing aircraft with a mission to advance the benefits of sustainable air mobility.We are...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. IT System Engineer (Contractor to Hire)

    Sr. IT System Engineer (Contractor to Hire)

    OPPO • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    IT System Engineer (Contractor to Hire).Be among the first 25 applicants.OPPO US Research Center is seeking a highly skilled and hands‑on IT System Engineer to support our routine business.Provide ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior R&D Engineer

    Senior R&D Engineer

    Synopsys • Mountain View, CA, United States
    serp_jobs.job_card.full_time
    The Senior R&D Engineer is responsible for the deployment and maintenance of cloud-based HPC infrastructure.In this role, the Senior R&D Engineer will use advanced technical and problem-solving ski...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior System Software Engineer - AI Performance and Efficiency Tools

    Senior System Software Engineer - AI Performance and Efficiency Tools

    NVIDIA • Santa Clara, CA, United States
    serp_jobs.job_card.full_time
    A key part of NVIDIA's strength is our sophisticated analysis / debugging tools that empower NVIDIA engineers to improve perf and power efficiency of our products and the running applications.We ar...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr Director, Systems Engineering - Strategic SME

    Sr Director, Systems Engineering - Strategic SME

    Celestica • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Americas, USA, California, San Jose.The role is for a highly accomplished and forward‑thinking Sr Director Technical Engineer for a leading role in a critical customer engagement.This role demands ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. Solution Engineer

    Sr. Solution Engineer

    CommScope • Sunnyvale, California, USA
    serp_jobs.job_card.full_time
    In our hyper-connected world RUCKUS Networks is redefining how organizations connect communicate and collaborate.Were seeking a Senior Solution Engineer to join our dynamic Solution Engineering tea...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. Systems Design Engineer

    Sr. Systems Design Engineer

    KLA • Milpitas, California, USA
    serp_jobs.job_card.full_time
    KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem.Virtually every electronic device in the world is produced using our technologies.No laptop smartphon...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    System Software Engineer, Sr. Staff

    System Software Engineer, Sr. Staff

    SK hynix America Inc. • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Job Title : System Software Engineer, Sr.At SK hynix America, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. Software Engineer, Healthcare, EHR Systems

    Sr. Software Engineer, Healthcare, EHR Systems

    Talent Search PRO • Pleasanton, CA, United States
    serp_jobs.job_card.full_time
    The Senior Software Engineer, Product Engineering is responsible for designing, developing, and maintaining cloud-based healthcare solutions that power next-generation Electronic Health Record (EHR...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. Solution Manager / Rack Solution (27117)

    Sr. Solution Manager / Rack Solution (27117)

    Super Micro Computer • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Supermicro® is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customer...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. System Engineer - GPU Servers (27156)

    Sr. System Engineer - GPU Servers (27156)

    Super Micro Computer Spain, S.L. • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Location : San Jose, California, United States.Supermicro® is a top-tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Da...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI System Engineer, Sr. Staff

    AI System Engineer, Sr. Staff

    Sk Hynix America • San Jose, California, United States
    serp_jobs.job_card.full_time
    Job Title : AI System Engineer, Sr.At SK hynix America, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data center...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. Systems Engineer, PHY Standards (Wi-Fi / BT)

    Sr. Systems Engineer, PHY Standards (Wi-Fi / BT)

    Synaptics Inc. • San Jose, CA, US
    serp_jobs.job_card.full_time
    Synaptics is leading the charge in AI at the Edge, bringing AI closer to end users and transforming how we engage with intelligent connected devices, whether at home, at work, or on the move.As the...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30
    Sr. Software Engineer - Engineering Productivity (Fullstack)

    Sr. Software Engineer - Engineering Productivity (Fullstack)

    Reliable Robotics Corporation • Mountain View, CA, United States
    serp_jobs.job_card.permanent
    We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. Software Engineer, Infra

    Sr. Software Engineer, Infra

    Jerry.ai • Palo Alto, California, USA
    serp_jobs.job_card.full_time +1
    Were building the first AI-powered.From insurance to repairs to road safety were connecting the entire car ownership experience into one mobile-first platform. Our revenue has grown 60x in the last ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. System Engineer

    Sr. System Engineer

    Support Revolution • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Select how often (in days) to receive an alert : Create Alert.San Jose, California, United States.Supermicro® is a Top Tier provider of advanced server, storage, and networking solutions for Data Ce...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior System Software Engineer

    Senior System Software Engineer

    ChargePoint • Campbell, CA, United States
    serp_jobs.job_card.full_time
    With electric vehicles expected to be nearly 30% of new vehicle sales by 2025 and more than 50% by 2040, electric mobility is becoming a reality. ChargePoint (NYSE : CHPT) is at the center of this re...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. Engineer, Software

    Sr. Engineer, Software

    Alamar Biosciences, Inc. • Fremont, CA, United States
    serp_jobs.job_card.full_time
    At Alamar, we are passionate about enabling our customers to make scientific discoveries that translate into clinical outcomes and benefit patients. Our team is growing quickly as we develop innovat...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted