Talent.com
AI/ML Computing Cluster Engineer
AI/ML Computing Cluster EngineerSk Hynix America • San Jose, California, United States
AI / ML Computing Cluster Engineer

AI / ML Computing Cluster Engineer

Sk Hynix America • San Jose, California, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Title : AI / ML Computing Cluster Engineer

Office Location : San Jose, CA

Work Model : Onsite

About SK hynix America

At SK hynix America, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data centers. As a global leader in DRAM and NAND flash technologies, we drive the evolution of advancing mobile technology, empowering cloud computing, and pioneering future technologies. Our cutting-edge memory technologies are essential in today's most advanced electronic devices and IT infrastructure, enabling enhanced performance and user experiences across the digital landscape.

We're looking for innovative minds to join our mission of shaping the future of technology. At SK hynix America, you'll be part of a team that's pioneering breakthrough memory solutions while maintaining a strong commitment to sustainability. We're not just adapting to technological change – we're driving it, with significant investments in artificial intelligence, machine learning, and eco-friendly solutions and operational practices. As we continue to expand our market presence and push the boundaries of what's possible in semiconductor technology, we invite you to be part of our journey to creating the next generation of memory solutions that will define the future of computing.

Job Overview :

As the AI / ML Computing Cluster engineer, you will work on development and operation of high-performance computing clusters supporting AI / ML workloads. You will be responsible for development, implementation, operation, and optimization of AI data center IT environments to ensure scalability, performance, reliability, and cost-effectiveness. This role requires collaboration with cross-functional teams to align computing infrastructure with the organization's strategic direction.

Responsibilities :

Computing Cluster Infrastructure Development

  • Design and implement distributed computing cluster infrastructure to support large-scale AI / ML model training and inference jobs with a focus on transformer-based AI models.
  • Build and maintain distributed system to ensure scalability, efficient resource allocation, and high throughput.
  • Optimize cluster performance through hardware selection, equipment configuration, network engineering, and performance analysis.
  • Deploy and operate data center networking infrastructure using software system for automation, design validation, deployment, and operational support.
  • Implement tools and processes to maintain high uptime and ensure infrastructure reliability during both model training and inference phases.
  • Identify and resolve performance bottlenecks, improving overall system throughput and response times.

Team Leadership & Collaboration

  • Collaborate with cross-functional teams, including research, security, and benchmark test engineering teams, to integrate infrastructure with AI workflows, ensuring seamless deployment and operation.
  • Engage with technology vendors and partners to evaluate new solutions to drive innovation in AI computing infrastructure.
  • Qualification :

  • Master’s degree or above in Computer Science, Electrical Engineering, or related fields.
  • 2+ years of experience in AI cluster engineering, MLOps, and benchmark testing, including GPU performance analysis, memory usage, and energy / power monitoring tools.
  • Strong familiarity with AI computing architecture, AI / ML infrastructure requirements, memory architecture and usages in AI / ML, AI algorithm trends and best practices.
  • Expertise in optimizing resource utilization, improving system throughput, and reducing latency in both training and inference.
  • Equal Employment Opportunity :

    SKHYA is an Equal Employment Opportunity Employer. We provide equal employment opportunities to all qualified applicants and employees and prohibit discrimination and harassment of any type without regard to race, sex, pregnancy, sexual orientation, religion, age, gender identity, national origin, color, protected veteran or disability status, genetic information or any other status protected under federal, state, or local applicable laws.

    Compensation :

    Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. Pay within the provided range varies by work location and may also depend on job-related skills and experience. Your Recruiter can share more about the specific salary range for the job location during the hiring process.

    Pay Range

    $100,000 - $150,000 USD

    serp_jobs.job_alerts.create_a_job

    Aiml Engineer • San Jose, California, United States

    Job_description.internal_linking.related_jobs
    Senior ML Engineer : Generative AI for Cloud Observability

    Senior ML Engineer : Generative AI for Cloud Observability

    Cisco Systems, Inc. • San Jose, CA, US
    serp_jobs.job_card.full_time
    A leading technology company is seeking an experienced engineer to build the intelligent backbone of its Observability Cloud. This role requires applying advanced AI and collaboration with product t...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI / ML Architect

    AI / ML Architect

    Cooley LLP • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    Cooley is seeking an AI / ML Architect to join the Practice Engineering team within the Innovation department.As a leading technology law firm, Cooley is determined to become a leader in the digital ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Solutions Architect : On-Prem & Cloud ML Deployments

    AI Solutions Architect : On-Prem & Cloud ML Deployments

    7wdata • Santa Clara, CA, United States
    serp_jobs.job_card.full_time
    A technology company is seeking a Machine Learning Engineer / Solution Architect with expertise in deploying deep learning models on-prem and in the cloud. Responsibilities include technical engagemen...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal ML Engineer - GenAI & Large-Scale AI Systems

    Principal ML Engineer - GenAI & Large-Scale AI Systems

    Walmart • Sunnyvale, CA, US
    serp_jobs.job_card.full_time
    A large retail company in California is looking for a Principal Machine Learning Engineer to lead AI and machine learning projects. This role involves developing and deploying scalable solutions, co...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal ML Engineer — GenAI & Large-Scale AI Systems

    Principal ML Engineer — GenAI & Large-Scale AI Systems

    Walmart • Sunnyvale, CA, United States
    serp_jobs.job_card.full_time
    A large retail company in California is looking for a Principal Machine Learning Engineer to lead AI and machine learning projects. This role involves developing and deploying scalable solutions, co...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI / ML Computing Cluster Engineer

    AI / ML Computing Cluster Engineer

    SK hynix America Inc. • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Job Title : AI / ML Computing Cluster Engineer.At SK hynix America, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to d...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal ML Engineer — Lead End-to-End AI / ML Platforms

    Principal ML Engineer — Lead End-to-End AI / ML Platforms

    Intuit Inc. • Mountain View, CA, United States
    serp_jobs.job_card.full_time
    A financial technology company is seeking a Principal Machine Learning Engineer in Mountain View, California.This role involves leading AI strategy and deploying AI / ML solutions across financial pr...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ML Engineer : LLMs, VLMs & Reasoning AI | Equity

    ML Engineer : LLMs, VLMs & Reasoning AI | Equity

    Tensor • San Jose, CA, United States
    serp_jobs.job_card.full_time
    An innovative AI company in San Jose is seeking a skilled Machine Learning Engineer with expertise in developing LLMs and VLMs. The ideal candidate will have a strong education background and proven...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Director, AI / ML Forward Deployment & Systems

    Director, AI / ML Forward Deployment & Systems

    CareerArc • Santa Clara, CA, United States
    serp_jobs.job_card.full_time
    A leading technology company, located in Santa Clara, seeks an Engineering Leader to drive innovation in PC systems.The role involves leading cross-functional teams to implement cutting-edge techno...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior AI and ML HPC Cluster Engineer

    Senior AI and ML HPC Cluster Engineer

    NVIDIA • Santa Clara, CA, United States
    serp_jobs.job_card.full_time
    NVIDIA has continuously reinvented itself over two decades.Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parall...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff ML Engineer - AI Systems Lead (Remote)

    Staff ML Engineer - AI Systems Lead (Remote)

    GEICO • Palo Alto, CA, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    A leading insurance company is seeking a Staff Machine Learning Engineer to architect scalable AIML solutions and lead technical initiatives. The ideal candidate will have over 6 years of hands-on e...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Aerial RAN AI / ML Algorithms Engineer

    Aerial RAN AI / ML Algorithms Engineer

    Nvidia Corporation • Santa Clara, CA, United States
    serp_jobs.job_card.full_time
    A leading technology firm in Santa Clara is seeking a self-motivated Senior Software Engineer to spearhead RAN algorithms focusing on AI / ML for 5G and 6G networks. This role requires deep expertise ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI / ML Engineer

    AI / ML Engineer

    Krane • Hayward, CA, United States
    serp_jobs.job_card.full_time
    Krane is building intelligent tools that power the future of construction operations.You’ll lead the design and deployment of intelligent systems that automate project documentation, streamline sup...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ML Research Engineer for Home Robotics & Embodied AI

    ML Research Engineer for Home Robotics & Embodied AI

    Sunday Robotics • Mountain View, CA, United States
    serp_jobs.job_card.full_time
    A tech innovation company in Mountain View, California, is seeking a Machine Learning Research Engineer.You'll design sophisticated robot learning algorithms to enhance dexterous manipulation in ho...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Data / AI / ML Software Engineer (Santa Clara)

    Senior Data / AI / ML Software Engineer (Santa Clara)

    Crossing Hurdles • Santa Clara, CA, US
    serp_jobs.job_card.full_time +1
    Crossing Hurdles is a global recruitment firm partnering with, a fast-growing Clinical Data Intelligence platform built on 12+ years of advanced research in Machine Reading and Knowledge Graph tech...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Hybrid ML Engineer : Build Scalable AI for Payments

    Hybrid ML Engineer : Build Scalable AI for Payments

    PayPal • San Jose, CA, United States
    serp_jobs.job_card.full_time
    A leading digital payments platform in San Jose is seeking a Machine Learning Engineer to assist in designing, developing, and implementing machine learning models. The ideal candidate will collabor...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior ML Platform Engineer for Large-Scale AI Infra

    Senior ML Platform Engineer for Large-Scale AI Infra

    Apple Inc. • Santa Clara, CA, United States
    serp_jobs.job_card.full_time
    A leading technology company in Santa Clara is seeking a Machine Learning Engineer to design and build large-scale distributed services that power their search and foundation model platforms.You wi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Senior Data / AI / ML Software Engineer

    Senior Data / AI / ML Software Engineer

    Crossing Hurdles • Santa Clara, CA, United States
    serp_jobs.job_card.full_time
    Crossing Hurdles is a global recruitment firm partnering with, a fast-growing Clinical Data Intelligence platform built on 12+ years of advanced research in Machine Reading and Knowledge Graph tech...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted