Talent.com
Software Engineer - Supercomputing Platform & Infrastructure
Software Engineer - Supercomputing Platform & InfrastructureMagic AI Corp. • New York, NY, United States
Software Engineer - Supercomputing Platform & Infrastructure

Software Engineer - Supercomputing Platform & Infrastructure

Magic AI Corp. • New York, NY, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Magic's mission is to build safe AGI that accelerates humanity's progress on the world's most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal.

About the role :

As a Software Engineer on our Supercomputing Platform & Infrastructure team, you will design and build resilient and optimized solutions for AI workloads on massive Computing Clusters.

What you might work on :

  • Work closely with the training and inference teams to deliver high performance and reliability across storage, networking, and distributed computing designs.
  • Build the software stack to run massive-scale (thousands of GPUs), highly available supercomputing infrastructure
  • Troubleshoot and resolve complex issues across hardware accelerated devices, networking, storage subsystems (local NVMe / Block Storage / NFS), OS, drivers and cloud environments, and automate detection and recovery processes
  • Operate data-intensive workloads at petabyte-scale
  • Increase the ease-of-use and self-serviceability of the compute platforms at Magic through top-notch documentation and developer workflow design
  • Investigate and resolve incidents across security and availability

What we're looking for :

  • Experience working with production GPU deployments, data-intensive applications, large-scale model training and HPC
  • Strong understanding of networking-, storage- and data-related technologies
  • Experience with GCP, AWS, Azure, OCI or similar cloud platforms
  • Strong software engineering skills
  • Strong IaC knowledge with extensive experience in Terraform, Pulumi, AWS CDK / CloudFormation or similar
  • Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience.

    Our culture :

  • Integrity. Words and actions should be aligned
  • Hands-on. At Magic, everyone is building
  • Teamwork. We move as one team, not N individuals
  • Focus. Safely deploy AGI. Everything else is noise
  • Quality. Magic should feel like magic
  • Compensation, benefits and perks (US) :

  • Annual salary range : $225K - $550K
  • Equity is a significant part of total compensation, in addition to salary
  • 401(k) plan with 6% salary matching
  • Generous health, dental and vision insurance for you and your dependents
  • Unlimited paid time off
  • Visa sponsorship and relocation stipend to bring you to SF, if possible
  • A small, fast-paced, highly focused team
  • serp_jobs.job_alerts.create_a_job

    Software Engineer Infrastructure • New York, NY, United States

    Job_description.internal_linking.related_jobs
    Infrastructure Software Engineer, Public Sector

    Infrastructure Software Engineer, Public Sector

    Scale AI, Inc. • New York, NY, United States
    serp_jobs.job_card.full_time
    Scale AI is seeking a highly skilled and motivated.Software Engineer, AI Infrastructure & Security.Public Sector Engineering team. As a part of this team, you will play a critical role in delivering...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Platform Engineer - Elasticsearch or Solr / Fusion

    Platform Engineer - Elasticsearch or Solr / Fusion

    Galent DBA Diamond Pick • New York City, NY, US
    serp_jobs.job_card.full_time
    Role : Platform Engineer – Elasticsearch or Solr / Fusion Location : NYC, NY – Need to go onsite from Day 1 – Hybrid 3 Days / Week Contract Looking for : Platform Engineers who manage configure, Authent...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Infrastructure and DevOps Engineer

    Infrastructure and DevOps Engineer

    Axelon Services Corporation • Jersey City, NJ, US
    serp_jobs.job_card.full_time
    Global Financial Firm located in Jersey City, NJ has an immediate contract opportunity for an experienced professional.Infrastructure and DevOps Engineer. Hybrid (expected in the office weekly 3 day...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. Cloud Infrastructure Engineer

    Sr. Cloud Infrastructure Engineer

    Align Communications • New York, New York, United States
    serp_jobs.job_card.full_time
    Is technology your passion? Do you want to work with smart, forward-thinking individuals? Do you want to grow in career you love?. At Align, our professionals are the key to our success.We don’t jus...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ML Infrastructure Engineer

    ML Infrastructure Engineer

    Vantai • New York, NY, United States
    serp_jobs.job_card.full_time
    VantAI pairs bleeding-edge machine learning techniques with deep systems biology expertise to build computational models that uncover hidden relationships between molecules, targets, and diseases.T...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    DevOps Engineer

    DevOps Engineer

    Constrafor • New York, New York, United States
    serp_jobs.job_card.full_time
    Constrafor is a SaaS and fintech platform purpose-built for construction.We are setting new standards of productivity and cost-efficiency for the way General Contractors and Subcontractors manage p...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Infrastructure Engineer

    Senior Infrastructure Engineer

    Knot • New York, New York, United States
    serp_jobs.job_card.full_time
    Knot’s mission is to empower consumers and businesses alike with connected merchant and banking experiences.Knot is like “Plaid for merchant connectivity. We are building the platform connecting mer...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. Infrastructure Engineer

    Sr. Infrastructure Engineer

    Infotrack Us • New York, New York, United States
    serp_jobs.job_card.full_time
    InfoTrack is a platform that seamlessly connects law firms to the courts and to the services that they need to litigate successfully. We're global leaders in legal technology with unparalleled exper...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior / Staff Infrastructure Engineer

    Senior / Staff Infrastructure Engineer

    Privy • New York, New York, United States
    serp_jobs.job_card.full_time
    As our first infrastructure engineer at Privy, you will help define what it means to write and deploy code to tens of millions of users across thousands of customers, all on a rapidly growing engin...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Build & Release Infrastructure Engineer

    Build & Release Infrastructure Engineer

    Rokt • New York, New York, United States
    serp_jobs.job_card.full_time
    We are Rokt, a hyper-growth ecommerce leader.Rokt is the global leader in ecommerce, unlocking real-time relevance in the moment that matters most. Rokt’s AI Brain and ecommerce Network powers billi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Infrastructure

    Software Engineer, Infrastructure

    Opal Security • New York, New York, United States
    serp_jobs.job_card.full_time
    Opal is redefining identity security for modern enterprises.The concept of least privilege access is well understood in theory but very hard in practice. We've all felt the pain of not getting the a...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Infrastructure Engineer

    Infrastructure Engineer

    Blacksmith • New York, New York, United States
    serp_jobs.job_card.full_time
    At Blacksmith, we provide cloud infra to help companies run their CI (GitHub Actions) substantially faster and cheaper.Our mission is to build a CI cloud. Our bet is that CI, as a class of workloads...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Infrastructure Engineer

    Infrastructure Engineer

    Endex • New York, New York, United States
    serp_jobs.job_card.full_time
    Over the next few years, every financial institution will have teams of AI analysts working alongside their sharpest minds. At Endex, we're on a mission to bridge the present to the inevitable by bu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Forward Deployed DevOps / Infrastructure Engineer

    Senior Forward Deployed DevOps / Infrastructure Engineer

    TriEdge Investments • New York, New York, United States
    serp_jobs.job_card.full_time
    At TriEdge Investments, we build world-class technology to drive value creation across a portfolio of 30+ companies.Our approach is deliberate and staged : we start by delivering bespoke solutions t...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Agents Infrastructure

    Software Engineer, Agents Infrastructure

    Anthropic • New York, New York, United States
    serp_jobs.job_card.full_time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Forward Deployed Infrastructure Engineer

    Forward Deployed Infrastructure Engineer

    Teleskope • New York, New York, United States
    serp_jobs.job_card.full_time
    Teleskope is redefining data security for the AI era with the only dedicated platform that combines precise visibility with automated remediation. Teleskope continuously scans, catalogs, and classif...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior IT & Cloud Infrastructure Engineer

    Senior IT & Cloud Infrastructure Engineer

    Drawbridge Partners • New York, New York, United States
    serp_jobs.job_card.full_time
    Senior IT & Cloud Infrastructure Engineer.At Drawbridge, we are committed to attracting and retaining the best individuals who enjoy working in a dynamic environment. You will be joining an agile te...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, AI Infrastructure

    Software Engineer, AI Infrastructure

    Fireworks Ai • New York, New York, United States
    serp_jobs.job_card.full_time
    At Fireworks, we’re building the future of generative AI infrastructure.Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry.We’ve been inde...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted