Talent.com
Senior HPC Cluster Systems Administrator
Senior HPC Cluster Systems AdministratorLawrence Berkeley National Laboratory • Berkeley, CA, United States
Senior HPC Cluster Systems Administrator

Senior HPC Cluster Systems Administrator

Lawrence Berkeley National Laboratory • Berkeley, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Berkeley Lab's ( LBNL ) Information Technology Division ( IT ) has an opening for a Senior HPC Cluster Systems Administrator to join their ScienceIT Team !

In this exciting role, you will support the Berkeley Lab research community by building, integrating, and maintaining Linux-based resources, high-performance computing cluster systems, and Kubernetes clusters. This role provides extensive expertise in High Performance Computing infrastructure and delivers advanced Linux solutions to further scientific endeavors at Berkeley Lab. The mission of Scientific Computing under ScienceIT is to facilitate groundbreaking fundamental research globally by providing essential computing tools, networks, and expertise to enable pioneering science.

This position has an anticipated start date of January 5, 2026.

We're here for the same mission, to bring science solutions to the world. Join our team and YOU will play a supporting role in our goal to address global challenges! Have a high level of impact and work for an organization associated with 17 Nobel Prizes!

Why join Berkeley Lab?

We invest in our employees by offering a total rewards package you can count on :

  • Exceptional health and retirement benefits , including pension or 401K-style plans
  • Opportunities to grow in your career - check out our Tuition Assistance Program
  • A culture where you'll belong - we are invested in our teams!
  • In addition to accruing vacation and sick time, we also have an annual Winter Holiday Shutdown
  • Parental bonding leave (for both mothers and fathers)
  • Pet insurance

What You Will Do :

  • Perform Linux system and HPC cluster maintenance and installations, operating system upgrades, system security hardening and intrusion detection, storage and file system management, system hardware, customization of user group working environment, troubleshooting, network monitoring, and crash recovery.
  • Design, deploy, and manage scalable applications using Kubernetes, ensuring the availability, performance, and readiness of the Kubernetes infrastructure.
  • Automate deployment, scaling, and management of containerized applications, and collaborating with DevOps and development teams to streamline CI / CD pipelines.
  • Design, deploy, and manage the global storage platform to ensure high performance, massive scalability, reliability, and future-proof solutions.
  • Support storage technologies such as Lustre, VAST, and networks.
  • Resolve I / O issues related to business applications, including diagnosing and resolving complex storage, Linux, and networking challenges in a fast-paced environment.
  • Research new storage management technologies, techniques, and provide recommendations.
  • Participate in developing system administration, security, and network policies, documentation, and tools oriented towards efficient systems management.
  • Participate in cluster support to staff and researchers, including initial installation, integration, and ongoing maintenance of Linux High-Performance Computing cluster systems. This includes travel to remote sites if as needed.
  • Co-leading technical efforts with other senior system administrators in areas of HPC technologies such as job schedulers, high-performance interconnects, parallel file systems, cybersecurity, cluster management, container orchestration, VM infrastructure, networking, performance tuning, or data center planning.
  • Co-leading group projects of small to medium size and complexity, to implement and deploy new computing technologies and associated services to the research community.
  • What We Are Looking For :

  • A Bachelor's Degree (or equivalent knowledge / training) in Computer Science, Engineering, or a related discipline, and a minimum of 12 years of relevant experience in Linux system administration within a large distributed computing environment, including experience providing systems and end-user support for multiple scientific or computational research groups or an equivalent combination of education and experience.
  • Demonstrated ability to manage large-scale, performance-critical environments, including capacity planning, scaling, and optimization.
  • Significant experience deploying, scaling, and managing Kubernetes clusters, with a strong understanding of its architecture (pods, deployments, services, ingress) and container orchestration. Proven proficiency with CI / CD tools like Jenkins or GitLab CI.
  • Proven experience with Red Hat derivatives (CentOS, Scientific Linux, Rocky Linux), Debian, Ubuntu, and large-scale system and configuration management tools (Kickstart, Ansible, Puppet, Chef, Warewulf). Expertise in supporting standard services (NFS, LDAP, SMB, MySQL, Apache / Nginx HTTPD).
  • Strong HPC expertise, including Linux, job schedulers, high-performance interconnects, parallel file systems, cybersecurity, container orchestration, cluster management, VM infrastructure, networking, performance tuning, scientific application support, and data center planning.
  • Proficiency in Python and Bash for building, optimizing, and debugging scientific codes (C, C++, Fortran, Java), including experience with compilers (GCC, Intel), debuggers, Makefiles, and version-control (git, Subversion).
  • Expertise in storage system design and optimization (Lustre, S3, VAST, Weka, Ceph, DDN), including a deep understanding of the storage stack (kernel to user space, including file systems, block storage, I / O schedulers, VFS), storage benchmarking, and performance tuning (throughput, latency, IOPS, workload-specific optimizations).
  • Excellent oral and written communication skills including experience organizing and presenting customer focused technical data, reports, and projects to audiences with varying degrees of technical expertise.
  • Strong interpersonal skills including experience with research facilitation and project management in a multidisciplinary team environment.
  • Desired Qualifications :

  • An Advanced Degree (or equivalent knowledge / training) in Computer Science, Engineering, or a related discipline.
  • Experience with software engineering and / or software development.
  • Familiarity with Kubernetes-related tools like Helm, Istio, and Prometheus.
  • Demonstrated experience supporting research at a National Lab and / or in an academic or research environment.
  • Additional Information :

  • Application Deadline : For full consideration, please apply with a resume and a cover letter describing your interest by December 19, 2025 .
  • Appointment type : This is a full-time, career appointment, exempt (monthly paid) from overtime pay.
  • Salary Information : This position is expected to pay $178,644 - $218,364 annually, which fits within the full salary range of $158,808 - $267,996 annually for job code C70.4. It is not typical for an individual to be offered a salary at or near the top of the range for a position. Salary for this position will be commensurate with the final candidate's qualification and experience, including skills, knowledge, relevant education, certifications, and aligned with the internal peer group.
  • Background Check : This position may be subject to a background check. Any convictions will be evaluated to determine if they directly relate to the responsibilities and requirements of the position. Having a conviction history will not automatically disqualify an applicant from being considered for employment.
  • Work Modality : This position is eligible for a hybrid work schedule - a combination of teleworking and performing work on site at Lawrence Berkeley National Lab, 1 Cyclotron Road, Berkeley, CA 94720. Work schedules are dependent on business needs. Individuals working a hybrid schedule must reside within 150 miles of Berkeley Lab. Starting May 7, a REAL ID or other acceptable form of identification is required to access Berkeley Lab sites (for more information click here ).
  • Relocation : This position is eligible for relocation assistance.
  • Work Authorization : Applicants must be legally authorized to work in the United States. Berkeley Lab does not provide visa sponsorship for this position.
  • Want to learn more about working at Berkeley Lab? Please visit : careers.lbl.gov

    Equal Employment Opportunity Employer : The foundation of Berkeley Lab is our Stewardship Values : Team Science, Service, Trust, Innovation, and Respect; and we strive to build community with these shared values and commitments. Berkeley Lab is an Equal Opportunity Employer. We heartily welcome applications from all who could contribute to the Lab's mission of leading scientific discovery, excellence, and professionalism. In support of our rich global community, all qualified applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, protected veteran status, or other protected categories under State and Federal law.

    Berkeley Lab is a University of California employer. It is the policy of the University of California to undertake affirmative action and anti-discrimination efforts, consistent with its obligations as a Federal and State contractor.

    Misconduct Disclosure Requirement : As a condition of employment, the finalist will be required to disclose if they are subject to any final administrative or judicial decisions within the last seven years determining that they committed any misconduct, are currently being investigated for misconduct, left a position during an investigation for alleged misconduct, or have filed an appeal with a previous employer.

    serp_jobs.job_alerts.create_a_job

    System Administrator • Berkeley, CA, United States

    Job_description.internal_linking.related_jobs
    Remote IT Systems Administrator : Cloud & Storage Expert

    Remote IT Systems Administrator : Cloud & Storage Expert

    Kimball Electronics • San Francisco, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    A global electronics company is seeking an IT Systems Administrator to manage infrastructure systems.Key responsibilities include administration, Level II support, and business collaboration.Candid...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    IT Systems Engineer - East

    IT Systems Engineer - East

    Omada Health • South San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Candidates must reside on the East Coast in the U.Omada Health is on a mission to inspire and engage people in lifelong health, one step at a time. As an IT Systems Engineer, you will play a critica...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Director, Data and AI Architecture Leader

    Senior Director, Data and AI Architecture Leader

    Dynavax Technologies • Emeryville, CA, United States
    serp_jobs.job_card.full_time
    This position can be 100% remote, but must be located in the United States.Dynavax is a commercial-stage biopharmaceutical company developing and commercializing novel vaccines to help protect the ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Infrastructure Engineer

    Infrastructure Engineer

    FAR.AI • Berkeley, California, United States
    serp_jobs.job_card.full_time
    AI is a non-profit AI research institute dedicated to ensuring advanced AI is safe and beneficial for everyone.Our mission is to facilitate breakthrough AI safety research, advance global understan...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior HPC Cluster Systems Administrator

    Senior HPC Cluster Systems Administrator

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    serp_jobs.job_card.full_time
    Information Technology Division (.Senior HPC Cluster Systems Administrator to join their.In this exciting role, you will support the Berkeley Lab research community by building, integrating, and ma...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Systems Administrator

    Systems Administrator

    Addison Group • South San Francisco, California, United States
    serp_jobs.job_card.full_time
    Systems Administrator – Biotech | South San Francisco (Onsite).Monday–Friday, 8 : 00 AM – 5 : 00 PM (flexible start by ±30 mins). This position is eligible for medical, dental, vision, and 401(k).A rapi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Staff Systems Engineer

    Staff Systems Engineer

    Bio-Rad Laboratories • Hercules, CA, United States
    serp_jobs.job_card.full_time
    Working within Bio-Rad's Life Science R&D Group as a Systems Engineer, you will take engineering concepts, requirements and transform them into functional prototypes and finished products that impr...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Director of Nursing ($5,000 Sign on Bonus)

    Director of Nursing ($5,000 Sign on Bonus)

    The Terraces at Los Altos - a HumanGood community • Sausalito, CA, US
    serp_jobs.job_card.full_time +1
    Terraces at Los Altos, a distinguished HumanGood life plan community, is seeking a Director of Nursing (DON) for its Health Center team. Under limited supervision, the DON plans, directs, organizes,...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Solution Director, Employee Experience Solutions - Healthcare Growth

    Solution Director, Employee Experience Solutions - Healthcare Growth

    PG Forsta • Emeryville, CA, United States
    serp_jobs.job_card.full_time
    PG Forsta is the leading experience measurement, data analytics, and insights provider for complex industries-a status we earned over decades of deep partnership with clients to help them understan...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Administrator of Business Operations - College of Pharmacy

    Administrator of Business Operations - College of Pharmacy

    Touro University • Vallejo, CA, United States
    serp_jobs.job_card.full_time
    The Administrator of Business Operations supports the College of Pharmacy's mission by managing day-to-day fiscal and administrative operations. Working independently and collaboratively with Colleg...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    System Administrator

    System Administrator

    Machaon Diagnostics • Berkeley, CA, United States
    serp_jobs.job_card.full_time
    Machaon Diagnostics is a clinical reference laboratory and contract research organization (CRO) that focuses on diagnosing, treating, and monitoring hemostatic and thrombotic conditions, complement...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Surgical Technologist II - Emeryville

    Surgical Technologist II - Emeryville

    Stanford Health Care • Emeryville, CA, United States
    serp_jobs.job_card.full_time
    If you're ready to be part of our legacy of hope and innovation, we encourage you to take the first step and explore our current job openings. Your best is waiting to be discovered.Rotating - 10 Hou...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior IT Systems Administrator - Desktop & Network Support

    Senior IT Systems Administrator - Desktop & Network Support

    Shiva IT Services • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    An established industry player is seeking a dedicated IT Support Specialist to join their dynamic team.In this role, you will provide Tier 1 and 2 desktop user support, ensuring smooth operations f...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Registry Data Systems Analyst- Remote - 136400

    Registry Data Systems Analyst- Remote - 136400

    UC San Diego Health • Richmond, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    This position is limited to California Residents and may require travel to Richmond and / or Sacramento, California.UCSD Layoff from Career Appointment. Apply by 8 / 27 / 2025 for consideration with prefe...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Relativity Senior Systems Administrator

    Senior Relativity Senior Systems Administrator

    CGS Federal (Contact Government Services) • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Senior Relativity Senior Systems Administrator.We are seeking a Senior Relativity Sr.Systems Administrator to join our team. You will handle a variety of projects to support and improve the organiza...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    IT Systems Engineer Manager

    IT Systems Engineer Manager

    Scale AI, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Scale AI is seeking an experienced IT Systems Engineering Manager to lead the design, development, and operation of our expanding SaaS and infrastructure ecosystem. In this role, you'll have the opp...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Facility Administrator

    Facility Administrator

    Davita Inc. • El Sobrante, CA, United States
    serp_jobs.job_card.full_time
    San Pablo Damn RdSuite C-D, El Sobrante, California, 94803-7218, United States of America.As a Healthcare Operations Manager (Facility Administrator) at DaVita, you'll be a part of a Team that valu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    IT Systems Administrator I

    IT Systems Administrator I

    Pinterest • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted