Network Production Engineer, Network Infrastructure
The Network Infrastructure team is responsible for designing, building and operating one of the largest networks in the world. Networking is at the core of all Meta products and experiences, and we are looking for Production Engineers who are interested in solving complex technical challenges in the Backbone or Datacenter Network domains. Production Network Engineers at Meta are hybrid software and network engineers who keep reliability and scalability in mind as they work on different parts of the lifecycle (designing, building, and operating our worldwide network). This role offers an opportunity to solve the scaling challenges of supporting billions of people using our family of apps; to cutting-edge challenges in AI workloads that power new Meta products.
Responsibilities
- Conceiving, developing, and deploying systems and tools to keep the network running reliably and efficiently
- Establish and implement global best practices and contribute to the design of new scalable network solutions
- Define and partner with network hardware, software, and vendor teams on the development of network platforms
- Participate in an on-call rotation to support the global network infrastructure and analyze data to diagnose and identify root causes to network issues
- Lead projects to address hard technical challenges, directly contributing to roadmaps and partner alongside the best engineers in the industry to develop reliable and scalable solutions
- Help onboard new team members while working with other technical leads and managers to make the team a success
Minimum Qualifications
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience7+ years of relevant experience developing scalable and reliable systems and / or networksExperience coding in higher-level languages (e.g., Python, C++, Go, Rust, etc)Experience learning software, frameworks and APIsEngineering degree, or a related technical discipline or equivalent experienceExperience in configuration and maintenance of network devices and NMS systems, or applications such as web servers, load balancers, relational databases, storage systems and messaging systemsExperience developing and understanding network device configuration for at least one vendor (Juniper, Cisco, Arista, Brocade, etc.)Preferred Qualifications
Expert knowledge of TCP / IP and IPv6Protocol knowledge in one or more of BGP, MPLS, ISIS or similar routing protocols - knowledge in typical configurations and performance tuningUnderstanding of routing and switching - hardware design and knowledge of forwarding and data planesUnderstanding of RDMA congestion control mechanisms on IB and RoCE NetworksUnderstanding of AI training workloads and demands they exert on networksUnderstanding of the latest artificial intelligence (AI) technologies