Browse All Jobs
Job Description

Scale AI is looking for a Network Engineer to join their Infrastructure team in San Francisco. The Network Engineer will be responsible for designing, securing, and scaling the network infrastructure that powers Scale AI’s global operations. He will own the architecture and implementation of scalable, redundant, and secure Layer 2 and Layer 3 networks to support massive data transfer, including multi-gigabit connectivity and direct cloud integrations. This role requires a deep understanding of datacenter best practices, structured cabling, secure routing, and network automation. He will work across teams to ensure the infrastructure remains performant, fault-tolerant, and secure.

The role involves:

  • Designing, building, and maintaining high-performance, multi-site network infrastructure supporting 10–100Gbps+ throughput.
  • Leading physical and logical network deployments in both enterprise and colocation environments.
  • Defining and enforcing secure network segmentation, routing policy, firewall zones, and access control models.
  • Configuring and managing Layer 2/3 connectivity, including VLANs, link aggregation (LAG/LACP), and dynamic routing protocols (BGP, OSPF, etc.).
  • Implementing telemetry and observability systems for real-time performance monitoring and alerting.
  • Managing the deployment and provisioning of switches, optics, cabling, and rack-level power infrastructure.
  • Collaborating with hardware and software teams to support scalable, fault-tolerant data capture and upload workflows.
  • Using Infrastructure-as-Code tools such as Terraform to manage cloud and network infrastructure.
  • Troubleshooting performance bottlenecks, carrier handoffs, and hardware-level issues using tools like iperf, ethtool, tcpdump, mtr, and nmap.
  • Interfacing with datacenter providers and carriers to coordinate cross-connects and bandwidth services.
  • Ensuring compliance with best practices in datacenter operations, structured cabling, airflow containment, etc.

Requirements:

  • Proven experience designing and managing high-throughput networks in office and datacenter environments.
  • Deep knowledge of switching, routing, and structured cabling best practices across L2/L3 protocols.
  • Experience with network hardware from multiple vendors (Cisco, Juniper, etc.); Junos or IOS familiarity preferred.
  • Strong understanding of network security principles: MACsec, VPNs, firewall policy, Zero Trust segmentation.
  • Hands-on experience with 10/25/40/100GbE hardware, SFP/QSFP transceivers, and fiber/copper cabling.
  • Experience automating network provisioning and configuration using scripting (e.g., Python) and version-controlled workflows.
  • Understanding of cloud networking and edge integrations across cloud providers; AWS experience is a plus.
  • Strong debugging and troubleshooting skills, from physical layer issues to protocol misconfigurations.
  • Familiarity with datacenter deployment and vendor environments (e.g., Equinix, Digital Realty, etc.).
  • Excellent documentation and communication skills with both technical and cross-functional teams.

Scale AI offers:

  • Comprehensive health, dental and vision coverage
  • Retirement benefits
  • A learning and development stipend
  • Generous PTO
Apply Manually

Scale AI

Scale AI accelerates the development of AI applications across industries. The company's products power advanced language models, generative models, and computer vision models. Scale AI serves generative AI companies, government agencies, and enterprises, assisting organizations in building and deploying AI. Committed to inclusivity and equal opportunity, Scale AI fosters professional growth, offering opportunities to contribute to cutting-edge AI projects and collaborate with experts in the field.

All Jobs at Scale AI (200)