The Runtime team at Sambanova is a seasoned engineering team with a proven track record of delivering cutting-edge system software solutions for AI and machine learning applications in the enterprise & commercial landscape.
Runtime is responsible for the lowest levels of the SambaNova stack, above the hardware. We handle all phases of software infrastructure to enable the higher level apps, including:
- OS interface/integration
- Data model manipulation for scaling
- Networking/communication intra and inter node
- Orchestration of partitioned workloads
- Error monitoring and general care and feeding of the hardware.
We build a high performance, distributed and scalable software execution environment for SambaNova DataScale & SambaSuite platform(s) to support data-flow applications e.g. ML training and inference, data processing operations like ETL, and HPC applications.
We are searching for experienced software engineer candidates that will work on all parts of the Runtime infrastructure. That will include drivers, kernel modules, and userspace libraries. The candidate will participate in building, testing and deploying next-generation high-performance compute systems for AI applications at scale. We expect the candidate to have a strong background in programming, building and testing software in distributed systems, performance tuning of large scale systems, and good teamwork and planning skills.
Likely Work Responsibilities
- Build and enhance infrastructure for high performance ML training and inference.
- System software (drivers and kernel) support for the next generation silicon.
- User-facing tools (analysis, job management, profiling etc) for Datascale systems.
- Virtualization, for isolation and ease of use in multi-tenant environments.
- Collaborate with other software teams; ML, Compiler, DevOps.
Preferred Skills & Qualifications
- Experience with operating system, kernel space driver and user space library
- Experience with communication fabrics, such as RDMA, PCIe, Infiniband and RoCE
- Experience with software bringup for custom hardware
- Good communication skills and enthusiasm to help colleagues
Annual Salary Range and Level
The base salary for this position ranges from $155,000/year up to $185,000/year. This range is based on role, level, and location and reflects the salary target for new hires in the US. Individual pay within the range will depend on a number of factors, including a candidate’s job-related qualifications, skills, competencies and experience, and location.
#LI-FK1
#LI-FK1