Working at SambaNova
The Runtime team at SambaNova is a seasoned engineering team with a proven track record of delivering cutting-edge system software solutions for AI and machine learning applications in the enterprise & commercial landscape.
We handle all phases of software infrastructure to enable the higher level apps, including:
- OS interface/integration
- Data model manipulation for scaling
- Networking/communication intra and inter node
- Orchestration of partitioned workloads
- Error monitoring and general care and feeding of the hardware
We build a high performance, distributed and scalable software execution environment for SambaNova DataScale & SambaSuite platform(s) to support data-flow applications e.g. ML training and inference, data processing operations like ETL, and HPC applications.
We are searching for an experienced software engineer who will work on all parts of the runtime stacks for AI, ML, and scientific applications in high-performance distributed systems. You will participate in building, testing and deploying next-generation high-performance compute systems for AI applications at scale. We expect the candidate to have a strong background in programming, building and testing software in distributed systems, performance tuning of large scale systems, and good teamwork and planning skills.
Role Responsibilities
- Work on design and implementation of new and enhanced features of the runtime stack to support high performance and scalable ML training applications
- Work on design and implementation of new and enhanced features of the runtime stack to support low-latency ML inference applications
- Work on system software support for the next generation RDU system
- Provide tools and performance profilers for customers to configure and use the Datascale system
- Work on various virtualization technologies to provide isolation and ease of use to enterprise environments
- Collaborate with other software teams like ML, Compiler, DevOps
Basic Qualifications
- Bachelor’s or Master’s Degree in Computer Science, Computer Engineering, or equivalent and with 5-10 years of industry experience
- Proficiency in C/C++ and Python
Preferred Qualifications
- Experience with operating system, kernel space driver, and user space library
- Experience with different types of fabrics, such as PCIe, Infiniband, and RoCE
- Experience with fast networking stacks, such as RDMA
- Experience with software bringup for hardware accelerator
- Good communication skills and enthusiasm to help colleagues
Annual Salary Range and Level
The base salary for this position ranges from $165,000/year up to $210,000/year. This range is based on role, level, and location and reflects the salary target for new hires in the US. Individual pay within the range will depend on a number of factors, including a candidate’s job-related qualifications, skills, competencies and experience, and location.
#LI-FK1