Current Statistics
1,748,663 Total Jobs 393,294 Jobs Today 17,936 Cities 222,695 Job Seekers 146,729 Resumes |
|
|
|
|
|
|
Software Engineering Manager - Triton Inference Server - Santa Clara California
Company: Karkidi Location: Santa Clara, California
Posted On: 05/04/2024
We are looking for Software Engineering Manager to lead the development efforts for the Triton Inference Server team! Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in problems from image classification to recommenders to large language models. We are a fast-paced, agile team building tools and software to make design and deployment of new deep learning models easier and accessible to more inference solution providers and data scientists.In this role, you will manage an engineering team designing, developing, and optimizing software that streamlines AI inferencing. Ideal candidates will not only have experience leading an agile, system software engineering team, but also motivated to push the boundaries of what is possible with AI inferencing on both CPUs and GPUs. If this sounds exciting, we would love to hear from you!What you'll be doing: - Lead, mentor, and grow the Triton engineering team and be responsible for planning and execution of projects as well as the quality and performance of the Triton Inference Server.
- Work closely with Product and Program Management to establish feature roadmaps and coordinate project dependencies; load-balance asynchronous requests across available resources; and collaborating on all feature designs.
- Engage with internal and external partners and costumers to understand their use cases and requirements.What we need to see:
- Masters or PhD or equivalent experience in Computer Science, computer architecture, or related field.
- 8+ years of overall experience in developing customer facing software.
- 3+ years of experience recruiting, training, and leading software engineering teams.
- Strong fundamentals in building and deploying cloud services using HTTP REST, gRPC, protobuf, and related technologies.
- Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis, and test design. Emphasis on clean and SOLID object-oriented programming principles are a plus.
- Experience running a large open source project - use of GitHub, bug tracking, branching and merging code, OSS licensing issues handling patches, etc.
- Experience with agile software development practices is a requirement, including familiarity with tools such as JIRA and AHA.Ways to stand out from the crowd:
- Experience working in a globally distributed organization.
- Experience with machine learning algorithms and frameworks. Especially experience frameworks such as TensorFlow, PyTorch, ONNX, TensorRT, OpenVino, and vLLM.
- Good knowledge of CPU and/or GPU hardware architecture.
|
|
|
|
|
|
|