Software Engineer V

Duration:  6 Months

Pay Rate: 95-100/HR. on W2

The team co-optimizes both software (e.g., algorithms, models and numerics) and hardware (e.g., platform and network) to come up with a balanced system design. Developing new systems requires understanding performance bottlenecks on existing systems. Therefore, the team also works on optimizing performance of AI models running in production. This has resulted in TCO wins for all key AI services.

The team makes impact by working on both pathfinding and productization: it builds prototypes to demonstrate the value of the new ideas, and it works closely with many partner teams to bring them into production.

  • Software libraries, models, and frameworks for training systems
  • GPU optimizations with CUDA
  • Model co-design for scalability and efficiency of production recommender and content understanding models. - Specifically Generative AI use cases like LargeLanguageModels and DiffusionModels.

Very specific skills that are a must:

  • GPU performance optimizations
  • CUDA programming
  • C++

*Must haves* Experience with some subset of the following HPC systems software: * Accelerator (GPU/ASIC) kernel development and optimization (e.g., NVIDIA, AMD, Intel, or other misc accelerator), *Extensive programming experience. Knowledge of C/C++ and Python

Nice to Have* * numerical libraries (e.g., mixed precision linear algebra, tensor-based frameworks), performance enablement, tracing, profiling and debugging. * Scientific computing or other forms of HPC with an AI/ML/DL emphasis

There will be 2 interviews:

  • Technical/Coding
  • Behavioral/team fit