Senior Kernel Developer
Posted 6 hours 9 minutes ago by Luxoft
Permanent
Full Time
Other
Not Specified, United Kingdom
Job Description
Project description
Luxoft is looking for an AI software development engineer to develop ML kernels in the Triton kernel language. We are looking for an engineer who is passionate about optimizing Machine Learning GPU kernels and improving the performance of key applications and benchmarks. What you do directly impacts the performance of AMD GPUs and enables us to become a competitive solution for generative AI. Become a part of our high-impact and incredibly talented Triton kernels team.
Responsibilities- Develop ML kernels for matrix multiplication, Flash Attention and other ML operators
- Benchmark, perform competitive analysis and optimize kernels to improve performance
- Collaborate with the GPU architecture team to improve future generations
- Apply knowledge of software engineering best practices
Must have
- Proficiency with C/C++
- Proficiency in CUDA or HIP / ROCm or OpenCL programming
- Solid understanding of parallel programming models, and optimization techniques
- Strong problem-solving skills and the ability to work in a collaborative environment
Nice to have
- Familiarity with models like LLama, Mixtral and Gemma is a plus
- Knowledge of MLIR, LLVM and GPU assembly and GPU architecture is a plus
- Familiarity with PyTorch or JAX