Leave us your email address and we'll send you all the new jobs according to your preferences.

ML RL & Optimization Scientist for Scalable LLMs

Posted 20 hours 54 minutes ago by Huawei Technologies Research & Development (UK) Ltd

Permanent
Full Time
Other
London, United Kingdom
Job Description

Huawei Technologies Research & Development (UK) Ltd is looking for a skilled professional to research and develop advanced machine learning systems focused on scaling reinforcement learning and optimization infrastructure. You will design and execute RL workflows to enhance machine learning capabilities while managing large-scale distributed training efforts across GPU clusters.

The ideal candidate will have a strong background in PyTorch and RLHF methods, equipped to thrive in a dynamic environment where research intersects with production. Competitive benefits are included.

Email this Job