2026 - Intern AI Research Engineer - Networking & Inference Gateway

Posted 2 hours 30 minutes ago by Huawei Ireland Research Center

Permanent
Full Time
Apprenticeships & Internships Jobs
Dublin, Ireland
Job Description

Location: Dublin, Ireland

Intern AI Research Engineer - Huawei Cloud Inference Serving Team

We are looking for an Intern AI Research Engineer to work on Huawei Cloud Inference Serving Team. Join us to push the boundaries of what's possible in Distributed AI inference performance and reliability. In this role, you won't be just writing code; you will be solving some of the most complex and exciting challenges in distributed Inference serving. Your work will directly impact the latency, efficiency, security and stability of our cloud-based AI services.

Responsibilities
  • Develop and optimize core components of high-performance, low-latency AI inference gateway systems
  • Implement request routing and load balancing mechanisms to efficiently distribute traffic across available resources
  • Build networking optimizations and connection management systems to minimize latency and maximize throughput
  • Create observability and monitoring tools for real-time performance analysis and system health tracking
  • Develop REST and gRPC APIs that integrate seamlessly with existing ML infrastructure
  • Write well-tested, maintainable code and participate in code reviews with the team
Requirements
  • Currently pursuing BS/MS in Computer Science, Computer Engineering, or related field
  • Strong programming fundamentals with experience in systems languages (Rust, Go, C++, or Java)
  • Solid understanding of networking concepts including HTTP, TCP/IP, load balancing, and distributed systems
  • Experience with concurrent programming, async patterns, and multi-threaded applications
  • Knowledge of API development, web services, and service-oriented architectures
  • Understanding of performance optimization techniques and system profiling
Nice to Have
  • Strong candidates would understand transformer architectures and internals, and might have hands-on experience with model serving infrastructure, quantization techniques or building APIs around large language models
  • Hands on experience with Rust programming language and its async ecosystem
  • Familiarity with gRPC, Protocol Buffers, and high-performance RPC frameworks
What You'll Gain

This internship provides deep hands-on experience with modern systems programming and large-scale distributed networking infrastructure. You'll work closely with experienced engineers who will guide you through complex technical challenges in building production AI serving systems. The role offers valuable exposure to cutting-edge technologies like Rust async programming, high-performance networking, and observability systems, giving you skills that are highly valued across the industry. You'll contribute to real systems that handle production traffic while learning best practices in reliability, performance, and scalable system design.