Leave us your email address and we'll send you all the new jobs according to your preferences.
System Development Engineer, AGI Infrastructure
Posted 2 hours 57 minutes ago by Amazon
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive engineer to play a pivotal role in the development and maintenance of industry-leading multi-modal and multi-lingual large language models (LLM). AGI's mission is to leverage our hyper-scalable, general-purpose large model training and inference systems to develop and deploy cutting edge sensory AI foundational models that revolutionize machine perception, interpretation, and interaction with humans and the physical world.
We believe in "Work Hard. Have Fun. Make History" by focusing on sharing learning experiences from the front line with the development teams. The options for people in the team are vast: master a domain, juggle multiple tasks during incidents, love process and methodical improvement, or keep your head down, headphones on, and bash out code to support the team. You will be required to deeply understand technology landscapes, evaluate new technologies, influence within your team, work with peers and senior leaders to define and revise operational excellence standards, tackle abstract issues spanning multiple functional areas, and drive improvements that can scale across other teams, services, and platforms.
Key Responsibilities- Identify performance bottlenecks in compute infrastructure and propose solutions.
- Mentor junior members of the team to deliver results.
- Provide support for cluster and node management, ensuring smooth operation of GenAI infrastructure.
- Participate in design and code reviews and identify bottlenecks.
- Troubleshoot and research root causes thoroughly and fix defects.
- Continuously improve and automate cluster, capacity, and maintenance upgrades.
- 3+ years of administrative experience in networking, storage systems, operating systems, and hands on systems engineering.
- Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, or Rust.
- Experience with Linux/Unix.
- Experience with CI/CD pipelines and build processes.
- Well versed in core AWS services, including EC2, Lambda, EKS, etc.
- Experience setting up and managing CI/CD pipelines using AWS CodePipeline, GitHub Actions, or similar platforms.
- Familiarity with Infrastructure as Code (IaC) tools like AWS CloudFormation, Terraform, or the AWS CDK.
- Understanding of networking concepts such as VPC, subnets, security groups, Load Balancers, and Route 53.
- Hands on experience in Kubernetes.
- Experience with distributed systems at scale.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Amazon
Related Jobs
Systems Engineer (all genders)
- Salzburg, Salzburg, Austria, 5020
Director of F&B (m/w/d)
- Wien, Austria
Director, Key Account Technology (KAT) Team (all genders)
- Kärnten, Villach, Austria, 9500
Junior Office Administrator
- London, United Kingdom
Education Advisor - Teaching Digital with Confidence
- Not Specified, United Kingdom