Leave us your email address and we'll send you all the new jobs according to your preferences.
Principal Machine Learning Engineer - Production Systems
Posted 2 days 3 hours ago by SoftInWay Inc
£100,000 - £125,000 Annual
Permanent
Full Time
Other
Gloucestershire, Bristol, United Kingdom, BS153
Job Description
Senior/Principal ML Systems Architect (TensorFlow + Python) Overview 
We are seeking a highly experienced ML Systems Architect to design and implement a scalable, production-grade architecture for our machine learning solver. This role bridges research prototypes and commercial deployment, ensuring reliability, maintainability, and performance in a mixed technology stack.
Responsibilities- Architect the ML Solver Platform:
- Define modular architecture for data preprocessing, model execution, and post-processing.
- Establish clear API contracts between Python/TensorFlow and C# services.
- Convert research code into robust, testable, and observable services.
- Implement CI/CD pipelines, automated testing, and reproducibility standards.
- Design REST/gRPC endpoints for cross-language communication.
- Ensure compatibility with C#/.NET services.
- Performance & Scalability:
- Optimize GPU/CPU utilization, batching strategies, and memory management.
- Plan for multi-model and multi-tenant scenarios.
- MLOps & Lifecycle Management:
- Implement model versioning, artifact registries, and deployment workflows.
- Set up monitoring, logging, and alerting for solver performance.
- Security & Compliance:
- Apply best practices for secrets management, dependency scanning, and secure artifact storage.
- ML Frameworks: Expert in TensorFlow (TF2/Keras), experience with ONNX Runtime for inference.
- Programming: Advanced Python for ML; strong understanding of packaging, type checking, and performance profiling.
- APIs: Proficiency in gRPC/Protobuf and REST for cross-language integration.
- Performance Optimization: GPU acceleration (CUDA/cuDNN), mixed precision, XLA, profiling.
- Observability: Metrics, tracing, structured logging, dashboards.
- Security: SBOM, image signing, role-based access, vulnerability scanning.
- Experience with ONNX Runtime Training, PyTorch, or hybrid ML architectures.
- Familiarity with distributed training strategies and multi-GPU setups.
- Knowledge of feature stores and data validation frameworks.
- Exposure to regulated environments and compliance frameworks.
- ML: TensorFlow, ONNX Runtime, tf2onnx.
- APIs: FastAPI, gRPC.
- Work on cutting-edge ML solutions integrated into commercial engineering software.
- Define architecture that scales across global deployments.
- Collaborate with a team of experts in ML, software engineering, and UI development.
SoftInWay Inc
Related Jobs
Verkäufer Frischetheke (m/w/d)
- Thüringen, Battgendorf, Germany, 99625
Verkäufer Frischetheke (m/w/d)
- Sachsen-Anhalt, Albersroda, Germany, 06268
Verkäufer Als Fachkraft / Quereinsteiger Frischetheke (m/w/d)
- Thüringen, Weimar, Germany, 99427
Verkäufer Als Fachkraft / Quereinsteiger Frischetheke (m/w/d)
- Sachsen-Anhalt, Magdeburg, Germany, 39118
Verkäufer Als Fachkraft / Quereinsteiger Frischetheke (m/w/d)
- Brandenburg, Nauen, Germany, 14641