AI Inference Engineer
Posted 6 days 18 hours ago by Pantera Capital
We are seeking an AI Inference Engineer to join our expanding team. Our current technology stack includes Python, C++, TensorRT-LLM, and Kubernetes. This role offers the opportunity to work on large-scale deployment of machine learning models for real-time inference.
Responsibilities
- Develop APIs for AI inference used by internal and external clients
- Benchmark and optimize bottlenecks in our inference stack
- Enhance system reliability and observability, and manage system outages
- Research and implement optimizations for LLM inference
Qualifications
- Experience with ML systems and deep learning frameworks such as PyTorch, TensorFlow, ONNX
- Knowledge of LLM architectures and inference optimization techniques (e.g., batching, quantization)
- Experience deploying scalable, reliable, real-time model serving systems
- (Optional) GPU architecture understanding or CUDA programming experience
The compensation range for this role is $190,000 - $240,000.
At Perplexity, we have experienced significant growth since launching the world's first conversational answer engine over a year ago. Our AI-powered search assistant now has 10 million monthly active users, with over 1 million app installs across iOS and Android. In 2023, we processed over 500 million user queries globally.
We have raised substantial funding, including a $73.6 million Series B in January 2024 led by IVP with participation from NVIDIA, Jeff Bezos' fund, NEA, Databricks, and others. In April 2024, we closed a $62.7 million Series B1 led by Daniel Gross, valuing Perplexity at over $1 billion.
Our investor network includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Naval Ravikant, Tobi Lutke, among others.
Final offer amounts depend on experience and expertise and may vary from listed figures.
Equity is included as part of the total compensation package, in addition to base salary.
Benefits include comprehensive health, dental, and vision insurance for employees and dependents, along with a 401(k) plan.