Senior AI Developer for Inference Optimization

Drive the future of AI systems as a Senior Developer specializing in inference optimization. Bring your 3+ years of experience to a fast-paced startup environment focused on innovation and quality.

In this role, you will be responsible for fine-tuning our LLM inference engine. Your work will ensure it is both scalable and cost-efficient while you collaborate with a skilled team. High standards and proactive engagement are essential as you elevate our AI capabilities.

Key Responsibilities: • Optimize performance of our LLM inference backend • Collaborate on implementing innovative AI solutions • Ensure scalability and cost-effectiveness of operations • Maintain high standards of quality and reliability • Foster a culture of proactive engagement and teamwork

Requirements: • 3+ years in software engineering required • Familiarity with vLLM, quantization, and optimization techniques • Bachelor’s or Master’s degree from a reputable institution • Strong coding abilities in Python and Kubernetes • Innovative mindset with attention to detail

Join a rapidly expanding AI startup committed to crafting beautiful and efficient AI systems.

Back to blog

Other Jobs To Apply

No other job posts for this day.