Munich, Germany



Enhance our ML/LLMOps and AI Agent infrastructure for model evaluation, testing, and deployment within an AWS environment. Evaluate and implement different LLM frameworks, AI agent architectures, and 3rd party APIs. Design and experiment with AI agent evaluation systems to measure reasoning quality, reliability, and consistency. Support the development and automation of AI agent systems and Retrieval-Augmented Generation (RAG) pipelines. Document your findings in our internal knowledge base Contribute to the development, improvement, and automation of our machine learning and data pipelines Gain experience in industry critical infrastructure technologies and apply them productively