Tallinn, Estonia



We are seeking highly motivated MSc or PhD interns to work on video generation and multimodal video foundation models. Interns will focus on one or more components of the foundation model lifecycle and are encouraged to propose creative, research-driven ideas that advance the state of the art. You will contribute to the development and improvement of open-source video foundation models, analyze their limitations, and design scalable solutions. This is a research-focused internship with opportunities to publish at top-tier computer vision and machine learning conferences, and to work with petabyte-scale video datasets and large distributed GPU clusters with thousands of GPUs.