Multimodal AI Intern (Audio-Visual AI) - PhD Level
Staines-upon-ThamesJoin a leading AI research team driving innovation in next-generation mobile technologies.
We're looking for a PhD-level intern or recent graduate to work on cutting-edge audio-visual AI solutions that will shape the future of smart, on-device intelligence.
You'll collaborate with world-class researchers and engineers, helping turn novel machine learning concepts into production-ready software for intelligent mobile platforms.
What You'll Do:Develop and prototype innovative solutions in multimodal on-device AI (audio + video).
Research and implement methods such as contrastive learning, model compression, or multimodal LLMs.
Tackle real-world challenges with efficient, scalable code using PyTorch or TensorFlow.Work within a high-impact team and contribute to research publications and internal reports.
What We're Looking For:
PhD student or recent graduate in ML/AI, Computer Science, Engineering, or a related field.First-author publications in top AI/ML venues (CVPR, NeurIPS, ICML, ICLR, etc.).
Strong skills in Python and/or C/C++, and hands-on experience with modern ML frameworks.Familiarity with Git and sound software engineering practices.
Excellent communication and problem-solving abilities.
Bonus Points For:
Experience in emotion recognition, foundational face models, or deception detection.
Knowledge of multi-task learning, embedded AI, or distributed ML systems.
Contributions to open-source ML libraries.Expertise in AI pipeline optimization and profiling.
Please note Reed.co.uk does not communicate with candidates via Whatsapp, and we will never ask you to provide your bank, passport or driving licence details during the application process. To stay safe in your job search and flexible work, we recommend visiting JobsAware, a non-profit, joint industry and law enforcement organisation working to combat labour market abuse. Visit the JobsAware website for information and free expert advice for safer work.