Call reference: 20260619_MDU
Closing date: 28/06/2026
We are seeking a postdoc to join the Vision, Language and Reading group at the Computer Vision Center (CVC), in Barcelona, Spain.
The position is initially for 1 year and linked to the project “Multimodal LLMs for Document Undestanding” (MuDocU), financed by the Spanish Ministry of Science.
Proyecto PID2023-146426NB-100 financiado por MICIU/AEI /10.13039/501100011033 y por FEDER, UE.
The successful candidate is expected to participate in large-scale training efforts, research on multimodal pre-training and finetuning methods, and applications on the specific use case of Document Understanding.
KEY DUTIES
- Lead and contribute to specific Work Packages (WPs) within the project, ensuring timely delivery of objectives and milestones.
- Conduct cutting-edge research on multimodal large language models (LLMs), and manage the publication of research findings in top-tier conferences and journals
- Prepare and submit proposals for high-performance computing (HPC) resources and manage allocated compute efficiently.
- Optimize training pipelines for scalability and reproducibility.
- Contribute to group mentoring activities such as reading groups or internal seminars.
- Actively collaborate with internal group members and external project partners
CANDIDATE ’S PROFILE
The candidate should possess a PhD in machine learning or computer vision, or be in the final stage of their PhD with a scheduled or imminent thesis defense. A strong publication record is required. We are looking for candidates who have publications in top conferences like ICDAR, CVPR, ECCV, ICCV, AAAI, NeurIPS.
The candidate should have a strong background in Large Language Models and experience in the document image analysis field. Experience in industry will be considered a strong asset.
The applicants are expected to be fluent in both oral and written communication in English. They should work well in a team while demonstrating initiative and independence. The candidate is expected to co-supervise PhD students.
CONDITIONS
- Location: Computer Vision Center (Campus Universitat Autònoma de Barcelona)
- Gross annual salary: 30.000 €
- Starting date: July or September 2026
THE COMPUTER VISION CENTER
The selected candidate will work in the Computer Vision Centre (CVC), Barcelona, a research institute comprising more than 150 researchers and support staff, dedicated to computer vision research and knowledge transfer. With a strong international projection and links to the industry, the Computer Vision Centre offers an exciting environment for scientific career development. The Computer Vision Centre has a plan for expansion of its permanent research staff base and has received the “HR Excellence in Research” award as a provider and supporter of a stimulating and favourable working environment.
Barcelona is a vibrant city and an important Artificial Intelligence hub. The high quality of life is combined with an open and international looking character of the city. Barcelona is very well connected by air, sea and ground transportation. The region of Catalonia boosts its own AI strategy, in which the CVC is a key player.
APPLICATION PROCESS
All applications must be sent through the online form indicating the offer code 20260619_MDU.
OTM-R Principles for Recruitment Processes
CVC is committed to the principles of Open, Transparent and Merit-based Recruitment (OTM-R) across all its recruitment processes. In 2015, we were awarded the HR Excellence in Research seal by the European Commission.