About BSC
The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses MareNostrum, one of the most powerful supercomputers in Europe, was a founding and hosting member of the former European HPC infrastructure PRACE (Partnership for Advanced Computing in Europe), and is now hosting entity for EuroHPC JU, the Joint Undertaking that leads large-scale investments and HPC provision in Europe. The mission of BSC is to research, develop and manage information technologies in order to facilitate scientific progress. BSC combines HPC service provision and R&D into both computer and computational science (life, earth and engineering sciences) under one roof, and currently has over 1000 staff from 60 countries.
Look at the BSC experience:
BSC-CNS YouTube Channel
Let's stay connected with BSC Folks!
We are particularly interested for this role in the strengths and lived experiences of women and underrepresented groups to help us avoid perpetuating biases and oversights in science and IT research. In instances of equal merit, the incorporation of the under-represented sex will be favoured.
We promote Equity, Diversity and Inclusion, fostering an environment where each and every one of us is appreciated for who we are, regardless of our differences.
If you consider that you do not meet all the requirements, we encourage you to continue applying for the job offer. We value diversity of experiences and skills, and you could bring unique perspectives to our team.
Context And Mission
The Language Modeling Team at the newly created AI Institute hosted at BSC has consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan governments with the mission to develop fundamental open- source resources and technologies for Spanish and Catalan. In connection with this, the LM team is currently in charge of two flagship projects at the national and regional level: the ALIA project, funded by the Spanish Secretariat of Digitalisation and Artificial Intelligence, and the AINA project, aimed at developing AI resources for Catalan, funded by the Catalan Digitalisation Department. In addition, the Unit participates in various EU funded international projects. In this context, the newly created AI Institute is looking for talent to help design and improve its operations and infrastructure.
The Language Modeling Team is looking for candidates with a background in computational linguistics with experience in Language Technologies, specifically in Deep Learning and large language model building, and possibly in other areas of Natural Language and Speech Processing.
The successful candidate will work in a highly sophisticated HPC environment, have access to state-of-the-art systems and computational infrastructures, and establish collaborations with experts in different areas at the local and international levels.
The researcher will implement innovative techniques for language modelling and evaluation in the HPC environment.
Key Duties
- Execute post-training experiments using Instruction Tuning and modern Preference Alignment (DPO, GSPO, DAPO, CISPO) focused on Latin American Spanish variants, Brazilian Portuguese and indigenous languages.
- Implement training workflows with frameworks such as TRL, FastChat, and NeMo-RL to adapt models to low-resource and regional linguistic contexts.
- Develop cultural and linguistic benchmarks to evaluate model performance across different LAC regions including Brazil; ensuring the capture of regional idioms and social contexts.
- Curate and preprocess datasets for indigenous languages and regional variants, implementing strategies for effective learning in low-resource scenarios.
- Collaborate with the senior team to ensure that global models maintain previous capabilities while improving local linguistic and cultural accuracy.
Requirements
- Education
- Degree in Computer Science, Mathematics or related.
- Essential Knowledge and Professional Experience
- Solid experience in Fine-tuning and Post-training of Large Language Models.
- Proficiency in Python and Deep Learning libraries such as PyTorch.
- Hands-on experience with modern alignment frameworks like TRL, FastChat, or NeMo-RL.
- Familiarity with NLP evaluation metrics and benchmarking methodologies.
- Excellent level of English (written and spoken).
- Additional Knowledge and Professional Experience
- Experience in working with Brazilian Portuguese, dialectal variations of Spanish, or low-resource languages.
- Knowledge of advanced preference alignment methods such as GSPO, DAPO, or CISPO.
- Understanding of cultural bias evaluation and sociolinguistics in the LAC context.
- Fluency in Spanish is highly valued.
- Knowledge of any indigenous language of the Americas or co-official languages of Spain is a significant plus.
- Competences
- High sensitivity to cultural and linguistic diversity and its technical implications.
- Strong analytical skills for identifying performance gaps in non-hegemonic linguistic variants.
- Ability to work effectively in a collaborative and international research environment.
- Proactive attitude towards solving challenges related to data scarcity.
Conditions
- The position will be located at BSC within the Directors Department
- We offer a full-time contract (35h/week), a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, restaurant tickets, private health insurance, support to the relocation procedures
- Duration: Open-ended contract due to technical and scientific activities linked to the project and budget duration
- Holidays: 22 days of holidays + 6 personal days + 24th and 31st of December per our collective agreement
- Salary: we offer a competitive salary commensurate with the qualifications and experience of the candidate and according to the cost of living in Barcelona
- Starting date: ASAP
Applications procedure and process
All applications must be submitted via the BSC website and contain:
- A full CV in English including contact details
- A cover/motivation letter with a statement of interest in English, clearly specifying for which specific area and topics the applicant wishes to be considered. Additionally, two references for further contacts must be included. Applications without this document will not be considered.
Development of the recruitment process
The selection will be carried out through a competitive examination system ("Concurso-Oposición"). The recruitment process consists of two phases:
- Curriculum Analysis: Evaluation of previous experience and/or scientific history, degree, training, and other professional information relevant to the position. - 40 points
- Interview phase: The highest-rated candidates at the curriculum level will be invited to the interview phase, conducted by the corresponding department and Human Resources. In this phase, technical competencies, knowledge, skills, and professional experience related to the position, as well as the required personal competencies, will be evaluated. - 60 points. A minimum of 30 points out of 60 must be obtained to be eligible for the position.
The recruitment panel will be composed of at least three people, ensuring at least 25% representation of women.
In accordance with OTM-R principles, a gender-balanced recruitment panel is formed for each vacancy at the beginning of the process. After reviewing the content of the applications, the panel will begin the interviews, with at least one technical and one administrative interview. At a minimum, a personality questionnaire as well as a technical exercise will be conducted during the process.
The panel will make a final decision, and all individuals who participated in the interview phase will receive feedback with details on the acceptance or rejection of their profile.
At BSC, we seek continuous improvement in our recruitment processes. For any suggestions or comments/complaints about our recruitment processes, please contact recruitment [at] bsc [dot] es.
For more information, please follow this link.
Deadline
The vacancy will remain open until a suitable candidate has been hired. Applications will be regularly reviewed and potential candidates will be contacted.
OTM-R principles for selection processes
BSC-CNS is committed to the principles of the Code of Conduct for the Recruitment of Researchers of the European Commission and the Open, Transparent and Merit-based Recruitment principles (OTM-R). This is applied for any potential candidate in all our processes, for example by creating gender-balanced recruitment panels and recognizing career breaks etc.
BSC-CNS is an equal opportunity employer committed to diversity and inclusion. We are pleased to consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability or any other basis protected by applicable state or local law.
For more information follow this link