hi, i am raphaelavalos

> i do research, mostly rl, planning & llms

> currently fine-tuning llms @cohere

> education = {
phd candidate @ailab_vub @fwo,
meng cs @telecom,
msc applied math @mva_ens,
msc data science @eurecom
}

> tools = [linux, vim (not master yet), python, jax]

> workshop_organization = [EWRL23, ALA23, ALA24]

> full_resume

selected_papers

[ICLR24] The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models
Raphael Avalos*, Florent Delgrange*, Ann Nowe, Guillermo Perez, Diederik M Roijers
[ pdf ] [ bibtex ]

[RLC24] Online Planning in POMDPs with State-Requests
Raphael Avalos, Eugenio Bargiacchi, Ann Nowé, Diederik M Roijers, Frans Oliehoek
[ pdf ] [ bibtex ]

[TMLR23] Local Advantage Networks for Multi-Agent Reinforcement Learning in Dec-POMDPs
Raphael Avalos, Mathieu Reymond, Ann Nowe, Diederik M Roijers
[ pdf ] [ bibtex ]

news

[2025-03-10]
new website
[2024-11-18]
started fine-tuning @cohere

contact_me

firstname@lastname.fr