Raphael Avalos

[NEW PAPER]

dymo paper (combining policy and world modeling to predict action success for agentic llms) on arxiv

[NEW PAPER]

shiq paper (value base alg for llms) on arxiv

> i do research, mostly rl, planning & llms

> currently loading thesis manuscript...

> previously working on agentic llms @cohere

> education = {
phd candidate @ailab_vub @fwo,
meng cs @telecom,
msc applied math @mva_ens,
msc data science @eurecom
}

> tools = [linux, python, jax, vim (not master yet)]

> workshop_organization = [EWRL23, ALA24, ALA25]

> full_resume

selected_papers

[ICLR24] The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models
Raphael Avalos*, Florent Delgrange*, Ann Nowe, Guillermo Perez, Diederik M Roijers
[ pdf ] [ bibtex ]

[RLC24] Online Planning in POMDPs with State-Requests
Raphael Avalos, Eugenio Bargiacchi, Ann Nowé, Diederik M Roijers, Frans Oliehoek
[ pdf ] [ bibtex ]

[TMLR23] Local Advantage Networks for Multi-Agent Reinforcement Learning in Dec-POMDPs
Raphael Avalos, Mathieu Reymond, Ann Nowe, Diederik M Roijers
[ pdf ] [ bibtex ]

news

[2025-05-20]

shiq paper (value base alg for llms) on arxiv

[2025-04-01]

command a technical report published arxiv

[2025-03-10]

new website

[2024-11-18]

started fine-tuning @cohere

contact_me

firstname@lastname.fr

hi, i am raphaelavalos

> looking for work on rl, llms or other cool ml projects

selected_papers

news

contact_me