Skip to content
View ameroyer's full-sized avatar
🥐
🥐

Block or report ameroyer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ameroyer/README.md

Amélie Royer  🥐

Deep Learning Research Engineer · Kyutai Labs · Paris, France

Building open-science AI at kyutai.org 🇫🇷 · Previously Qualcomm AI Research 🇳🇱 · PhD from IST Austria (2020) 🇦🇹

Website Twitter Google Scholar


🔬 Research

My work sits at the intersection of computer vision, multimodal learning, and neural network efficiency. I care about building models that are both powerful and practical. I'm very interested in learning from multiple modalities and how to best make use of raw data that is inherently multimodal.

Project Higlights Venue What it is about
🏠 CASA -- Cross-Attention Strikes Back arXiv, 2026 Vision-language model with cross-attention for scalable streaming inference
M👁️shiVis -- Kyutai with an "eye" CVPR 2026 Adds visual understanding to the Moshi speech model with a data-efficient training pipeline
MSViT -- Mixed-Scale Tokenization ICCV NViT Workshop, 2023 Dynamic token scaling for Vision Transformers based on image content
Scalarization for Multi-Task Learning NeurIPS, 2023 Large-scale study of multi-task/domain training dynamics + population-based optimization
Knowledge Distillation: A good teacher is patient and consistent (oral) CVPR, 2022 Patient & consistent teacher = surprisingly strong distillation recipe

→ Full list on my website


📓 Side Projects & Snippets

Personal experiments, small tools, and one-off builds:

  • 🍬 PeperNoten -- A small script to auto-generate a "skimmed-through" Obsidian note in markdown from an arxiv paper.
  • 🎄 Advent of Code -- A list of my past Advent of Code participations
  • 🕵️ Codenames Solo -- solo mode for the Codenames boardgame, powered by GPT and Streamlit
  • 🌊 Glow JAX -- clean JAX/Flax implementation of the Glow generative model

🛠 Tech stack

Python JAX PyTorch Jupyter

Pinned Loading

  1. ameroyer.github.io ameroyer.github.io Public

    Personal websites (CV, publications and blog posts)

    Jupyter Notebook 3

  2. kyutai-labs/moshivis kyutai-labs/moshivis Public

    Kyutai with an "eye"

    Python 241 30

  3. kyutai-labs/casa kyutai-labs/casa Public

    A vision-language model with an improved cross-attention mechanism for scalable streaming inference

    Python 28 3

  4. glow_jax glow_jax Public

    An implementation of the Glow generative model in jax and flax

    Jupyter Notebook 8 1

  5. codenames_solo codenames_solo Public

    A solo mode for Codenames (the boardgame) using streamlit and the OpenAI API

    Python

  6. advent_of_code_2023 advent_of_code_2023 Public

    Advent of Code 2023 - Python solutions

    Jupyter Notebook