Foundation models

Foundation models break free from predefined and restricted ontologies, enabling them to perceive a wider range of elements in their surroundings. By pretraining these foundation models in an unsupervised way, we can adapt them to a diverse set of tasks.

Selected publications

  1. 2026

  2. Driving on Registers
    Ellington Kirby, Alexandre Boulch, Yihong Xu, Yuan Yin, Gilles Puy, Éloi Zablocki, Andrei Bursuc, Spyros Gidaris, Renaud Marlet, Florent Bartoccioni, Anh-Quan Cao, Nermin Samet, Tuan-Hung VU, Matthieu Cord
    CVPR 2026
  3. Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
    Shashanka Venkataramanan, Valentinos Pariza, Mohammadreza Salehi, Lukas Knobel, Spyros Gidaris, Elias Ramzi, Andrei Bursuc, Yuki M. Asano
    CVPR 2026
  4. MAD: Motion Appearance Decoupling for efficient Driving World Models
    Ahmad Rahimi, Valentin Gerard, Éloi Zablocki, Matthieu Cord, Alexandre Alahi
    CVPR 2026
  5. NAF: Zero-Shot Feature Upsampling via Neighborhood Attention Filtering
    Loick Chambon, Paul Couairon, Éloi Zablocki, Alexandre Boulch, Nicolas Thome, Matthieu Cord
    CVPR 2026Highlight
  6. Boosting Visual Instruction Tuning with Self-Supervised Guidance
    Sophia Sirko-Galouchenko, Monika Wysoczanska, Andrei Bursuc, Nicolas Thome, Spyros Gidaris
    preprint 2026
  7. CLIP’s Visual Embedding Projector is a Few-shot Cornucopia
    Mohammad Fahes, Tuan-Hung Vu, Andrei Bursuc, Patrick Pérez, Raoul de Charette
    WACV 2026
  8. GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
    Éloi Zablocki*, Valentin Gerard*, Amaia Cardiel, Eric Gaussier, Matthieu Cord, Eduardo Valle
    TMLR 2026Featured Certification
  9. 2025

  10. DINO-Foresight: Looking into the Future with DINO
    Efstathios Karypidis, Ioannis Kakogeorgiou, Spyros Gidaris, and Nikos Komodakis
    NeurIPS 2025
  11. JAFAR: Jack up Any Feature at Any Resolution
    Paul Couairon*, Loick Chambon*, Louis Serrano, Jean-Emmanuel Haugeard, Matthieu Cord, Nicolas Thome
    NeurIPS 2025
  12. Learning to Steer: Input-dependent Steering for Multimodal LLMs
    Jayneel Parekh, Pegah Khayatan, Mustafa Shukor, Arnaud Dapogny, Alasdair Newson, Matthieu Cord
    NeurIPS 2025
  13. FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
    Yasser Benigmim, Mohammad Fahes, Tuan-Hung Vu, Andrei Bursuc, Raoul de Charette
    ICCV 2025
  14. MoSiC: Optimal-Transport Motion Trajectories for Dense Self-Supervised Learning
    Mohammadreza Salehi*, Shashanka Venkataramanan*, Ioana Simion, Efstratios Gavves, Cees G. M. Snoek, Yuki M Asano (* equal contribution)
    ICCV 2025
  15. VaViM and VaVAM: Autonomous Driving through Video Generative Modeling
    Florent Bartoccioni, Elias Ramzi, Victor Besnier, Shashanka Venkataramanan, Tuan-Hung Vu, Yihong Xu, Loick Chambon, Spyros Gidaris, Serkan Odabas, David Hurych, Renaud Marlet, Alexandre Boulch, Mickael Chen, Éloi Zablocki, Andrei Bursuc, Eduardo Valle, Matthieu Cord
    CoRL Workshop 2025
  16. Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
    Efstathios Karypidis, Ioannis Kakogeorgiou, Spyros Gidaris, and Nikos Komodakis
    CVPR 2025
  17. Test-Time Contrastive Concepts for Open-World Semantic Segmentation
    Monika Wysoczańska, Antonin Vobecky, Amaia Cardiel, Tomasz Trzciński, Renaud Marlet, Andrei Bursuc, Oriane Siméoni
    TMLR 2025
  18. LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression Comprehension
    Amaia Cardiel, Éloi Zablocki, Elias Ramzi, Oriane Siméoni, Matthieu Cord
    ICLR 2025
  19. MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments
    Spyros Gidaris, Andrei Bursuc, Oriane Siméoni, Nikos Komodakis, Matthieu Cord, Patrick Pérez
    TMLR 2024 and ICLR 2025
  20. 2024

  21. A Concept-Based Explainability Framework for Large Multimodal Models
    Jayneel Parekh, Pegah Khayatan, Mustafa Shukor, Alasdair Newson, Matthieu Cord
    NeurIPS 2024
  22. DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut
    Paul Couairon, Mustafa Shukor, Jean-Emmanuel Haugeard, Matthieu Cord, Nicolas Thome
    NeurIPS 2024
  23. No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations
    Walter Simoncini, Spyros Gidaris, Andrei Bursuc, Yuki M. Asano
    NeurIPS 2024
  24. The BRAVO Semantic Segmentation Challenge Results in UNCV2024
    Tuan-Hung Vu, Eduardo Valle, Andrei Bursuc, Tommie Kerssies, Daan de Geus, Gijs Dubbelman, Long Qian, Bingke Zhu, Yingying Chen, Ming Tang, Jinqiao Wang, Tomáš Vojíř, Jan Šochman, Jiří Matas, Michael Smith, Frank Ferrie, Shamik Basu, Christos Sakaridis, Luc Van Gool
    ECCV 2024
  25. Reliability in Semantic Segmentation: Can We Use Synthetic Data?
    Thibaut Loiseau, Tuan-Hung Vu, Mickael Chen, Patrick Pérez, Matthieu Cord
    ECCV 2024
  26. UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction
    Lan Feng, Mohammadhossein Bahari, Kaouther Messaoud Ben Amor, Éloi Zablocki, Matthieu Cord, Alexandre Alahi
    ECCV 2024
  27. A Simple Recipe for Language-guided Domain Generalized Segmentation
    Mohammad Fahes, Tuan-Hung Vu, Andrei Bursuc, Patrick Pérez, Raoul de Charette
    CVPR 2024
  28. OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
    Sophia Sirko-Galouchenko, Alexandre Boulch, Spyros Gidaris, Andrei Bursuc, Antonin Vobecky, Patrick Pérez, Renaud Marlet
    CVPR Workshop WAD 2024
  29. Three Pillars improving Vision Foundation Model Distillation for Lidar
    Gilles Puy, Spyros Gidaris, Alexandre Boulch, Oriane Siméoni, Corentin Sautier, Patrick Pérez, Andrei Bursuc, Renaud Marlet
    CVPR 2024
  30. BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds
    Corentin Sautier, Gilles Puy, Alexandre Boulch, Renaud Marlet, Vincent Lepetit
    3DV 2024
  31. 2023

  32. POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
    Antonin Vobecky, Oriane Siméoni, David Hurych, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Josef Sivic
    NeurIPS 2023
  33. PØDA: Prompt-driven Zero-shot Domain Adaptation
    Mohammad Fahes, Tuan-Hung Vu, Andrei Bursuc, Patrick Pérez, Raoul de Charette
    ICCV 2023
  34. ALSO: Automotive Lidar Self-supervision by Occupancy estimation
    Alexandre Boulch, Corentin Sautier, Björn Michele, Gilles Puy, Renaud Marlet
    CVPR 2023
  35. 2022

  36. What to Hide from Your Students: Attention-Guided Masked Image Modeling
    Ioannis Kakogeorgiou, Spyros Gidaris, Bill Psomas, Yannis Avrithis, Andrei Bursuc, Konstantinos Karantzalos, and Nikos Komodakis
    ECCV 2022
  37. 2021

  38. Online Bag-of-Visual-Words Generation for Unsupervised Representation Learning
    Spyros Gidaris, Andrei Bursuc, Gilles Puy, Nikos Komodakis, Patrick Pérez, and Matthieu Cord
    CVPR 2021
  39. 2020

  40. Learning Representations by Predicting Bags of Visual Words
    Spyros Gidaris, Andrei Bursuc, Nikos Komodakis, Patrick Pérez, and Matthieu Cord
    CVPR 2020