Dheeraj Varghese
AI Researcher, VIS Lab

I’m currently working on combining discrete diffusion and autoregression for multilingual multimodal models with Mohammad Mahdi Derakhshani and Cees Sneok at the VIS lab. Previously, for my thesis, I explored curriculum learning in Visual Language Models, supervised by Yuki Asano.
My experience with multimodality has made me curious to explore a few roads to improve efficiency in current models. I’d love to explore ‘Long context adaptation’ as a means to bring test and train closer; briding the gap and reducing the computational overhead of transformers. Also, I find hippocampal-cortical interactions very interesting 🧐
news
Mar 11, 2025 | Co-organized a hackathon for the First Workshop on Structure & Generalization in Multimodal Language Understanding (SAGE-MLU 2025) |
---|---|
May 22, 2024 | Gave a lecture on Vision-Language Models ✌ |
Mar 21, 2024 | Will be a Teaching Assistant for Natural Language Processing at VU! |
Mar 15, 2024 | Attended the ELLIS Winter School on Foundation Models |
Nov 10, 2023 | Will be a Teaching Assistant for Learning Machines at VU 🤖 |
latest posts
Mar 28, 2023 | ClipCap Evolved |
---|