Updates

News

Highlights from the Vision Lab and INSAIT — papers, releases, media, and group news.

2026

INSAIT announces 17 papers at CVPR 2026, its strongest computer-vision presence so far. The accepted work spans robotics, spatial reasoning, egocentric vision, earth observation, visual agents, and agentic vision.

Read coverage

Nikolay Nikolov presents INSAIT’s latest Physical AI work at Google Cloud Day Sofia, focusing on systems that scale beyond robots trained only from demonstrations.

Read coverage

Bulgarian government ministers visit INSAIT and meet the Computer Vision group leadership, including Luc Van Gool, to discuss the institute’s next stage of development and strategic role for Bulgaria.

Read coverage

Asen Nachkov discusses autonomous-vehicle research and DiffSim Trinity in Bulgarian media. The work studies differentiable simulation as a path toward safer, more data-efficient autonomous driving.

Read coverage

Lorenzo Venturoli joins INSAIT for a Master’s thesis in 3D computer vision, bringing experience in 3D semantics, SLAM, and multimodal benchmarks.

Read coverage

EgoNight is released at ICLR 2026 as a benchmark for nighttime egocentric vision. The project introduces aligned day-night videos, nighttime VQA, depth estimation, and cross-illumination retrieval to test vision systems beyond daytime conditions.

Read coverage

Anna-Maria Halacheva discusses Articulate3D, robotics, and her research path on the Superhuman podcast with Georgi Nenov.

Read coverage

VOID is launched in collaboration with Netflix as an open model for video object and interaction deletion. The model removes objects while reconstructing physically plausible scene dynamics rather than only filling pixels.

Read coverage

Hongyu An joins INSAIT as a PhD student in Computer Vision with Luc Van Gool and Jinjin Gu. His work focuses on image/video super-resolution, restoration, and generative diffusion models.

Read coverage

Mengshun Hu joins INSAIT as a postdoctoral researcher in Computer Vision. His research focuses on video restoration, space-time video super-resolution, and emerging VideoAgent systems.

Read coverage

INSAIT announces 8 accepted ICLR 2026 papers, placing Bulgaria first in Eastern Europe by accepted publications at the conference. Several of the accepted works connect to vision, multimodal AI, and embodied reasoning.

Read coverage

Taewoo Kim joins INSAIT as a postdoctoral researcher in Computer Vision and computational imaging. His work connects event-based cameras, low-light enhancement, video generation/editing, and agentic AI.

Read coverage

INSAIT launches the CVPR 2026 Workshop on Agentic AI for Visual Media, led by Jinjin Gu and Lei Sun in collaboration with Snap and Adobe. The workshop focuses on AI systems that reason, plan, and execute multi-step visual-media workflows.

Read coverage

Dannong Xu joins INSAIT as a PhD student in Computer Vision, working on multimodal learning, large language models, and visual reasoning.

Read coverage

Asen Nachkov discusses DiffSim Trinity and safer autonomous driving on Bloomberg TV Bulgaria, explaining how differentiable simulation supports more reliable planning.

Read coverage

Asen Nachkov explains autonomous vehicles and real-world decision-making on Bulgarian National Radio, highlighting the role of simulation and prediction in driving systems.

Read coverage

Ruibo Ming joins INSAIT as a PhD student in the Computer Vision group, working on video generation, autonomous driving, 3D reconstruction, and multimodal systems.

Read coverage

2025

Anna-Maria Halacheva discusses INSAIT’s CSRankings success and AI talent development on bTV, following the institute’s rise in European computer-science rankings.

Read coverage

INSAIT reaches 13th in Europe and 1st in Eastern Europe in CSRankings, with computer vision ranked 6th in Europe among the institute’s tracked research areas.

Read coverage

Nikola Popovic presents INSAIT’s large-scale 3D dataset work in a Bulgarian National Radio interview, explaining why open 3D data matters for language-aware scene understanding.

Read coverage

INSAIT releases SceneSplat-49k, the largest open-source 3D Gaussian Splatting scene collection, together with a benchmark for language-aware 3D AI systems.

Read coverage

INSAIT attends NeurIPS 2025 with 7 accepted papers and multiple workshop activities, including work connected to world models, video AI, and interactive environments.

Read coverage

Prof. Angela Yao visits INSAIT and meets the Computer Vision group, sharing perspectives from her work in human-centric and visual AI research.

Read coverage

StateSpaceDiffuser is accepted to NeurIPS 2025 as a diffusion-based world model for long-context spatial consistency. The work is led by Nedko Savov with collaborators including Deheng Zhang, Danda Paudel, and Luc Van Gool.

Read coverage

Cheng Tang joins INSAIT as a Computer Vision intern with Luc Van Gool, focusing on RL-driven multimodal reasoning enhancement.

Read coverage

Letian Shi joins INSAIT as a doctoral student in the Computer Vision group, working with Luc Van Gool and Danda Paudel.

Read coverage

INSAIT presents 3 accepted robotics papers at IROS 2025, including work on autonomous-vehicle policies trained with differentiable simulation.

Read coverage

Nikolay Nikolov presents SPEAR-1 and robotic foundation models on bTV, describing how 3D understanding can improve robot behavior in the physical world.

Read coverage

SPEAR-1 is featured on Bulgarian National Television as a robotics foundation model built around spatial reasoning and 3D understanding.

Read coverage

Jan-Nico Zaech gives an ICCV 2025 workshop keynote on robotic AI and industrial transfer, highlighting how foundation data can support more capable robots.

Read coverage

Stefan Ailuro joins INSAIT as a PhD student in Computer Vision, working with Luc Van Gool and Danda Paudel.

Read coverage

INSAIT announces 13 ICCV 2025 papers and a major computer-vision presence in Hawaii, including work across 3D vision, robotics, perception, and multimodal AI.

Read coverage

INSAIT co-organizes the OpenSUN3D challenges at ICCV 2025 with Stanford, ETH Zurich, Google, NVIDIA, Microsoft, and TUM, advancing open-world 3D scene understanding.

Read coverage

INSAIT’s robotics team presents at CoRL 2025 and qualifies for the Real Robot Challenge, bringing the group’s embodied-AI work into a leading robotics venue.

Read coverage

Generalist Robot Manipulation Beyond Action Labeled Data is accepted to CoRL 2025. The work studies how robot policies can learn from broader visual and 3D data, not only action-labeled demonstrations.

Read coverage

GAIA 2025 takes place in Sofia, bringing together researchers and industry around geospatial AI, foundation models, and Earth observation.

Read coverage

INSAIT announces GAIA 2025, the first international symposium on Geospatial AI and Applications with Foundation Models, hosted in Sofia.

Read coverage

Xiaoye Wang joins INSAIT for a long-term research visit in 3D computer vision, working with Luc Van Gool and Danda Paudel.

Read coverage

INSAIT opens Google co-funded PhD positions in egocentric vision and multimodal LLMs, supporting research on first-person perception, streaming multimodal models, and real-world visual understanding.

Read coverage

Yuanqi Yao joins INSAIT as a PhD student in Computer Vision and Robotics, focusing on embodied AI, robotic manipulation, and vision-language-action models.

Read coverage

Google gives INSAIT $150,000 to support multimodal LLM research under Luc Van Gool, extending the institute’s work in vision-language and egocentric AI.

Read coverage

Berke Gokmen joins INSAIT for a long-term research visit in Computer Vision and AI, working on 3D scene generation and generative vision models.

Read coverage

INSAIT co-organizes the OpenSUN3D Workshop at ICCV 2025, focused on open-vocabulary and open-world 3D scene understanding.

Read coverage

INSAIT co-organizes the Physics-IQ Challenge at ICCV 2025, a benchmark for evaluating whether generative models produce physically plausible videos.

Read coverage

Prof. Richard Hartley visits INSAIT and meets the Computer Vision group, giving a talk on modeling probability distributions in data manifolds using diffusion.

Read coverage

INSAIT co-organizes the NeurIPS 2025 Workshop on Embodied World Models for Decision-Making, connecting simulation, video-language-action models, robotics, and autonomous driving.

Read coverage

GaussianVLM is released as a 3D vision-language model for understanding immersive Gaussian-splat scenes reconstructed from ordinary video.

Read coverage

Zhendong Li joins INSAIT as a doctoral student in Computer Vision, working with Luc Van Gool and Danda Paudel.

Read coverage

INSAIT welcomes the 2025 AI Summer Research Fellows, selected from more than 4,000 applicants, with projects spanning computer vision, robotics, foundational models, and trustworthy AI.

Read coverage

INSAIT starts a joint research program with MIT CSAIL, expanding collaboration across frontier AI, systems, and related research directions.

Read coverage

ObjectRelator wins 2nd place in the Ego-Exo4D Challenge at the CVPR 2025 EgoVis Workshop, advancing cross-view object relation understanding for egocentric and exocentric video.

Read coverage

INSAIT’s work on multi-modal semantic segmentation under sensor failures wins Best Paper at the CVPR 2025 TMM-OpenWorld Workshop. The paper studies robustness when sensors are missing or noisy in real-world multimodal systems.

Read coverage

INSAIT presents 7 accepted papers at CVPR 2025, strengthening Bulgaria’s presence at the leading computer-vision conference.

Read coverage

SceneSplat is released for instant 3D object discovery from text in photorealistic Gaussian-splat scenes. The system enables open-vocabulary interaction directly in 3D space.

Read coverage

EarthMind is released as a foundation model for multimodal Earth observation and geospatial AI, combining visual, radar, and multi-spectral satellite data.

Read coverage

Exploration-Driven Generative Interactive Environments is accepted to CVPR 2025. The work introduces an auto-exploration pipeline for training generative interactive environments without human demonstrations.

Read coverage

News

2026

2025

Research

Group

Resources