Famous or recent Multimodal Large Language Models and related works.
Research about foundation models in robotics.
Recent research on the causal inference in MLLM / LLMs.
Recent research on visual reasoning, especially in the 3D world.
Famous and recent paradigm of multimodal learning, especially leveraging unpaired data.