dongyh20

Follow

Yuhao Dong dongyh20

Follow

Tsinghua University, Senior Student

50 followers · 32 following

Tsinghua University

Achievements

Achievements

Highlights

Pro

dongyh20/README.md

Hi there 👋

🔭 I’m currently working on the topic of visual perception and my long-term goal is to build general foundation models.

⚡ Recently I'm focusing on vision-language model and unified visual models.

📫 If you are also interested in relevant issues, feel free to chat with me!

Pinned Loading

Oryx-mllm/Oryx Oryx-mllm/Oryx Public

MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Python 294 14
Octopus Octopus Public

🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.

Python 274 19
EvolvingLMMs-Lab/lmms-eval EvolvingLMMs-Lab/lmms-eval Public

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2.2k 178
Insight-V Insight-V Public

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Python 122 4
Chain-of-Spot Chain-of-Spot Public

Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models

Python 89 6