Skip to content
Change the repository type filter

All

    Repositories list

    • SEED-Story: Multimodal Long Story Generation with Large Language Model
      Python
      Other
      5571170Updated Sep 29, 2024Sep 29, 2024
    • Open-MAGVIT2: Democratizing Autoregressive Visual Generation
      Python
      Apache License 2.0
      2462740Updated Sep 27, 2024Sep 27, 2024
    • Official Code for MotionCtrl [SIGGRAPH 2024]
      Python
      Apache License 2.0
      711.3k240Updated Sep 20, 2024Sep 20, 2024
    • ST-LLM

      Public
      [ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"
      Python
      Apache License 2.0
      411290Updated Sep 10, 2024Sep 10, 2024
    • mllm-npu

      Public
      mllm-npu: training multimodal large language models on Ascend NPUs
      Python
      Apache License 2.0
      27820Updated Aug 29, 2024Aug 29, 2024
    • MasaCtrl

      Public
      [ICCV 2023] Consistent Image Synthesis and Editing
      Python
      Apache License 2.0
      26717212Updated Aug 19, 2024Aug 19, 2024
    • Plot2Code

      Public
      Python
      21500Updated Aug 17, 2024Aug 17, 2024
    • PhotoMaker [CVPR 2024]
      Jupyter Notebook
      Other
      7489.4k1403Updated Aug 15, 2024Aug 15, 2024
    • GFPGAN

      Public
      GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
      Python
      Other
      5.9k36k34623Updated Jul 26, 2024Jul 26, 2024
    • CustomNet

      Public
      Python
      Apache License 2.0
      926261Updated Jul 22, 2024Jul 22, 2024
    • BrushNet

      Public
      [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
      Python
      Other
      1141.4k431Updated Jul 17, 2024Jul 17, 2024
    • ViT-Lens

      Public
      [CVPR 2024] ViT-Lens: Towards Omni-modal Representations
      Python
      Other
      1015530Updated Jul 2, 2024Jul 2, 2024
    • T2I-Adapter
      Python
      2013.4k836Updated Jun 21, 2024Jun 21, 2024
    • SmartEdit

      Public
      Official code of SmartEdit [CVPR-2024 Highlight]
      Python
      823780Updated Jun 21, 2024Jun 21, 2024
    • InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
      Python
      Apache License 2.0
      3243.1k962Updated Jun 20, 2024Jun 20, 2024
    • LLaMA-Pro

      Public
      [ACL 2024] Progressive LLaMA with Block Expansion.
      Python
      Apache License 2.0
      34470230Updated May 20, 2024May 20, 2024
    • NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
      Python
      Other
      1838771Updated May 14, 2024May 14, 2024
    • BTS

      Public
      BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
      Other
      02540Updated Apr 16, 2024Apr 16, 2024
    • UMT

      Public
      UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
      Python
      Other
      1818910Updated Apr 15, 2024Apr 15, 2024
    • BEBR

      Public
      Official code for "Binary embedding based retrieval at Tencent"
      Python
      Apache License 2.0
      14220Updated Mar 7, 2024Mar 7, 2024
    • DeSRA

      Public
      Official codes for DeSRA (ICML 2023)
      Python
      012350Updated Feb 2, 2024Feb 2, 2024
    • ViSFT

      Public
      Python
      Apache License 2.0
      23310Updated Jan 20, 2024Jan 20, 2024
    • MM-RealSR

      Public
      Codes for "Metric Learning based Interactive Modulation for Real-World Super-Resolution"
      Python
      BSD 3-Clause "New" or "Revised" License
      12154100Updated Jan 16, 2024Jan 16, 2024
    • HOSNeRF

      Public
      HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video
      Python
      Apache License 2.0
      76521Updated Dec 12, 2023Dec 12, 2023
    • VTLayout

      Public
      0310Updated Oct 23, 2023Oct 23, 2023
    • TVTS

      Public
      Turning to Video for Transcript Sorting
      Jupyter Notebook
      Other
      24400Updated Aug 27, 2023Aug 27, 2023
    • AnimeSR

      Public
      Codes for "AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos"
      Python
      Other
      3433181Updated Aug 18, 2023Aug 18, 2023
    • pi-Tuning

      Public
      Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.
      Python
      Other
      13220Updated Jul 21, 2023Jul 21, 2023
    • SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes
      Apache License 2.0
      67640Updated Jul 10, 2023Jul 10, 2023
    • GVT

      Public
      Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".
      Python
      Apache License 2.0
      05550Updated Jun 27, 2023Jun 27, 2023