Skip to content
View hulianyuyy's full-sized avatar

Block or report hulianyuyy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hulianyuyy/README.md

Hi👋, i'm a PhD candidate (2021.09-now) in Tianjin University, China. My major interests include video understanding, sign language understanding and multi-modal learning. I'd like to let the people benefit more from general computer vision techniques. For more information, please visit www.hulianyu.top. Feel free to contact me via hly2021@tju.edu.cn.

✉ News:

  • We release iLLaVA, an efficient method for large vision language models by merging visual tokens. It could achieve about throughput and 1.7× - 2× memory reduction with comparable performance through merging redundant visual tokens in some certain layers.

  • We release Deep Correletaed Prompting, which tackles the missing-modality scenarios by proposing three different types of prompting approaches, largely improving the robustness of large vision-language models.

  • We release CorrNet+, an unified model with superior performance on both continuous sign language recognition and sign language translation tasks by using only RGB inputs.

  • We release DSTA-SLR, which performs sign language recognition (SLR) with pure skeleton inputs but ahcieves comparable accuracy and much faster speed than recognition with RGB inputs.

Anurag's GitHub stats

Pinned Loading

  1. STGAT STGAT Public

    Skeleton-Based Action Recognition with Local Dynamic Spatial-Temporal Aggregation (Expert Systems with Applications 2023) (Previous name: Spatial Temporal Graph Attention Network for Skeleton-Based…

    Python 38 7

  2. CorrNet CorrNet Public

    Continuous Sign Language Recognition with Correlation Network (CVPR 2023)

    Python 105 20

  3. DSTA-SLR DSTA-SLR Public

    Dynamic Spatial-Temporal Aggregation for Skeleton-Aware Sign Language Recognition (COLING2024)

    Python 7 2

  4. CorrNet_Plus CorrNet_Plus Public

    CorrNet+: Sign Language Recognition and Translation via Spatial-Temporal Correlation

    Python 17 3

  5. Deep_Correlated_Prompting Deep_Correlated_Prompting Public

    Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)

    Python 12

  6. iLLaVA iLLaVA Public

    iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models

    Python 12 1