Skip to content
View Hxyou's full-sized avatar
🌊
🌊

Block or report Hxyou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models

Python 37 Updated Mar 19, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,238 227 Updated Mar 24, 2025
Python 30 4 Updated Mar 13, 2024

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

TypeScript 1,621 64 Updated Jan 30, 2025
Python 15 1 Updated Nov 10, 2023

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 855 45 Updated Nov 23, 2024
6 1 Updated Nov 9, 2023
Python 8,605 508 Updated Oct 9, 2024

✨✨Latest Advances on Multimodal Large Language Models

14,427 927 Updated Mar 21, 2025

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

1,113 58 Updated Jan 4, 2024

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

Python 486 24 Updated Apr 4, 2024

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

766 24 Updated Jul 20, 2023

Awesome-LLM: a curated list of Large Language Model

22,307 1,834 Updated Mar 24, 2025

Official Code of IdealGPT

Python 34 8 Updated Oct 13, 2023

A collection of resources and papers on Diffusion Models

HTML 11,567 968 Updated Aug 1, 2024

[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

Python 805 55 Updated Apr 28, 2023

Official Repository of ChatCaptioner

Jupyter Notebook 462 30 Updated Apr 13, 2023

Official Code of ECCV 2022 paper MS-CLIP

Python 88 3 Updated Jul 27, 2022

Repo for external large-scale work

Python 6,522 726 Updated Apr 27, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,142 1,044 Updated Mar 24, 2025

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

1,183 57 Updated Jun 28, 2024

Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER adversarial training part

Python 119 14 Updated Jan 13, 2021
Python 15 Updated Jan 20, 2022

[ICLR 2022 poster] Official PyTorch implementation of "Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework"

Python 522 68 Updated Feb 2, 2025

Grounded Language-Image Pre-training

Python 2,357 202 Updated Jan 24, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,966 2,611 Updated Mar 4, 2025

VOLO: Vision Outlooker for Visual Recognition

Jupyter Notebook 941 96 Updated Sep 18, 2022

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Python 1,134 136 Updated Aug 22, 2023

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 14,511 2,107 Updated Jul 24, 2024

Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition

Jupyter Notebook 564 88 Updated Oct 31, 2021
Next
Showing results