26hzhang

Follow

💭

I may be slow to respond.

ZHANG HAO 26hzhang

💭

I may be slow to respond.

Follow

333 followers · 23 following

Achievements

Achievements

Highlights

Pro

Lists (2)

Sort

🔮 Future ideas

Med-MLLM

Stars

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 570 30 Updated Feb 8, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 2,128 161 Updated Feb 11, 2025

DAMO-NLP-SG / VideoLLaMA3

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 416 27 Updated Feb 10, 2025

microsoft / rStar

Python 389 37 Updated Feb 11, 2025

MoonshotAI / Kimi-k1.5

2,795 154 Updated Feb 2, 2025

openai / openai-realtime-agents

This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.

TypeScript 4,916 497 Updated Jan 27, 2025

MiniMax-AI / MiniMax-01

Python 2,107 145 Updated Feb 10, 2025

WLiK / LLM4Rec-Awesome-Papers

A list of awesome papers and resources of recommender system on large language model (LLM).

1,588 130 Updated Aug 15, 2024

OS-Copilot / OS-Genesis

Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 93 5 Updated Jan 24, 2025

IDEA-FinAI / ChartMoE

[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding

Jupyter Notebook 26 1 Updated Feb 11, 2025

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Python 173 11 Updated Feb 8, 2025

sunanhe / MedDr

A generalist foundation model for healthcare capable of handling diverse medical data modalities.

Python 63 3 Updated Apr 25, 2024

QiushiSun / Awesome-Code-Intelligence

Neural Code Intelligence Survey 2024; Reading lists and resources

239 11 Updated Feb 2, 2025

AI-in-Health / MedLLMsPracticalGuide

[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,353 124 Updated Jan 24, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,420 358 Updated Feb 8, 2025

CaraJ7 / MMSearch

[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 412 29 Updated Jan 23, 2025

JulieJin-km / Dynamic_Contrastive_Decoding

Code for EMNLP 2024 paper "DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering"

Python 3 Updated Nov 28, 2024

apple / ml-toad

Python 13 1 Updated Aug 27, 2024

chuanyangjin / MMToM-QA

[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering

Python 136 14 Updated Oct 27, 2024

liyucheng09 / Contamination_Detector

Lightweight tool to identify Data Contamination in LLMs evaluation

Python 46 1 Updated Mar 8, 2024

pillowsofwind / Course-Correction

[EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"

Python 19 1 Updated Oct 2, 2024

starrYYxuan / LeCo

This the implementation of LeCo

Python 30 1 Updated Jan 20, 2025

ChatGPTNextWeb / NextChat

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 80,385 60,640 Updated Feb 11, 2025

microsoft / MInference

[NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an …

Python 906 44 Updated Jan 31, 2025

RUC-NLPIR / FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 1,747 142 Updated Feb 11, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,367 2,352 Updated Aug 12, 2024

fc2869 / lo-fit

LoFiT: Localized Fine-tuning on LLM Representations

Python 32 5 Updated Jan 15, 2025

likenneth / honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Python 496 40 Updated Jan 28, 2025

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,289 1,310 Updated Feb 11, 2025

HSLiu-Initial / CtrlA

This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.

Jupyter Notebook 58 10 Updated Oct 9, 2024