Skip to content
View 26hzhang's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Block or report 26hzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A fork to add multimodal model training to open-r1

Python 570 30 Updated Feb 8, 2025

Witness the aha moment of VLM with less than $3.

Python 2,128 161 Updated Feb 11, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 416 27 Updated Feb 10, 2025
Python 389 37 Updated Feb 11, 2025

This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.

TypeScript 4,916 497 Updated Jan 27, 2025
Python 2,107 145 Updated Feb 10, 2025

A list of awesome papers and resources of recommender system on large language model (LLM).

1,588 130 Updated Aug 15, 2024

Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 93 5 Updated Jan 24, 2025

[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding

Jupyter Notebook 26 1 Updated Feb 11, 2025

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Python 173 11 Updated Feb 8, 2025

A generalist foundation model for healthcare capable of handling diverse medical data modalities.

Python 63 3 Updated Apr 25, 2024

Neural Code Intelligence Survey 2024; Reading lists and resources

239 11 Updated Feb 2, 2025

[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,353 124 Updated Jan 24, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,420 358 Updated Feb 8, 2025

[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 412 29 Updated Jan 23, 2025

Code for EMNLP 2024 paper "DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering"

Python 3 Updated Nov 28, 2024
Python 13 1 Updated Aug 27, 2024

[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering

Python 136 14 Updated Oct 27, 2024

Lightweight tool to identify Data Contamination in LLMs evaluation

Python 46 1 Updated Mar 8, 2024

[EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"

Python 19 1 Updated Oct 2, 2024

This the implementation of LeCo

Python 30 1 Updated Jan 20, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 80,385 60,640 Updated Feb 11, 2025

[NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an …

Python 906 44 Updated Jan 31, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 1,747 142 Updated Feb 11, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,367 2,352 Updated Aug 12, 2024

LoFiT: Localized Fine-tuning on LLM Representations

Python 32 5 Updated Jan 15, 2025

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Python 496 40 Updated Jan 28, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,289 1,310 Updated Feb 11, 2025

This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.

Jupyter Notebook 58 10 Updated Oct 9, 2024
Next