OpenBMB / MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
See what the GitHub community is most excited about today.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Build Conversational AI in minutes ⚡️
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
A collective list of free APIs
A high-performance algorithmic trading platform and event-driven backtester
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Robust Speech Recognition via Large-Scale Weak Supervision
Low code web framework for real world applications, in Python and Javascript
PDF to Markdown with vision models