Chat & pretrained large audio language model proposed by Alibaba Cloud
LongBench v2 and LongBench (ACL 25'&24')
Towards Human-Level Text-to-Speech through Style Diffusion
Solve end to end problems using Llama model family
Multi-tool for semantic search
A Python library for extracting structured information
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Seamlessly integrate LLMs into scikit-learn
Virtual AI anchor that combines state-of-the-art technology
Shared repository for open-sourced projects from the Google AI Lang
Get Alerts from your Docker Container Logs
A Python package for segmenting geospatial data with the SAM
Unified Multimodal Understanding and Generation Models
Terminal-based CPU stress and monitoring utility
OpenRecall is a fully open-source, privacy-first alternative
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Large Multimodal Models for Video Understanding and Editing
AI-Powered Data Processing: Use LOTUS to process all of your datasets
A Model Context Protocol server for searching and analyzing arXiv
Python crawler for collecting and downloading Sina Weibo user data
ChatGPT extension for scientific research work
Pushing the Limits of Mathematical Reasoning in Open Language Models
LLM-based agent for general purpose software engineering tasks
An interactive program for statistical analysis of texts