Fast State-of-the-Art Static Embeddings
Evaluate and compare LLM outputs, catch regressions, improve prompts
SGLang is a fast serving framework for large language models
A long-running autonomous coding agent powered by the Claude Agent
PyTorch library of curated Transformer models and their components
Terminal-based LLM chat tool with multi-model and local support
Low code tool to rapidly build and coordinate multi-agent teams
llama and other large language models on iOS and MacOS offline
AI tool for detecting complex vulnerabilities in Python codebases
Building Mixture-of-Experts from LLaMA with Continual Pre-training
Pruna is a model optimization framework built for developers
chat web app for teams, sass with user management and ratelimit
Meta Agents Research Environments is a comprehensive platform
An MCP client for Neovim that seamlessly integrates MCP servers
Private chat with local GPT with document, images, video, etc.
Agents-Flex is an elegant LLM Application Framework like LangChain
Fastest, smallest, and fully autonomous AI assistant infrastructure
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
Speech-AI-Forge is a project developed around TTS generation model
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Code to accompany "A Method for Animating Children's Drawings"
On the Structural Pruning of Large Language Models
Traditional Mandarin LLMs for Taiwan
Local CLI Copilot, powered by Ollama