Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
AI agents running research on single-GPU nanochat training
Use Microsoft Edge's online text-to-speech service from Python
Ready-to-use OCR with 80+ supported languages
Multimodal-Driven Architecture for Customized Video Generation
A ranked list of awesome machine learning Python libraries
JAX-based neural network library
NLP Cloud serves high performance pre-trained or custom models for NER
Speech recognition module for Python
End-to-End Library for Continual Learning based on PyTorch
State-of-the-art 2D and 3D Face Analysis Project
Qlib is an AI-oriented quantitative investment platform
GUI for a Vocal Remover that uses Deep Neural Networks
User toolkit for analyzing and interfacing with Large Language Models
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Deep Research framework, combining language models with tools
Open Source Differentiable Computer Vision Library
TTS with kokoro and onnx runtime
A python library that makes AMR parsing, generation and visualization
Probabilistic time series modeling in Python
Module for automatic summarization of text documents and HTML pages
A Simple and Universal Swarm Intelligence Engine
A python module to repair invalid JSON from LLMs
The official Python SDK for Model Context Protocol servers and clients
Real time face swap and one-click video deepfake