MII makes low-latency and high-throughput inference possible
The standard data-centric AI package for data quality and ML
ktrain is a Python library that makes deep learning AI more accessible
LISA: Reasoning Segmentation via Large Language Model
Skywork-R1V is an advanced multimodal AI model series
Refer and Ground Anything Anywhere at Any Granularity
Gemma open-weight LLM library, from Google DeepMind
Language modeling in a sentence representation space
High-Resolution Image Synthesis with Latent Diffusion Models
Build cross-modal and multimodal applications on the cloud
Plug-n-play module turning text-to-image models into animation
Run GGUF models easily with a UI or API. One File. Zero Install.
Guiding Instruction-based Image Editing via Multimodal Large Language
A Python application to add watermarks (text or image) to PDF files
Mice speech to text with MX Cinnamon OS ISO
AI-powered tool to quickly remove watermarks from images flawlessly
Overcoming Data Limitations for High-Quality Video Diffusion Models
A library for transfer learning by reusing parts of TensorFlow models
Ainee - AI Notetaking and Learning Companion
Multi-Voice and Prompt-Controlled TTS Engine
Official code for Style Aligned Image Generation via Shared Attention
Embed images and sentences into fixed-length vectors
Generate 3D objects conditioned on text or images
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Convert an image to text to spot intelligible words.