Multilingual Document Layout Parsing in a Single Vision-Language Model
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
No-code LLM Platform to launch APIs and ETL Pipelines
Progressbar 2 - A progress bar for Python 2 and Python 3
Code for Cicero, an AI agent that plays the game of Diplomacy
AI tool that converts GitHub repositories into interactive diagrams
ChatGPT extension for scientific research work
Open source software calculating industrial noise in the environment
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
"Big Model" trains a visual multimodal VLM with 26M parameters
A NumPy-compatible array library accelerated by CUDA
A little word cloud generator in Python
wxPython's Project Phoenix. A new implementation of wxPython
A Coverage-Guided, Native Python Fuzzer
About 24 Lessons, 12 Weeks, Get Started as a Web Developer
The book "Performance Analysis and Tuning on Modern CPU"
matplotlib: plotting with Python
nsync is a C library that exports various synchronization primitives
Inference script for Oasis 500M
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Let agents classify your bank transactions
A library for accelerating Transformer models on NVIDIA GPUs
Python package for AutoML on Tabular Data with Feature Engineering
LLM training in simple, raw C/CUDA
OCR expert VLM powered by Hunyuan's native multimodal architecture