Search Results for "object recognition"

Sort By:

Showing 70 open source projects for "object recognition"

View related business solutions

Loan management software that makes it easy.
Ideal for lending professionals who are looking for a feature rich loan management system

Bryt Software is ideal for lending professionals who are looking for a feature rich loan management system that is intuitive and easy to use. We are 100% cloud-based, software as a service. We believe in providing our customers with fair and honest pricing. Our monthly fees are based on your number of users and we have a minimal implementation charge.

Learn More
AestheticsPro Medical Spa Software
Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.

Learn More
1

Exclusively Dark Image Dataset

ExDARK dataset is the largest collection of low-light images

...It contains 7,363 images captured across ten different low-light scenarios, ranging from extremely dark environments to twilight. Each image is annotated with both image-level labels and object-level bounding boxes for 12 object categories, making it suitable for detection and classification tasks. The dataset was created to address the lack of large-scale low-light datasets available for research in object detection, recognition, and enhancement. It has been widely used in studies of low-light image enhancement, deep learning approaches, and domain adaptation for vision models. ...

Downloads: 14 This Week

Last Update: 1 day ago
See Project
2

OpenCV

Open Source Computer Vision Library

OpenCV (Open Source Computer Vision Library) is a comprehensive open-source library for computer vision, machine learning, and image processing. It enables developers to build real-time vision applications ranging from facial recognition to object tracking. OpenCV supports a wide range of programming languages including C++, Python, and Java, and is optimized for both CPU and GPU operations.

Downloads: 49 This Week

Last Update: 2025-12-31
See Project
3

Transformers

State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX

...Text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. Images, for tasks like image classification, object detection, and segmentation. Audio, for tasks like speech recognition and audio classification. Transformers provides APIs to quickly download and use those pretrained models on a given text, fine-tune them on your own datasets and then share them with the community on our model hub. At the same time, each python module defining an architecture is fully standalone and can be modified to enable quick research experiments.

Downloads: 23 This Week

Last Update: 13 hours ago
See Project
4

hfapigo

Unofficial (Golang) Go bindings for the Hugging Face Inference API

(Golang) Go bindings for the Hugging Face Inference API. Directly call any model available in the Model Hub. An API key is required for authorized access. To get one, create a Hugging Face profile.

Downloads: 7 This Week

Last Update: 2025-11-06
See Project
Agentic AI SRE built for Engineering and DevOps teams.
No More Time Lost to Troubleshooting

NeuBird AI's agentic AI SRE delivers autonomous incident resolution, helping team cut MTTR up to 90% and reclaim engineering hours lost to troubleshooting.

Learn More
5

CutLER

Code release for Cut and Learn for Unsupervised Object Detection

CutLER is an approach for unsupervised object detection and instance segmentation that trains detectors without human-annotated labels, and the repo also includes VideoCutLER for unsupervised video instance segmentation. The method follows a “Cut-and-LEaRn” recipe: bootstrap object proposals, refine them iteratively, and train detection/segmentation heads to discover objects across diverse datasets.

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
6

Google AI Edge Gallery

A gallery that showcases on-device ML/GenAI use cases

...The project bundles runnable samples that show how to run TensorFlow Lite/Edge TPU models (and similar lightweight runtimes) on mobile and embedded platforms, demonstrating common tasks like image classification, object detection, audio recognition, and pose estimation. Each sample is intended to be both a learning aid and a practical starting point: code is organized to show model loading, pre/post-processing, performance measurement, and common optimization knobs (quantization, NNAPI/Delegate usage, and hardware accelerators). The repo also collects small, well-documented models and conversion scripts so developers can reproduce a pipeline from a full-size model down to a device-friendly artifact.

Downloads: 1,582 This Week

Last Update: 2026-04-02
See Project
7

Paper2GUI

Convert AI papers to GUI

...It can be used immediately without installation. It already supports 40+ AI models, covering AI painting, speech synthesis, video frame complementing, video super-resolution, object detection, and image stylization. , OCR recognition and other fields. Support Windows, Mac, Linux systems. Paper2GUI: 一款面向普通人的 AI 桌面 APP 工具箱，免安装即开即用，已支持 40+AI 模型，内容涵盖 AI 绘画、语音合成、视频补帧、视频超分、目标检测、图片风格化、OCR 识别等领域。支持 Windows、Mac、Linux 系统。

Downloads: 7 This Week

Last Update: 2024-09-20
See Project
8

Interactive Machine Learning Experiments

Interactive Machine Learning experiments

...The project combines Jupyter or Colab notebooks with browser-based visual demos that allow users to see trained models operating in real time. Many experiments involve tasks such as image classification, object detection, gesture recognition, and simple generative models. The models are typically trained in Python using TensorFlow and then exported for interactive demonstrations in a web environment using JavaScript and TensorFlow.js. Because the project focuses on experimentation rather than production systems, it acts as a sandbox where developers can explore machine learning concepts and observe model behavior. ...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
9

VideoPipe

A cross-platform video structuring (video analysis) framework

VideoPipe is an open-source C++ framework designed for building modular video analysis pipelines that process and structure video data using computer vision models. It operates using a pipeline architecture where independent nodes can be combined flexibly to create customized workflows for tasks such as object detection, face recognition, and behavior analysis. The framework is designed to be lightweight and portable, with minimal dependencies compared to other video processing systems, making it easier to deploy across different environments. It supports multiple inference backends, including OpenCV DNN, TensorRT, PaddleInference, and ONNXRuntime, allowing developers to choose the most suitable runtime for their performance and hardware requirements. ...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
Field Service+ for MS Dynamics 365 & Salesforce
Empower your field service with mobility and reliability

Resco’s mobile solution streamlines your field service operations with offline work, fast data sync, and powerful tools for frontline workers, all natively integrated into Dynamics 365 and Salesforce.

Learn More
10

Airtest

UI Automation Framework for Games and Apps

...AirtestIDE is an out-of-the-box GUI tool that helps to create and run cases in a user-friendly way. AirtestIDE supports a complete automation workflow. Poco adds the ability to directly access object(UI widget) hierarchy across the major platforms and game engines. It allows writing instructions in Python, to achieve more advanced automation.

Downloads: 9 This Week

Last Update: 2025-12-04
See Project
11

Qwen3-VL

Qwen3-VL, the multimodal large language model series by Alibaba Cloud

...Qwen3-VL is built for complex tasks such as GUI automation, multimodal coding (converting images or videos into HTML, CSS, JS, or Draw.io diagrams), long-context reasoning with support up to 1M tokens, and comprehensive video understanding. It also brings advanced perception capabilities, including spatial grounding, object recognition, OCR across 32 languages, and robust handling of challenging inputs like low-light or distorted text.

Downloads: 8 This Week

Last Update: 3 days ago
See Project
12

Open Model Zoo

Pre-trained Deep Learning models and demos

Open Model Zoo is a large repository of high-quality pre-trained deep learning models and demonstration applications designed to work with the OpenVINO™ toolkit, offering a comprehensive starting point for a wide range of AI and computer vision workloads. It includes hundreds of models covering object detection, classification, segmentation, pose estimation, speech recognition, text-to-speech, and more, many of which are already converted into formats optimized for inference on CPUs, GPUs, VPUs, and other accelerators supported by OpenVINO. In addition to model files, Open Model Zoo provides demo applications that show realistic usage patterns and help developers quickly prototype and understand inference pipelines in C++, Python, or via the OpenCV Graph API. ...

Downloads: 1 This Week

Last Update: 2026-01-10
See Project
13

ANTLR

Parser generator to read, process, or translate structured text

...Lex Machina uses ANTLR for information extraction from legal texts. Oracle uses ANTLR within SQL Developer IDE and their migration tools. NetBeans IDE parses C++ with ANTLR. The HQL language in the Hibernate object-relational mapping framework is built with ANTLR.

Downloads: 6 This Week

Last Update: 2024-08-03
See Project
14

MediaPipe Solutions

Cross-platform, customizable ML solutions

...These pipelines can run on a wide variety of platforms including mobile devices, desktop systems, web browsers, and embedded edge devices. MediaPipe is widely used in computer vision and multimedia applications such as hand tracking, face detection, pose estimation, object recognition, and gesture analysis. The framework includes prebuilt solutions that developers can quickly integrate into applications as well as lower-level APIs that allow custom pipeline construction.

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
15

Lingvo

Framework for building neural networks

...It has been used to implement state of the art architectures such as recurrent neural networks, Transformer models, variational autoencoder hybrids, and multi task systems. Lingvo includes reference models and configurations for domains like machine translation, automatic speech recognition, language modeling, image understanding, and 3D object detection. Centralized hyperparameter configuration files allow researchers to share exact experiment setups so others can retrain and compare results reliably.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
16

SmartVision

Free video surveillance software compatible with Windows

...You can remotely manage multiple IP cameras to perform various tasks. In emergencies, it automatically initiates recording, preserving crucial video footage as evidence. The system offers features such as motion detection, object detection, face recognition, automatic license plate recognition (ALRP), fire and dust detection, and is integrated with cloud services.

2 Reviews

Downloads: 0 This Week

Last Update: 2024-10-09
See Project
17

MediaPipe Face Detection

Detect faces in an image

The MediaPipe Face Detection model is a high-performance, real-time face detection solution that uses machine learning to identify faces in images and video streams. It is optimized for mobile and embedded platforms, offering fast and accurate face detection while maintaining a small memory footprint. This model supports multiple face detections and is highly efficient, making it suitable for a variety of applications such as augmented reality, user authentication, and facial expression analysis.

Downloads: 4 This Week

Last Update: 2025-03-19
See Project
18

MMDetection

An open source object detection toolbox based on PyTorch

MMDetection is an open source object detection toolbox that's part of the OpenMMLab project developed by Multimedia Laboratory, CUHK. It stems from the codebase developed by the MMDet team, who won the COCO Detection Challenge in 2018. Since that win this toolbox has continuously been developed and improved. MMDetection detects various objects within a given image with high efficiency. Its training speed is comparable or even faster than those of other codebases like Detectron2 and...

Downloads: 0 This Week

Last Update: 2024-01-05
See Project
19

Blazeface

Blazeface is a lightweight model that detects faces in images

Blazeface is a lightweight, high-performance face detection model designed for mobile and embedded devices, developed by TensorFlow. It is optimized for real-time face detection tasks and runs efficiently on mobile CPUs, ensuring minimal latency and power consumption. Blazeface is based on a fast architecture and uses deep learning techniques to detect faces with high accuracy, even in challenging conditions. It supports multiple face detection in varying lighting and poses, and is designed...

Downloads: 4 This Week

Last Update: 2025-03-19
See Project
20

3DPass

The Implementation of The Ledger of Things Node. Layer 1 decentralized

3DPass is an open-source Layer 1 blockchain written in Rust and built on Substrate. It introduces a novel consensus mechanism called Proof of Scan, where miners validate by running recognition algorithms on 3D objects. Each object produces a reproducible cryptographic fingerprint (HASH ID) that is stored on-chain. If the same object is submitted again, the network rejects it, ensuring that only one of a kind assets can be registered. This approach enforces authenticity at the content level, something traditional file storage and NFT systems cannot achieve. ...

Downloads: 10 This Week

Last Update: 2025-09-15
See Project
21

ZenoTest

Automated UI testing for Windows apps – fast, stable, simple.

...It supports a wide range of technologies including WPF, WinForms, Qt, and native applications, without requiring plugins or framework-specific integrations. With its intelligent object recognition, ZenoTest reliably identifies UI elements even when layouts change, ensuring stable and maintainable test cases. The built-in recorder and C-like scripting language make it easy to create, customize, and reuse automated tests. ZenoTest also generates clear HTML reports, providing full transparency into test execution and results. ...

Downloads: 0 This Week

Last Update: 2026-03-28
See Project
22

NKTgLaw

Core library & API for the NKTg Law (Nguyen Khanh Tung). Includes core

Core library & API for the NKTg Law (Nguyen Khanh Tung). Includes core implementation, REST/gRPC API, and 150+ client wrappers

Downloads: 0 This Week

Last Update: 2 days ago
See Project
23

LifeAI

LifeAI is an artificial intelligence system that can be applied to robotics, games, or business. It simulates key processes of our minds, such as organizing data into concepts and categories, planning actions based on their predicted outcome, and communication. LifeAI was designed to be simple, but powerful and flexible enough to have many applications.

Downloads: 0 This Week

Last Update: 2023-05-20
See Project
24

ImageAI

A python library built to empower developers

ImageAI is an easy-to-use Computer Vision Python library that empowers developers to easily integrate state-of-the-art Artificial Intelligence features into their new and existing applications and systems. It is used by thousands of developers, students, researchers, tutors and experts in corporate organizations around the world. You will find features supported, links to official documentation as well as articles on ImageAI. ImageAI is widely used around the world by professionals,...

Downloads: 37 This Week

Last Update: 2022-12-21
See Project
25

Darknet

Convolutional Neural Networks

Darknet is an open source neural network framework written in C and CUDA, developed by Joseph Redmon. It is best known as the original implementation of the YOLO (You Only Look Once) real-time object detection system. Darknet is lightweight, fast, and easy to compile, making it suitable for research and production use. The repository provides pre-trained models, configuration files, and tools for training custom object detection models. With GPU acceleration via CUDA and OpenCV integration, it achieves high performance in image recognition tasks. ...

Downloads: 26 This Week

Last Update: 3 days ago
See Project