Search Results for "object recognition"

Sort By:

Showing 53 open source projects for "object recognition"

View related business solutions

Linux Clear Filters & Widen Search

DialedIn: Cloud Contact Center Software
Cloud Based Contact Center Software that Drives Success

DialedIn is a modern call center software designed to transform customer interactions and streamline your operations, helping teams achieve more daily. By automating and optimizing key workflows across inbound, outbound, and blended environments, DialedIn helps you boost agent productivity and deliver better outcomes across every call.

Learn More
Outplacement, Executive Coaching and Career Development | Careerminds
Careerminds outplacement includes personalized coaching and a high-tech approach to help transition employees back to work faster.

By helping to avoid the potential risks of RIFs or layoffs through our global outplacement services, companies can move forward with their goals while preserving their internal culture, employer brand, and bottom lines.

Learn More
1

Exclusively Dark Image Dataset

ExDARK dataset is the largest collection of low-light images

...It contains 7,363 images captured across ten different low-light scenarios, ranging from extremely dark environments to twilight. Each image is annotated with both image-level labels and object-level bounding boxes for 12 object categories, making it suitable for detection and classification tasks. The dataset was created to address the lack of large-scale low-light datasets available for research in object detection, recognition, and enhancement. It has been widely used in studies of low-light image enhancement, deep learning approaches, and domain adaptation for vision models. ...

Downloads: 5 This Week

Last Update: 17 hours ago
See Project
2

OpenCV

Open Source Computer Vision Library

OpenCV (Open Source Computer Vision Library) is a comprehensive open-source library for computer vision, machine learning, and image processing. It enables developers to build real-time vision applications ranging from facial recognition to object tracking. OpenCV supports a wide range of programming languages including C++, Python, and Java, and is optimized for both CPU and GPU operations.

Downloads: 34 This Week

Last Update: 2025-12-31
See Project
3

Transformers

State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX

...Text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. Images, for tasks like image classification, object detection, and segmentation. Audio, for tasks like speech recognition and audio classification. Transformers provides APIs to quickly download and use those pretrained models on a given text, fine-tune them on your own datasets and then share them with the community on our model hub. At the same time, each python module defining an architecture is fully standalone and can be modified to enable quick research experiments.

Downloads: 7 This Week

Last Update: 2026-04-13
See Project
4

CutLER

Code release for Cut and Learn for Unsupervised Object Detection

CutLER is an approach for unsupervised object detection and instance segmentation that trains detectors without human-annotated labels, and the repo also includes VideoCutLER for unsupervised video instance segmentation. The method follows a “Cut-and-LEaRn” recipe: bootstrap object proposals, refine them iteratively, and train detection/segmentation heads to discover objects across diverse datasets.

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
Kinetic Software - Epicor ERP
Discrete, make-to-order and mixed-mode manufacturers who need a global cloud ERP solution

Grow, thrive, and compete in a global marketplace with Kinetic—an industry-tailored, cognitive ERP that helps you work smarter and stay connected.

Learn More
5

Interactive Machine Learning Experiments

Interactive Machine Learning experiments

...The project combines Jupyter or Colab notebooks with browser-based visual demos that allow users to see trained models operating in real time. Many experiments involve tasks such as image classification, object detection, gesture recognition, and simple generative models. The models are typically trained in Python using TensorFlow and then exported for interactive demonstrations in a web environment using JavaScript and TensorFlow.js. Because the project focuses on experimentation rather than production systems, it acts as a sandbox where developers can explore machine learning concepts and observe model behavior. ...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
6

Paper2GUI

Convert AI papers to GUI

...It can be used immediately without installation. It already supports 40+ AI models, covering AI painting, speech synthesis, video frame complementing, video super-resolution, object detection, and image stylization. , OCR recognition and other fields. Support Windows, Mac, Linux systems. Paper2GUI: 一款面向普通人的 AI 桌面 APP 工具箱，免安装即开即用，已支持 40+AI 模型，内容涵盖 AI 绘画、语音合成、视频补帧、视频超分、目标检测、图片风格化、OCR 识别等领域。支持 Windows、Mac、Linux 系统。

Downloads: 3 This Week

Last Update: 2024-09-20
See Project
7

hfapigo

Unofficial (Golang) Go bindings for the Hugging Face Inference API

(Golang) Go bindings for the Hugging Face Inference API. Directly call any model available in the Model Hub. An API key is required for authorized access. To get one, create a Hugging Face profile.

Downloads: 1 This Week

Last Update: 2025-11-06
See Project
8

Airtest

UI Automation Framework for Games and Apps

...AirtestIDE is an out-of-the-box GUI tool that helps to create and run cases in a user-friendly way. AirtestIDE supports a complete automation workflow. Poco adds the ability to directly access object(UI widget) hierarchy across the major platforms and game engines. It allows writing instructions in Python, to achieve more advanced automation.

Downloads: 4 This Week

Last Update: 2025-12-04
See Project
9

Qwen3-VL

Qwen3-VL, the multimodal large language model series by Alibaba Cloud

...Qwen3-VL is built for complex tasks such as GUI automation, multimodal coding (converting images or videos into HTML, CSS, JS, or Draw.io diagrams), long-context reasoning with support up to 1M tokens, and comprehensive video understanding. It also brings advanced perception capabilities, including spatial grounding, object recognition, OCR across 32 languages, and robust handling of challenging inputs like low-light or distorted text.

Downloads: 4 This Week

Last Update: 3 days ago
See Project
HR Outsourcing Built for Small and Midsize Businesses
Payroll. Benefits. Compliance. Technology. All in one place.

TriNet is a leading provider of HR outsourcing solutions built for small and midsize businesses. Its platform combines payroll, benefits, risk management, compliance, and HR technology in one integrated system. Through its PEO (Professional Employer Organization) and HR Plus (ASO) offerings, TriNet helps companies streamline HR administration, stay compliant, and access enterprise-level benefits. Businesses can run payroll efficiently, manage compliance with complex state and federal regulations, and offer competitive employee benefits with ease. The company’s intuitive HR platform also automates time tracking, leave requests, and onboarding. With TriNet, organizations can focus on growth while ensuring their people and processes are supported by expert HR guidance.

Learn More
10

MediaPipe Solutions

Cross-platform, customizable ML solutions

...These pipelines can run on a wide variety of platforms including mobile devices, desktop systems, web browsers, and embedded edge devices. MediaPipe is widely used in computer vision and multimedia applications such as hand tracking, face detection, pose estimation, object recognition, and gesture analysis. The framework includes prebuilt solutions that developers can quickly integrate into applications as well as lower-level APIs that allow custom pipeline construction.

Downloads: 1 This Week

Last Update: 2026-03-15
See Project
11

Open Model Zoo

Pre-trained Deep Learning models and demos

Open Model Zoo is a large repository of high-quality pre-trained deep learning models and demonstration applications designed to work with the OpenVINO™ toolkit, offering a comprehensive starting point for a wide range of AI and computer vision workloads. It includes hundreds of models covering object detection, classification, segmentation, pose estimation, speech recognition, text-to-speech, and more, many of which are already converted into formats optimized for inference on CPUs, GPUs, VPUs, and other accelerators supported by OpenVINO. In addition to model files, Open Model Zoo provides demo applications that show realistic usage patterns and help developers quickly prototype and understand inference pipelines in C++, Python, or via the OpenCV Graph API. ...

Downloads: 1 This Week

Last Update: 2026-01-10
See Project
12

ANTLR

Parser generator to read, process, or translate structured text

...Lex Machina uses ANTLR for information extraction from legal texts. Oracle uses ANTLR within SQL Developer IDE and their migration tools. NetBeans IDE parses C++ with ANTLR. The HQL language in the Hibernate object-relational mapping framework is built with ANTLR.

Downloads: 7 This Week

Last Update: 2024-08-03
See Project
13

Lingvo

Framework for building neural networks

...It has been used to implement state of the art architectures such as recurrent neural networks, Transformer models, variational autoencoder hybrids, and multi task systems. Lingvo includes reference models and configurations for domains like machine translation, automatic speech recognition, language modeling, image understanding, and 3D object detection. Centralized hyperparameter configuration files allow researchers to share exact experiment setups so others can retrain and compare results reliably.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
14

MediaPipe Face Detection

Detect faces in an image

The MediaPipe Face Detection model is a high-performance, real-time face detection solution that uses machine learning to identify faces in images and video streams. It is optimized for mobile and embedded platforms, offering fast and accurate face detection while maintaining a small memory footprint. This model supports multiple face detections and is highly efficient, making it suitable for a variety of applications such as augmented reality, user authentication, and facial expression analysis.

Downloads: 4 This Week

Last Update: 2025-03-19
See Project
15

Blazeface

Blazeface is a lightweight model that detects faces in images

Blazeface is a lightweight, high-performance face detection model designed for mobile and embedded devices, developed by TensorFlow. It is optimized for real-time face detection tasks and runs efficiently on mobile CPUs, ensuring minimal latency and power consumption. Blazeface is based on a fast architecture and uses deep learning techniques to detect faces with high accuracy, even in challenging conditions. It supports multiple face detection in varying lighting and poses, and is designed...

Downloads: 5 This Week

Last Update: 2025-03-19
See Project
16

MMDetection

An open source object detection toolbox based on PyTorch

MMDetection is an open source object detection toolbox that's part of the OpenMMLab project developed by Multimedia Laboratory, CUHK. It stems from the codebase developed by the MMDet team, who won the COCO Detection Challenge in 2018. Since that win this toolbox has continuously been developed and improved. MMDetection detects various objects within a given image with high efficiency. Its training speed is comparable or even faster than those of other codebases like Detectron2 and...

Downloads: 0 This Week

Last Update: 2024-01-05
See Project
17

3DPass

The Implementation of The Ledger of Things Node. Layer 1 decentralized

3DPass is an open-source Layer 1 blockchain written in Rust and built on Substrate. It introduces a novel consensus mechanism called Proof of Scan, where miners validate by running recognition algorithms on 3D objects. Each object produces a reproducible cryptographic fingerprint (HASH ID) that is stored on-chain. If the same object is submitted again, the network rejects it, ensuring that only one of a kind assets can be registered. This approach enforces authenticity at the content level, something traditional file storage and NFT systems cannot achieve. ...

Downloads: 1 This Week

Last Update: 2025-09-15
See Project
18

NKTgLaw

Core library & API for the NKTg Law (Nguyen Khanh Tung). Includes core

Core library & API for the NKTg Law (Nguyen Khanh Tung). Includes core implementation, REST/gRPC API, and 150+ client wrappers

Downloads: 5 This Week

Last Update: 2026-04-12
See Project
19

LifeAI

LifeAI is an artificial intelligence system that can be applied to robotics, games, or business. It simulates key processes of our minds, such as organizing data into concepts and categories, planning actions based on their predicted outcome, and communication. LifeAI was designed to be simple, but powerful and flexible enough to have many applications.

Downloads: 0 This Week

Last Update: 2023-05-20
See Project
20

ImageAI

A python library built to empower developers

ImageAI is an easy-to-use Computer Vision Python library that empowers developers to easily integrate state-of-the-art Artificial Intelligence features into their new and existing applications and systems. It is used by thousands of developers, students, researchers, tutors and experts in corporate organizations around the world. You will find features supported, links to official documentation as well as articles on ImageAI. ImageAI is widely used around the world by professionals,...

Downloads: 37 This Week

Last Update: 2022-12-21
See Project
21

Darknet

Convolutional Neural Networks

Darknet is an open source neural network framework written in C and CUDA, developed by Joseph Redmon. It is best known as the original implementation of the YOLO (You Only Look Once) real-time object detection system. Darknet is lightweight, fast, and easy to compile, making it suitable for research and production use. The repository provides pre-trained models, configuration files, and tools for training custom object detection models. With GPU acceleration via CUDA and OpenCV integration, it achieves high performance in image recognition tasks. ...

Downloads: 30 This Week

Last Update: 3 days ago
See Project
22

Hello AI World

Guide to deploying deep-learning inference networks

Hello AI World is a great way to start using Jetson and experiencing the power of AI. In just a couple of hours, you can have a set of deep learning inference demos up and running for realtime image classification and object detection on your Jetson Developer Kit with JetPack SDK and NVIDIA TensorRT. The tutorial focuses on networks related to computer vision, and includes the use of live cameras. You’ll also get to code your own easy-to-follow recognition program in Python or C++, and train your own DNN models onboard Jetson with PyTorch. Ready to dive into deep learning? ...

Downloads: 0 This Week

Last Update: 2022-08-03
See Project
23

Flashlight library

A C++ standalone library for machine learning

Flashlight is a fast, flexible machine learning library written entirely in C++ by Facebook AI Research and the creators of Torch, TensorFlow, Eigen, and Deep Speech. Native support in C++ and simple extensibility make Flashlight a powerful research framework that's hackable to its core and enables fast iteration on new experimental setups and algorithms with little unopinionated and without sacrificing performance. In a single repository, Flashlight provides apps for research across...

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
24

Tensorflow Transformers

State of the art faster Transformer with Tensorflow 2.0

...These models can be applied on text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. Images, for tasks like image classification, object detection, and segmentation. Audio, for tasks like speech recognition and audio classification. Faster AutoReggressive Decoding, TFlite support, creating TFRecords is simple. Auto-Batching tf.data.dataset or tf.ragged tensors. Everything is dictionary (inputs and outputs) Multiple mask modes like causal, user-defined, prefix. tensorflow-text tokenizer support. ...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
25

DeepStack

The World's Leading Cross Platform AI Engine for Edge Devices

DeepStack is an AI API engine that serves pre-built models and custom models on multiple edge devices locally or on your private cloud. DeepStack runs completely offline and independent of the cloud. You can also install and run DeepStack on any cloud VM with docker installed to serve as your private, state-of-the-art and real-time AI server.

Downloads: 12 This Week

Last Update: 2024-09-04
See Project