Open Source Computer Vision Libraries - Page 8

  • Rezku Point of Sale Icon
    Rezku Point of Sale

    Designed for Real-World Restaurant Operations

    Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
    Learn More
  • QA Wolf | We Write, Run and Maintain Tests Icon
    QA Wolf | We Write, Run and Maintain Tests

    For developer teams searching for a testing software

    QA Wolf is an AI-native service that delivers 80% automated E2E test coverage for web & mobile apps in weeks not years.
    Learn More
  • 1
    Portable Robotics Eye Vergence Control

    Portable Robotics Eye Vergence Control

    Eye movements control portable on different robotic stereo heads

    This project provides a software module for the control of the binocular coordination of a robotic stereo head, based on a bio-inspired algorithm. The project is now available for the iCub platform to work on YARP [https://github.com/stino78/vergence-control/][1] The algorithm works on the top of a distributed representation of binocular disparity supplied by a population of binocular energy-model neural units. The project allows a robust control and adaptive binocular coordination for different robot stereo platforms. Reference publications: Gibaldi, A., Vanegas, M., Canessa, A., & Sabatini, S. P. (2017). A portable bio-inspired architecture for efficient robotic vergence control. International Journal of Computer Vision,. Gibaldi, A., Canessa, A., Chessa, M., Sabatini, S. P., & Solari, F. (2011, October). A neuromorphic control module for real-time vergence eye movements on the iCub robot head. In Humanoid Robots (Humanoids), 2011
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    ProximityForest

    Efficient Approximate Nearest Neighbors for General Metric Spaces

    A proximity forest is a data structure that allows for efficient computation of approximate nearest neighbors of arbitrary data elements in a metric space. See: O'Hara and Draper, "Are You Using the Right Approximate Nearest Neighbor Algorithm?", WACV 2013 (best student paper award). One application of a ProximityForest is given in the following CVPR publication: Stephen O'Hara and Bruce A. Draper, "Scalable Action Recognition with a Subspace Forest," IEEE Conference on Computer Vision and Pattern Recognition, 2012. This source code is provided without warranty and is available under the GPL license. More commercially-friendly licenses may be available. Please contact Stephen O'Hara for license options. Please view the wiki on this site for installation instructions and examples on reproducing the results of the papers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    PyArmadillo

    PyArmadillo

    linear algebra library for Python

    PyArmadillo - streamlined linear algebra library for Python, with emphasis on ease of use. Alternative to NumPy / SciPy. * Main page: https://pyarma.sourceforge.io * Documentation: https://pyarma.sourceforge.io/docs.html * Bug reports: https://pyarma.sourceforge.io/faq.html * Git repo: https://gitlab.com/jason-rumengan/pyarma
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PyCV is a Python package of modules useful for computer vision tasks. Its current focus is on boosting techniques, Haar-like features, and face detection. PyCV provides the world's fastest method for training a face detector, in a few hours.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Captain Compliance - Data Privacy and Compliance Software Icon
    Captain Compliance - Data Privacy and Compliance Software

    Privacy Compliance Software - Avoid Fines and Prevent Lawsuits

    Captain Compliance handles your data privacy requirements so you can be privacy compliant. No more compliance stress, stop stressing over regulatory risks – just privacy protection managed by experts. Our user-friendly platform backed by privacy professionals simplifies the process of navigating regulations, giving your customers transparent choices, and building essential trust for your organization.
    Learn More
  • 5
    PyCls

    PyCls

    Codebase for Image Classification Research, written in PyTorch

    pycls is a focused PyTorch codebase for image classification research that emphasizes reproducibility and strong, transparent baselines. It popularized families like RegNet and supports classic architectures (ResNet, ResNeXt) with clean implementations and consistent training recipes. The repository includes highly tuned schedules, augmentations, and regularization settings that make it straightforward to match reported accuracy without guesswork. Distributed training and mixed precision are first-class, enabling fast experiments on multi-GPU setups with simple, declarative configs. Model definitions are concise and modular, making it easy to prototype new blocks or swap backbones while keeping the rest of the pipeline unchanged. Pretrained weights and evaluation scripts cover common datasets, and the logging/metric stack is designed for quick comparison across runs. Practitioners use pycls both as a baseline factory and as a scaffold for new classification backbones.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    PyTorch SimCLR

    PyTorch SimCLR

    PyTorch implementation of SimCLR: A Simple Framework

    For quite some time now, we know about the benefits of transfer learning in Computer Vision (CV) applications. Nowadays, pre-trained Deep Convolution Neural Networks (DCNNs) are the first go-to pre-solutions to learn a new task. These large models are trained on huge supervised corpora, like the ImageNet. And most important, their features are known to adapt well to new problems. This is particularly interesting when annotated training data is scarce. In situations like this, we take the models’ pre-trained weights, append a new classifier layer on top of it, and retrain the network. This is called transfer learning, and is one of the most used techniques in CV. Aside from a few tricks when performing fine-tuning (if the case), it has been shown (many times) that if training for a new task, models initialized with pre-trained weights tend to learn faster and be more accurate then training from scratch using random initialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    PyVision Computer Vision Toolkit

    A Python computer vision library

    PyVision is a object-oriented Computer Vision Toolkit for researchers that contains vision and machine learning algorithms and algorithm analysis and easily interfaces with scipy/numpy, PIL, opencv and other computer and machine learning libraries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    The Python Computer Vision Framework is an opened project deisgned for all those interested in computer vision. It aims at making computer vision more easy and structured and matlab-free. It may also be used for other artistic and scientific areas.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    R-FCN

    R-FCN

    R-FCN: Object Detection via Region-based Fully Convolutional Networks

    R-FCN (“Region-based Fully Convolutional Networks”) is an object detection framework that makes almost all computation fully convolutional and shared across the image, unlike prior region-based approaches (e.g. Faster R-CNN) which run per-region sub-networks. The repository provides an implementation (in Python) supporting end-to-end training and inference of R-FCN models on standard datasets. The authors propose position-sensitive score maps to reconcile the need for translation variance (in detection) and translation invariance (in classification). R-FCN is efficient (low per-region overhead) and competitive in accuracy (e.g. with ResNet backbones). Position-sensitive score maps for per-region classification without expensive per-region convs. Optional “deformable R-FCN” extension for improved performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Easy-to-Use Website Accessibility Widget Icon
    Easy-to-Use Website Accessibility Widget

    An accessibility solution for quick website accessibility improvement.

    All in One Accessibility is an AI based accessibility tool that helps organizations to enhance the accessibility and usability of websites quickly.
    Learn More
  • 10
    R1-V

    R1-V

    Witness the aha moment of VLM with less than $3

    R1-V is an initiative aimed at enhancing the generalization capabilities of Vision-Language Models (VLMs) through Reinforcement Learning in Visual Reasoning (RLVR). The project focuses on building a comprehensive framework that emphasizes algorithm enhancement, efficiency optimization, and task diversity to achieve general vision-language intelligence and visual/GUI agents. The team's long-term goal is to contribute impactful open-source research in this domain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    General C++ Library, with modules for Computer Vision, Pattern Recognition and much more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Raster Vision

    Raster Vision

    Open source framework for deep learning satellite and aerial imagery

    Raster Vision is an open source framework for Python developers building computer vision models on satellite, aerial, and other large imagery sets (including oblique drone imagery). There is built-in support for chip classification, object detection, and semantic segmentation using PyTorch. Raster Vision allows engineers to quickly and repeatably configure pipelines that go through core components of a machine learning workflow: analyzing training data, creating training chips, training models, creating predictions, evaluating models, and bundling the model files and configuration for easy deployment. The input to a Raster Vision pipeline is a set of images and training data, optionally with Areas of Interest (AOIs) that describe where the images are labeled. The output of a Raster Vision pipeline is a model bundle that allows you to easily utilize models in various deployment scenarios.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Rcnn

    Rcnn

    R-CNN: Regions with Convolutional Neural Network Features

    This repository contains the original MATLAB implementation of R-CNN (Regions with Convolutional Neural Networks), a pioneering deep learning-based object detection framework. Developed by Ross Girshick, R-CNN combines region proposals with convolutional neural networks to detect objects in images. It was one of the first approaches to significantly improve performance on object detection benchmarks like PASCAL VOC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Computer vision video game controller for the Call of Duty game series.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The Microsoft Kinect is known to be an affordable commercial sensor. As an introduction to computer vision, we developed a small program to show how it can be used in machine learning, more specifically in the field of object recognition using support vector machines (SVM). Demo: http://www.youtube.com/watch?v=DDdShv1pQTg
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Facis is a computer vision project based on OpenCV. It enables face detection and identification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    RoboRealm Extensions are plugins to the RoboRealm application that allow you to extend RoboRealm in whatever way you need. RoboRealm is a powerful computer vision based application for use in machine vision, image analysis, and image processing systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Robogathers

    Robogathers

    A simple opensource 3d network game

    Robogathers is a 3d game, developed mainly for academic purposes to teach future software engineers with 3d graphic processing, mathematical modelling, networking and multithreading. In the far future we also want to use this project as a base for scientific research in fields of visual odometry, simultaneous localization and mapping and computer vision. The game consist in driving your robot gathering good prizes and avoiding bad ones. The game is multiplayer and during it you can compete with other people via network as well as with AI robots.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SAHI

    SAHI

    A lightweight vision library for performing large object detection

    A lightweight vision library for performing large-scale object detection & instance segmentation. Object detection and instance segmentation are by far the most important fields of applications in Computer Vision. However, detection of small objects and inference on large images are still major issues in practical usage. Here comes the SAHI to help developers overcome these real-world problems with many vision utilities. Detection of small objects and objects far away in the scene is a major challenge in surveillance applications. Such objects are represented by small number of pixels in the image and lack sufficient details, making them difficult to detect using conventional detectors. In this work, an open-source framework called Slicing Aided Hyper Inference (SAHI) is proposed that provides a generic slicing aided inference and fine-tuning pipeline for small object detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Low-power approximate adders provide basic building blocks for approximate computing hardware that have shown remarkable energy efficiency for error-resilient applications (like image/video processing, computer vision, etc.), especially for battery-driven portable systems. In this paper, we present a novel scalable, fast yet accurate analytical method to evaluate the output error probability of multi-bit low power adders for a predetermined probability of input bits. Our method recursively computes the error probability by considering the accurate cases only, which are considerably smaller than the erroneous ones. Our method can handle the error analysis of a wider-range of adders with negligible computational overhead. To ensure its rapid adoption in industry and academia, we have open-sourced our LabVIEW and MATLAB libraries. Lab Web Page: http://save.seecs.nust.edu.pk/projects/SEALPAA/ Emails: 14mseemayub@seecs.edu.pk, osman.hasan@seecs.edu.pk, muhammad.shafique@tuwien.ac.at
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    SOD

    SOD

    An Embedded Computer Vision & Machine Learning Library

    SOD is an embedded, modern cross-platform computer vision and machine learning software library that expose a set of APIs for deep-learning, advanced media analysis & processing including real-time, multi-class object detection and model training on embedded systems with limited computational resource and IoT devices. SOD was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in open source as well as commercial products. SOD implements state-of-the-art computer vision algorithms found to be mandatory in real-world application areas. Sobel operator, Otsu's binarization and over 100 image/frame processing & analysis interfaces. Designed for computational efficiency and with a strong focus on real-time applications. SOD includes a comprehensive set of both classic and state-of-the-art deep-neural networks with their pre-trained models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    SURF-nanodots

    Very basic computer vision program

    This was my first big programming project. I'd like to clean it up and improve the back end design when time permits. See the wiki for known issues and more information. A computer vision program for analysis of magnetic data collected by a scanning probe microscope. Originated in summer 2007 as a collection of C compiled for Matlab (MEX) files and was eventually ported to a standalone C++ application with a GUI created in Qt. This program takes atomic and magnetic force microscope (AFM/MFM) image pairs as input and uses threshold segmentation to identify magnetic nanodots by intensity in the AFM image. These are then used to assess the magnetic states of those dots in the MFM image Attribution: "C++ GUI Programming with Qt 4" by Blanchette and Summerfield was helpful in getting me started on the GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Savant

    Python Computer Vision & Video Analytics Framework With Batteries Incl

    Savant is an open-source, high-level framework for building real-time, streaming, highly efficient multimedia AI applications on the Nvidia stack. It helps to develop dynamic, fault-tolerant inference pipelines that utilize the best Nvidia approaches for data center and edge accelerators. Savant is built on DeepStream and provides a high-level abstraction layer for building inference pipelines. It is designed to be easy to use, flexible, and scalable. It is a great choice for building smart CV and video analytics applications for cities, retail, manufacturing, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Scilab Image Processing Toolbox

    Scilab Image Processing Toolbox

    Advanced image processing toolbox for Scilab on Unix/Linux/Mac OS

    SIP is the image processing and computer vision package for SciLab, a free Matlab-like programming environment. SIP reads/writes images in formats like JPEG, PNG, and BMP. It does filtering, segmentation, edge detection, morphology, and shape analysis. Download from Git http://siptoolbox.sourceforge.net/devel
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Self-learning-Computer-Science

    Self-learning-Computer-Science

    Resources to learn computer science in your spare time

    Self-learning Computer Science is a curated, open-source guide repository designed to help learners independently study computer science topics using high-quality university-level resources. The author (an undergraduate CS student) assembled links to courses from institutions like MIT, UC Berkeley, Stanford, etc., covering mathematics, programming, data structures/algorithms, computer architecture, machine learning, software engineering and more. It’s aimed at learners who find traditional course structures restrictive and want a flexible, self-paced path through CS, with a focus on building depth and breadth rather than shortcut exam skills. The repository provides a roadmap, references, teaching materials, and sometimes the author’s own project examples, offering both guidance and community support. Because the CS field is broad, the structure helps learners allocate study time, avoid duplication, and benefit from “best in class” resources instead of randomly browsing.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB