Open Source Computer Vision Libraries - Page 9

  • Rezku Point of Sale Icon
    Rezku Point of Sale

    Designed for Real-World Restaurant Operations

    Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
    Learn More
  • Jscrambler: Pioneering Client-Side Protection Platform Icon
    Jscrambler: Pioneering Client-Side Protection Platform

    Jscrambler offers an exclusive blend of cutting-edge first-party JavaScript obfuscation and state-of-the-art third-party tag protection.

    Jscrambler is the leader in Client-Side Protection and Compliance. We were the first to merge advanced polymorphic JavaScript obfuscation with fine-grained third-party tag protection in a unified Client-Side Protection and Compliance Platform. Our integrated solution ensures a robust defense against current and emerging client-side cyber threats, data leaks, and IP theft, empowering software development and digital teams to innovate securely. With Jscrambler, businesses adopt a unified, future-proof client-side security policy all while achieving compliance with emerging security standards including PCI DSS v4.0. Trusted by digital leaders worldwide, Jscrambler gives businesses the freedom to innovate securely.
    Learn More
  • 1
    Show Facebook Computer Vision Tags

    Show Facebook Computer Vision Tags

    Chrome Extension that displays automated image tags from Facebook

    Show Facebook Computer Vision Tags is a Chrome (and Firefox) browser extension created to expose and overlay the automatically generated image tags that Facebook applies to photos in users’ feeds. Since Facebook uses a computer-vision model to analyse user-uploaded images and generate alt-text tags for accessibility (e.g., “Image may contain: golf, grass, outdoor and nature”), this extension surfaces those hidden tags directly in the UI—revealing what kind of information Facebook infers about images (objects present, activities being done, environment). The purpose is educational and somewhat cautionary: to help users understand the scope of visual inference and privacy issues. Once installed, the extension overlays those tags on images in the timeline, making visible what is typically hidden metadata. The project is relatively lightweight but has garnered attention due to its privacy transparency angle.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Computer Vision Application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    A software to implement the existing stereo matching algorithms in computer vision, including the easiest SSD, and the newest algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    T81 558

    T81 558

    Applications of Deep Neural Networks

    Deep learning is a group of exciting new technologies for neural networks. Through a combination of advanced training techniques and neural network architectural components, it is now possible to create neural networks that can handle tabular data, images, text, and audio as both input and output. Deep learning allows a neural network to learn hierarchies of information in a way that is like the function of the human brain. This course will introduce the student to classic neural network structures, Convolution Neural Networks (CNN), Long Short-Term Memory (LSTM), Gated Recurrent Neural Networks (GRU), General Adversarial Networks (GAN) and reinforcement learning. Application of these architectures to computer vision, time series, security, natural language processing (NLP), and data generation will be covered. High-Performance Computing (HPC) aspects will demonstrate how deep learning can be leveraged both on graphical processing units (GPUs), as well as grids.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Endpoint Protection Software for Businesses | HYPERSECURE Icon
    Endpoint Protection Software for Businesses | HYPERSECURE

    DriveLock protects systems, data, end devices from data loss and misuse.

    The HYPERSECURE endpoint protection platform is a comprehensive suite of products and services enhanced by European third-party solutions. It ensures our customers’ IT security, regulatory compliance, and digital sovereignty.
    Learn More
  • 5
    The STAIR Vision Library (SVL), originally developed to support the STanford AI Robot, provides software infrastructure for computer vision, machine learning, and probabilistic graphical models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Time Adaptive Self-Organizing Map

    An Artificial Neural Network for Clustering, Classification, etc

    This project tries to include Time Adaptive Self-Organizing Map (TASOM) implementations for solving Computational Intelligence problems such as Pattern Recognition, Computer Vision, Clustering, Active Contour Modeling, and the like. The TASOM has been originally introduced for adaptive and changing environments. Several versions of TASOM networks have been introduced. Some of them are capable of changing the number of neurons based on the problems at hand. Moreover, a binary tree version of the TASOM has been introduced for faster performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Tiny

    Tiny

    Tiny Face Detector, CVPR 2017

    This repository implements the Tiny Face Detector (from Hu & Ramanan, CVPR 2017) in MATLAB (using MatConvNet). The method is designed to detect tiny faces (i.e. very small-scale faces) by combining multi-scale context modeling, foveal descriptors, and scale enumeration strategies. It provides training/testing scripts, a demo (tiny_face_detector.m), model loading, evaluation on WIDER FACE, and supporting utilities (e.g. cnn_widerface_eval.m). The code depends on MatConvNet, which must be compiled (with GPU / CUDA / cuDNN support) for full performance. Pretrained model provided (ResNet101-based, plus alternatives). Demo and evaluation scripts for benchmark datasets. Use of “foveal descriptors” to incorporate context for low-resolution faces. Pretrained model provided (ResNet101-based, plus alternatives).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Toolbox

    Toolbox

    Piotr's Image & Video Matlab Toolbox

    Piotr’s Image & Video MATLAB Toolbox is a general-purpose MATLAB toolbox for image and video processing and vision tasks, offering utilities, filters, detection, feature extraction, and algorithm building blocks. Example and demo scripts for usage (e.g. acfReadme, detector readmes). It augments MATLAB’s native capabilities (not replacing the Image Processing Toolbox) by providing efficient, reusable wrappers and optimized routines. Example and demo scripts for usage (e.g. acfReadme, detector readmes). Support for compilation / mex (for speed) and cross-platform compatibility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    C++ library that implements basic data structures and algorithms for computer vision and image processing applications
    Downloads: 0 This Week
    Last Update:
    See Project
  • More Bookings. Better Experience. Icon
    More Bookings. Better Experience.

    For tour and activity providers

    The all-in-one solution built to help you stay organised and get more bookings with thousands of connections to online travel agencies (OTAs), resellers and suppliers.
    Learn More
  • 10
    TorchIO

    TorchIO

    Medical imaging toolkit for deep learning

    TorchIO is an open-source Python library for efficient loading, preprocessing, augmentation and patch-based sampling of 3D medical images in deep learning, following the design of PyTorch. It includes multiple intensity and spatial transforms for data augmentation and preprocessing. These transforms include typical computer vision operations such as random affine transformations and also domain-specific ones such as simulation of intensity artifacts due to MRI magnetic field inhomogeneity (bias) or k-space motion artifacts. TorchIO is a Python package containing a set of tools to efficiently read, preprocess, sample, augment, and write 3D medical images in deep learning applications written in PyTorch, including intensity and spatial transforms for data augmentation and preprocessing. Transforms include typical computer vision operations such as random affine transformations and also domain-specific ones such as simulation of intensity artifacts due to MRI magnetic field inhomogeneity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Travis

    Travis

    Travis is a small library for computer vision in robots

    Travis contains several algorithms for computer vision in robotics. There are tracking, motion detection and reactive navigation functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    UAPD

    UAlbany Pedestrian Detection Project

    UAlbany Pedestrian Detection Project (Computer Vision)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    UoMASM

    UoMASM

    A C++ Active Shape Model Library from the University of Manchester

    A library and associated tools for building and searching with Active Shape Models (ASM). It builds on the VXL computer vision libraries. Includes a GUI tool to search new images with the ASM. Depends on: VXL, Qt (Version 4), UoMqVXL
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    UoMqVXL

    UoMqVXL

    Qt based GUI classes for the VXL Computer Vision Libraries

    Qt based GUI classes for VXL from the University of Manchester. Includes libraries to display VXL images and graphics, including shape model manipulation. Also includes tools to display images and points, and to annotate images with points.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    VGGT

    VGGT

    [CVPR 2025 Best Paper Award] VGGT

    VGGT is a transformer-based framework aimed at unifying classic visual geometry tasks—such as depth estimation, camera pose recovery, point tracking, and correspondence—under a single model. Rather than training separate networks per task, it shares an encoder and leverages geometric heads/decoders to infer structure and motion from images or short clips. The design emphasizes consistent geometric reasoning: outputs from one head (e.g., correspondences or tracks) reinforce others (e.g., pose or depth), making the system more robust to challenging viewpoints and textures. The repo provides inference pipelines to estimate geometry from monocular inputs, stereo pairs, or brief sequences, together with evaluation harnesses for common geometry benchmarks. Training utilities highlight data curation and augmentations that preserve geometric cues while improving generalization across scenes and cameras.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    VOSM2DExplorer

    VOSM2DExplorer

    VOSM 2D Explorer

    An Eclipse SWT Java GUI based application to demonstrate 2D statistical models in computer vision, for face representation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    VRN

    VRN

    Code for "Large Pose 3D Face Reconstruction

    The VRN (Volumetric Regression Network) repository implements the “Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression” method. Instead of explicitly fitting a 3D model via landmark estimation and deformation, VRN treats the reconstruction task as volumetric segmentation: it learns a CNN to regress a 3D volume aligned to the input image, and then extracts a mesh via isosurface from that volume. The network is unguided (no 2D landmarks as intermediate). The mesh surfaces can be textured (in MATLAB branch) and colored. Docker container provided for easy CPU deployment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    computer vision workshop project - realtime video processing and simulation of musical instruments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Vehícular Traffic Variables Acquisition
    Acquisition Software of vehicular traffic variables using the computer vision library OpenCV and Qt.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ViAmI-Server

    ViAmI-Server

    Pattern recognition for ADL events

    This software uses computer vision algorithms for mining sequence data from telemonitoring data with CBRs. We propose an approach which treats the detection of changes in behavior detected with a sensor/video fusion, which occur at radically different time-scales, through a CBR in two levels: low and high level. The system is always updating the database with the daily data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ViKi (Virtual Interactive keyboard Interface) is a global framework that enables contactless human machine interaction using computer vision techniques. Only a simple webcam is sufficient to emulate traditional devices such as mouse and keyboard do.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    VideoMan Library

    VideoMan Library

    C++ library for image acquisition and visualization

    Library for capturing video from cameras, 3d sensors, frame-grabbers, video files and image sequences. It can also display multiple images using OpenGL with different layouts. Easy integration with OpenCV, CUDA... Perfect for computer vision. Keywords: video capture, computer vision, machine vision, opencv, opengl, cameras, video input devices, firewire, usb, gige
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The Vision Analysis Toolkit provides a framework for the development of computer vision algorithms using arbitrary named channel containers, flexible data types, and 1/2/3-D buffers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A C# library for use in image processing and computer vision research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The webcam computer vision (WCCV) project focuses on computer vision algorithms and systems that follow the web cam paradigm: cheap, robust and efficient. The project comprises designs, patterns, code, tutorials and examples.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB