Showing 116 open source projects for "files"

View related business solutions
  • Award-winning proxy networks, AI-powered web scrapers, and business-ready datasets for download.
 Icon
    Award-winning proxy networks, AI-powered web scrapers, and business-ready datasets for download.


    How the world collects public web data

    Bright Data is a leading data collection platform, enabling businesses to collect crucial structured and unstructured data from millions of websites through our proprietary technology. Our proxy networks give you access to sophisticated target sites using precise geo-targeting. You can also use our tools to unblock tough target sites, accomplish SERP-specific data collection tasks, manage and optimize your proxy performance as well as automating all of your data collection needs.
    Learn More
  • PageDNA: Web-to-Print eCommerce Software Icon
    PageDNA: Web-to-Print eCommerce Software

    eCommerce for Print, Signs and Fulfillment Trusted by In‑Plants and Commercial Print Leaders

    PageDNA enables successful eCommerce strategies for commercial print sales organizations, internal print shops, and brand owners. PageDNA’s online ordering platform increases print volume while decreasing touch costs for all stakeholders: clientele, print operations, and the organizations they support.
    Learn More
  • 1
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 102 This Week
    Last Update:
    See Project
  • 2
    pdfly

    pdfly

    CLI tool to extract (meta)data from PDF and manipulate PDF files

    A Python library designed for manipulating PDF files with functionalities for extraction, transformation, and document generation.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    PackageDev

    PackageDev

    Tools to ease the creation of snippets, syntax definitions, etc.

    PackageDev provides syntax highlighting and other helpful utility for Sublime Text resource files. Resource files are ways of configuring the Sublime Text text editor to various extends, including but not limited to: custom syntax definitions, context menus (and the main menu), and key bindings. Thus, this package is ideal for package developers, but even normal users of Sublime Text who want to configure it to their liking should find it very useful.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    TikZ

    TikZ

    TikZ figures for concepts in physics/chemistry/ML

    Collection of 111 standalone TikZ figures for illustrating concepts in physics, chemistry, and machine learning. Check out janosh.github.io to search, sort, open in Overleaf, and download figures (PDF/SVG/PNG) from this collection.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Silverware is an enterprise-grade hospitality platform built for hotels, resorts, and complex multi-venue operations. Icon
    Silverware is an enterprise-grade hospitality platform built for hotels, resorts, and complex multi-venue operations.

    Silverware powers high-end hospitality environments

    Silverware is built for hotel, resort, and multi-venue hospitality operators who need enterprise-grade control, deep integrations, and always-on reliability to run complex operations at scale.
    Learn More
  • 5
    Extract TOTP/HOTP secrets

    Extract TOTP/HOTP secrets

    Extract one time password (OTP) secrets from QR codes

    The Python script extract_otp_secrets.py extracts one-time password (OTP) secrets from QR codes exported by two-factor authentication (2FA) apps such as "Google Authenticator".
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    WriteTeX

    WriteTeX

    An Inkscape extension: Latex/Tex editor for Inkscape

    Due to an incompatible change of the Inkscape extension API, this extension has to split into two versions. For Inkscape versions lower than 1.0, users should use the files in the 0.9.x folder, the other users should use files in the 1.0.x folder.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Nano PDF Editor

    Nano PDF Editor

    Edit PDF files with Nano Banana

    Nano PDF Editor is a minimalist, portable PDF viewer and toolkit that focuses on simplicity, speed, and ease of integration for applications that need basic PDF rendering without heavy dependencies. It provides core functionality such as page navigation, zooming, text selection, and rendering directly to native graphics surfaces, making it suitable for lightweight PDF viewing scenarios on desktop or embedded platforms. Designed to be easily embedded into larger software projects, Nano-PDF...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 8
    python-bibtexparser v2

    python-bibtexparser v2

    Bibtex parser for Python 3

    Welcome to python-bibtexparser, a parser for .bib files with a long history and wide adaption. Bibtexparser is available in two versions: V1 and V2. For new projects, we recommend using v2 which, in the long run, will provide an overall more robust and faster experience. For now, however, note that v2 is an early beta, and does not contain all features of v1.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    FreeTAKServer

    FreeTAKServer

    Situational Awareness Server compatible with TAK clients

    ...It's free and open source (released under the Eclipse Public License. FTS allows you to connect ATAK clients to share geo-information, to chat with all the connected clients, exchange files and more. It intends to support all the major use cases of the original TAK server.
    Downloads: 9 This Week
    Last Update:
    See Project
  • Evertune | Improve Your Brand's Visibility in AI Search Icon
    Evertune | Improve Your Brand's Visibility in AI Search

    For enterprise marketing teams looking for a platform to understand and influence how AI models like ChatGPT recommend their products or services.

    Evertune is the Generative Engine Optimization (GEO) platform that helps brands improve visibility in AI search across ChatGPT, AI Overview, Gemini, Claude and more.
    Learn More
  • 10
    RenderCV

    RenderCV

    LaTeX CV generator from a YAML/JSON input file

    RenderCV is a LaTeX CV/resume framework. It allows you to create a high-quality CV as a PDF from a YAML file with full Markdown syntax support and complete control over the LaTeX code. RenderCV offers built-in LaTeX and Markdown templates ready to produce high-quality CVs. However, the templates are entirely arbitrary and can easily be updated to leverage RenderCV's capabilities with your custom CV themes.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    srsly

    srsly

    Modern high-performance serialization utilities for Python

    ...Currently supports JSON, JSONL, MessagePack, Pickle, and YAML. Serialization is hard, especially across Python versions and multiple platforms. After dealing with many subtle bugs over the years (encodings, locales, large files) our libraries like spaCy and Prodigy have steadily grown a number of utility functions to wrap the multiple serialization formats we need to support (especially json, msgpack and pickle). These wrapping functions ended up duplicated across our codebases, so we wanted to put them in one place.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12
    autopep8

    autopep8

    A tool that automatically formats Python code to conform to the PEP 8

    autopep8 automatically formats Python code to conform to the PEP 8 style guide. It uses the pycodestyle utility to determine what parts of the code need to be formatted. autopep8 is capable of fixing most of the formatting issues that can be reported by pycodestyle. Correct deprecated or non-idiomatic Python code (via lib2to3). Use this for making Python 2.7 code more compatible with Python 3. Put a blank line between a class docstring and its first method declaration. Remove blank lines...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    adstex

    adstex

    Automated generation of NASA ADS bibtex entries directly from citation

    adstex automatically identifies all citation keys (e.g., identifiers, author+year) in your TeX source files and uses NASA's Astrophysics Data System (ADS) to generate corresponding bibtex entries. Write your papers without worrying about the bibtex entries. Simply put down arXiv IDs, ADS bibcodes, DOIs, or first author & year citation keys in your \cite commands, and then use adstex to automatically generate the bibtex file for you.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    RSS to Telegram Bot

    RSS to Telegram Bot

    A Telegram RSS bot that cares about your reading experience

    A Telegram RSS bot that cares about your reading experience.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    py-pdf-parser

    py-pdf-parser

    A Python tool to help extracting information from structured PDFs

    py-pdf-parser is a Python tool designed to help extract information from structured PDFs. It provides a simple interface to define parsing rules and extract data from PDF documents. ​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    wikmd

    wikmd

    A file based wiki that uses markdown

    ...Instead of storing the data in a database I chose to have a file-based system. The advantage of this system is that every file is directly readable inside a terminal etc. Also when you have direct access to the system you can export the files to anything you like.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    jello

    jello

    CLI tool to filter JSON and JSON Lines data with Python syntax

    Filter JSON and JSON Lines data with Python syntax. jello is similar to jq in that it processes JSON and JSON Lines data except jello uses standard python dict and list syntax. JSON or JSON Lines can be piped into jello via STDIN or can be loaded from a JSON file or JSON Lines files (JSON Lines are automatically slurped into a list of dictionaries). Once loaded, the data is available as a python list or dictionary object named '_'. Processed data can be output as JSON, JSON Lines, bash array lines, or a grep-able schema.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    arxiv_latex_cleaner

    arxiv_latex_cleaner

    arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper

    This tool allows you to easily clean the LaTeX code of your paper to submit to arXiv. From a folder containing all your code, e.g. /path/to/latex/, it creates a new folder /path/to/latex_arXiv/, that is ready to ZIP and upload to arXiv.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    LovelyPlots

    LovelyPlots

    Matplotlib style sheets to nicely format figures for scientific papers

    LovelyPlots is a repository containing matplotlib style sheets to nicely format figures for scientific papers, theses, and presentations while keeping them fully editable in Adobe Illustrator. Additionally, .svg export options allow figures to automatically adapt their font to your document's font. For example, .svg figures imported in a .tex file will automatically be generated with the text font used in your .tex file.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    PdfBooklet
    PdfBooklet is a Python Gtk application which allows to make books or booklets from existing pdf files. It can also adjust margins, rotate, scale, merge files or extract pages.
    Leader badge
    Downloads: 191 This Week
    Last Update:
    See Project
  • 21
    asammdf

    asammdf

    Fast Python reader and editor for ASAM MDF / MF4 (Measurement Format)

    *asammdf* is a fast Python parser and editor for ASAM (Associtation for Standardisation of Automation and Measuring Systems) MDF / MF4 (Measurement Data Format) files. It supports MDF versions 2 (.dat), 3 (.mdf) and 4 (.mf4). *asammdf* works on Python 2.7, and Python >= 3.4
    Leader badge
    Downloads: 103 This Week
    Last Update:
    See Project
  • 22
    Echo HTML Viewer

    Echo HTML Viewer

    Fast offline HTML viewer for opening local HTML files on Windows

    Echo HTML Viewer is a lightweight desktop app for viewing local HTML files without a browser or internet connection. Designed for simplicity and privacy, it lets you open saved web pages, documentation, and archived content in a clean, distraction-free interface. Key features: • Open HTML files instantly • Drag & drop support • Fast startup and low resource usage • Fully offline — no telemetry, no tracking • No background services Use cases: • View saved websites offline • Read HTML documentation • Preview local HTML files quickly FREE version includes core functionality...
    Leader badge
    Downloads: 60 This Week
    Last Update:
    See Project
  • 23
    Grassroots DICOM

    Grassroots DICOM

    Cross-platform DICOM implementation

    Grassroots DiCoM is a C++ library for DICOM medical files. It is accessible from Python, C#, Java and PHP. It supports RAW, JPEG, JPEG 2000, JPEG-LS, RLE and deflated transfer syntax. It comes with a super fast scanner implementation to quickly scan hundreds of DICOM files. It supports SCU network operations (C-ECHO, C-FIND, C-STORE, C-MOVE). PS 3.3 & 3.6 are distributed as XML files.
    Leader badge
    Downloads: 100 This Week
    Last Update:
    See Project
  • 24

    realwatermark

    A Python application to add watermarks (text or image) to PDF files

    A Python application to add watermarks (text or image) to PDF files, converts them into image and back to PDF with options for OCR and compression.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    pypandoc

    pypandoc

    Thin wrapper for "pandoc" (MIT)

    Pypandoc provides a thin wrapper for pandoc, a universal document converter. Pypandoc uses pandoc, so it needs an available installation of pandoc. Pypandoc provides 2 packages, "pypandoc" and "pypandoc_binary", with the second one including pandoc out of the box. The 2 packages are identical, with the only difference being that one includes pandoc, while the other don't. If pandoc is already installed (i.e. pandoc is in the PATH), pypandoc uses the version with the higher version number,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB