Search Results for "python web crawler" - Page 3

Showing 342 open source projects for "python web crawler"

View related business solutions
  • Self-hosted n8n: No-code AI workflows Icon
    Self-hosted n8n: No-code AI workflows

    Connect workflows. Integrate data

    A free-to-use workflow automation tool, n8n lets you connect all your apps and data in one customizable, no-code platform. Design workflows and process data from a simple, unified dashboard.
    Learn More
  • All-in-One Mental Health EHR Icon
    All-in-One Mental Health EHR

    Simplify your systems. Strengthen your cash flow. Start fresh with Ensora Health

    Ensora Health’s Mental Health EHR is designed for mental health professionals, therapists, and practice managers looking for a secure, user-friendly solution to streamline administrative tasks and improve efficiency in their practice management
    Learn More
  • 1
    crawler4j

    crawler4j

    Open source web crawler for Java

    crawler4j is an open source web crawler for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes. You need to create a crawler class that extends WebCrawler. This class decides which URLs should be crawled and handles the downloaded page. shouldVisit function decides whether the given URL should be crawled or not.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Holarse

    Holarse

    website software for holarse

    HolaCMS 3 Source Code which will power the new Holarse website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    Apache GUI

    A free open sourced Apache Http Server GUI developed with Java.

    This project is used to provide a free java based open source Apache Http Server GUI. The solution is deployed as a web application that is accessible through a web browser. The solution is designed to be compatible with modern Windows and IX operating systems. Source code for this project is hosted on github https://github.com/jrossi227/ApacheGUI
    Leader badge
    Downloads: 51 This Week
    Last Update:
    See Project
  • 4
    Gecco

    Gecco

    Lightweight Java web crawler framework with jQuery-style extraction

    Gecco is a lightweight web crawler framework written in Java that simplifies the process of building web scraping applications. It is designed to make crawler development straightforward by allowing developers to extract page elements using jQuery-style selectors rather than complex parsing logic. It integrates several well-known Java libraries and frameworks, including tools for HTTP requests, HTML parsing, JSON processing, and application development. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Resco toolkit for building mobile apps Icon
    Resco toolkit for building mobile apps

    A no-code toolkit for building responsive and resilient mobile business applications for Microsoft Power Platform, Dynamics 365, Dataverse and Salesfo

    Deploying mobile apps with Resco takes days, not months—all without writing a single line of code. Workers can download the Resco app from AppStore, Google Play, or Windows Store, log into your company environment, and instantly use the app you have published on any device.
    Learn More
  • 5
    Kingpin

    Kingpin

    NEXT GENERATION BUSINESS PROCESS AND DOCUMENT MANAGEMENT SYSTEM

    KINGPIN - A New Paradigm Of Working in Action Connecting People, Process, System and Technology - Kingpin can understand all forms of collaborative tasks, interact naturally with team and systems and people at scale. Say goodbye to silo solutions. Make your work effortless and efficient with a single solution on connected architecture that takes care of all your management needs. Kingpin facilitates communication between different departments, external vendors and service providers to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    DSTK - DataScience ToolKit is an opensource free software for statistical analysis, data visualization, text analysis, and predictive analytics. Newer version and smaller file size can be found at: https://sourceforge.net/projects/dstk3/ It is designed to be straight forward and easy to use, and familar to SPSS user. While JASP offers more statistical features, DSTK tends to be a broad solution workbench, including text analysis and predictive analytics features. Of course you may specify...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    phoneutria
    A Java Web crawler: multi-threaded, scalable, with high performance, extensible and polite. It can be used to crawl and index any web or enterprise domain and is configurable through a XML configuration file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    H2O-3

    H2O-3

    H2O is an Open Source, Distributed, Fast & Scalable Machine Learning

    ...It supports many machine learning algorithms including generalized linear models, gradient boosting machines, deep learning networks, and ensemble techniques. The platform provides interfaces for multiple programming languages such as Python, R, Java, and Scala, making it accessible to a wide range of developers and data scientists. H2O-3 integrates with big data technologies such as Hadoop and Apache Spark, enabling organizations to run machine learning workflows on large-scale data infrastructure. The platform also includes a web-based interface called Flow that allows users to build models interactively through notebooks and visual tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SIGAR (System Information Gatherer and Reporter) is a cross-platform, cross-language library and command-line tool for accessing operating system and hardware level information in Java, Perl and .NET.
    Downloads: 33 This Week
    Last Update:
    See Project
  • Enterprise-Class Managed File Transfer. Icon
    Enterprise-Class Managed File Transfer.

    For organizations that need to automate secure file transfers to protect sensitive data.

    Diplomat MFT by Coviant Software is a secure, reliable managed file transfer solution designed to simplify and automate SFTP, FTPS, and HTTPS file transfers. Built for seamless integration, Diplomat MFT works across major cloud storage platforms, including AWS S3, Azure Blob, Google Cloud, Oracle Cloud, SharePoint, Dropbox, Box, and more.
    Learn More
  • 10

    sourcegreed

    a java-based crawler

    a java-based crawler
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    YoMo

    YoMo

    You Only Meter Once - The open hardware smart metering system

    Researches have shown that immediate feedback can increase energy saving dramatically. YoMo aims to empower people using low-cost open hardware smart meters in their household. The metering system consists of a coordination unit, a raspberry pi, and several smart meters, arduino boards equipped with communication shields and our self-designed yomo metering shield. The smart meters are attached to household devices and monitor their energy consumption. The gathered data is timestamped and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    This SourceForge project page is obsolete. Please visit http://www.musicpd.org/
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13

    Platform Course

    Opensource framework for creation bussiness web applications

    Platform Course 5.0.0 is opensource framework for easy development of original solutions for unique business process. Main advantages: cross-domain auth, LDAP integration, cross-browser GWT-based UI, big tables handling, chart and geo maps, input forms on XForms. Tested with MSSQL, PostgreSQL and Oracle. Actual SVN Repository for this product is located at https://share.curs.ru/svn/showcase/branches/stable/ Login: reader Password: reader If you wish to know further...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Scalaris

    Distributed, transactional key-value store

    Scalaris is a scalable, transactional, distributed key-value store. It was the first NoSQL database, that supported the ACID properties for multi-key transactions. It can be used for building scalable Web 2.0 services. Scalaris uses a structured overlay with a non-blocking Paxos commit protocol for transaction processing with strong consistency over replicas. Scalaris is implemented in Erlang.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ChannelFinder Directory Service
    Generic directory service (for the EPICS tool set). Directory entries are channels with properties and tags. Applications can query a web service to retrieve channels based on wildcard matching on properties and tags, and manipulate the directory data. Code has been moved to GitHub at https://github.com/ChannelFinder
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    openModeller is a complete C++ framework for species potential distribution modelling. The project also includes a graphical user interface, a web service interface and an API for Python.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 17

    WebCollector

    WebCollector is an open source web crawler framework based on Java.

    WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes. Github: https://github.com/CrawlScript/WebCollector Demo: https://github.com/CrawlScript/WebCollector/blob/master/YahooCrawler.java
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    COAR-DMS

    COAR-DMS

    DMS for linux, C++ library, server, webUI , SOAP

    COAR-DMS is document management system for 32/64 bit. linux. Acts as library, server and tools. Library features: - storage management, free pages recycling - transaction log - indexing: full text, tags, metadata, document attributes - inverted index - versioning, collaboration - document trees, trees versionning - folders - plugins for auth (PAM,LDAP), db, file types plugins - tags - metadata (key value pairs) - object level security, folders documents ACL, - unix...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Java-based framework for decoupling back-end services and front-end interfaces. Browse and interact with a database, a class library, a network, a log file, or any live java object as though it were a filesystem. (It works with filesystems too!)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    scipion-xmipp

    scipion-xmipp

    Image processing framework to integrate EM software packages.

    Scipion is an image processing framework to obtain 3D models of macromolecular complexes using Electron Microscopy (3DEM). It integrates several software packages and presents an unified interface for both biologists and developers. Scipion allows to execute workflows combining different software tools, while taking care of formats and conversions. Additionally, all steps are tracked and can be reproduced later on. Xmipp is a well-known package in the EM image processing. It is integrated...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    OVNI

    OVNI

    Open Virtualization Nodes Infrastructure

    OVNI is, first of all, an AJAX web-application to create and manage Virtual Machines on KVM nodes. it's developed under WaveMaker and rely on Libvirt to be compatible with other tools such as virsh. In the future, the project aims to provide a complete virtualization environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    openSEQ

    NGS compute distro proloaded with pipeline analysis software

    ...Software included (location - name): path - biobambam path - bwa path - bowtie2 path - bowtie path - cufflinks path - bcftools path - tophat path - scalpel path - samtools path - sra-toolkit path - bamtools path - delly path - delly-parallel path - picard-tools path - fastahack path - twoBitToFa ~/prog/java/ - mutect ~/prog/python/ - platypus ~/prog/java/ - gatk
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DoCookBook

    DoCookBook

    Cookbook Style Document for DocBook Customizations

    This project has been moved to GitHub: https://github.com/tomschr/dbcookbook/ The DoCookBook project aims to create an open source book about DocBook and the DocBook XSL stylesheets written as a cookbook and released under a Creative Commons license.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    helenos

    Web based GUI tool to manage your data stored in Apache Cassandra

    Helenos is a free web based environment that simplifies a data exploring & schema managament with Apache Cassandra database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The goal of this project is to make possible to access Progress database from any external program that can use sockets. The server (broker and agents) are written in Progress 4GL and made use of sockets capabilities of Progress V9.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB