Showing 54 open source projects for "web crawler source code"

View related business solutions
  • Iris Powered By Generali - Iris puts your customer in control of their identity. Icon
    Iris Powered By Generali - Iris puts your customer in control of their identity.

    Increase customer and employee retention by offering Onwatch identity protection today.

    Iris Identity Protection API sends identity monitoring and alerts data into your existing digital environment – an ideal solution for businesses that are looking to offer their customers identity protection services without having to build a new product or app from scratch.
    Learn More
  • The AI workplace management platform Icon
    The AI workplace management platform

    Plan smart spaces, connect teams, manage assets, and get insights with the leading AI-powered operating system for the built world.

    By combining AI workflows, predictive intelligence, and automated insights, OfficeSpace gives leaders a complete view of how their spaces are used and how people work. Facilities, IT, HR, and Real Estate teams use OfficeSpace to optimize space utilization, enhance employee experience, and reduce portfolio costs with precision.
    Learn More
  • 1
    Java JWT JSON

    Java JWT JSON

    Java JWT: JSON Web Token for Java and Android

    JJWT aims to be the easiest-to-use and understand library for creating and verifying JSON Web Tokens (JWTs) and JSON Web Keys (JWKs) on the JVM and Android. JJWT is a pure Java implementation based exclusively on the JOSE Working Group RFC specifications.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Fronsetia

    Fronsetia

    A web application for testing webservices

    Fronsetia (Free Online Service Testing Application) is a Java web application that allows testing webservices (REST and SOAP). No end-user installations required, just deploy the application on your server and pass the link so that users can use their usual browsers to access Fronsetia. You can think of Fronsetia as a simple version of SoapUI on the web, or a simple and free and open-source version of the "wls_utc" utility. See the project homepage https://fronsetia.sourceforge.io and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    A generic platform for fast and flexible development and delivery of a wide variety of immersive scenario-based games that enable professional skills acquisition. For further information see http://emergo.ou.nl/emergo/community/EN/emergo.htm. The platform is based on ZK framework (http://www.zkoss.org, ZK ZOL license: https://www.zkoss.org/license#peee) and Spring framework (http://www.springsource.org/spring-framework). NOTE: as of 2024, latest (mavenized) code base can be found at...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    crawler4j

    crawler4j

    Open source web crawler for Java

    crawler4j is an open source web crawler for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes. You need to create a crawler class that extends WebCrawler. This class decides which URLs should be crawled and handles the downloaded page. shouldVisit function decides whether the given URL should be crawled or not.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Data management solutions for confident marketing Icon
    Data management solutions for confident marketing

    For companies wanting a complete Data Management solution that is native to Salesforce

    Verify, deduplicate, manipulate, and assign records automatically to keep your CRM data accurate, complete, and ready for business.
    Learn More
  • 5
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    SmartSlog (Smart Space Ontology) is a software/application development kit (SDK or ADK) for programming Smart-M3 agents (Knowledge Processors, KPs) that consume/produce smart space content according with its high-level ontological representation. SmartSlog applies the code generation approach: given an OWL ontology description, the ontology programming library is produced. The latter provides API to access the smart space via a Smart-M3 Semantic Information Broker (SIB) and data structures...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    YADS

    Yet Another DPWS Stack, a WS4D JMEDS fork (discontinued)

    Note: This project has been discontinued due to the lack of quality of the original project's code base. I implemented a new DPWS stack from scratch for other projects, which was less work... YADS (Yet Another DPWS Stack) is a fork of the WS4D JMEDS webservice stack. YADS tries to integrate more standard Java SE features (type-safe template containers, concurrency, logging and modern Java language features) and high level HTTP components using non-blocking connections. YADS depends on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SoapUI
    The SoapUI download has moved to https://www.soapui.org/downloads/soapui/source-forge.html With more than 9 million downloads SoapUI is the de-facto standard for REST and SOAP API functional, security and performance testing.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Turn traffic into pipeline and prospects into customers Icon
    Turn traffic into pipeline and prospects into customers

    For account executives and sales engineers looking for a solution to manage their insights and sales data

    Docket is an AI-powered sales enablement platform designed to unify go-to-market (GTM) data through its proprietary Sales Knowledge Lake™ and activate it with intelligent AI agents. The platform helps marketing teams increase pipeline generation by 15% by engaging website visitors in human-like conversations and qualifying leads. For sales teams, Docket improves seller efficiency by 33% by providing instant product knowledge, retrieving collateral, and creating personalized documents. Built for GTM teams, Docket integrates with over 100 tools across the revenue tech stack and offers enterprise-grade security with SOC 2 Type II, GDPR, and ISO 27001 compliance. Customers report improved win rates, shorter sales cycles, and dramatically reduced response times. Docket’s scalable, accurate, and fast AI agents deliver reliable answers with confidence scores, empowering teams to close deals faster.
    Learn More
  • 10
    EXMARaLDA
    EXMARaLDA stands for "Extensible Markup Language for Discourse Annotation". It's a system of concepts, data formats and tools for the computer assisted transcription and annotation of spoken language, and the analysis of spoken language corpora. This project's source code has moved to https://github.com/Exmaralda-Org/exmaralda
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    xorlisp

    Bit level lambda continuations and nothing else - Queue automata

    Not working yet. To deal with the Halting Problem, computing and data are navigated using debugger ops: linearForward and treeForward, which navigate an astronomically large bit string where 1 is ( and 0 is ). All pairs are derived from (). For example, true is represented as ((()())()), and false is (()(()())). It appears related to the church encoding of lambda where T chooses first parameter and F chooses second, of a pair. Continuations are nearly finished code and are represented as a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    NexusDataLink

    Connect, monitor and control your (embedded) systems remotely. m2m/IoT

    Connect, monitor and control your systems or embedded devices remotely (m2m/IoT) - for example your Raspberry Pi. The communication interface is defined in XML automatically providing a REST interface. NexusDataLink integrates smoothly in existing software or firmware and significantly reduces connection- or communication-related source code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DoCookBook

    DoCookBook

    Cookbook Style Document for DocBook Customizations

    This project has been moved to GitHub: https://github.com/tomschr/dbcookbook/ The DoCookBook project aims to create an open source book about DocBook and the DocBook XSL stylesheets written as a cookbook and released under a Creative Commons license.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    FM-Classic provides an easy way to get data from Java servlets into Web pages, and helps you keep graphic design separate from application logic. FM-Classic is a continuation of the FreeMarker 1.x code base.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15

    fiskalizacija

    Java code for signing and handling Croatian fiscal service requests

    Java code for signing and handling requests for Croatian fiscal service. Includes unsigned and signed xml samples. For completeness of project my personal testing key is included. Units tests with sample data are available for the ease of use. Whole Eclipse project is committed to source forge Git. Commercial support for implementation into different java application is available upon request. Enjoy
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    NextBus Public Feed Adapter for Java

    A Java adapter for the NextBus Public XML Feed Service

    Restful Web Service adapter for the NextBus prediction and transit network reporting service. This implementation uses XSD Schema, JAXB and a domain object model to decouple the consuming application from the over-the-wire representation of the data. NextBus is currently providing real-time bus data in several major metro transit agencies including, but not limited to MBTA (Boston), Toronto, Seattle, Brooklyn, Staten Island as well as a number of University towns. In additional to the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Framework (scripts, configuration, code) to build free and public services around travel and leisure data. That project makes an extensive use of already existing data sources such as Geonames and dbPedia, and adds some glue around those (eg, links).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    earmark
    *** IMPORTANT NOTICE *** The source code of the earliest version of the EARMARK Data Structure is now available at http://www.github.com/essepuntato/EarmarkDataStructure Even if the SourceForge repository is still active, it concerns old versions of the API and it is not maintained anymore. *** SERVICE DESCRIPTION *** Extremely Annotational RDF Markup (EARMARK) is an ontological approach to the specification of markup structures on text content. It allows not only documents...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    *** IMPORTANT NOTICE *** The source code of the earliest version of LODE is now available at http://www.github.com/essepuntato/LODE Even if the SourceForge repository is still active, it concerns old versions of the tool and it is not maintained anymore. *** SERVICE DESCRIPTION *** LODE is a service that extracts axioms and annotations from an OWL ontology, and renders them as ordered lists, together with their textual definitions, in a human-readable HTML page designed for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    essential is a lightweight REST framework for Java with focus on processing XML and JSON out of the box.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Service Invocation Tool generates Java code to implement a declared service facade by calling operations of another service and mapping data structures between them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Given an XSD schema and an XSA.xml configuration file, XML Skeleton Annotations (XSA) generates JSF forms UI to create XML records compliant to the XSD and following the XML skeleton defined in XSA.xml, still keeping everything under your control.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    WalkingToolsGpx

    WalkingToolsGpx

    Web editor for locative media GPS projects, deploys to Android App

    NOTE: There is no longer a downloadable desktop app. The project is in the cloud now. See hipergeo.net and google play->hipergeo, still very beta, but up and running. The Walking Tools Reference Platform APIs provide support for GPS and GPX data processing for locative art practices (such as geo-annotation, drawing and narrative), in addition to telematic geospatial performances and other experimental practices with mobile phones. Among the resources are flexible, functional Android...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MOLGENIS can be used to generate databases for life science experiments (micrroarray, mass spectrometry, genomics) having a web user interface, csv exchange format, and programmatic interfaces (web services, rest, and r-project). See NatRevGen 8.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Simple Remoting
    This project is retired and moved into the Corn Gate project. Please switch to https://sourceforge.net/projects/corn-gate.corn.p/ link for the project. Simple Remoting is an alternative SOA library which uses JSON instead of XML as its messaging format. Converting popular java object types like POJO, Spring and EJB to services can be done within minutes without any code change on existing classes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB