Showing 1600 open source projects for "java html parser"

View related business solutions
  • PeerGFS PEER Software - File Sharing and Collaboration Icon
    PeerGFS PEER Software - File Sharing and Collaboration

    One Solution to Simplify File Management and Orchestration Across Edge, Data Center, and Cloud Storage

    PeerGFS is a software-only solution developed to solve file management/file replication challenges in multi-site, multi-platform, and hybrid multi-cloud environments.
    Learn More
  • Striven | All In One Business Management Software Icon
    Striven | All In One Business Management Software

    Striven is an all-in-one business management software suite with everything your organization needs for success.

    Striven is the all-in-one business management software that lowers your costs, improves your operations, and makes work easier. Make your company’s data coherent, connected, and relevant.
    Learn More
  • 1
    Java Tablesaw

    Java Tablesaw

    Java dataframe and visualization library

    Tablesaw is a dataframe and visualization library that supports loading, cleaning, transforming, filtering, and summarizing data. If you work with data in Java, it may save you time and effort. Tablesaw also supports descriptive statistics and can be used to prepare data for working with machine learning libraries like Smile, Tribuo, H20.ai, DL4J. Import data from RDBMS, Excel, CSV, TSV, JSON, HTML, or Fixed Width text files, whether they are local or remote (http, S3, etc.) Tablesaw supports data visualization by providing a wrapper for the Plot.ly JavaScript plotting library. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    commonmark-java

    commonmark-java

    Java library for parsing and rendering CommonMark (Markdown)

    Java library for parsing and rendering Markdown text according to the CommonMark specification (and some extensions). Provides classes for parsing input to an abstract syntax tree of nodes (AST), visiting and manipulating nodes, and rendering to HTML. It started out as a port of commonmark.js, but has since evolved into a full library with a nice API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    jsoup

    jsoup

    Java library for working with real-world HTML

    ...The parser will make every attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ANTLR

    ANTLR

    Parser generator to read, process, or translate structured text

    ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files. It's widely used to build languages, tools, and frameworks. From a grammar, ANTLR generates a parser that can build and walk parse trees. It’s widely used in academia and industry to build all sorts of languages, tools, and frameworks. Twitter search uses ANTLR for query parsing, with over 2 billion queries a day. The languages for...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Inspections+ Mobile forms for Dynamics 365 - Resco.net Icon
    Inspections+ Mobile forms for Dynamics 365 - Resco.net

    Start collecting field data without the hassles of complicated development thanks to resco.Inspections' native integration with Dynamics 365.

    Equip your frontline teams with a robust digital solution to simplify data collection and reporting. Handle inspections and audits effortlessly, even in remote locations, and create comprehensive reports on the spot, all integrated with Dynamics 365.
    Learn More
  • 5
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes. The tool combines deterministic parsing...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    JavaParser

    JavaParser

    Java 1-17 Parser and Abstract Syntax Tree for Java

    This project contains a set of libraries implementing a Java 1.0 - Java 17 Parser with advanced analysis functionalities. The project binaries are available in Maven Central. We strongly advise users to adopt Maven, Gradle or another build system for their projects. If you are not familiar with them we suggest taking a look at the maven quickstart projects. Since Version 3.5.10, the JavaParser project includes the JavaSymbolSolver.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    JSON Schema Validator

    JSON Schema Validator

    A fast Java JSON schema validator that supports draft V4, V6, V7, V201

    This is a Java implementation of the JSON Schema Core Draft v4, v6, v7, v2019-09 and v2020-12(partial) specification for JSON schema validation. In addition, it also works for OpenAPI 3.0 request/response validation with some configuration flags. For users who want to collect information from a JSON node based on the schema, the walkers can help.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 8
    JSqlParser

    JSqlParser

    JSqlParser parses an SQL statement and translate it into a hierarchy

    JSqlParser is a SQL statement parser. It translates SQLs in a traversable hierarchy of Java classes. JSqlParser is not limited to one database but provides support for a lot of specials of Oracle, SqlServer, MySQL, PostgreSQL. To name some, it has support for Oracles join syntax using (+), PostgreSQLs cast syntax using ::, relational operators like != and so on. Then the result can be accessed in a structured way.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    jackson-core

    jackson-core

    Core part of Jackson that defines Streaming API

    This project contains core low-level incremental ("streaming") parser and generator abstractions used by Jackson Data Processor. It also includes the default implementation of handler types (parser, generator) that handle JSON format. The core abstractions are not JSON specific, although naming does contain 'JSON' in many places, due to historical reasons. Only packages that specifically contain the word 'json' are JSON-specific. This package is the base on which Jackson's data-binding...
    Downloads: 15 This Week
    Last Update:
    See Project
  • The AI coach for teams, built on validated assessments. Icon
    The AI coach for teams, built on validated assessments.

    Cloverleaf is an assessment-backed AI Coach that fully understands your people and the context of their workday.

    Give managers and teams proactive, contextual coaching to lead effectively, communicate clearly, and navigate real work situations as they happen.
    Learn More
  • 10
    Apache Baremaps

    Apache Baremaps

    Create custom vector tiles from OpenStreetMap

    Baremaps is an Apache Incubator project that provides tools and a Java-based pipeline for building and rendering vector tiles from OpenStreetMap (OSM) data. It’s designed for fast map generation, serving tiles, and supporting real-time updates, making it a powerful backend for map-based applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Hawtio

    Hawtio

    Hawtio web console helps you manage your JVM stuff and stay cool

    Hawtio is a lightweight and modular Web console for managing Java applications. Hawtio has plugins such as: Apache Camel and JMX (Logs, Spring Boot, Quartz, and more will be provided soon). You can dynamically extend Hawtio with your own plugins or automatically discover plugins inside the JVM. The only server-side dependency (other than the static HTML/CSS/JS/images) is the excellent Jolokia library which has a small footprint (around 300KB) and is available as a JVM agent or comes embedded as a servlet inside the hawtio-default.war.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    JBake

    JBake

    Java based open source static site/blog generator for developers

    JBake is a Java-based, open source, static site/blog generator for developers & designers. The project uses Gradle 4.9+ as the build system. We configured the gradle check style Plugin to run with the check task. It does not break the build if convention violations are found. But prints a warning and generates a report. Source available on GitHub, licensed under MIT License.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Asciidoc Editor based on JavaFX 20

    Asciidoc Editor based on JavaFX 20

    Asciidoc Editor and Toolchain written with JavaFX 19

    Asciidoc FX is a WYSIWYG editor for the Asciidoc markup language. You can build PDF, Epub, and HTML books, documents, and slides. Supported Operating Systems and Builds shows the list of available builds with links for reference. If you are looking for the very latest version, visit the link in the note above to be guaranteed of downloading the latest and greatest version of AsciidocFX. AsciidocFX converts documents via the AsciidoctorJ library.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    WebMagic

    WebMagic

    A scalable web crawler framework for Java

    WebMagic is a scalable crawler framework. It covers the whole lifecycle of crawler, downloading, url management, content extraction and persistent. It can simplify the development of a specific crawler. WebMagic is a simple but scalable crawler framework. You can develop a crawler easily based on it. WebMagic has a simple core with high flexibility, a simple API for html extracting. It also provides annotation with POJO to customize a crawler, and no configuration is needed. Some other...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    Acode

    Acode

    A powerful text/code editor for Android

    ...Acode lets you build and run websites right in your browser, debug with ease using the built-in console, and edit a wide range of source files from Python and CSS to Java, JavaScript, Dart, and more.
    Downloads: 58 This Week
    Last Update:
    See Project
  • 16
    Joern

    Joern

    Open-source code analysis platform for C/C++/Java/Binary/Javascript

    Joern is a platform for analyzing source code, bytecode, and binary executables. It generates code property graphs (CPGs), a graph representation of code for cross-language code analysis. Code property graphs are stored in a custom graph database. This allows code to be mined using search queries formulated in a Scala-based domain-specific query language. Joern is developed with the goal of providing a useful tool for vulnerability discovery and research in static program analysis.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 17
    thymeleaf

    thymeleaf

    Thymeleaf is a modern server-side Java template engine for web

    HTML templates written in Thymeleaf still look and work like HTML, letting the actual templates that are run in your application keep working as useful design artifacts. Eclipse, IntelliJ IDEA, Spring, Play, even the up-and-coming Model-View-Controller API for Java EE 8. Write Thymeleaf in your favourite tools, using your favourite web-development framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    EasyExcel

    EasyExcel

    Lightweight Java library developed by Alibaba for reading and writing

    EasyExcel is a Java library focused on reading and writing Excel files with very low memory usage, making it suitable for large datasets that overwhelm traditional APIs. It uses streaming/event-driven parsing to avoid loading entire workbooks into memory, and it maps rows to Java objects via simple annotations. Writers support multiple sheets, custom styles, merged cells, and template-based filling so production reports remain maintainable. The API emphasizes developer ergonomics: callbacks...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 19
    TeXtidote

    TeXtidote

    Spelling, grammar and style checking on LaTeX documents

    If so, you probably know that the process is far from simple. Since LaTeX documents contain special commands and keywords (the so-called "markup") that are not part of the "real" text, you cannot run a grammar checker directly on these files: it cannot tell the difference between markup and text. The other option is to remove all this markup, leaving only the "clear" text; however, when a grammar tool points to a problem at a specific line in this clear text, it becomes hard to retrace that...
    Downloads: 119 This Week
    Last Update:
    See Project
  • 20
    Kryo

    Kryo

    Java binary serialization and cloning, fast, efficient, automatic

    Kryo is a fast and efficient binary object graph serialization framework for Java. The goals of the project are high speed, low size, and an easy-to-use API. The project is useful any time objects need to be persisted, whether to a file, database or over the network. Kryo can also perform automatic deep and shallow copying/cloning. This is direct copying from object to object, not object to bytes to object. Kryo has three sets of methods for reading and writing objects. If the concrete class...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    Agents-Flex

    Agents-Flex

    Agents-Flex is an elegant LLM Application Framework like LangChain

    Agents-Flex includes a variety of network protocols for connecting LLMs, such as HTTP, SSE and WS. Its simple and flexible design allows developers to easily connect to various LLMs, including OpenAI, LLama, and other AI. Agents-Flex provides a rich set of development templates and Prompt Frameworks, including FEW-SHOT, CRISPE, BROKE, and ICIO. Developers can also customize their own unique prompt templates. Agents-Flex has a very flexible Function Calling component. It supports local method...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 22
    Apache Log4j

    Apache Log4j

    Apache Log4j 2 is a versatile, feature-rich, efficient logging API

    Apache Log4j is a versatile, industrial-grade Java logging framework composed of an API, its implementation, and components to assist the deployment for various use cases. Log4j is used by 8% of the Maven ecosystem and listed as one of the top 100 critical open source software projects. The project is actively maintained by a team of several volunteers and supported by a big community.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 23
    Byte Buddy

    Byte Buddy

    Runtime code generation for the Java virtual machine

    Byte Buddy is a code generation and manipulation library for creating and modifying Java classes during the runtime of a Java application and without the help of a compiler. Other than the code generation utilities that ship with the Java Class Library, Byte Buddy allows the creation of arbitrary classes and is not limited to implementing interfaces for the creation of runtime proxies. Furthermore, Byte Buddy offers a convenient API for changing classes either manually, using a Java agent or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    cordova-plugin-camera-preview

    cordova-plugin-camera-preview

    Cordova plugin that allows camera interaction from HTML code

    Cordova Camera Preview is a plugin that allows developers to add real-time camera preview functionality to Cordova-based mobile apps. It provides direct access to the device's camera feed with customizable overlays and controls, enabling the creation of augmented reality, scanning, and camera-based interaction apps.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Portfolio Performance

    Portfolio Performance

    Track and evaluate the performance of your investment portfolio

    Portfolio Performance: Track and evaluate the performance of your investment portfolio across stocks, cryptocurrencies, and other assets. Performance indicators like that True-Time Weighted Rate of Return or the internal rate of return ( Internal rate of return ) are calculated. Historical courses are automatically run by Yahoo Finance loaded or can be extracted from any HTML pages. Due to the open file format, all data is available as XML available or can be used as CSV be exported. Support...
    Downloads: 30 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB