DataHub is the leading open-source data catalog helping teams discover, understand, and govern their data assets.
Modern Data Catalog and Metadata Platform
Built on an open source foundation with a thriving community of 13,000+ members, DataHub gives you unmatched flexibility to customize and extend without vendor lock-in. DataHub Cloud is a modern metadata platform with REST and GraphQL APIs that optimize performance for complex queries, essential for AI-ready data management and ML lifecycle support.
Learn More
Electronic Lab Notebook (ELN) Software
Ideal for any lab. Whether you’re just starting up, a small or large academic institution, or a globally operating company.
eLabJournal is an all-in-one Electronic Lab Notebook (ELN) software that includes sample tracking and protocol management modules.
A program for organizing and commenting textual citations from texts (books, articles or other published works) for use in producing scientific or academic publications. Works together with EndNote, RefWorks and BibTeX. Exports to PDF and Word (RTF).
A Database for managing all your multimedia content with only this tool. You can have an overview about which movies you have lend to someone else. It's easy, it's portable and it's free.
This project helps JDBC programmers from creating DB connection url by hand coding; by using this package we only need to add a simple property file and initialize it with a single utility class. Currently this is developed with Apache Derby in mind.
Migratool is a software tool, which aims to easy the geospatial (and non-geospatial) data migration among both distributed and heterogeneous data sources. This tool is based on a three tiers architecture and it has been implemented using the J2EE archite
Tokyo Cabinet is the successor of QDBM, a high performance database library similar to the DBM family. It also supports hash and B-tree databases and does not require any server process. The overall speed is improved compared to QDBM.
HShreX addresses hybrid storage of XML, combining relational and native solutions. It enables quick design and evaluation of alternative choices by annotating the XML schema and is an extension of a previous tool ShreX (shrex.sourceforge.net).
Utility to load Excel files to Database(MySQL/Oracle) with automatic creation of tables and columns. Utility supports batch processing of multiple files.
MailArchiva is a powerful, full featured email archiving (email archiver) and compliance solution for mail systems such as Microsoft Exchange. It stores all incoming, outgoing and internal emails for long term storage. A web based user interface is avail
This tool is a small tool to upload a csv file to a database. This tool can analyse a CSV file and it can determine the field lenght and the column name.
Decima is a database that was designed to support time-series data mining. It consists of PostgreSQL custom type definition, implementation of GiST index for that type and snowflake database schema.
This is a project to wrap databases in order to use with java. The objective is to make the work with databases as easy as possible. In order to do this, every table will be wrap with a class of java.
Now part of Apache's Mahout machine learning project at http://mahout.apache.org/-- please see there for latest info and code and releases and support!
The Semantic Web implementation using native xml database as backend storage. A SPARQL java compiler to XQuery using Jena. There are XQuery scripts for native xml database Sedna(http://modis.ispras.ru/sedna/).
JLaufprogramm is a free tool offering the possibility to manage a complete running-event. Over time it has replaced its commercial predecessor program "Lauf".