MarDRe is a de novo MapReduce-based parallel tool to remove duplicate and near-duplicate DNA reads through the clustering of single-end and paired-end sequences from FASTQ/FASTA datasets. This tool allows bioinformatics to avoid the analysis of not necessary reads, reducing the time of subsequent procedures with the dataset.

MarDRe is the Big Data counterpart of ParDRe (link above), which employs HPC technologies (i.e., hybrid MPI/multithreading) to reduce runtime on multicore systems. Instead, MarDRe takes advantage of the MapReduce programming model to significantly improve ParDRe performance on distributed systems, especially on cloud-based infrastructures. Written in pure Java to maximize cross-platform compatibility, MarDRe is built upon the open-source Apache Hadoop project, the most popular distributed computing framework for Big Data processing.

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow MarDRe

MarDRe Web Site

Other Useful Business Software
The full-stack observability platform that protects your dataLayer, tags and conversion data Icon
The full-stack observability platform that protects your dataLayer, tags and conversion data

Stop losing revenue to bad data today. and protect your marketing data with Code-Cube.io.

Code-Cube.io detects issues instantly, alerts you in real time and helps you resolve them fast. No manual QA. No unreliable data. Just data you can trust and act on.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of MarDRe!

Additional Project Details

Operating Systems

Linux

Intended Audience

Healthcare Industry, Information Technology, Science/Research

User Interface

Command-line, Console/Terminal

Programming Language

Java

Related Categories

Java Bio-Informatics Software, Java Big Data Tool

Registered

2017-01-30