Dynamic Hadoop Fair Scheduler (DHFS) is an optimized Hadoop Fair Scheduler that improves the performance of Hadoop by maximizing the slots utilization while guarantees the fairness across pools. It is based on the observation that at different period of time there may be idle map (or reduce) slots, as the job proceeds from map phase to reduce phase. We can use the unused map slots for those overloaded reduce tasks to improve the performance of the MapReduce workload, and vice versa, by breaking the implicit assumption that map tasks are run on map slots and reduce tasks are run on reduce slots. For example, at the beginning of MapReduce workload computation, there will be only computing map tasks and no computing reduce tasks, i.e., all the computation workload lies in the map-side. In that case, we can make use of idle reduce slots for running map tasks. Two types of DHFS are provided, namely, Pool-independent DHFS (PI-DHFS) and Pool-dependent DHFS (PD-DHFS) for users to choose.

Project Activity

See All Activity >

Follow DHFS

DHFS Web Site

Other Useful Business Software
Simplify Purchasing For Your Business Icon
Simplify Purchasing For Your Business

Manage what you buy and how you buy it with Order.co, so you have control over your time and money spent.

Simplify every aspect of buying for your business in Order.co. From sourcing products to scaling purchasing across locations to automating your AP and approvals workstreams, Order.co is the platform of choice for growing businesses.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of DHFS!

Additional Project Details

Registered

2013-05-16