Skip to main content

Advances, Systems and Applications

Table 1 Workloads based on Hadoop framework: System Resource Utilization

From: Hadoop and memcached: Performance and power characterization and analysis

Workloads

System resource utilization

Wordsort

Sort Phase: IO-bound, Reduce Phase: Communication-bound.

Word Count

CPU-bound

Terasort

Map Stage: CPU-Bound

Reduce stage: IO-bound

NutchIndexing

IO-bound with high CPU utilizations in map stage. This workload is mainly used for web searching.

Kmeans

CPU-bound in iteration, IO-bound in clustering. It is used for machine learning and data mining.