Fig. 5From: A dockerized framework for hierarchical frequency-based document clustering on cloud computing infrastructuresThe binary tree of the NYTimes dataset. The most similar leaf clusters that was discovered during the graph construction module are presented using the same colorsBack to article page