Skip to main content

Advances, Systems and Applications

Table 7 Average values of the internal clustering evaluation metrics per tree level after the application of the branch breaking algorithm on the NYTimes dataset

From: A dockerized framework for hierarchical frequency-based document clustering on cloud computing infrastructures

Level

#C

I

IS

HS

BS

TS

0

1

0.000

0.000

0.32

58.072

NA

1

2

0.000

2.500

0.238

69.462

NA

2

4

25.000

30.000

0.152

79.915

NA

3

7

36.429

47.857

0.121

84.922

NA

4

13

48.462

62.692

0.084

90.207

0.184

5

21

55.238

71.667

0.068

92.481

0.196

6

30

55.167

74.000

0.065

93.397

0.196

7

40

57.875

77.250

0.057

94.202

0.204

8

46

61.957

79.565

0.051

94.636

0.204

9

47

60.638

78.830

0.052

94.432

0.204

10

48

59.375

78.229

0.055

94.223

0.204

11

49

58.163

77.755

0.056

94.071

0.204

12

50

57.000

77.400

0.058

93.913

0.204

13

51

55.882

77.157

0.059

93.794

0.204

14

52

54.808

77.019

0.061

93.692

0.204

15

53

53.774

76.981

0.062

93.619

0.204

16

54

52.778

77.037

0.062

93.575

0.204

17

55

51.818

77.182

0.063

93.554

0.211

18

56

50.893

77.411

0.063

93.554

0.211

19

57

50.000

77.719

0.062

93.575

0.270

20

58

49.138

78.103

0.061

93.596

0.326