Skip to main content

Advances, Systems and Applications

Table 6 Average values of the internal clustering evaluation metrics per tree level after the application of the binary tree construction algorithm on the NYTimes dataset

From: A dockerized framework for hierarchical frequency-based document clustering on cloud computing infrastructures

Level

#C

I

IS

HS

BS

TS

0

1

0.000

0.000

0.320

58.072

NA

1

2

0.000

2.500

0.238

69.462

1.000

2

4

25.000

30.000

0.152

79.915

1.000

3

7

36.429

47.857

0.121

84.922

0.814

4

13

48.462

62.692

0.084

90.207

0.539

5

21

55.238

71.667

0.068

92.481

0.524

6

32

58.125

75.938

0.060

93.833

0.468

7

49

61.837

80.204

0.048

95.010

0.422

8

72

66.181

83.819

0.041

95.926

0.393

9

101

68.911

86.584

0.034

96.594

0.367

10

138

71.377

88.659

0.029

97.071

0.350

11

184

72.962

90.272

0.025

97.417

0.335

12

235

73.617

91.447

0.022

97.626

0.314

13

298

74.295

92.416

0.020

97.814

0.298

14

374

75.013

93.583

0.018

97.993

0.286

15

460

75.304

94.543

0.016

98.127

0.278

16

561

75.383

95.463

0.015

98.240

0.275

17

681

75.206

96.300

0.013

98.335

0.263

18

827

75.224

97.056

0.011

98.432

0.256

19

997

74.493

97.723

0.010

98.484

0.249

20

1206

73.669

98.184

0.008

98.521

0.254

21

1462

72.302

98.683

0.005

98.519

0.269

22

1753

71.714

99.395

0.001

98.549

0.279

23

1965

74.221

100.000

0.000

98.711

0.287