Skip to main content

Table 4 Focused crawler effectiveness and efficiency when the second group of hyperlink selection methods are employed

From: Hybrid focused crawling on the Surface and the Dark Web

Methods

 

Threshold t

0.5

0.6

0.7

0.8

0.9

3

Precision

0.742

0.741

0.777

0.805

0.914

Recall

0.619

0.538

0.523

0.482

0.269

F-measure

0.675

0.624

0.625

0.603

0.416

Surface Web hits

203

192

176

173

49

Dark Web hits

132

122

111

106

69

Total hits

335

314

287

279

118

4

Precision

0.768

0.770

0.766

0.779

0.874

Recall

0.596

0.579

0.563

0.510

0.282

F-measure

0.671

0.661

0.649

0.617

0.426

Surface Web hits

498

496

496

464

227

Dark Web hits

138

136

136

131

94

Total hits

636

632

632

595

321

5

Precision

0.743

0.740

0.777

0.804

0.913

Recall

0.617

0.536

0.520

0.480

0.266

F-measure

0.674

0.622

0.623

0.601

0.413

Surface Web hits

203

192

176

173

49

Dark Web hits

86

77

66

61

24

Total hits

289

269

242

234

73

6

Precision

0.698

0.740

0.777

0.804

0.913

Recall

0.617

0.837

0.520

0.480

0.266

F-measure

0.655

0.786

0.623

0.601

0.413

Surface Web hits

203

192

176

173

49

Dark Web hits

111

77

66

61

24

Total hits

314

269

242

234

73

7

Precision

0.687

0.687

0.777

0.804

0.913

Recall

0.624

0.536

0.520

0.480

0.266

F-measure

0.654

0.602

0.623

0.601

0.413

Surface Web hits

513

486

470

451

231

Dark Web hits

127

67

62

62

25

Total hits

640

537

513

513

256

8

Precision

0.674

0.672

0.765

0.791

0.913

Recall

0.624

0.536

0.520

0.480

0.266

F-measure

0.648

0.596

0.619

0.597

0.413

Surface Web hits

211

200

176

173

49

Dark Web hits

126

103

66

61

24

Total hits

337

303

242

234

73