Logrank: a clickstream-based web page importance metric for web crawlers
Information extraction of the Web and more precise ranking methods of Web pages are among the open issues in search engines' research area due to the ever-growing and dynamic nature of the World Wide Web. Therefore, proposing novel approaches or performing any enhancement to the existed algorit...
Saved in:
Main Authors: | , |
---|---|
格式: | Article |
出版: |
2012
|
主题: | |
在线阅读: | http://eprints.utm.my/id/eprint/47162/ |
标签: |
添加标签
没有标签, 成为第一个标记此记录!
|
id |
my.utm.47162 |
---|---|
record_format |
eprints |
spelling |
my.utm.471622020-02-29T13:07:22Z http://eprints.utm.my/id/eprint/47162/ Logrank: a clickstream-based web page importance metric for web crawlers Ahmadi Abkenari, F. Selamat, Ali TK Electrical engineering. Electronics Nuclear engineering Information extraction of the Web and more precise ranking methods of Web pages are among the open issues in search engines' research area due to the ever-growing and dynamic nature of the World Wide Web. Therefore, proposing novel approaches or performing any enhancement to the existed algorithms is the concern of many researchers in this field. Since the performance of any Web crawler is highly dependent to the applied Web page importance metric and regarding the obstacles of existed link-dependent or context-based metrics, the innovative heuristics that guarantees the accuracy of search results and better employment of resources is highly on demand. This paper introduces a novel link independent clickstream-based Web page importance metric, illustrates the metric's effectiveness through experimentally testing it over the UTM University Web domain and evaluates the results with information retrieval evaluation measures. 2012 Article PeerReviewed Ahmadi Abkenari, F. and Selamat, Ali (2012) Logrank: a clickstream-based web page importance metric for web crawlers. International Journal Of Digital Content Technology And Its Applications, 6 (1). pp. 200-207. ISSN 1975-9339 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
TK Electrical engineering. Electronics Nuclear engineering |
spellingShingle |
TK Electrical engineering. Electronics Nuclear engineering Ahmadi Abkenari, F. Selamat, Ali Logrank: a clickstream-based web page importance metric for web crawlers |
description |
Information extraction of the Web and more precise ranking methods of Web pages are among the open issues in search engines' research area due to the ever-growing and dynamic nature of the World Wide Web. Therefore, proposing novel approaches or performing any enhancement to the existed algorithms is the concern of many researchers in this field. Since the performance of any Web crawler is highly dependent to the applied Web page importance metric and regarding the obstacles of existed link-dependent or context-based metrics, the innovative heuristics that guarantees the accuracy of search results and better employment of resources is highly on demand. This paper introduces a novel link independent clickstream-based Web page importance metric, illustrates the metric's effectiveness through experimentally testing it over the UTM University Web domain and evaluates the results with information retrieval evaluation measures. |
format |
Article |
author |
Ahmadi Abkenari, F. Selamat, Ali |
author_facet |
Ahmadi Abkenari, F. Selamat, Ali |
author_sort |
Ahmadi Abkenari, F. |
title |
Logrank: a clickstream-based web page importance metric for web crawlers |
title_short |
Logrank: a clickstream-based web page importance metric for web crawlers |
title_full |
Logrank: a clickstream-based web page importance metric for web crawlers |
title_fullStr |
Logrank: a clickstream-based web page importance metric for web crawlers |
title_full_unstemmed |
Logrank: a clickstream-based web page importance metric for web crawlers |
title_sort |
logrank: a clickstream-based web page importance metric for web crawlers |
publishDate |
2012 |
url |
http://eprints.utm.my/id/eprint/47162/ |
_version_ |
1662754248053489664 |
score |
13.252575 |