Intelligent Web Navigation

Virtual integration systems retrieve information according to the user’s interest. This information is retrieved from several web applications, but it is presented to the user uniformly, in an online process. Therefore, response time is a significant factor. An essential part of any information retrieval system is navigation through pages. Usually web pages contain a high number of links, some of them leading to interesting information, but most of them having other purposes, like advertising or internal site navigation. Traditional crawlers follow every link in each page, in order to analyze the target page, and classify it as interesting or irrelevant. This means having to retrieve, analyze and classify thousands of pages for every single site, which is a costly task. This problem can be solved with the combination of a web page classifier, to distinguish between interesting and irrelevant pages, and a link classifier, which automatically identifies links leading to interesting pages. This kind of navigation is more efficient and has a lower cost than traditional crawlers. Moreover, navigation model is automatically extracted from the site, instead of being handcrafted, reducing the supervision from the user.

Content

Author and article information

Contributors

Inma Hernández

Conference

Publication date: September 2009

Publication date (Print): September 2009

Pages: 117-124

Affiliations

[0001]Departamento de Lenguajes y Sistemas Informáticos

Universidad de Sevilla

Avda. Reina Mercedes s n

41012 Sevilla Spain

Article

DOI: 10.14236/ewic/FDIA2009.19

SO-VID: f950ad2d-4212-436c-8822-1991b1d545e2

License:

This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Conference name: Third BCS-IRSG Symposium on Future Directions in Information Access (FDIA 2009)

Conference acronym: FDIA

Conference number: 3

Conference location: Padua, Italy

Conference date: 1 September 2009

Conference sponsor: Electronic Workshops in Computing (eWiC)

Conference theme: Computers XXIII Celebrating People and Technology

History

Product

1477-9358 BCS Learning & Development

Self URI (article page): https://www.scienceopen.com/hosted-document?doi=10.14236/ewic/FDIA2009.19

Self URI (journal page): https://ewic.bcs.org/

Celebrating 65 years of The Computer Journal - free-to-read perspectives - bcs.org/tcj65

Intelligent Web Navigation

Abstract