PARCAHYD: An Architecture of a Parallel Crawler based on Augmented Hypertext Documents

A. K. Sharma; J.P. Gupta; D. P. Agarwal

Абстрактный

PARCAHYD: An Architecture of a Parallel Crawler based on Augmented Hypertext Documents

A. K. Sharma, J.P. Gupta, D. P. Agarwal

Search engines use web crawlers to collect documents for storage, indexing and analysis of information. Due to the phenomenal growth of web, it becomes vital to create high performance crawling systems. Augmentations to hypertext documents were proposed [6] so that the documents become suitable for parallel crawlers. PARCAHYD is an on going project aimed at designing of a Parallel Crawler based on Augmented Hypertext Documents. In this paper, the architecture of this parallel crawler is presented.

Отказ от ответственности: Этот тезис был переведен с использованием инструментов искусственного интеллекта и еще не прошел рецензирование или проверку.

Международный журнал достижений в области технологийОткрытый доступ

Абстрактный

PARCAHYD: An Architecture of a Parallel Crawler based on Augmented Hypertext Documents

Международный журнал достижений в области технологий
Открытый доступ