Abstract
Web crawler is the core component of WWW search engine and information retrieval systems. This paper discussed the architecture of a distributed Web crawler and the design ideas about the Web crawler data structure, system modules and related algorithms. The key problems encountered in the design and implementations were also commented, and the solutions to those problems were presented.
| Original language | English |
|---|---|
| Pages (from-to) | 59-61 |
| Number of pages | 3 |
| Journal | Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University |
| Volume | 38 |
| Issue number | 1 |
| State | Published - Jan 2004 |
| Externally published | Yes |
Keywords
- Distributed system
- Java
- Search engine
- Web crawler
Fingerprint
Dive into the research topics of 'Design and implementation of a distributed high-performance web crawler'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver