Содержание
- 2. Introducing BlogSpider is a website project that allows user to crawl pages find on them RSS
- 3. Project structure Project consists of four main parts: Lighthouse Crawler Tracker Web application
- 4. Base crawling alghoritm Web crawler is an essential component of search engines, data mining and other
- 5. Base concept of crawler cluster Here you can see basic roles wich must be in crawler
- 6. What is lighthouse? Lighthouse is a dedicated seed nodes tool for our cluster. It only has
- 7. Let`s look how it work
- 9. Скачать презентацию