Содержание
- 2. Introducing BlogSpider is a website project that allows user to crawl pages find on them RSS
- 3. Project structure Project consists of four main parts: Lighthouse Crawler Tracker Web application
- 4. Base crawling alghoritm Here you can see base idea of crawling alghoritm
- 5. Base concept of crawler cluster Here you can see basic roles wich must be in crawler
- 6. What is lighthouse? Lighthouse is a dedicated seed nodes tool for our cluster. It only has
- 7. Let`s look how it work
- 9. Скачать презентацию