PELCRA WebLign crawler
WEBLIGN
ID:
511
A customizable site-specific crawler for multilingual websites. The tool provides a general crawling infrastructure and several site-specific parsers. The crawling results are stored in a simple relational database (the database schema is provided along with the code.)
- Java Runtime Environment
- MySQL Server
People who looked at this resource also viewed the following: