PELCRA WebLign crawler

WEBLIGN

ID:

511

A customizable site-specific crawler for multilingual websites. The tool provides a general crawling infrastructure and several site-specific parsers. The crawling results are stored in a simple relational database (the database schema is provided along with the code.)

You don’t have the permission to edit this resource.
  • Java Runtime Environment
  • MySQL Server