The www/p5-WWW-Robot port
p5-WWW-Robot-0.026p0 – configurable web traversal engine (cvsweb github mirror)
Description
This module implements a configurable web traversal engine, for a robot
or other web agent. Given an initial web page (URL), the Robot will get
the contents of that page, and extract all links on the page, adding
them to a list of URLs to visit.
Features of the Robot module include:
* Follows the Robot Exclusion Protocol.
* Supports the META element proposed extensions to the Protocol.
* Implements many of the Guidelines for Robot Writers.
* Configurable.
* Builds on standard Perl 5 modules for WWW, HTTP, HTML, etc.
WWW: https://metacpan.org/release/WWW-Robot
Maintainer
The OpenBSD ports mailing-list
Categories
Build dependencies
Run dependencies
Test dependencies
Files
- /usr/local/libdata/perl5/site_perl/WWW/Robot.pm
- /usr/local/man/man3p/WWW::Robot.3p
- /usr/local/share/examples/p5-WWW-Robot/
- /usr/local/share/examples/p5-WWW-Robot/poacher