mod_oai is an Apache module that allows web crawlers to efficiently discover new, modified, and deleted web resources from a web server by using OAI-PMH, a protocol which is widely used in the digital libraries community. mod_oai also allows harvesters to obtain "archive-ready" resources from a web server.

The mod_oai project is housed at Old Dominion University under the direction of Michael L. Nelson. mod_oai is developed under the GNU General Public License (GPL), and is distributed free of charge.

References

edit
  • Michael L. Nelson; Joan A. Smith; Ignacio Garcia del Campo; Herbert Van de Sompel; Xiaoming Liu (2006). "Efficient, Automatic Web Resource Harvesting" (PDF). Proceedings of the 8th ACM International Workshop on Web Information and Data Management (WIDM 2006). doi:10.1145/1183550.1183560.
edit