This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages)(Learn how and when to remove this template message)
A Gnutella crawler is a software program used to gather statistic information on the gnutella file sharing network, such as the number of users, the market share of different clients and the geographical distribution of the userbase.
Early crawlers used the Ping/Pong messages to discover hosts connected to the network. Although this method is still usable, it is too slow to capture enough data for a topological overview of the gnutella network as it requires initiating full gnutella connections; this involves several roundtrips to perform the header processing. An extension has been added to the gnutella protocol to allow crawlers to quickly access ultrapeers. Right now, there is no publicly accessible crawler online on the gnutella network, since the last one hosted by Lime Wire LLC has been taken down.
Gnutella2 (G2) also supports crawlers for the gain of statistical data such as the network size or the network composition (clients, versions, usernames and usercountries). Right now, there is only one crawler existing on the G2 network, called g2paranha. It is written and maintained by dcat and licensed under the GPL.
- Daniel Stutzbach and Reza Rejaie, "Capturing Accurate Snapshots of the Gnutella Network", the Global Internet Symposium, March, 2005.
- "Gnutella2 network crawler (g2panthera)". dcat. Retrieved 2013-08-11.
- The Gnutella Crawler Protocol - An old, but still valid document on Gnutella network crawling.
- The GDF:Communicating Network Topology Information
- Ion Sampler - a statistical crawler that estimates the number of nodes in the Gnutella overlay.
- Cruiser - a distributed crawler that gives some of the best estimates of Gnutella's size.
|This network-related software article is a stub. You can help Wikipedia by expanding it.|