Home - NUTCH - Apache Software Foundation
https://cwiki.apache.org/confluence/display/NUTCH/Home
WEBApache Nutch is a highly extensible and scalable open source web crawler software project. Stemming from Apache Lucene , the project comprises two codebases, namely: Nutch 1.x ( ACTIVE ): A well matured, production ready crawler. 1.x enables fine grained configuration, relying on Apache Hadoop data structures, which are great for batch …
DA: 21 PA: 68 MOZ Rank: 61