PMC Member and Committership Invitation from Apache Nutch

Apache Nutch

Apache Nutch is a highly extensible and scalable open source web crawler software project. Stemming from Apache Lucene, the project has diversified and now comprises two codebases, namely:

Nutch 1.x: A well matured, production ready crawler. 1.x enables fine grained configuration, relying on Apache Hadoop data structures, which are great for batch processing.

Nutch 2.x: An emerging alternative taking direct inspiration from 1.x, but which differs in one key area; storage is abstracted away from any specific underlying data store by using Apache Gora for handling object to persistent mappings. This means we can implement an extremely flexible model/stack for storing everything (fetch time, status, content, parsed text, outlinks, inlinks, etc.) into a number of NoSQL storage solutions.

I’ve received an invitation from Apache Nutch for being a PMC Member and Committer and accepted it. I am glad to be an official member of Apache Nutch project.

kamaci

2 Comments

  1. Hi Kamaci, what’s the requirement to become a PMC member of Apache Nutch project?

    • Keep your passion to contribute. Try to answer questions at mail list and contribute to source code. The persistence will be paid off.

Leave a Reply

Your email address will not be published. Required fields are marked *