Apache Hama

Apache Hama is an Apache Top-Level open source project, allowing you to do advanced analytics beyond MapReduce.

Many data analysis techniques such as machine learning and graph algorithms require iterative computations, this is where Bulk Synchronous Parallel model can be more effective than "plain" MapReduce. Therefore To run such iterative data analysis applications more efficiently, Hama offers pure Bulk Synchronous Parallel computing engine.

Hama can make jobs that take hours in Hadoop run in minutes. For details, see the Performance Benchmarks.

Getting Started

Start by installing Hama on a Hadoop cluster.

Getting Involved

Hama is an open source volunteer project under the Apache Software Foundation. We encourage you to learn about the project and contribute your expertise. Here are some starter links: