Getting Giraph to build/run with CDH4 (cannot initialize cluster - check mapreduce.framework.name)

David Boyd 2013-02-25, 16:26
    I am trying to get the Giraph 0.2 snapshot (pulled via GIT on Friday)
to build and run with CDH4.

I modified the pom.xml to provide a profile for my specific version (4.1.1).
The build works (mvn -Phadoop_cdh4.1.1 clean package test) and passes
all the tests.

If I try to do the next step and submit to my cluster with the command:
mvn -Phadoop_cdh4.1.1 test -Dprop.mapred.job.tracker=

  the JSON test in core fails.  If I move that test out of the way a
whole bunch of tests in examples
fail.  They all fail with:
> java.io.IOException: Cannot initialize Cluster. Please check your
> configuration for mapreduce.framework.name and the correspond server
> addresses.

I have tried passing mapreduce.framework.name as both local and classic.
   I have also set those values in my mapreduce-site.xml.

Interestingly I can run the pagerank benchmark in code with the command:
> hadoop jar
> ./giraph-core/target/giraph-0.2-SNAPSHOT-for-hadoop-2.0.0-cdh4.1.3-jar-with-dependencies.jar
> org.apache.giraph.benchmark.PageRankBenchmark
> -Dmapred.child.java-opts="-Xmx64g -Xms64g XX:+UseConcMarkSweepGC
> -XX:-UseGCOverheadLimit" -Dgiraph.zkList= -e 1 -s 3 -v
> -V 50000 -w 83
And it completes just fine.

I have searched high and low for documents and examples on how to run
the example programs from other
than maven but have not found any thing.

Any help or suggestions  would be greatly appreciated.


