Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> HBaseStorage in pig 0.8


Copy link to this message
-
HBaseStorage in pig 0.8
We are trying to use the HBaseStorage LoadFunc in pig 0.8 and getting an exception.

org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to open iterator for alias raw
at org.apache.pig.PigServer.openIterator(PigServer.java:754)
at org.apache.pig.tools.grunt.GruntParser.processDump(GruntParser.java:612)
at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:303)
at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:165)
at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:141)
at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:76)
at org.apache.pig.Main.run(Main.java:465)
at org.apache.pig.Main.main(Main.java:107)
Caused by: java.io.IOException: Couldn't retrieve job.
at org.apache.pig.PigServer.store(PigServer.java:818)
at org.apache.pig.PigServer.openIterator(PigServer.java:728)
... 7 more
Other jobs seem to work.

What are the requirements for getting hbase storage to work?

This is what I am doing:
1 - added hbase config and hadoop config to my PIG_CLASSPATH
2 - pig this script:

REGISTER ../lib/hbase-0.20.6.jar

raw = LOAD 'hbase://piggytest' USING org.apache.pig.backend.hadoop.hbase.HBaseStorage('content:field1 anchor:field1a anchor:field2a') as (content_field1, anchor_field1a, anchor_field2a);

dump raw;

---
what else am I missing?