Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 12 (0.231s).
Loading phrases to help you
refine your search...
Re: HCatalog select one column from hive - Pig - [mail # user]
...The pig script should be as easy ashive_table = LOAD ‘a_table’ USING org.apache.hcatalog.pig.HCatLoader();hive_column = FOREACH hive_table GENERATE a_column;The trickier part could be the pi...
   Author: David LaBarbera, 2014-05-29, 12:43
Re: Embedded pig in java - Pig - [mail # user]
...Try running with java -cp pig.jar idlocal.IdlocalDavidOn Apr 3, 2014, at 7:54 AM, Junior Tsire  wrote: ...
   Author: David LaBarbera, 2014-04-03, 12:24
Re: Unit test for Pig UDF using DistributedCache - Pig - [mail # user]
...One approach is to separate your code from the pig wrapper. That way you only need to unit test your business logic.An example would be something likepublic class wrapperUdf extends EvalFunc...
   Author: David LaBarbera, 2014-02-10, 15:31
[PIG-2949] JsonLoader only reads arrays of objects - Pig - [issue]
...I'm trying to load a vendor file that's json ecoded into pig. One of the fields is an array of strings. The builtin JsonLoader only reads arrays composed of json objects{"object_array}:[{"el...
http://issues.apache.org/jira/browse/PIG-2949    Author: David LaBarbera, 2013-06-02, 16:14
Re: How can I pass command-line parameters with whitespace to an apache pig script? - Pig - [mail # user]
...My first though is to try  flt='\'a1==1 AND a2=2\''  but mostly want to recommend running pig with the dry run (-r or -dryrun)  flag so you can see how the substitution is bei...
   Author: David LaBarbera, 2013-04-24, 14:42
Re: pig script - failed reading input from s3 - Pig - [mail # user]
...Try  fs.s3n.aws…  and also load from s3  data = load 's3n://...'   The "n" stands for native. I believe S3 also supports block device storage (s3://) which allows bigger ...
   Author: David LaBarbera, 2013-04-08, 13:27
[expand - 1 more] - Re: Loader for small files - Pig - [mail # user]
...What process creates the data in HDFS? You should be able to set the block size there and avoid the copy.  I would test the dfs.block.size on the copy and see if you get the mapper spli...
   Author: David LaBarbera, 2013-02-11, 20:38
Re: Pig prints help options in start - Pig - [mail # user]
...Before the help information, do you see any message like JAVA_HOME not set …  David  On Feb 4, 2013, at 12:11 PM, Ionut Ignatescu  wrote:  ...
   Author: David LaBarbera, 2013-02-11, 13:49
Re: How to read Mahout generated sequence files in Pig - Pig - [mail # user]
...The elephant bird sequence file loader should work, you'll just need to register the mahout jar with the vector writable they use.  David  On Feb 4, 2013, at 7:06 PM, Harsha  ...
   Author: David LaBarbera, 2013-02-06, 20:37
Re: How do I load JSON in Pig? - Pig - [mail # user]
...Try  com.twitter.elephantbird.pig.load.JsonLoader('-nestedLoad') This should allow access to nested object as nested map ($0#'level1#'level2'#'level3' …)  David  On Nov 21, 20...
   Author: David LaBarbera, 2012-11-21, 14:25
Pig (12)
Hadoop (1)
mail # user (11)
issue (1)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (1)
last 9 months (12)
Daniel Dai (404)
Dmitriy Ryaboy (346)
Alan Gates (334)
Cheolsoo Park (312)
Jonathan Coveney (237)
Rohini Palaniswamy (188)
Russell Jurney (177)
Bill Graham (131)
Olga Natkovich (131)
Prashant Kommireddi (108)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (71)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Santhosh Srinivasan (29)
David LaBarbera