Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 12 (0.316s).
Loading phrases to help you
refine your search...
[PIG-2949] JsonLoader only reads arrays of objects - Pig - [issue]
...I'm trying to load a vendor file that's json ecoded into pig. One of the fields is an array of strings. The builtin JsonLoader only reads arrays composed of json objects{"object_array}:[{"el...
http://issues.apache.org/jira/browse/PIG-2949    Author: David LaBarbera, 2013-05-20, 01:04
Re: How can I pass command-line parameters with whitespace to an apache pig script? - Pig - [mail # user]
...My first though is to try  flt='\'a1==1 AND a2=2\''  but mostly want to recommend running pig with the dry run (-r or -dryrun)  flag so you can see how the substitution is bei...
   Author: David LaBarbera, 2013-04-24, 14:42
Re: pig script - failed reading input from s3 - Pig - [mail # user]
...Try  fs.s3n.aws…  and also load from s3  data = load 's3n://...'   The "n" stands for native. I believe S3 also supports block device storage (s3://) which allows bigger ...
   Author: David LaBarbera, 2013-04-08, 13:27
Re: Loader for small files - Pig - [mail # user]
...What process creates the data in HDFS? You should be able to set the block size there and avoid the copy.  I would test the dfs.block.size on the copy and see if you get the mapper spli...
   Author: David LaBarbera, 2013-02-11, 20:38
Re: Loader for small files - Pig - [mail # user]
...You could store your data in smaller block sizes. Do something like hadoop fs HADOOP_OPTS="-Ddfs.block.size=1048576 -Dfs.local.block.size=1048576" -cp /org-input /small-block-input You might...
   Author: David LaBarbera, 2013-02-11, 18:29
Re: Pig prints help options in start - Pig - [mail # user]
...Before the help information, do you see any message like JAVA_HOME not set …  David  On Feb 4, 2013, at 12:11 PM, Ionut Ignatescu  wrote:  ...
   Author: David LaBarbera, 2013-02-11, 13:49
Re: How to read Mahout generated sequence files in Pig - Pig - [mail # user]
...The elephant bird sequence file loader should work, you'll just need to register the mahout jar with the vector writable they use.  David  On Feb 4, 2013, at 7:06 PM, Harsha  ...
   Author: David LaBarbera, 2013-02-06, 20:37
Re: How do I load JSON in Pig? - Pig - [mail # user]
...Try  com.twitter.elephantbird.pig.load.JsonLoader('-nestedLoad') This should allow access to nested object as nested map ($0#'level1#'level2'#'level3' …)  David  On Nov 21, 20...
   Author: David LaBarbera, 2012-11-21, 14:25
Re: force schema with TOBAG - Pig - [mail # user]
...Cheolsoo  That works. Thanks so much for the help. And congratulations on your new committer status!  David  On Oct 31, 2012, at 12:23 PM, Cheolsoo Park  wrote:  ...
   Author: David LaBarbera, 2012-10-31, 16:54
Re: force schema with TOBAG - Pig - [mail # user]
...Cheolsoo  Thank you for the response. This works in 0.10, but not on 0.9.2-amzn. I  get an error message that there's an unexpected symbol at or near  $ID_NULL (ID_NULL is bei...
   Author: David LaBarbera, 2012-10-31, 11:53
Sort:
project
Pig (12)
Hadoop (1)
type
mail # user (11)
issue (1)
date
last 7 days (1)
last 30 days (2)
last 90 days (3)
last 6 months (7)
last 9 months (12)
author
Dmitriy Ryaboy (1352)
Alan Gates (955)
Jonathan Coveney (731)
Daniel Dai (543)
Russell Jurney (485)
Olga Natkovich (453)
Prashant Kommireddi (367)
Bill Graham (334)
Cheolsoo Park (246)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (154)
Jeff Zhang (146)
Santhosh Srinivasan (142)
David LaBarbera