Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 76 (0.179s).
Loading phrases to help you
refine your search...
loaded records but gone - Pig - [mail # user]
...I wrote a record loader MyLoader  and used it to load  aa = LOAD 'input_on_hdfs' USING MyLoader() AS ( blah:chararray, blahblah:chararray );  bb = FOREACH aa generate *; store...
   Author: Yang, 2013-04-27, 06:52
SIZE() always leads to 1 reducer? - Pig - [mail # user]
...I set default_parallel=15  but when I did a  y = group z ALL; x = foreach y generate SIZE(z);  the 2 lines generate a MR job with only 1 reducer.   I guess it's because S...
   Author: Yang, 2013-04-11, 22:13
IMPORT issues - Pig - [mail # user]
...I like the new feature of IMPORT, it allows me to break up big chunks of code into manageable, reusable units.  but one issue is that if I get a compile error or run time error in one o...
   Author: Yang, 2013-03-20, 23:21
Re: STORE skipped in pigUnit test? - Pig - [mail # user]
...thanks!   On Fri, Mar 15, 2013 at 3:31 PM, meghana narasimhan  wrote:  ...
   Author: Yang, 2013-03-17, 05:00
STORE skipped in pigUnit test? - Pig - [mail # user]
...I'm trying to test a custom LOAD class, which also contains the code for STORE.  I put in a STORE in my pigUnit script. but the resulting file is never created.  is STORE always sk...
   Author: Yang, 2013-03-15, 22:22
null pointer error with a simple pig program - Pig - [mail # user]
...the following code gave null pointer exception   rbl_raw = load 's3://mybucket/rbl-logs/{2013/03/06,2013/03/05}' AS (line:chararray);  rbl = FOREACH rbl_raw GENERATE FLATTEN(loadrb...
   Author: Yang, 2013-03-11, 07:11
Re: generate multiple output files? - Pig - [mail # user]
...very nice, thanks!   On Fri, Jan 11, 2013 at 2:52 PM, Dmitriy Ryaboy  wrote:  ...
   Author: Yang, 2013-01-18, 00:25
generate multiple output files? - Pig - [mail # user]
...let's say I have an input dataset, each row has 2 fields, the first field is a value among 100 possible values. I want to just split the input dataset into 100 outputs , based on the  v...
   Author: Yang, 2013-01-09, 22:37
issues with using JOIN inside a MACRO? - Pig - [mail # user]
...hadoop@ip-10-245-54-191:~/top50/new$ cat a.pig DEFINE mymacro(blah, zoo) RETURNS foo { x = JOIN $blah BY id, $zoo BY id;         y = JOIN x BY $blah::id, $zoo BY id; &nbs...
   Author: Yang, 2012-11-07, 23:44
Re: debug feature?? - Pig - [mail # user]
...ok, I found this practice to be useful:   I divide my code into sections, each section implemented as a macro.  then I debug each macro separately, at the end of each macro, I manu...
   Author: Yang, 2012-11-07, 21:05
Sort:
project
Pig (76)
Avro (52)
Zookeeper (21)
Hadoop (15)
HBase (8)
MapReduce (5)
Hive (1)
type
mail # user (76)
date
last 7 days (0)
last 30 days (1)
last 90 days (6)
last 6 months (8)
last 9 months (76)
author
Dmitriy Ryaboy (1352)
Alan Gates (955)
Jonathan Coveney (732)
Daniel Dai (545)
Russell Jurney (485)
Olga Natkovich (453)
Prashant Kommireddi (367)
Bill Graham (334)
Cheolsoo Park (246)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (154)
Jeff Zhang (146)
Santhosh Srinivasan (142)
Yang