Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 30 (0.123s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: why the udf can not work - Pig - [mail # user]
...Looks like you need to learn a bit about how java package works. The path for the class file needs to be  /home/huyong/test/*myudfs*/UPPER.class  Or remove the first line in your U...
   Author: Dexin Wang, 2011-06-19, 17:24
Re: pig script takes much longer than java MR job - Pig - [mail # user]
...Yeah sounds like a lot to dump if it takes 15 minutes to run. That alone can  take long time.    I once forgot to comment out some debug line in my udf. When run with produc t...
   Author: Dexin Wang, 2011-06-18, 00:34
[expand - 4 more] - Re: running pig on amazon ec2 - Pig - [mail # user]
...Thanks a lot for the good advice.  I'll see if I can get lzo setup. Currently I'm using emr which uses pig 0.6. I'll looking into whirr to start the hadoop cluster on ec2.  There i...
   Author: Dexin Wang, 2011-06-16, 04:16
[expand - 1 more] - Re: Setting the store file name with date - Pig - [mail # user]
...I don't think version is a problem. variables is probably supported from th e start of the Pig.  Using      STORE result INTO 'out-$date';  I mentioned about, when y...
   Author: Dexin Wang, 2011-05-23, 16:18
[expand - 2 more] - Re: elephantbird JsonLoader doesn't like gz? - Pig - [mail # user]
...Turns out it's only a problem if I run it in local mode, running it in cluster doesn't have this problem. I'm using EB1.2.5.  Wonder how you fix the problem since it seems it's not EB p...
   Author: Dexin Wang, 2011-05-19, 04:32
Re: Can I pass an entire relation to a Pig UDF? - Pig - [mail # user]
...If the whole set is not that big, sorting in shell might be the easiest.  I' ve done that with result set of millions of records.    On Apr 26, 2011, at 8:49 PM, Arun A K &nbs...
   Author: Dexin Wang, 2011-04-27, 04:14
Re: implementing "if" logic - Pig - [mail # user]
...Here's a trick I used:  Together with $x, pass in another parameter $comment that's either '' (blank) when x>0 or '--' (double dashes) when x==0. Then  result = SOME OPERATION $...
   Author: Dexin Wang, 2011-03-27, 21:10
[expand - 2 more] - Re: reducer throttling? - Pig - [mail # user]
...Thanks for your explanation Alex.  In some cases, there isn't even a reduce phase. For example, we have some raw data, after our custom LOAD function and some filter function, it direct...
   Author: Dexin Wang, 2011-03-25, 01:18
[expand - 2 more] - Re: possibly Pig throttles the number of mappers - Pig - [mail # user]
...Thanks Alan!  We are using 0.79. Also got an answer from #hadoop channel and with this quora answer:  http://www.quora.com/Where-does-Hadoop-latency-come-from-e-g-it-takes-15-25-se...
   Author: Dexin Wang, 2011-03-24, 00:58
[expand - 2 more] - Re: STORE with variable? - Pig - [mail # user]
...Unfortunately, it doesn't work.  Seems the same problem as in https://issues.apache.org/jira/browse/PIG-1547  On Tue, Mar 8, 2011 at 1:22 PM, Dexin Wang  wrote:  ...
   Author: Dexin Wang, 2011-03-08, 22:04
Sort:
project
Pig (30)
type
mail # user (30)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (30)
author
Daniel Dai (396)
Dmitriy Ryaboy (346)
Alan Gates (334)
Cheolsoo Park (310)
Jonathan Coveney (237)
Rohini Palaniswamy (184)
Russell Jurney (176)
Bill Graham (131)
Olga Natkovich (131)
Prashant Kommireddi (107)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (70)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (41)
pi song (41)
Gianmarco De Francisci Mo...(38)
Koji Noguchi (38)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Pradeep Gollakota (34)
Jeff Zhang (32)
Santhosh Srinivasan (29)
Dexin Wang