Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 11 (0.053s).
Loading phrases to help you
refine your search...
Re: Spilling issue - Optimize "GROUP BY" - Pig - [mail # user]
...If it is indeed a balancing issue, you could load to counter 1 and 2, filter, group/count, and join. That way you assure that the filtering is done after the mappers, and then the combiner k...
   Author: Mehmet Tepedelenlioglu, 2014-01-10, 19:00
Re: Problem with using CROSS in PIG - Pig - [mail # user]
...Looks like a bug.  On Aug 2, 2013, at 1:51 AM, Simonffy Szilvia  wrote:  ...
   Author: Mehmet Tepedelenlioglu, 2013-08-02, 21:32
Re: Fwd: Problem with using CROSS in PIG - Pig - [mail # user]
...I had the same problem. You can search the mailing list to find out more about it. But, in a nut shell, this happens only when pig calculated the number of reducers it needs. It will go away...
   Author: Mehmet Tepedelenlioglu, 2013-08-02, 06:42
Re: Replicated Join and OOM errors - Pig - [mail # user]
...You can always split your tables such that same keys end up in same splits. Then you replicated join the corresponding splits and take the union.  On Jul 19, 2013, at 12:26 PM, Arun Ahu...
   Author: Mehmet Tepedelenlioglu, 2013-07-19, 19:58
Re: A UDF that is both Algebraic and Accumulator - Pig - [mail # user]
...It uses both. They are not contradictory.   ________________________________  From: Ahmed Eldawy  To: [EMAIL PROTECTED]  Sent: Tuesday, June 4, 2013 11:31 AM Subject: A U...
   Author: Mehmet Tepedelenlioglu, 2013-06-04, 18:46
Re: Synthetic keys - Pig - [mail # user]
...0.10.0-cdh4.1.2  On 5/28/13 11:07 AM, "Pradeep Gollakota"  wrote:  ...
   Author: Mehmet Tepedelenlioglu, 2013-05-28, 18:18
Re: Cross product bug pig 0.10? - Pig - [mail # user]
...Hi,  So I found a somewhat easy way to replicate this error with this script running in a cluster (distributed). The setting at the top are artificial to produce the result with only a ...
   Author: Mehmet Tepedelenlioglu, 2013-05-22, 00:34
Re: Join question - Pig - [mail # user]
...I am not sure if I understand you correctly, but you seem to want to find the average per id. For that all you need to do is group by id, and then take the avg for every group. You don't nee...
   Author: Mehmet Tepedelenlioglu, 2013-04-02, 01:20
Re: easiest way to get loops in PIG? - Pig - [mail # user]
...If I understand you correctly, and you want to find out what the  components of a graph are, the trans closure probably is not the way to  go as this is quadratic on the number of ...
   Author: Mehmet Tepedelenlioglu, 2012-06-21, 05:12
Re: DISTINCT with 2 fields in a tuple - Pig - [mail # user]
...Just group on those 2 fields. The 'group' field of the output will  contain all the distinct combinations. That is, of course, if that is what you wanted to  do in the first place....
   Author: Mehmet Tepedelenlioglu, 2012-04-11, 21:04
Sort:
project
Pig (11)
Hadoop (8)
MapReduce (1)
type
mail # user (11)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (1)
last 9 months (11)
author
Dmitriy Ryaboy (346)
Alan Gates (334)
Daniel Dai (315)
Cheolsoo Park (244)
Jonathan Coveney (237)
Russell Jurney (174)
Rohini Palaniswamy (136)
Bill Graham (132)
Olga Natkovich (129)
Prashant Kommireddi (106)
Julien Le Dem (84)
Aniket Mokashi (76)
Thejas Nair (69)
Thejas M Nair (62)
Mridul Muralidharan (61)
Mehmet Tepedelenlioglu