Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 13 (0.148s).
Loading phrases to help you
refine your search...
Re: extract tuple from bag in an order - Pig - [mail # user]
...You could separate the inner bags, flatten, rank, and join. It would be ugly and inefficient though. It is best to just write a udf which basically does what the python’s zip function does.O...
   Author: Mehmet Tepedelenlioglu, 2014-06-05, 16:51
[expand - 1 more] - Re: How to sample an inner bag? - Pig - [mail # user]
...I have no experience with the python udfs (I use Java). But I doubt the example you supplied would work. First, I am not sure if a bag is a subclass of sequence, which is, I believe, what yo...
   Author: Mehmet Tepedelenlioglu, 2014-05-28, 20:27
Re: Spilling issue - Optimize "GROUP BY" - Pig - [mail # user]
...If it is indeed a balancing issue, you could load to counter 1 and 2, filter, group/count, and join. That way you assure that the filtering is done after the mappers, and then the combiner k...
   Author: Mehmet Tepedelenlioglu, 2014-01-10, 19:00
Re: Problem with using CROSS in PIG - Pig - [mail # user]
...Looks like a bug.  On Aug 2, 2013, at 1:51 AM, Simonffy Szilvia  wrote:  ...
   Author: Mehmet Tepedelenlioglu, 2013-08-02, 21:32
Re: Fwd: Problem with using CROSS in PIG - Pig - [mail # user]
...I had the same problem. You can search the mailing list to find out more about it. But, in a nut shell, this happens only when pig calculated the number of reducers it needs. It will go away...
   Author: Mehmet Tepedelenlioglu, 2013-08-02, 06:42
Re: Replicated Join and OOM errors - Pig - [mail # user]
...You can always split your tables such that same keys end up in same splits. Then you replicated join the corresponding splits and take the union.  On Jul 19, 2013, at 12:26 PM, Arun Ahu...
   Author: Mehmet Tepedelenlioglu, 2013-07-19, 19:58
Re: A UDF that is both Algebraic and Accumulator - Pig - [mail # user]
...It uses both. They are not contradictory.   ________________________________  From: Ahmed Eldawy  To: [EMAIL PROTECTED]  Sent: Tuesday, June 4, 2013 11:31 AM Subject: A U...
   Author: Mehmet Tepedelenlioglu, 2013-06-04, 18:46
[expand - 3 more] - Re: Synthetic keys - Pig - [mail # user]
...0.10.0-cdh4.1.2  On 5/28/13 11:07 AM, "Pradeep Gollakota"  wrote:  ...
   Author: Mehmet Tepedelenlioglu, 2013-05-28, 18:18
[expand - 2 more] - Re: Cross product bug pig 0.10? - Pig - [mail # user]
...Hi,  So I found a somewhat easy way to replicate this error with this script running in a cluster (distributed). The setting at the top are artificial to produce the result with only a ...
   Author: Mehmet Tepedelenlioglu, 2013-05-22, 00:34
[expand - 1 more] - Re: Join question - Pig - [mail # user]
...I am not sure if I understand you correctly, but you seem to want to find the average per id. For that all you need to do is group by id, and then take the avg for every group. You don't nee...
   Author: Mehmet Tepedelenlioglu, 2013-04-02, 01:20
Sort:
project
Pig (13)
Hadoop (8)
MapReduce (1)
type
mail # user (13)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (2)
last 9 months (13)
author
Daniel Dai (405)
Dmitriy Ryaboy (345)
Alan Gates (333)
Cheolsoo Park (271)
Jonathan Coveney (230)
Russell Jurney (173)
Rohini Palaniswamy (172)
Olga Natkovich (131)
Bill Graham (130)
Prashant Kommireddi (110)
Julien Le Dem (81)
Aniket Mokashi (79)
Thejas Nair (70)
Thejas M Nair (64)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
liyunzhang_intel (37)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Mehmet Tepedelenlioglu
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB