Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 81 to 90 from 1352 (0.785s).
Loading phrases to help you
refine your search...
Re: Unexpected data type - Pig - [mail # dev]
...This is a serialization error. What version of Pig are you running?   On Sat, Oct 6, 2012 at 10:38 AM, Allan  wrote: ralizedDataType(BinInterSedes.java:1166) BinInterSedesDatum(Bin...
   Author: Dmitriy Ryaboy, 2012-10-07, 00:17
Re: Optimizations in pig - Pig - [mail # user]
...bucketing and partitioning is just setting the files up right. you can do that explicitly.  Pig also lets you push down any filtering and projection into the loader, as long as said loa...
   Author: Dmitriy Ryaboy, 2012-10-04, 23:59
Re: regular expression as delimiter in PigStorage? - Pig - [mail # user]
...Hi Lei, This is currently not supported. However one can always create a new loadfunc and implement his own parsing (perhaps by extending PigStorage and overriding the parsing bits).  D...
   Author: Dmitriy Ryaboy, 2012-09-28, 23:26
Re: Pig multiple groupby problem - Pig - [mail # user]
...Can you check if your mapper logs said anything about in-map aggregation being turned off? In fact, the whole log of one of the mappers might help (POPartialAgg prints some helpful stats). &...
   Author: Dmitriy Ryaboy, 2012-09-28, 22:58
Re: Pig multiple groupby problem - Pig - [mail # user]
...When you tried 2888, did you have pig.exec.mapPartAgg set to true, and pig.exec.mapPartAgg.minReduction set to a low value (2 or 3)?  You said you applied the patch -- what version are ...
   Author: Dmitriy Ryaboy, 2012-09-28, 22:12
Re: How can I access secure HBase in UDF - Pig - [mail # user]
...If someone figures this out ll the way to working code, could you blog it?  D  On Thu, Sep 27, 2012 at 10:54 AM, Rohini Palaniswamy  wrote:  ...
   Author: Dmitriy Ryaboy, 2012-09-28, 00:59
Re: Using matches in generate clause? - Pig - [mail # user]
...With Pig 0.9 you can do this, though:  FOREACH html_pages GENERATE portal_id, (html matches 'some pattern' ? 1 : 0) as wp_match:int;    On Thu, Sep 27, 2012 at 10:38 AM, Alan ...
   Author: Dmitriy Ryaboy, 2012-09-27, 19:31
Re: How can I split the data with more reducers? - Pig - [mail # user]
...Neat pie chart! What produces this?  Trunk is not entirely stable right now, but it's stabilizing pretty rapidly (as long as you don't go using DateTime types and Cube operations.. don'...
   Author: Dmitriy Ryaboy, 2012-09-17, 09:07
Re: reuse same Tuple and ArrayList for every getNext call in LoadFunc? - Pig - [mail # user]
...Anything that builds a bag -- for example, I was just looking at the DefaultDataBag code (and by extension, DistinctDataBag, etc) and it does not do any tuple copies. We could, of course, ch...
   Author: Dmitriy Ryaboy, 2012-09-17, 05:30
Re: Issues with SAMPLE in PIG v0.8.1 - Pig - [mail # user]
...I just ran this very script three times using Pig 0.8 (svn revision 1148107) on a set of 2.5 million rows and got (2509), (2552), and (2473) as the output.  Don't know what to tell you....
   Author: Dmitriy Ryaboy, 2012-09-17, 05:24
Sort:
project
Pig (1352)
Hadoop (14)
MapReduce (5)
HBase (3)
type
mail # user (1062)
mail # dev (290)
date
last 7 days (0)
last 30 days (1)
last 90 days (37)
last 6 months (51)
last 9 months (1352)
author
Dmitriy Ryaboy (1352)
Alan Gates (954)
Jonathan Coveney (730)
Daniel Dai (541)
Russell Jurney (485)
Olga Natkovich (453)
Prashant Kommireddi (367)
Bill Graham (334)
Cheolsoo Park (246)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (154)
Jeff Zhang (146)
Santhosh Srinivasan (142)