Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 101 to 110 from 1352 (0.25s).
Loading phrases to help you
refine your search...
Re: Reading BytesWritable in sequence file - Pig - [mail # user]
...Yup: https://github.com/kevinweil/elephant-bird  D  On Tue, Sep 11, 2012 at 4:00 PM, Mohit Anchlia  wrote:...
   Author: Dmitriy Ryaboy, 2012-09-11, 23:21
Re: Reading BytesWritable in sequence file - Pig - [mail # user]
...Try the one in Elephant-Bird.  On Tue, Sep 11, 2012 at 11:22 AM, Mohit Anchlia  wrote:...
   Author: Dmitriy Ryaboy, 2012-09-11, 22:27
Re: Modifying databag on the fly - Pig - [mail # dev]
...FYI -- we wound up going with a much cleaner and memory-friendly solution of returning a new databag implementation which simply proxied all the calls to the original bag, but returned a spe...
   Author: Dmitriy Ryaboy, 2012-09-08, 06:10
Re: Using LoadFunc to get arbitrary data into Pig script - Pig - [mail # user]
...Hi Thomas, This isn't a complete answer, but take a look at mock.Storage that Julien wrote to make testing easy:  http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/builtin/mo...
   Author: Dmitriy Ryaboy, 2012-09-07, 16:56
Re: Machine Learning + Pig? - Pig - [mail # user]
...Please take a look at Alek and Jimmy's paper on ML in Pig; there are also a few presentations they did on this, here's one from the Hadoop Summit: https://speakerdeck.com/u/lintool/p/large-s...
   Author: Dmitriy Ryaboy, 2012-09-06, 23:01
Re: Current "patch available' and open issues - Pig - [mail # dev]
...+1  (we also need to train committers to actually review stuff.. guilty of not reviewing, myself..)  D  On Tue, Sep 4, 2012 at 10:59 AM, Alan Gates  wrote:...
   Author: Dmitriy Ryaboy, 2012-09-04, 18:42
Re: Extremely slow when loading small amount of data from HBase - Pig - [mail # user]
...I think the hbase folks recommend something like 40 regions per node per tab le, but I might be misremembering something. Have you tried emailing the hba se users list?  On Sep 4, 2012,...
   Author: Dmitriy Ryaboy, 2012-09-04, 11:54
Re: UDF Performance Problem - Pig - [mail # user]
...That's cause you used "group all" which groups everything into one group, which by definition can only go to one reducer.  What if instead you group into some large-enough number of buc...
   Author: Dmitriy Ryaboy, 2012-09-03, 17:21
Current "patch available' and open issues - Pig - [mail # dev]
...Hi folks, Here's a link to a almost 40 JIRAs where patches are available. We should review them and either close as won't fix, drop "patch available" if the patch doesn't pass muster, or com...
   Author: Dmitriy Ryaboy, 2012-09-03, 00:01
Re: Custom DB Loader UDF - Pig - [mail # user]
...You can also look at what Vertica did for their Pig connector:  https://github.com/vertica/Vertica-Hadoop-Connector/blob/master/pig-connector/com/vertica/pig/VerticaLoader.java  (i...
   Author: Dmitriy Ryaboy, 2012-09-02, 21:17
Sort:
project
Pig (1352)
Hadoop (14)
MapReduce (5)
HBase (3)
type
mail # user (1062)
mail # dev (290)
date
last 7 days (0)
last 30 days (0)
last 90 days (32)
last 6 months (51)
last 9 months (1352)
author
Dmitriy Ryaboy (1352)
Alan Gates (955)
Jonathan Coveney (732)
Daniel Dai (545)
Russell Jurney (485)
Olga Natkovich (453)
Prashant Kommireddi (367)
Bill Graham (334)
Cheolsoo Park (246)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (154)
Jeff Zhang (146)
Santhosh Srinivasan (142)