Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 63 (0.493s).
Loading phrases to help you
refine your search...
Re: deserializing nested protobufs - Pig - [mail # user]
...extension are not supported yet. there is a patch pending : https://github.com/kevinweil/elephant-bird/pull/143  Can you check if that covers your use case?  On Tue, Apr 3, 2012 at...
   Author: Raghu Angadi, 2012-04-03, 23:41
Re: Is it possible to use Pig streaming (StreamToPig) in a way that handles multiple lines as a single input tuple? - Pig - [mail # user]
...why not pipe multi-line xml from the executable through another script that understands it?  On Wed, Mar 28, 2012 at 8:24 AM, Ahmed Sobhi  wrote:  ...
   Author: Raghu Angadi, 2012-04-03, 21:32
Re: Compressing output using block compression - Pig - [mail # user]
...SequenceFileStorage in elephant-bird lets you load and store to sequence files. If your input is text lines, you can store each line as 'value'. You can experiment with different codecs. &nb...
   Author: Raghu Angadi, 2012-04-03, 21:08
most high profile user - Pig - [mail # user]
...unbelievable!  https://twitter.com/#!/mcuban/status/182273293347328000  anyone has more scoop on this?...
   Author: Raghu Angadi, 2012-03-21, 16:25
Re: pig and hbase integration = hanging jobs - Pig - [mail # user]
...201203071602_0001  did you check that? From that link you can also navigate to output from mapper task. did you create "info" column family in the table?  On Wed, Mar 7, 2012 at 7:...
   Author: Raghu Angadi, 2012-03-07, 16:45
Re: Best practice for DB connection - Pig - [mail # user]
...On Tue, Mar 6, 2012 at 5:02 PM, Mark Kerzner wrote:   yes. you can initialize inside the first call to UDF.exec(). The same UDF object is used for the entire mapper.  Don't initial...
   Author: Raghu Angadi, 2012-03-07, 08:27
Re: HBaseStorage STORE method comparison - Pig - [mail # user]
...fastest might be to use local mode, and avoid even the first map only job :)  You are right, for 10 keys it does not really matter. Even doing 1000s of updates to the same row in #2 is ...
   Author: Raghu Angadi, 2012-03-07, 08:22
Re: LZO support for Pig-0.9.1 - Pig - [mail # user]
...btw, for simple use cases, you can load/store lzo files using PigStorage(). While stor  Only disadvantage is that files are not splittable.  grunt> set output.compression.enable...
   Author: Raghu Angadi, 2012-01-24, 00:32
Re: Multithreaded UDF - Pig - [mail # user]
...oh, this is much better than custom loader hack I mentioned to batch up input tuples.  On Wed, Nov 9, 2011 at 12:22 PM, Mridul Muralidharan wrote:  ...
   Author: Raghu Angadi, 2011-11-09, 21:08
Re: Multithreaded UDF - Pig - [mail # user]
...Assuming 1-5 seconds is mainly waiting for IO, using multiple reducers or mapper might not be suitable since it just takes too many mapper an d reducer slots. Couple of options:  1. use...
   Author: Raghu Angadi, 2011-11-09, 18:38
Sort:
project
Hadoop (173)
Pig (63)
HBase (10)
HDFS (9)
Kafka (6)
MapReduce (4)
type
mail # user (52)
mail # dev (7)
issue (4)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (1)
last 9 months (63)
author
Dmitriy Ryaboy (1352)
Alan Gates (955)
Jonathan Coveney (732)
Daniel Dai (545)
Russell Jurney (485)
Olga Natkovich (453)
Prashant Kommireddi (367)
Bill Graham (334)
Cheolsoo Park (246)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (154)
Jeff Zhang (146)
Santhosh Srinivasan (142)
Raghu Angadi