Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 91 to 100 from 125 (0.13s).
Loading phrases to help you
refine your search...
Join,Filter on the same line and optimization - Pig - [mail # user]
...Is it possible to say something like   F = JOIN A BY (FILE_NAME,CREATED_DATE,FORM_ID,FORM_ID_ROOT), B BY (FILE_NAME,CREATED_DATE,FORM_ID,FORM_ID_ROOT) AND FILTER A BY FORM_ID == 0; &nbs...
   Author: Mohit Anchlia, 2012-04-11, 22:39
Re: DISTINCT with 2 fields in a tuple - Pig - [mail # user]
...Thanks I tried something like this and it worked, but I have one more question:   grunt> B = foreach A GENERATE FORM_ID, SET_ID;  grunt> C= DISTINCT B;  What's the diffe...
   Author: Mohit Anchlia, 2012-04-11, 21:06
DISTINCT with 2 fields in a tuple - Pig - [mail # user]
...I am trying to get distinct from 2 fields in a record. something like select distinct a, b from c; So I wrote this in pig which is actually not working. I did:   A = LOAD '/examples/for...
   Author: Mohit Anchlia, 2012-04-11, 20:53
Re: Compressing output using block compression - Pig - [mail # user]
...Thanks for the examples. It appears that snappy is not splittable and suggested approach is to write to sequence files.  I know how to load from sequencefiles, but in pig I can't find a...
   Author: Mohit Anchlia, 2012-04-03, 20:57
Re: Compressing output using block compression - Pig - [mail # user]
...I am currently using Snappy in sequence files. I wasn't aware snappy uses block compression. Does it mean Snappy is splittable? If so then how can I use it in pig?  Thanks again  O...
   Author: Mohit Anchlia, 2012-04-03, 20:02
Re: Compressing output using block compression - Pig - [mail # user]
...Thanks for your input.  It looks like it's some work to configure LZO. What are the other alternatives? We read new sequence files and generate output continuously. What are my options?...
   Author: Mohit Anchlia, 2012-04-03, 19:18
Re: Compressing output using block compression - Pig - [mail # user]
...Is bzip2 not advisable? I think it can split too and is supported out of the box.  On Thu, Mar 29, 2012 at 8:08 PM, 帝归  wr ote:  st er/elephantbird/pig/store m...
   Author: Mohit Anchlia, 2012-04-03, 18:39
Re: Compressing output using block compression - Pig - [mail # user]
...Thanks! When I store output how can I tell pig to compress it in LZO format?  On Thu, Mar 29, 2012 at 4:02 PM, Dmitriy Ryaboy  wrote:  ...
   Author: Mohit Anchlia, 2012-03-29, 23:07
Compressing output using block compression - Pig - [mail # user]
...We currently have 100s of GB of uncompressed data which we would like to zip using some compression that is block compression so that we can use multiple input splits. Does pig support any s...
   Author: Mohit Anchlia, 2012-03-28, 16:45
Re: Duplicate rows when using regular expression - Pig - [mail # user]
...I disabled it and it worked. However, in order to see number of tasks that go re-scheduled I went to map/reduce admin page->Completed Job->click one job and tried to look inside map ta...
   Author: Mohit Anchlia, 2012-03-27, 21:42
Sort:
project
Hadoop (148)
HBase (125)
Pig (125)
Flume (55)
MapReduce (34)
HDFS (11)
Hive (10)
Accumulo (1)
type
mail # user (125)
date
last 7 days (0)
last 30 days (0)
last 90 days (12)
last 6 months (21)
last 9 months (125)
author
Dmitriy Ryaboy (1352)
Alan Gates (955)
Jonathan Coveney (732)
Daniel Dai (544)
Russell Jurney (485)
Olga Natkovich (453)
Prashant Kommireddi (367)
Bill Graham (334)
Cheolsoo Park (246)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (154)
Jeff Zhang (146)
Santhosh Srinivasan (142)
Mohit Anchlia