Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 27 (4.485s).
Loading phrases to help you
refine your search...
Re: Pig write to single file - Pig - [mail # user]
...What I'm doing is at the end of each day I deduce and store all my log files in lzo format in an archive directory. I thought that since LZO is splittable and Hadoop likes larger files that ...
   Author: Mark, 2013-05-01, 17:21
Pig write to single file - Pig - [mail # user]
...Thought I understood how to output to a single file but It doesn't seem to be working. Anything I'm missing here?    rows   = LOAD '$input'; unique = DISTINCT rows PARELLEL 1;...
   Author: Mark, 2013-05-01, 16:51
Sorting/Partitioning of Pig output - Pig - [mail # user]
...I understand in the traditional map/reduce paradigm that each key will get sent to the same reducer sorted but in pig there is no such thing as a "key".  I'm curious to know how pig kno...
   Author: Mark, 2013-03-27, 18:46
General Pig store questions - Pig - [mail # user]
...In map/reduce all values for 1 key are guaranteed to go to the same reducer. Is there something analogous to this in Pig? If so, what determines the key when I output a bunch of tuples? &nbs...
   Author: Mark, 2013-03-22, 19:39
Re: Please help with grouped count - Pig - [mail # user]
...Not entirely following you on the Order UDF.. doesn one exist or are you  proposing I create this myself?  Thanks.  On 5/11/12 10:49 AM, Jonathan Coveney wrote:...
   Author: Mark, 2012-05-11, 18:05
Re: Please help with grouped count - Pig - [mail # user]
...Also, using your example, how could I limit the number of terms per country?  On 5/11/12 9:47 AM, Mark wrote:...
   Author: Mark, 2012-05-11, 16:48
Re: Please help with grouped count - Pig - [mail # user]
...Thank you so much, that's pretty much what I was going for but with a  slightly different output.  Just to be clear... are these equivalent?  b = foreach (group a by (country,...
   Author: Mark, 2012-05-11, 16:47
Please help with grouped count - Pig - [mail # user]
...We have logs in the following format  us, foo us, foo fr, fizz us, bar fr, baz fr, fizz us, foo fr, fizz  Where the first column is a country and the second column is a search term...
   Author: Mark, 2012-05-11, 00:23
Replacing values - Pig - [mail # user]
...I have some tuples and I would like to be able to transform the first  row using the REGEX_EXTRACT method but leave all the other rows  untouched. Now I can easily do this with kno...
   Author: Mark, 2011-08-10, 05:25
Select x amount of input paths/files - Pig - [mail # user]
...We have our data in folders partitioned by day:  ie /user/pig/logs/2011/06/30  Is there any way to select the last x amount of days to use as input?  Thanks...
   Author: Mark, 2011-07-01, 05:06
Sort:
project
HBase (80)
Hadoop (77)
Kafka (43)
Pig (27)
Hive (7)
MapReduce (4)
Zookeeper (4)
Chukwa (1)
HDFS (1)
type
mail # user (27)
date
last 7 days (0)
last 30 days (2)
last 90 days (4)
last 6 months (4)
last 9 months (27)
author
Dmitriy Ryaboy (1352)
Alan Gates (955)
Jonathan Coveney (731)
Daniel Dai (544)
Russell Jurney (485)
Olga Natkovich (453)
Prashant Kommireddi (367)
Bill Graham (334)
Cheolsoo Park (246)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (154)
Jeff Zhang (146)
Santhosh Srinivasan (142)
Mark