Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 131 to 140 from 163 (0.123s).
Loading phrases to help you
refine your search...
Re: Reducers slowing down? (UNCLASSIFIED) - Pig - [mail # user]
...I am not sure why the rate at which output is generated is slowing down. But cross in pig is not optimized ­ it uses only one reducer. (a major limitation if you are trying to process lots o...
   Author: Thejas Nair, 2010-03-05, 23:17
Re: LOAD from multiple directories - Pig - [mail # user]
...I was going to suggest - "/20100121/{10,11}/{30,40,50,00,10,20}" but that would not work because it will also match - "/20100121/10/00" . I don't think hadoop file path globing can be used f...
   Author: Thejas Nair, 2010-01-21, 19:29
Re: Initial Benchmark Results - Pig - [mail # user]
...Hi Rob, the following pig-latin statement is unnecessary - words = FOREACH myinput GENERATE FLATTEN(TOKENIZE(\$0));   Ie, equivalent the pig-latin script for the hive query is -  m...
   Author: Thejas Nair, 2010-01-19, 18:50
Re: ParseException when using PigServer.registerScript against a file  that has a %declare directive - Pig - [mail # user]
...Hi Christian, Parameter substitution is currently supported only for the command line mode (batch mode).  A preprocessor processes the entire pig-script at once to do parameter substitu...
   Author: Thejas Nair, 2010-01-06, 18:19
Re: Complex data types as value in a map function - Pig - [mail # user]
...This is an issue in PigStorage  is present in recent versions of pig. Ie you cannot have complex types (bag, tuple, map) as a value in map type, if you are using PigStorage . See - http...
   Author: Thejas Nair, 2010-01-05, 18:38
Re: Variable support in Amazon's Elastic MapReduce version of Hive - Pig - [mail # user]
...The parameter substitution in pig is done using a query pre-processor, this code is mostly independent of rest of pig code, so it can be understood in isolation. It uses javacc. The code is ...
   Author: Thejas Nair, 2010-01-04, 15:05
Re: using multi-query through Java API - Pig - [mail # user]
...I don't think the order of the jobs is guaranteed. Yes, api's need to be added to support the association of job to store . ExecJob should return alias or the FileSpec of the store . To to t...
   Author: Thejas Nair, 2009-12-15, 21:38
Re: Pig Relation Sorting, labeling, partitioning - Pig - [mail # user]
...If you are ok with approximately dividing A into 100 parts on sorted order , you could do  B = order A by x parallel 100; That will generate 100 part files, (somewhat) evenly distributi...
   Author: Thejas Nair, 2009-11-20, 21:13
Re: Is Pig dropping records? - Pig - [mail # user]
...Another thing to verify is that clickurl's position in the schema is correct. -Thejas    On 11/19/09 11:43 AM, "Ashutosh Chauhan"  wrote:  ...
   Author: Thejas Nair, 2009-11-19, 19:48
Re: Is it possible to use an env variable in parameters file? - Pig - [mail # user]
...You should be able to get access to the env variable using shell commands - Eg - UDF_PATH=`echo $LIB_DIR`;  http://wiki.apache.org/pig/ParameterSubstitution  -Thejas   On 11/1...
   Author: Thejas Nair, 2009-11-13, 20:03
Pig (163)
Hive (114)
Hadoop (2)
Bigtop (1)
mail # user (123)
mail # dev (40)
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (1)
last 9 months (163)
Daniel Dai (384)
Dmitriy Ryaboy (345)
Alan Gates (334)
Cheolsoo Park (267)
Jonathan Coveney (230)
Russell Jurney (174)
Rohini Palaniswamy (160)
Olga Natkovich (131)
Bill Graham (130)
Prashant Kommireddi (108)
Aniket Mokashi (82)
Julien Le Dem (82)
Thejas Nair (70)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Serega Sheypak (29)