Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 31 to 40 from 78 (0.093s).
Loading phrases to help you
refine your search...
Re: Local vs mapreduce mode - Pig - [mail # user]
...Really dumb question but... when running in MapReduce mode, is your input file on HDFS?   On Tue, Nov 5, 2013 at 9:17 AM, Sameer Tilak  wrote:  ...
   Author: Pradeep Gollakota, 2013-11-05, 17:37
Re: Java UDF and incompatible schema - Pig - [mail # user]
...This is most likely because you haven't defined the outputSchema method of the UDF. The AS keyword merges the schema generated by the UDF with the user specified schema. If the UDF does not ...
   Author: Pradeep Gollakota, 2013-11-05, 01:08
[expand - 1 more] - Re: limit map tasks for load function - Pig - [mail # user]
...You would only be able to set it for the script... which means it will apply to all 8 jobs. However, my guess is that you don't need to control the number of map tasks per machine.   On...
   Author: Pradeep Gollakota, 2013-11-04, 01:25
Re: simple pig logic - Pig - [mail # user]
...If I understood your question correctly, given the following input:  main_data.txt {"id": "foo", "some_field": 12354, "score": 0} {"id": "foobar", "some_field": 12354, "score": 0} {"id"...
   Author: Pradeep Gollakota, 2013-10-31, 19:08
Re: UDFContext NULL JobConf - Pig - [mail # user]
...Are you able to post your UDF (or at least a sanitized version)?   On Wed, Oct 30, 2013 at 10:46 AM, Henning Kropp wrote:  ...
   Author: Pradeep Gollakota, 2013-10-30, 17:58
Re: count distinct on multiple columns - Pig - [mail # user]
...Great question. There seems to be some confusion about how DISTINCT operates. I remembered (and thankfully found) this message that explains the behavior.  As per the other post, it loo...
   Author: Pradeep Gollakota, 2013-10-29, 19:24
Re: Parent Child Relationships in Pig - Pig - [mail # user]
...Not really...  In my experience, Pig is only good at dealing with tabular data. The type of graphical data you have is not workable in Pig. Have you considered using a Graph database (s...
   Author: Pradeep Gollakota, 2013-10-25, 05:25
Re: Attach bag for each tuple and pass to UDF - Pig - [mail # dev]
...A replicated cross (implemented as a replicated join on a synthetic key) is probably your best bet.   On Wed, Oct 23, 2013 at 2:09 PM, Daniel Dai  wrote:  ...
   Author: Pradeep Gollakota, 2013-10-23, 22:32
[expand - 1 more] - Re: Elephant-Bird: Building error - Pig - [mail # user]
...Repo: Read the docs at https://github.com/kevinweil/elephant-bird   On Thu, Oct 17, 2013 at 4:17 PM, Sameer Tilak  wrote:  ...
   Author: Pradeep Gollakota, 2013-10-18, 00:53
[expand - 1 more] - Re: number of M/R jobs for a Pig Script - Pig - [mail # user]
...Can you describe what your input data looks like and what you want your output data to look like?  I don’t understand your question. A group by is really straight forward to do on a dat...
   Author: Pradeep Gollakota, 2013-10-15, 20:12
Sort:
project
Pig (78)
HBase (16)
Kafka (8)
Hadoop (7)
MapReduce (6)
Ambari (2)
Avro (2)
HDFS (2)
Accumulo (1)
type
mail # user (73)
mail # dev (4)
issue (1)
date
last 7 days (2)
last 30 days (3)
last 90 days (4)
last 6 months (12)
last 9 months (78)
author
Daniel Dai (404)
Dmitriy Ryaboy (346)
Alan Gates (334)
Cheolsoo Park (312)
Jonathan Coveney (237)
Rohini Palaniswamy (188)
Russell Jurney (177)
Bill Graham (131)
Olga Natkovich (131)
Prashant Kommireddi (108)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (71)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Santhosh Srinivasan (29)