Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 31 to 40 from 82 (0.165s).
Loading phrases to help you
refine your search...
Re: replicated join gets extra job - Pig - [mail # user]
...Use the ILLUSTRATE or EXPLAIN keywords to look at the details of the physical execution plan... from first glance it doesn't look like you'd need a 2nd job to do the joins, but if you can po...
   Author: Pradeep Gollakota, 2013-11-12, 04:30
Re: Reading from local and writing to HDFS? - Pig - [mail # user]
...Your pretty much stuck to options 1 and 2, with option 1 being the accepted solution. The whole idea of MapReduce is that you're not able to use a single machine to compute your answers. You...
   Author: Pradeep Gollakota, 2013-11-07, 16:24
Re: Bag of tuples - Pig - [mail # user]
...Each element in A is not a Bag. A relation is a collection of tuples (just like a bag). So each element in A is a tuple whose first element is a Bag.  If you want to order the tuples by...
   Author: Pradeep Gollakota, 2013-11-07, 01:03
[expand - 1 more] - Re: Pig Distributed Cache - Pig - [mail # user]
...I see... do you have to do a full cross product or are you able to do a join?   On Tue, Nov 5, 2013 at 11:07 AM, burakkk  wrote:  ...
   Author: Pradeep Gollakota, 2013-11-05, 19:50
Re: Local vs mapreduce mode - Pig - [mail # user]
...Really dumb question but... when running in MapReduce mode, is your input file on HDFS?   On Tue, Nov 5, 2013 at 9:17 AM, Sameer Tilak  wrote:  ...
   Author: Pradeep Gollakota, 2013-11-05, 17:37
Re: Java UDF and incompatible schema - Pig - [mail # user]
...This is most likely because you haven't defined the outputSchema method of the UDF. The AS keyword merges the schema generated by the UDF with the user specified schema. If the UDF does not ...
   Author: Pradeep Gollakota, 2013-11-05, 01:08
[expand - 1 more] - Re: limit map tasks for load function - Pig - [mail # user]
...You would only be able to set it for the script... which means it will apply to all 8 jobs. However, my guess is that you don't need to control the number of map tasks per machine.   On...
   Author: Pradeep Gollakota, 2013-11-04, 01:25
Re: simple pig logic - Pig - [mail # user]
...If I understood your question correctly, given the following input:  main_data.txt {"id": "foo", "some_field": 12354, "score": 0} {"id": "foobar", "some_field": 12354, "score": 0} {"id"...
   Author: Pradeep Gollakota, 2013-10-31, 19:08
Re: UDFContext NULL JobConf - Pig - [mail # user]
...Are you able to post your UDF (or at least a sanitized version)?   On Wed, Oct 30, 2013 at 10:46 AM, Henning Kropp wrote:  ...
   Author: Pradeep Gollakota, 2013-10-30, 17:58
Re: count distinct on multiple columns - Pig - [mail # user]
...Great question. There seems to be some confusion about how DISTINCT operates. I remembered (and thankfully found) this message that explains the behavior.  As per the other post, it loo...
   Author: Pradeep Gollakota, 2013-10-29, 19:24
Sort:
project
Pig (82)
HBase (21)
Kafka (13)
Hadoop (8)
MapReduce (6)
Ambari (2)
Avro (2)
HDFS (2)
Hive (2)
Accumulo (1)
type
mail # user (76)
mail # dev (5)
issue (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (3)
last 6 months (7)
last 9 months (82)
author
Daniel Dai (440)
Dmitriy Ryaboy (345)
Alan Gates (335)
Cheolsoo Park (273)
Jonathan Coveney (230)
Rohini Palaniswamy (204)
Russell Jurney (175)
Olga Natkovich (131)
Bill Graham (129)
Prashant Kommireddi (110)
Julien Le Dem (81)
Aniket Mokashi (79)
Thejas Nair (70)
Thejas M Nair (65)
Mridul Muralidharan (61)
liyunzhang_intel (51)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (39)
Jeff Zhang (38)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB