Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 51 to 60 from 74 (0.052s).
Loading phrases to help you
refine your search...
Re: Replace join with custom implementation - Pig - [mail # user]
...Oh... sorry... I missed the part where you were saying that you want to reimplement the replicated join algorithm   On Fri, Aug 2, 2013 at 9:13 AM, Pradeep Gollakota wrote:  ...
[+ more]    Author: Pradeep Gollakota, 2013-08-02, 13:14
Re: error while executing command after creating own udf - Pig - [mail # user]
...I seem to remember another person asking a similar question on the mailing list before.  I think the answer was a mismatch of the version number of pig that you're executing with vs ver...
   Author: Pradeep Gollakota, 2013-07-31, 12:03
Re: Get the tree structure of a HDFS dir, similar to dir/files - Pig - [mail # user]
...Huy,  I think this question probably belongs in the Hadoop mailing list over the Pig mailing list. However, I think you're looking for http://hadoop.apache.org/docs/r1.0.4/api/org/apach...
   Author: Pradeep Gollakota, 2013-07-28, 00:45
Re: Pig and Storm - Pig - [mail # dev]
...I've added a wiki page for a "Pig on Storm Proposal" at https://cwiki.apache.org/confluence/display/PIG/Pig+on+Storm+Proposal  I've included a primer on Storm (and Trident) as well as s...
[+ more]    Author: Pradeep Gollakota, 2013-07-25, 03:43
Re: pig 0.8.1 - Iterating contents of a Bag - Pig - [mail # user]
...Amit,  It looks like the FLATTEN operator is exactly what you're looking for (based on both the 'output you'd like to see' and the fact that your UDF accept's chararry's and not Bags). ...
   Author: Pradeep Gollakota, 2013-07-23, 22:09
Re: Filter bag with multiple output - Pig - [mail # user]
...You can do the SPLIT outside the nested FOREACH. I'm assuming you have UDF defined for VALID.  So, your scrpit can be written as:  rawRecords = LOAD '/data' as ...; grouped = GROUP...
[+ more]    Author: Pradeep Gollakota, 2013-07-23, 14:19
Re: Large Bag (100GB of Data) in Reduce Step - Pig - [mail # user]
...There's only one thing that comes to mind for this particular toy example.  "pig.cached.bag.memusage" property is the "Percentage of the heap that Pig will allocate for all of the bags ...
   Author: Pradeep Gollakota, 2013-07-22, 14:12
Re: Execute multiple PIG scripts parallely - Pig - [mail # user]
...You could probably just use nohup if they're all parallel and send them into the background.  Nohup pig script1.pig & Nohup pig script2.pig & Etc. On Jul 22, 2013 7:12 AM, "[EMAIL PROTE...
   Author: Pradeep Gollakota, 2013-07-22, 12:14
Re: Getting dimension values for Facts - Pig - [mail # user]
...Unfortunately I can't think of any good way of doing this (other than what Bertrand suggested with using a different language to generate the script).  I'd also recommend Hive... it may...
[+ more]    Author: Pradeep Gollakota, 2013-07-18, 17:51
RE: Want to add data in same file in Apache PIG? - Pig - [mail # user]
...If you want persistent storage like that, you're best bet is to use a database like HBase On Jul 18, 2013 7:56 AM, "Bhavesh Shah"  wrote:  ...
   Author: Pradeep Gollakota, 2013-07-18, 12:02
Pig (74)
HBase (15)
Kafka (8)
MapReduce (6)
Hadoop (3)
Ambari (2)
Avro (2)
HDFS (2)
Accumulo (1)
mail # user (69)
mail # dev (4)
issue (1)
last 7 days (0)
last 30 days (1)
last 90 days (6)
last 6 months (14)
last 9 months (74)
Daniel Dai (362)
Dmitriy Ryaboy (346)
Alan Gates (333)
Cheolsoo Park (295)
Jonathan Coveney (237)
Rohini Palaniswamy (175)
Russell Jurney (174)
Bill Graham (131)
Olga Natkovich (130)
Prashant Kommireddi (107)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (69)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (41)
pi song (41)
Gianmarco De Francisci Mo...(38)
"Cheolsoo Park (35)
Koji Noguchi (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Pradeep Gollakota (33)
Jeff Zhang (32)
Santhosh Srinivasan (29)