Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 61 to 70 from 82 (0.155s).
Loading phrases to help you
refine your search...
Re: Get the tree structure of a HDFS dir, similar to dir/files - Pig - [mail # user]
...Huy,  I think this question probably belongs in the Hadoop mailing list over the Pig mailing list. However, I think you're looking for http://hadoop.apache.org/docs/r1.0.4/api/org/apach...
   Author: Pradeep Gollakota, 2013-07-28, 00:45
[expand - 2 more] - Re: Pig and Storm - Pig - [mail # dev]
...I've added a wiki page for a "Pig on Storm Proposal" at https://cwiki.apache.org/confluence/display/PIG/Pig+on+Storm+Proposal  I've included a primer on Storm (and Trident) as well as s...
   Author: Pradeep Gollakota, 2013-07-25, 03:43
Re: pig 0.8.1 - Iterating contents of a Bag - Pig - [mail # user]
...Amit,  It looks like the FLATTEN operator is exactly what you're looking for (based on both the 'output you'd like to see' and the fact that your UDF accept's chararry's and not Bags). ...
   Author: Pradeep Gollakota, 2013-07-23, 22:09
[expand - 1 more] - Re: Filter bag with multiple output - Pig - [mail # user]
...You can do the SPLIT outside the nested FOREACH. I'm assuming you have UDF defined for VALID.  So, your scrpit can be written as:  rawRecords = LOAD '/data' as ...; grouped = GROUP...
   Author: Pradeep Gollakota, 2013-07-23, 14:19
Re: Large Bag (100GB of Data) in Reduce Step - Pig - [mail # user]
...There's only one thing that comes to mind for this particular toy example.  "pig.cached.bag.memusage" property is the "Percentage of the heap that Pig will allocate for all of the bags ...
   Author: Pradeep Gollakota, 2013-07-22, 14:12
Re: Execute multiple PIG scripts parallely - Pig - [mail # user]
...You could probably just use nohup if they're all parallel and send them into the background.  Nohup pig script1.pig & Nohup pig script2.pig & Etc. On Jul 22, 2013 7:12 AM, "[EMAIL PROTE...
   Author: Pradeep Gollakota, 2013-07-22, 12:14
[expand - 1 more] - Re: Getting dimension values for Facts - Pig - [mail # user]
...Unfortunately I can't think of any good way of doing this (other than what Bertrand suggested with using a different language to generate the script).  I'd also recommend Hive... it may...
   Author: Pradeep Gollakota, 2013-07-18, 17:51
RE: Want to add data in same file in Apache PIG? - Pig - [mail # user]
...If you want persistent storage like that, you're best bet is to use a database like HBase On Jul 18, 2013 7:56 AM, "Bhavesh Shah"  wrote:  ...
   Author: Pradeep Gollakota, 2013-07-18, 12:02
Re: header of a tuple/bag - Pig - [mail # user]
...It generally depends on what type of Storage mechanism is used. If it's PigStorage() then this information is not encoded into the data.  Assuming that the storage is PigStorage() and t...
   Author: Pradeep Gollakota, 2013-07-16, 18:30
Re: Problem with nested FOREACH, bag semantics and UDF - Pig - [mail # user]
...Just to confirm... you want your output to read as follows,  {1, {(1, count), (2, count), ..., (10, count)}} {1, {(11, count), (12, count), ..., (20, count)}} ... correct?  I think...
   Author: Pradeep Gollakota, 2013-07-15, 17:41
Sort:
project
Pig (82)
HBase (21)
Kafka (13)
Hadoop (8)
MapReduce (6)
Ambari (2)
Avro (2)
HDFS (2)
Hive (2)
Accumulo (1)
type
mail # user (76)
mail # dev (5)
issue (1)
date
last 7 days (0)
last 30 days (1)
last 90 days (4)
last 6 months (7)
last 9 months (82)
author
Daniel Dai (438)
Dmitriy Ryaboy (345)
Alan Gates (335)
Cheolsoo Park (273)
Jonathan Coveney (230)
Rohini Palaniswamy (201)
Russell Jurney (175)
Olga Natkovich (131)
Bill Graham (129)
Prashant Kommireddi (110)
Julien Le Dem (81)
Aniket Mokashi (79)
Thejas Nair (70)
Thejas M Nair (65)
Mridul Muralidharan (61)
liyunzhang_intel (50)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (39)
Jeff Zhang (37)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB