Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 61 to 70 from 79 (0.107s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: Filter bag with multiple output - Pig - [mail # user]
...You can do the SPLIT outside the nested FOREACH. I'm assuming you have UDF defined for VALID.  So, your scrpit can be written as:  rawRecords = LOAD '/data' as ...; grouped = GROUP...
   Author: Pradeep Gollakota, 2013-07-23, 14:19
Re: Large Bag (100GB of Data) in Reduce Step - Pig - [mail # user]
...There's only one thing that comes to mind for this particular toy example.  "pig.cached.bag.memusage" property is the "Percentage of the heap that Pig will allocate for all of the bags ...
   Author: Pradeep Gollakota, 2013-07-22, 14:12
Re: Execute multiple PIG scripts parallely - Pig - [mail # user]
...You could probably just use nohup if they're all parallel and send them into the background.  Nohup pig script1.pig & Nohup pig script2.pig & Etc. On Jul 22, 2013 7:12 AM, "[EMAIL PROTE...
   Author: Pradeep Gollakota, 2013-07-22, 12:14
[expand - 1 more] - Re: Getting dimension values for Facts - Pig - [mail # user]
...Unfortunately I can't think of any good way of doing this (other than what Bertrand suggested with using a different language to generate the script).  I'd also recommend Hive... it may...
   Author: Pradeep Gollakota, 2013-07-18, 17:51
RE: Want to add data in same file in Apache PIG? - Pig - [mail # user]
...If you want persistent storage like that, you're best bet is to use a database like HBase On Jul 18, 2013 7:56 AM, "Bhavesh Shah"  wrote:  ...
   Author: Pradeep Gollakota, 2013-07-18, 12:02
Re: header of a tuple/bag - Pig - [mail # user]
...It generally depends on what type of Storage mechanism is used. If it's PigStorage() then this information is not encoded into the data.  Assuming that the storage is PigStorage() and t...
   Author: Pradeep Gollakota, 2013-07-16, 18:30
Re: Problem with nested FOREACH, bag semantics and UDF - Pig - [mail # user]
...Just to confirm... you want your output to read as follows,  {1, {(1, count), (2, count), ..., (10, count)}} {1, {(11, count), (12, count), ..., (20, count)}} ... correct?  I think...
   Author: Pradeep Gollakota, 2013-07-15, 17:41
Re: Concatenate strings within a group - Pig - [mail # user]
...I'm not aware of any native PIG commands that can do this. So you'll have to implement a UDF to do this. My implementation would look as follows:  A = load 'data' as (id: int, seg_num: ...
   Author: Pradeep Gollakota, 2013-07-14, 17:52
Re: Error Handling in Scripts? - Pig - [mail # user]
...I have not tried this yet, but hadoop has a built-in mechanism for skipping bad records. I'm guessing that it would work fine with Pig. http://hadoop.apache.org/docs/r1.1.2/mapred_tutorial.h...
   Author: Pradeep Gollakota, 2013-07-11, 17:46
Re: ERROR 1071: Cannot convert a generic_writablecomparable to a String - Pig - [mail # user]
...Instead of doing "values.add((Text) value);" try doing "values.add(value.toString());" (And make sure that values is of type List instead of List)  I'm not too sure of the details but t...
   Author: Pradeep Gollakota, 2013-07-10, 20:01
Sort:
project
Pig (79)
HBase (21)
Kafka (9)
Hadoop (8)
MapReduce (6)
Ambari (2)
Avro (2)
HDFS (2)
Accumulo (1)
type
mail # user (74)
mail # dev (4)
issue (1)
date
last 7 days (0)
last 30 days (1)
last 90 days (4)
last 6 months (6)
last 9 months (79)
author
Daniel Dai (409)
Dmitriy Ryaboy (345)
Alan Gates (333)
Cheolsoo Park (271)
Jonathan Coveney (230)
Rohini Palaniswamy (180)
Russell Jurney (174)
Olga Natkovich (131)
Bill Graham (130)
Prashant Kommireddi (110)
Julien Le Dem (81)
Aniket Mokashi (79)
Thejas Nair (70)
Thejas M Nair (64)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
liyunzhang_intel (40)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB