Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 11 to 20 from 36 (0.132s).
Loading phrases to help you
refine your search...
Re: self join in pig - Pig - [mail # user]
...You need to load your data twice and then use it as any other join. Self-join is just like any other join to Pig.  Regards, Shahab   On Sun, Sep 15, 2013 at 1:37 PM, Raj hadoop &nb...
   Author: Shahab Yunus, 2013-09-15, 18:10
Re: Problem while using merge join - Pig - [mail # user]
...Wouldn't this slow down your data retrieval? Once column in each call instead of a batch?  Regards, Shahab   On Fri, Sep 13, 2013 at 2:34 PM, John  wrote:  ...
   Author: Shahab Yunus, 2013-09-13, 19:00
[expand - 1 more] - Re: Sort Order in HBase with Pig/Piglatin in Java - Pig - [mail # user]
..."but since hbase returns the values sorted"  You are right. I just wanted to highlight the subtlety that you are essentially relying on the external mechanism for the desired feature (s...
   Author: Shahab Yunus, 2013-09-13, 16:55
Re: Delete Output Folder in Pig Script - Pig - [mail # user]
...You can use a shell script to invoke your Pig scripts. Then you can do anything you want in that shell script. We are following this design. It gives you power to conditionally run, modify P...
   Author: Shahab Yunus, 2013-09-11, 14:12
Re: piglipstick - Pig - [mail # user]
...Netflix seem to has decent enough documentation on that: https://github.com/Netflix/Lipstick/wiki/Getting-Started  Have you already seen that?  Regards, Shahab   On Fri, Sep 6...
   Author: Shahab Yunus, 2013-09-06, 12:28
Re: Multiple reduce no effect - Pig - [mail # user]
...How is your key distribution in your data? There might be a chance that the 2 reducers are getting bulk of your data because of skewed key/data distribution.  higher values than the set...
   Author: Shahab Yunus, 2013-09-03, 12:54
Re: Reading json file. - Pig - [mail # user]
...Have you seen these? http://pig.apache.org/docs/r0.11.0/api/org/apache/pig/builtin/JsonStorage.html  http://hortonworks.com/blog/jsonize-anything-in-pig-with-tojson/  Regards, Shah...
   Author: Shahab Yunus, 2013-08-29, 22:22
[expand - 1 more] - Re: error in processing data in pig - Pig - [mail # user]
...I just tried this exact example and at the start it didn't work when I used the 'data' file as is how I copy and pasted it from the Pig Latin reference page and I was seeing similar issues t...
   Author: Shahab Yunus, 2013-08-04, 18:05
Re: Is it safe to have static methods in Hadoop Framework - Pig - [mail # user]
...If each job (its child tasks) is running in its own JVM then this should not be a problem.  Regards, Shahab   On Thu, Jul 25, 2013 at 2:46 PM, Huy Pham  wrote:  ...
   Author: Shahab Yunus, 2013-07-25, 19:09
Re: pig 0.8.1 - Iterating contents of a Bag - Pig - [mail # user]
...Amit, have you looked into TOBAG and TOTUPLE built-in UDFs? They are not helpful?  Regards, Shahab   On Tue, Jul 23, 2013 at 5:46 PM, Amit  wrote:  ...
   Author: Shahab Yunus, 2013-07-23, 21:58
Sort:
project
HBase (56)
Hadoop (53)
MapReduce (43)
Pig (36)
HDFS (17)
Cassandra (1)
Spark (1)
type
mail # user (36)
date
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (2)
last 9 months (36)
author
Daniel Dai (412)
Dmitriy Ryaboy (345)
Alan Gates (334)
Cheolsoo Park (271)
Jonathan Coveney (230)
Rohini Palaniswamy (180)
Russell Jurney (174)
Olga Natkovich (131)
Bill Graham (130)
Prashant Kommireddi (110)
Julien Le Dem (81)
Aniket Mokashi (79)
Thejas Nair (70)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
liyunzhang_intel (40)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Shahab Yunus
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB