Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 21 to 30 from 75 (0.122s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: Splitting by unique values in a relation - Pig - [mail # user]
...Sorry. I didn't know/understand that you had unknown values. Yes, in your case MultiStorage is a good way to split the data according to the values of a column. It worked for me in similar c...
   Author: Ruslan Al-Fakikh, 2013-09-16, 02:25
Re: self join in pig - Pig - [mail # user]
...Also keep in mind that a Pig relation is not aware of the order of records unless right after the ORDER statement. I guess that you need to use a row with a preceding row,corrent? Probably y...
   Author: Ruslan Al-Fakikh, 2013-09-15, 23:33
Re: COALESCE UDF? - Pig - [mail # user]
...Hi,  I think you could mimic it with an expression like this: b = foreach a generate ((field1 is null) ? ((field2 is null) ? null : field2) : field1);  Hope that helps, Ruslan &nbs...
   Author: Ruslan Al-Fakikh, 2013-09-04, 11:30
[expand - 1 more] - Re: Avro to Tuples during UnitTest - Pig - [mail # user]
...If you don't want to run the whole Pig script, then you probably shouldn't use the AvroStorage, because it loads data to the Pig as per my understanding. But as far as I understand you want ...
   Author: Ruslan Al-Fakikh, 2013-08-30, 07:34
Re: Reading json file. - Pig - [mail # user]
...Hi,  There are different json loaders available, but none of them worked for me when I had to deal with json. I ended up loading the file as text file, reading one line at a time and th...
   Author: Ruslan Al-Fakikh, 2013-08-30, 07:20
Re: Misplaced pigsample_123456.... file fails the pig job ! - Pig - [mail # user]
...Which hadoop distro are you using? I've heard Hortonworks has a windows-compatible hadoop.   On Wed, Aug 28, 2013 at 2:36 PM, Darpan R  wrote:  ...
   Author: Ruslan Al-Fakikh, 2013-08-29, 09:06
Re: Date Function in Pig - Pig - [mail # user]
...Hi,  I think the easiest way would be to use the piggybank converstion functions for such tasks: http://svn.apache.org/viewvc/pig/trunk/contrib/piggybank/java/src/main/java/org/apache/p...
   Author: Ruslan Al-Fakikh, 2013-08-27, 11:45
Re: dev How can I add a row number per input file to the data - Pig - [mail # user]
...Hi!  Probably these can help: http://pig.apache.org/docs/r0.11.1/basic.html#rank http://pig.apache.org/docs/r0.11.1/func.html#pigstorage (look for -tagsource)  I've never tried thi...
   Author: Ruslan Al-Fakikh, 2013-08-21, 15:03
[expand - 1 more] - Re: Removing characters from a bag - Pig - [mail # user]
...I guess that if you use newlines as row separator than Pig will load them using ALL the newlines. I don't think it can distinguish them. So you end up having too many rows. I think this type...
   Author: Ruslan Al-Fakikh, 2013-06-30, 01:01
Re: nested FOREACH statements - Pig - [mail # user]
...Hi!  I haven't tried this script, but here is an idea: flattenned = FOREACH data2 GENERATE group AS initialGroup, FLATTEN(data1); grouped = GROUP flattenned BY (initialGroup, lt, ln); c...
   Author: Ruslan Al-Fakikh, 2013-06-25, 10:09
Sort:
project
Pig (75)
Hive (17)
MapReduce (6)
Sqoop (5)
Avro (3)
Hadoop (3)
type
mail # user (75)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (75)
author
Daniel Dai (405)
Dmitriy Ryaboy (345)
Alan Gates (333)
Cheolsoo Park (272)
Jonathan Coveney (230)
Rohini Palaniswamy (175)
Russell Jurney (173)
Olga Natkovich (131)
Bill Graham (130)
Prashant Kommireddi (110)
Julien Le Dem (81)
Aniket Mokashi (79)
Thejas Nair (70)
Thejas M Nair (64)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
liyunzhang_intel (39)
Koji Noguchi (38)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB