Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 31 to 40 from 110 (0.172s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: Getting Error : java.io.IOException: Spill failed - Pig - [mail # user]
...How big is the output of the join expected to be ? (for example, if you hav e large number of join keys with same value in both files, the output could  be very large). Are you using re...
   Author: Thejas M Nair, 2011-06-01, 15:46
Re: No of reducers - Pig - [mail # dev]
...In pig 0.8 the default number of reducers changed from 1 to a value compute d based on input data size - http://pig.apache.org/docs/r0.8.1/cookbook.html#Use+the+Parallel+Features  -Thej...
   Author: Thejas M Nair, 2011-05-31, 18:41
Re: How to make a UDF that can take a variable number of arguments while using getArgToFuncMapping? - Pig - [mail # user]
...As a workaround , you can use - MAX(TOBAG(a,b,c));  For example , if a,b,c columns are of type int. Then, TOBAG(a,b,c) will hav e a schema of bag of integers, and the org.apache.pig.bui...
   Author: Thejas M Nair, 2011-05-20, 18:41
[expand - 1 more] - Re: Question about immediately projecting on a strsplit() return tuple... - Pig - [mail # user]
...  On 5/17/11 12:20 PM, "Daniel Eklund"  wrote:  ession  Yes, that is correct.  . I I think you would need to use the pig jar without hadoop in it, if you are using C...
   Author: Thejas M Nair, 2011-05-17, 20:41
Re: java.lang.OutOfMemoryError while running Pig Job - Pig - [mail # user]
...The stack trace shows that the OOM error is happening when the distinct is being applied. It looks like in some record(s) of the relation group_it, on e more of the following bags is very la...
   Author: Thejas M Nair, 2011-05-13, 23:46
Re: order by throwing exception in cluster - Pig - [mail # user]
...The exception stack has LocalJobRunner, that is strange. Have you specified the cmd line option "-x mapreduce" ? Is the hadoop conf  dir in class path? -Thejas    On 5/13/11 1...
   Author: Thejas M Nair, 2011-05-13, 23:13
Re: Order By Sampling - Pig - [mail # user]
...The sampling algorithm for order-by samples 100 records from every map task , using a reservoir sampling algorithm. I can't think of a way to store data that could adversely affect this samp...
   Author: Thejas M Nair, 2011-05-06, 19:33
Re: cannot cast issue: Pig filter against flatten column - Pig - [mail # user]
...Are you also using CassandraStorage? Can you paste the entire stack trace  ? It seems CassandraStorage is returning a DataByteArray instead of String, c an you try declaring the schema ...
   Author: Thejas M Nair, 2011-04-29, 00:43
[expand - 1 more] - Re: How to improve the performs of PIG Join - Pig - [mail # user]
...Here is the (theoretical) rule of thumb for replicated join : for replicated join to perform significantly better than default join, the  size of the replicated input should be  sm...
   Author: Thejas M Nair, 2011-04-19, 15:41
Re: COUNT sometimes returning a float value? - Pig - [mail # user]
...This is strange. Looking at COUNT code, there does not seem to be anyway it  could return a float. Do you have some example data/query that can be used to reproduce this ? Can you paste...
   Author: Thejas M Nair, 2011-04-16, 00:13
Sort:
project
Hive (237)
Pig (110)
Ambari (4)
Hadoop (3)
HDFS (2)
MapReduce (1)
type
mail # user (73)
issue (25)
mail # dev (12)
date
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (1)
last 9 months (110)
author
Daniel Dai (412)
Dmitriy Ryaboy (345)
Alan Gates (333)
Cheolsoo Park (271)
Jonathan Coveney (230)
Rohini Palaniswamy (180)
Russell Jurney (174)
Olga Natkovich (131)
Bill Graham (130)
Prashant Kommireddi (110)
Julien Le Dem (81)
Aniket Mokashi (79)
Thejas Nair (70)
Thejas M Nair (64)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
liyunzhang_intel (40)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB