Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 41 to 50 from 60 (0.202s).
Loading phrases to help you
refine your search...
Re: pig reduce OOM - Pig - [mail # user]
...there's also a reason of the OOM: I group the data by all , and the  parallelism is 1, With a big data bag, the reducer OOM   after digging into the pig source code ,  I find ...
   Author: Haitao Yao, 2012-07-09, 06:18
Re: pig reduce OOM - Pig - [mail # user]
...sorry for the improper statement.  The problem is the DataBag.  The BinSedesTuple read full  data of the  DataBag. and while use COUNT for the data, it causes OOM. The di...
   Author: Haitao Yao, 2012-07-09, 03:11
pig reduce OOM - Pig - [mail # user]
...hi, I wrote a pig script that one of the reduces always OOM no  matter how I change the parallelism.         Here's the script snippet: Data = group SourceData al...
   Author: Haitao Yao, 2012-07-06, 06:44
pig error message enhancement - Pig - [mail # user]
...hi, all I encountered an Exception like this:   ERROR org.apache.pig.tools.grunt.Grunt  -  org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1000: Error  during ...
   Author: Haitao Yao, 2012-07-04, 03:17
Re: how can I distinct one field of a relation - Pig - [mail # user]
...will, not exactly. I want a subset of A with all fields, and field a1 is distinct. for example:  A is:  1,2,3 1,2,3 4,5,6  What I want is : 1,2,3 4,5,6  How can I do this...
   Author: Haitao Yao, 2012-06-27, 09:47
Re: how can I distinct one field of a relation - Pig - [mail # user]
...I want a subset of A with a1 value distinct. the current distinct will compare all the fields in A, which is not what  I want.    Haitao Yao [EMAIL PROTECTED] weibo: @haitao_y...
   Author: Haitao Yao, 2012-06-27, 06:00
how can I distinct one field of a relation - Pig - [mail # user]
...hi, How can I distinct only one field of a relation? here's the demo:   A = LOAD 'data' AS (a1:int,a2:int,a3:int); B = distinct A by a1;     how can I do this?   &nbs...
   Author: Haitao Yao, 2012-06-27, 02:54
Re: how can I store multiple result once a time? - Pig - [mail # user]
...Yeah ,  union can do this.   But the real purpose for me is to reduce the map reduce job count .  Although I union 2 result sets into one, It still submit 2 map reduce  j...
   Author: Haitao Yao, 2012-03-02, 03:47
how can I store multiple result once a time? - Pig - [mail # user]
...Hi , all   How can I store multiple result using one store function?   for example: store Result1, Result 2 into '/tmp/result' using  PigStorage(',');   the default store...
   Author: Haitao Yao, 2012-03-02, 03:03
Re: How to run pig batch? - Pig - [mail # user]
...the pig script is generated by a system which is more than 4000 lines.    The pig client compile it for a long time.  I have solved the problem by dividing the big script &nbs...
   Author: Haitao Yao, 2012-02-21, 01:53
Sort:
project
Pig (60)
HDFS (2)
Hive (1)
MapReduce (1)
type
mail # user (55)
issue (5)
date
last 7 days (0)
last 30 days (0)
last 90 days (3)
last 6 months (6)
last 9 months (60)
author
Dmitriy Ryaboy (1352)
Alan Gates (955)
Jonathan Coveney (731)
Daniel Dai (544)
Russell Jurney (485)
Olga Natkovich (453)
Prashant Kommireddi (367)
Bill Graham (334)
Cheolsoo Park (246)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (154)
Jeff Zhang (146)
Santhosh Srinivasan (142)
Haitao Yao