Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 51 to 60 from 875 (0.072s).
Loading phrases to help you
refine your search...
Re: JOIN comparasion PIG V/S HIVE - Pig - [mail # user]
...Could you provide sample data and script that would allow us to reproduce this? Hive is faster at some things. Pig is faster at others. Both produce correct results.  D  On Mon, Oc...
   Author: Dmitriy Ryaboy, 2012-10-23, 04:19
[expand - 1 more] - Re: Help on running pig in local mode - Pig - [mail # user]
...What are you running this on?  This is really odd: /proc//status does not have information about swap space used(VmSwap).  D  On Fri, Oct 19, 2012 at 11:11 AM, lei tang  ...
   Author: Dmitriy Ryaboy, 2012-10-23, 00:39
Re: debug feature?? - Pig - [mail # user]
...Some testing tips:  1) parametrize your load/store statements so that if you have to run in hadoop mode, it's easy to switch to debug inputs / outputs (and debug input/output loaders an...
   Author: Dmitriy Ryaboy, 2012-10-23, 00:32
Re: _SUCCESS file -> _FAILURE file? - Pig - [mail # user]
...That's a Hadoop mapreduce feature, not a Pig feature, so that request should go there.  Can't really do the _failure thing though, if you think about it -- programs can fail by crashing...
   Author: Dmitriy Ryaboy, 2012-10-18, 21:03
[expand - 2 more] - Re: How can I read Hive text files on S3 from Pig? - Pig - [mail # user]
...The same underlying class is used by PigStorage in 11, so we should clean this up to make S3 users happy.  D  On Thu, Oct 18, 2012 at 5:22 AM, Martin Goodson  wrote:...
   Author: Dmitriy Ryaboy, 2012-10-18, 20:59
Re: NEED HELP in Hive Query - Pig - [mail # user]
...B = group A by ( name, date, url); and "A" which is a collection of tuples from A with the same name-date-url  counts = foreach B generate flatten(group) as (name, date, url), COUNT_STA...
   Author: Dmitriy Ryaboy, 2012-10-18, 04:12
[expand - 1 more] - Re: Pig storage and load functions and Cache - Pig - [mail # user]
...I am not sure I understand the question. You are trying to decide how to store results of your computation? Text (PigStorage, the default) is probably easiest to work with, but there are man...
   Author: Dmitriy Ryaboy, 2012-10-18, 04:09
Re: CHANGES.txt in branches - Pig - [mail # dev]
...Guilty.. I guess we should be putting them under 0.11 in trunk.  On Tue, Oct 16, 2012 at 8:18 PM, Jonathan Coveney  wrote:...
   Author: Dmitriy Ryaboy, 2012-10-17, 04:46
[expand - 2 more] - Re: Pig 0.11 - Pig - [mail # dev]
...Thanks Olga and welcome back!  I know there's some process for linking jiras to releases, but I'm not sure w hat that is. If you could explain and maybe cover a portion of that work, th...
   Author: Dmitriy Ryaboy, 2012-10-13, 00:59
Re: NEED HELP in PigStorage - Pig - [mail # user]
...Sounds like however you wrote the data, it has some sort of a binary delimiter. Figure out what that delimiter is, and tell PigStorage to use it. For example:  my_data = load 'path/to/d...
   Author: Dmitriy Ryaboy, 2012-10-12, 20:20
Pig (875)
Hadoop (9)
Drill (5)
MapReduce (3)
Bigtop (1)
HBase (1)
mail # user (693)
mail # dev (182)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (875)
Daniel Dai (404)
Dmitriy Ryaboy (346)
Alan Gates (334)
Cheolsoo Park (312)
Jonathan Coveney (237)
Rohini Palaniswamy (188)
Russell Jurney (177)
Bill Graham (131)
Olga Natkovich (131)
Prashant Kommireddi (108)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (71)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Santhosh Srinivasan (29)