Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 37 (0.099s).
Loading phrases to help you
refine your search...
Re: comparing two files using pig - Pig - [mail # user]
...Now here's where it gets fun :)  First, I do want to show you that (given sufficient coffee) there is a set theoretic approach to your first question that allows you to solve it with ju...
   Author: Jacob Perkins, 2013-06-21, 13:38
Re: Spreading data in Pig - Pig - [mail # user]
...Hi John,  The only way I can think of to do this is using the RANK operator (available only in pig version 0.11) along with a custom udf as follows:  * RANK the users relation to r...
   Author: Jacob Perkins, 2013-03-31, 18:13
[PIG-2317] Ruby/Jruby UDFs - Pig - [issue]
...It should be possible to write UDFs in Ruby. These UDFs will be registered in the same way as python and javascript UDFs....
http://issues.apache.org/jira/browse/PIG-2317    Author: Jacob Perkins, 2012-04-03, 18:21
Re: Once more... - Pig - [mail # user]
...Michael,  Why not just:  D = foreach (join C by datapoint2, B by datapoint1) generate       B::datapoint1, B::datapoint2;  Does that get you what you need? &nbs...
   Author: Jacob Perkins, 2012-03-19, 20:03
PigServer vs PigRunner - Pig - [mail # user]
...Hello,  I find myself needing to run a pig script iteratively from within a java program. Since I'm writing the data to a db (Cassandra) I can't (as far as I can tell) use PigServer's s...
   Author: Jacob Perkins, 2012-03-01, 16:15
Re: exclude rows from group - Pig - [mail # user]
...Marco,  What you want is a combination of COGROUP and FILTER, see:  $: cat foo.tsv  1 rich 1 happy 2 rich 3 happy 4 rich    A = LOAD 'foo.tsv' AS (user_id:int, user_...
   Author: Jacob Perkins, 2012-02-28, 15:59
Re: Left joins with != condition - Pig - [mail # user]
...If I understand correctly, this is nothing more than an anti-join which can be done with pig using a cogroup.  So your SQL below:   becomes something like:  a = load 'yee' as ...
   Author: Jacob Perkins, 2012-01-03, 13:34
Re: JOIN not printing properly - Pig - [mail # user]
...Have you taken a look at Pygmalion (http://github.com/jeromatron/pygmalion) which makes it MUCH easier to work with tabular data from Cassandra like you're trying to do?  For example: &...
   Author: Jacob Perkins, 2011-11-04, 14:57
Re: Store Groups Separately - Pig - [mail # user]
...You'll have to run a FOREACH...GENERATE over the data first and generate a single key to look like the filename you want. Then you can use MultiStorage() from the piggybank. See:  org.a...
   Author: Jacob Perkins, 2011-10-10, 16:57
Re: RDBMS and Pig - Pig - [mail # user]
...You might also take a look at  http://pig.apache.org/docs/r0.8.1/api/index.html?org/apache/pig/piggybank/storage/DBStorage.html  which is going to require that you 'register' the p...
   Author: Jacob Perkins, 2011-07-26, 13:11
Sort:
project
Pig (37)
Cassandra (1)
type
mail # user (35)
issue (2)
date
last 7 days (2)
last 30 days (2)
last 90 days (3)
last 6 months (5)
last 9 months (37)
author
Daniel Dai (358)
Dmitriy Ryaboy (346)
Alan Gates (333)
Cheolsoo Park (287)
Jonathan Coveney (237)
Russell Jurney (175)
Rohini Palaniswamy (167)
Bill Graham (131)
Olga Natkovich (130)
Prashant Kommireddi (106)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (69)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (41)
pi song (41)
Gianmarco De Francisci Mo...(38)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Koji Noguchi (33)
Pradeep Gollakota (33)
Jeff Zhang (32)
Santhosh Srinivasan (29)
Jacob Perkins