Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 9 from 9 (0.185s).
Loading phrases to help you
refine your search...
Re: how to access solr from pig - Pig - [mail # user]
...If the index contains term vectors, using mahout to extract them may be quicker than going in via the query interface:  http://www.lucidimagination.com/blog/2010/03/16/integrating-apach...
   Author: Andrew Clegg, 2011-11-30, 18:33
[expand - 3 more] - Re: Two-level access in Pig 0.8.1 - Pig - [mail # user]
...Sorry, scratch that question, clearly it doesn't, because I'm using it in 0.8.1 and it's not setting two-level access. Oh well.  On 1 November 2011 11:03, Andrew Clegg  wrote: &nbs...
   Author: Andrew Clegg, 2011-11-01, 11:06
Re: Is there a way to set reducer number of pig besides using parallel keyword? - Pig - [mail # user]
...Something I was wondering the other day... If you do a "group  all" and then pass the result to a non-algebraic aggregate function, will that guarantee that all the records go to a sing...
   Author: Andrew Clegg, 2011-10-12, 22:47
Re: replace value of a given field - Pig - [mail # user]
...Try:  foreach A generate (name == 'John' ? 'Marco' : name) as name;  or for multiple:  foreach A generate (name == 'John' ? 'Marco' : (name == 'Sally' ? 'Anne' : name)) as nam...
   Author: Andrew Clegg, 2011-10-11, 07:58
[expand - 2 more] - Re: outputSchema for UDF EvalFunc returning a DataBag - Pig - [mail # user]
...Yep, getSchemaFromString is what I was looking for, but I can't get it to generate a schema (for unit test purposes) that matches what I get inside my script during a real run.  As an e...
   Author: Andrew Clegg, 2011-10-04, 13:01
Re: Does the pig optimizer keep track of relations that are already sorted when doing a JOIN? - Pig - [mail # user]
...I'd never thought about this before, but some of my scripts could probably be made much quicker by taking advantage of this. From what operations are relations guaranteed to be sorted? Disti...
   Author: Andrew Clegg, 2011-08-21, 11:27
Re: RowCount - Pig - [mail # user]
...+1  On 26 July 2011 15:18, Grant Ingersoll  wrote:  patch that simply did: x = rowcount(foo) ?  I find myself doing sanity  checks on scripts a fair amount and am startin...
   Author: Andrew Clegg, 2011-07-26, 14:26
Schema changes when storing to a file - Pig - [mail # user]
...Hello again,  I have a relation with the following schema:  regrouped: {group: (artistid: int,country: int,week: chararray),projected_joined_albums: {key: (artistid: int,country: i...
   Author: Andrew Clegg, 2011-07-22, 13:30
[expand - 1 more] - Re: Confused by FOREACH .. GENERATE .. TOP semantics - Pig - [mail # user]
...Dmitriy -- my requirements have changed slightly in this particular instance, I actually now need to order by several columns, so I think that means I have to use an inner order-by, rather t...
   Author: Andrew Clegg, 2011-07-22, 13:15
Sort:
project
Pig (9)
type
mail # user (9)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (9)
author
Ted Yu (1644)
Harsh J (1293)
Jun Rao (1032)
Todd Lipcon (1002)
Stack (973)
Jonathan Ellis (842)
Andrew Purtell (797)
Jean-Daniel Cryans (753)
jacques@... (738)
stack (716)
Yusaku Sako (710)
Jarek Jarcec Cecho (699)
Eric Newton (697)
Jonathan Hsieh (675)
Roman Shaposhnik (659)
Brock Noland (656)
Namit Jain (649)
Neha Narkhede (647)
Hitesh Shah (626)
Owen O'Malley (625)
Steve Loughran (617)
Siddharth Seth (614)
Josh Elser (565)
Eli Collins (545)
Arun C Murthy (543)
Andrew Clegg