Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 20 (0.262s).
Loading phrases to help you
refine your search...
Re: how to find top N values using map-reduce ? - Hadoop - [mail # user]
...Maybe look at the pig source to see how it does it?  Russell Jurney http://datasyndrome.com  On Feb 1, 2013, at 11:37 PM, praveenesh kumar  wrote:  ...
   Author: Russell Jurney, 2013-02-02, 08:10
Re: how to find top N values using map-reduce ? - Hadoop - [mail # user]
...Pig. Datafu. 7 lines of code.  https://gist.github.com/4696443 https://github.com/linkedin/datafu   On Fri, Feb 1, 2013 at 11:17 PM, praveenesh kumar wrote:     Russell J...
   Author: Russell Jurney, 2013-02-02, 07:30
Re: Map-Reduce V/S Hadoop Ecosystem - Hadoop - [mail # user]
...Hourly consultants may prefer MapReduce. Everyone else should be using Pig, Hive, Cascading, etc.  Russell Jurney twitter.com/rjurney   On Nov 7, 2012, at 8:08 PM, yogesh dhari &nb...
   Author: Russell Jurney, 2012-11-07, 20:48
Re: Which hardware to choose - Hadoop - [mail # user]
...I believe he means per node.  Russell Jurney http://datasyndrome.com  On Oct 2, 2012, at 6:15 PM, hadoopman  wrote:  ...
   Author: Russell Jurney, 2012-10-03, 01:19
Re: Bad records - Hadoop - [mail # user]
...The job is failing because of exceptions parsing records, presumably. Trace your exception from logs, wrap the parsing code that is failing in try/catch. Increment counters and continue in y...
   Author: Russell Jurney, 2012-07-07, 22:55
Re: Bad records - Hadoop - [mail # user]
...Throw, catch and handle an exception on bad records.  Don't error out.  Log the error in your exception handler, increment a counter.  For general discussion, see: http://www....
   Author: Russell Jurney, 2012-07-07, 21:22
Re: Splunk + Hadoop - Hadoop - [mail # user]
...Because that isn't Cube.  Russell Jurney twitter.com/rjurney [EMAIL PROTECTED] datasyndrome.com  On May 18, 2012, at 2:01 PM, Ravi Shankar Nair  wrote:  ...
   Author: Russell Jurney, 2012-05-18, 22:58
Re: Splunk + Hadoop - Hadoop - [mail # user]
...I'm playing with using Hadoop and Pig to load MongoDB with data for Cube to consume. Cube  is a realtime tool... but we'll be replaying events from the past.  Does that count? &nbs...
   Author: Russell Jurney, 2012-05-18, 19:29
Re: activity on IRC . - Hadoop - [mail # user]
...I get good answers on Twitter.  Russell Jurney twitter.com/rjurney [EMAIL PROTECTED] datasyndrome.com  On Mar 28, 2012, at 12:27 PM, Todd Lipcon  wrote:  ...
   Author: Russell Jurney, 2012-03-28, 19:56
Re: Convergence on File Format? - Hadoop - [mail # user]
...Avro support in Pig will be fairly mature in 0.10.  Russell Jurney twitter.com/rjurney [EMAIL PROTECTED] datasyndrome.com  On Mar 8, 2012, at 3:10 PM, Serge Blazhievsky  wrote...
   Author: Russell Jurney, 2012-03-09, 00:01
Sort:
project
Pig (485)
Avro (40)
Kafka (27)
Hadoop (20)
Hive (18)
MapReduce (14)
HBase (7)
HDFS (2)
type
mail # user (20)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (2)
last 9 months (20)
author
Harsh J (1376)
Steve Loughran (931)
Owen O'Malley (816)
Todd Lipcon (756)
Arun C Murthy (575)
Eli Collins (513)
Allen Wittenauer (461)
Doug Cutting (344)
Konstantin Boudnik (335)
Mark Kerzner (334)
Edward Capriolo (328)
Ted Dunning (321)
Brian Bockelman (305)
Tom White (303)
jason hadoop (279)
Russell Jurney