Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 50 (0.115s).
Loading phrases to help you
refine your search...
Re: Running Hadoop client as a different user - Hadoop - [mail # user]
...Here is the issue - 1 - I am running a Java client on a machine unknown to the cluster - my default name on this pc is HYPERCHICKEN\local_admin - the name known to the cluster is slewis &nbs...
   Author: Steve Lewis, 2013-05-17, 09:09
Running Hadoop client as a different user - Hadoop - [mail # user]
...would run a java client on my local machine and would run as the local user on the cluster.  I say *      String connectString =   "hdfs://" + host + ":" + port + "/...
   Author: Steve Lewis, 2013-05-13, 09:55
Any samples of how to write a custom FileSystem - Hadoop - [mail # user]
...Specifically how do I register a Custom FileSystem - any sample code  Steven M. Lewis PhD 4221 105th Ave NE Kirkland, WA 98033 206-384-1340 (cell) Skype lordjoe_com...
   Author: Steve Lewis, 2012-02-01, 04:11
Has anyone written a Hadoop FileSystem where path refers to resources - Hadoop - [mail # user]
...I am writing some unit tests for custom splitters and it would be nice to keep the data as resources rather then reading files on a specific file system  Steven M. Lewis PhD 4221 105th ...
   Author: Steve Lewis, 2012-01-31, 21:29
When to use a combiner? - Hadoop - [mail # user]
...In working a sample issue I used a combiner - I noticed that the Combiner output records were 90% of the Combiner Input records and when looking at the data found relatively few duplicated k...
   Author: Steve Lewis, 2012-01-24, 17:33
Re: Problems with timeout when a Hadoop job generates a large number of key-value pairs - Hadoop - [mail # user]
...I have been silent for a few days because on my cluster I was UNABLE to reproduce the issue. What I do see is that merge is taking a HUGE amount of time -  In my hands the mapper reache...
   Author: Steve Lewis, 2012-01-23, 16:57
Re: Problems with timeout when a Hadoop job generates a large number of key-value pairs - Hadoop - [mail # user]
...Interesting - I strongly suspect a disk IO or network problem since my code is very simple and very fast. If you  add lines to  generateSubStrings to limit String length to 100 cha...
   Author: Steve Lewis, 2012-01-20, 22:23
Re: Problems with timeout when a Hadoop job generates a large number of key-value pairs - Hadoop - [mail # user]
...Good catch on the Configured - In my tests is extends my subclass of Configured but a I took out any dependencies on my environment.  Interesting - I strongly suspect a disk IO or netwo...
   Author: Steve Lewis, 2012-01-20, 22:16
Re: Problems with timeout when a Hadoop job generates a large number of key-value pairs - Hadoop - [mail # user]
...One thing I can say for sure is that  generateSubStrings() is not slow - Every input line in my sample is 100 characters and the timing should be very similar from one run to the next. ...
   Author: Steve Lewis, 2012-01-20, 22:01
Re: Problems with timeout when a Hadoop job generates a large number of key-value pairs - Hadoop - [mail # user]
...On Fri, Jan 20, 2012 at 12:18 PM, Michel Segel wrote:    The code seems to work well for small input files and is basically a standard sample.   The map map method starts out ...
   Author: Steve Lewis, 2012-01-20, 21:57
Sort:
project
MapReduce (99)
Hadoop (50)
HDFS (7)
type
mail # user (50)
date
last 7 days (2)
last 30 days (2)
last 90 days (2)
last 6 months (2)
last 9 months (50)
author
Harsh J (1372)
Steve Loughran (928)
Owen O'Malley (816)
Todd Lipcon (755)
Arun C Murthy (573)
Eli Collins (512)
Allen Wittenauer (461)
Doug Cutting (344)
Mark Kerzner (334)
Konstantin Boudnik (333)
Edward Capriolo (328)
Ted Dunning (321)
Brian Bockelman (305)
Tom White (302)
jason hadoop (279)
Steve Lewis