Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 81 to 90 from 140 (0.153s).
Loading phrases to help you
refine your search...
Re: add worker nodes when job is running - Hadoop - [mail # general]
...I don't think you need to do extra anything, the Sheduler on JT will do it for you. When there's node becoming free, Scheduler will assign new task to these nodes.   On Mon, Jan 18, 201...
   Author: Jeff Zhang, 2010-01-18, 06:09
Re: Is it always called part-00000? - Hadoop - [mail # user]
...Hi Mark,  1. If you use the old API, the ouput file is named part-00000, and if you use the new API, the output file will be part-r-00000, and there will be usually more than 1 output f...
   Author: Jeff Zhang, 2010-01-18, 02:15
Re: Quick Clarification of sort mechanism - Hadoop - [mail # user]
...Hi Rob,  The sort is an internal mechanism in hadoop, the reduce step will always do sort on the keys. If you want to sort the result by count, you could start a second job with the inp...
   Author: Jeff Zhang, 2010-01-16, 02:56
Re: DataNodeCluster - Hadoop - [mail # user]
...Hi ryan,  I think you can use MiniDFSCluster in the test package, lots of testcase of hadoop use this class to create a cluster locally. You can refer some testcase for details.   ...
   Author: Jeff Zhang, 2010-01-13, 06:37
[expand - 1 more] - Re: about translating Hadoop:The Definitive Guide into chinese one - Hadoop - [mail # general]
...Hi Wang,  This is good, I agree with the plan. And this is msn: [EMAIL PROTECTED] We can talk using IM later.    On Sat, Jan 9, 2010 at 10:32 PM, Andrew Wang wr ote:  . m...
   Author: Jeff Zhang, 2010-01-10, 06:35
RE: custom InputFormat - Hadoop - [mail # user]
...Hi valentine,  I am not sure what's your first job's OutputFormat. But I suggest you  use SequenceFileOutputFormat which will write to SequenceFile as the intermediate data store f...
   Author: Jeff Zhang, 2010-01-09, 20:06
Re: Is it possible to share a key across maps? - Hadoop - [mail # user]
...Actually you can treat the mapper task as a template design pattern, here's the persuade code:  Mapper.configure(JobConf) for each record in InputSplit:       do Mapper.m...
   Author: Jeff Zhang, 2010-01-09, 04:15
[expand - 1 more] - Re: How to reuse the nodes in blacklist ? - Hadoop - [mail # user]
...Thanks, it works.  Jeff Zhang   On Tue, Jan 5, 2010 at 5:00 PM, Amareshwari Sri Ramadasu  wrote:  ...
   Author: Jeff Zhang, 2010-01-05, 09:12
Re: how to use InputSampler & TotalOrderPartitioner? - Hadoop - [mail # user]
...Because the shuffle phase start as soon as any mapper task finish, and the shuffle phase needs the Partitioner to route the output of mapper to reducer. So the sampler must complete before t...
   Author: Jeff Zhang, 2010-01-05, 06:26
Re: Killing a Hadoop job - Hadoop - [mail # user]
...invoke command: hadoop job -kill jobID   Jeff Zhang   On Tue, Dec 29, 2009 at 10:02 PM, Mark Kerzner wrote:  ...
   Author: Jeff Zhang, 2009-12-30, 06:07
Sort:
project
Hadoop (138)
Tez (117)
Pig (114)
MapReduce (32)
HDFS (28)
HBase (21)
Hive (16)
YARN (8)
Spark (3)
Avro (2)
Sqoop (2)
Ambari (1)
type
mail # user (120)
mail # general (11)
mail # dev (8)
issue (1)
date
last 7 days (0)
last 30 days (1)
last 90 days (4)
last 6 months (4)
last 9 months (140)
author
Harsh J (571)
Steve Loughran (437)
Owen O'Malley (393)
Todd Lipcon (239)
Allen Wittenauer (212)
Eli Collins (184)
Chris Nauroth (180)
Alejandro Abdelnur (179)
Ted Yu (169)
Arun C Murthy (168)
Tom White (121)
Daryn Sharp (117)
Nigel Daley (115)
Konstantin Shvachko (111)
Colin Patrick McCabe (110)
Doug Cutting (96)
Aaron Kimball (94)
Edward Capriolo (88)
Mark Kerzner (87)
jason hadoop (82)
Kai Zheng (77)
Akira AJISAKA (75)
Hairong Kuang (75)
Benoy Antony (73)
Konstantin Boudnik (73)
Jeff Zhang
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB