Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 39 (0.297s).
Loading phrases to help you
refine your search...
Re: reducer gets values with empty attributes - MapReduce - [mail # user]
...Hi Alex,               Can you please attach your code? and the sample input data.  Best, Mahesh Balija, Calsoft Labs.   On Tue, Apr 30, 2013 at ...
   Author: Mahesh Balija, 2013-04-30, 06:33
Re: Writing data from HDFS to Tpae - MapReduce - [mail # user]
...Can you do the following, hadoop fs -copyToLocal    Best, Mahesh Balija, CalsoftLabs.    On Wed, Apr 24, 2013 at 12:12 PM, G, Prashanthi wrote:  ...
   Author: Mahesh Balija, 2013-04-26, 06:55
[expand - 1 more] - Re: Hadoop sampler related query! - Hadoop - [mail # user]
...Agreed with your explanation. One downside with your approach could be, if we collect samples from the intermediate keys on demand it might limit the partitioning to occur until all the mapp...
   Author: Mahesh Balija, 2013-04-24, 07:58
Re: namenode memory test - Hadoop - [mail # user]
...Can you manually go into the directory configured for hadoop.tmp.dir under core-site.xml and do an ls -l to find the disk usage details, it will have fsimage, edits, fstime, VERSION. or the ...
   Author: Mahesh Balija, 2013-04-24, 07:00
Re: R environment with Hadoop - MapReduce - [mail # user]
...Mahout is an alternative for R, if you are NOT aware of.  Thanks, Mahesh Balija, CalsoftLabs.   On Thu, Apr 11, 2013 at 12:25 AM, Ted Yu  wrote:  ...
   Author: Mahesh Balija, 2013-04-10, 21:21
Re: Need help optimizing reducer - Hadoop - [mail # user]
...The reason why the reducer is fast upto 66% is be because of the Sorting and Shuffling phase of the reduce and when the actual task is NOT yet started.  The reduce side is divided into ...
   Author: Mahesh Balija, 2013-03-05, 09:00
Re: Hadoop file system - MapReduce - [mail # user]
...You can be able to use Hdfs alone in the distributed mode to fulfill your requirement. Hdfs has the Filesystem java api through which you can interact with the HDFS from your client. HDFS is...
   Author: Mahesh Balija, 2013-03-05, 08:44
Re: mapper combiner and partitioner for particular dataset - MapReduce - [mail # user]
...What Harsh means by that is, you should create a custom partitioner which should take care of partitioning the records based on the input record data (Key, Value). i.e., if you have multiple...
   Author: Mahesh Balija, 2013-03-05, 08:05
Re: Running terasort with 1 map task - MapReduce - [mail # user]
...does passing the dfs.block.size=134217728 resolves your issue? or is it something else fixed your problem?  On Tue, Feb 26, 2013 at 6:04 PM, Arindam Choudhury  wrote:  ...
   Author: Mahesh Balija, 2013-02-26, 23:07
[expand - 1 more] - Re: WordPairCount Mapreduce question. - MapReduce - [mail # user]
...byte array comparison is for performance reasons only, but NOT the way you are thinking. This method comes from an interface called RawComparator which provides the prototype (public int com...
   Author: Mahesh Balija, 2013-02-25, 08:14
Sort:
project
MapReduce (23)
Hadoop (8)
HDFS (7)
HBase (1)
type
mail # user (35)
mail # dev (4)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (39)
author
Ted Yu (1833)
Harsh J (1303)
Jun Rao (1014)
Todd Lipcon (995)
Stack (986)
Andrew Purtell (875)
Jonathan Ellis (854)
stack (758)
Jean-Daniel Cryans (751)
Jarek Jarcec Cecho (747)
Yusaku Sako (743)
Eric Newton (706)
Jonathan Hsieh (683)
Hitesh Shah (680)
Roman Shaposhnik (677)
Josh Elser (673)
Steve Loughran (652)
Namit Jain (648)
Siddharth Seth (643)
Brock Noland (634)
Owen O'Malley (623)
Hyunsik Choi (582)
Neha Narkhede (566)
Arun C Murthy (548)
Eli Collins (545)
Mahesh Balija
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB