Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 39 (0.134s).
Loading phrases to help you
refine your search...
Re: reducer gets values with empty attributes - MapReduce - [mail # user]
...Hi Alex,               Can you please attach your code? and the sample input data.  Best, Mahesh Balija, Calsoft Labs.   On Tue, Apr 30, 2013 at ...
   Author: Mahesh Balija, 2013-04-30, 06:33
Re: Writing data from HDFS to Tpae - MapReduce - [mail # user]
...Can you do the following, hadoop fs -copyToLocal    Best, Mahesh Balija, CalsoftLabs.    On Wed, Apr 24, 2013 at 12:12 PM, G, Prashanthi wrote:  ...
   Author: Mahesh Balija, 2013-04-26, 06:55
[expand - 1 more] - Re: Hadoop sampler related query! - Hadoop - [mail # user]
...Agreed with your explanation. One downside with your approach could be, if we collect samples from the intermediate keys on demand it might limit the partitioning to occur until all the mapp...
   Author: Mahesh Balija, 2013-04-24, 07:58
Re: namenode memory test - Hadoop - [mail # user]
...Can you manually go into the directory configured for hadoop.tmp.dir under core-site.xml and do an ls -l to find the disk usage details, it will have fsimage, edits, fstime, VERSION. or the ...
   Author: Mahesh Balija, 2013-04-24, 07:00
Re: R environment with Hadoop - MapReduce - [mail # user]
...Mahout is an alternative for R, if you are NOT aware of.  Thanks, Mahesh Balija, CalsoftLabs.   On Thu, Apr 11, 2013 at 12:25 AM, Ted Yu  wrote:  ...
   Author: Mahesh Balija, 2013-04-10, 21:21
Re: Need help optimizing reducer - Hadoop - [mail # user]
...The reason why the reducer is fast upto 66% is be because of the Sorting and Shuffling phase of the reduce and when the actual task is NOT yet started.  The reduce side is divided into ...
   Author: Mahesh Balija, 2013-03-05, 09:00
Re: Hadoop file system - MapReduce - [mail # user]
...You can be able to use Hdfs alone in the distributed mode to fulfill your requirement. Hdfs has the Filesystem java api through which you can interact with the HDFS from your client. HDFS is...
   Author: Mahesh Balija, 2013-03-05, 08:44
Re: mapper combiner and partitioner for particular dataset - MapReduce - [mail # user]
...What Harsh means by that is, you should create a custom partitioner which should take care of partitioning the records based on the input record data (Key, Value). i.e., if you have multiple...
   Author: Mahesh Balija, 2013-03-05, 08:05
Re: Running terasort with 1 map task - MapReduce - [mail # user]
...does passing the dfs.block.size=134217728 resolves your issue? or is it something else fixed your problem?  On Tue, Feb 26, 2013 at 6:04 PM, Arindam Choudhury  wrote:  ...
   Author: Mahesh Balija, 2013-02-26, 23:07
[expand - 1 more] - Re: WordPairCount Mapreduce question. - MapReduce - [mail # user]
...byte array comparison is for performance reasons only, but NOT the way you are thinking. This method comes from an interface called RawComparator which provides the prototype (public int com...
   Author: Mahesh Balija, 2013-02-25, 08:14
MapReduce (23)
Hadoop (8)
HDFS (7)
HBase (1)
mail # user (35)
mail # dev (4)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (39)
Ted Yu (1699)
Harsh J (1295)
Todd Lipcon (995)
Stack (978)
Jun Rao (971)
Jonathan Ellis (844)
Andrew Purtell (816)
Jean-Daniel Cryans (753)
Yusaku Sako (719)
stack (714)
Jarek Jarcec Cecho (703)
Eric Newton (688)
Jonathan Hsieh (673)
Roman Shaposhnik (662)
Namit Jain (649)
Hitesh Shah (627)
Steve Loughran (626)
Owen O'Malley (625)
Siddharth Seth (614)
Josh Elser (557)
Brock Noland (549)
Eli Collins (545)
Neha Narkhede (545)
Arun C Murthy (543)
Doug Cutting (533)
Mahesh Balija