Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 111 to 120 from 148 (0.126s).
Loading phrases to help you
refine your search...
Writing small files to one big file in hdfs - Hadoop - [mail # user]
...We have small xml files. Currently I am planning to append these small files to one file in hdfs so that I can take advantage of splits, larger blocks and sequential IO. What I am unsure is ...
   Author: Mohit Anchlia, 2012-02-21, 17:15
Re: Hadoop install - Hadoop - [mail # user]
...Thanks Do I have to do something special to get Mahout xmlinput format and Pig with the new release of hadoop?  On Sat, Feb 18, 2012 at 6:42 AM, Tom Deutsch  wrote:  ...
   Author: Mohit Anchlia, 2012-02-19, 01:55
Hadoop install - Hadoop - [mail # user]
...What's the best way or guide to install latest hadoop. Is the latest Hadoop still .20 which comes up in google search. Could someone guide me with the latest hadoop distribution. I also need...
   Author: Mohit Anchlia, 2012-02-18, 14:24
Re: Processing small xml files - Hadoop - [mail # user]
...On Fri, Feb 17, 2012 at 11:37 PM, Srinivas Surasani wrote:   I can't seem to find examples of how to do xml processing in Pig. Can you please send me some pointers? Basically I need to ...
   Author: Mohit Anchlia, 2012-02-18, 14:12
Re: Processing small xml files - Hadoop - [mail # user]
...On Tue, Feb 14, 2012 at 10:56 AM, W.P. McNeill  wrote:   I need to install hadoop. Does this xmlinput format comes as part of the install? Can you please give me some pointers that...
   Author: Mohit Anchlia, 2012-02-18, 06:18
Re: Processing small xml files - Hadoop - [mail # user]
...On Sun, Feb 12, 2012 at 9:24 AM, W.P. McNeill  wrote:  Thanks for the input.  Do you first convert it into flat format and then run another hadoop job or do you just read xml ...
   Author: Mohit Anchlia, 2012-02-12, 20:30
Processing small xml files - Hadoop - [mail # user]
...What would be the best way to process small number of xml files? I read about Mahout xmlInputFormat, wondering what would be the best way for processing when small files are involved....
   Author: Mohit Anchlia, 2012-02-12, 16:59
Developing MapReduce - Hadoop - [mail # user]
...I use eclipse. Is this http://wiki.apache.org/hadoop/EclipsePlugIn still the best way to develop mapreduce programs in hadoop? Just want to make sure before I go down this path.  Or sho...
   Author: Mohit Anchlia, 2011-10-10, 14:34
Re: incremental loads into hadoop - Hadoop - [mail # user]
...This process of managing looks like more pain long term. Would it be easier to store in Hbase which has smaller block size?  What's the avg. file size?  On Sun, Oct 2, 2011 at 7:34...
   Author: Mohit Anchlia, 2011-10-03, 13:56
Re: Binary content - Hadoop - [mail # user]
...On Thu, Sep 1, 2011 at 1:25 AM, Dieter Plaetinck  wrote: nary files are small)  Thanks! Is there a specific tutorial I can focus on to see how it could be  done?...
   Author: Mohit Anchlia, 2011-09-01, 15:37
Sort:
project
Hadoop (148)
HBase (126)
Pig (125)
Flume (56)
MapReduce (34)
HDFS (11)
Hive (10)
Accumulo (1)
type
mail # user (148)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (148)
author
Harsh J (1389)
Steve Loughran (942)
Owen O'Malley (816)
Todd Lipcon (759)
Arun C Murthy (589)
Eli Collins (516)
Allen Wittenauer (461)
Konstantin Boudnik (347)
Doug Cutting (344)
Mark Kerzner (334)
Edward Capriolo (328)
Ted Dunning (321)
Brian Bockelman (305)
Tom White (304)
jason hadoop (279)
Mohit Anchlia