Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce, mail # user - is HDFS RAID "data locality" efficient?


Copy link to this message
-
Re: is HDFS RAID "data locality" efficient?
Steve Loughran 2012-08-08, 17:31
On 8 August 2012 09:46, Sourygna Luangsay <[EMAIL PROTECTED]> wrote:

>  Hi folks!****
>
> One of the scenario I can think in order to take advantage of HDFS RAID
> without suffering this penalty is:**
>
> **-          **Using normal HDFS with default replication=3 for my ���fresh
> data”****
>
> **-          **Using HDFS RAID for my historical data (that is barely
> used by M/R)****
>
> ** **
>
>
>
exactly: less space use on cold data, with the penalty that access
performance can be worse. As the majority of data on a hadoop cluster is
usually "cold", it's a space and power efficient story for the archive data

--
Steve Loughran
Hortonworks Inc