Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Hive >> mail # user >> hive - snappy and sequence file vs RC file

Chalcy Raja 2012-06-26, 13:05
Bejoy Ks 2012-06-26, 13:21
Chalcy Raja 2012-06-26, 15:17
Copy link to this message
Re: hive - snappy and sequence file vs RC file
SequenceFile compared to RCFile:
  * More widely deployed.
  * Available from MapReduce and Pig
  * Doesn't compress as small (in RCFile all of each columns values are put
  * Uncompresses and deserializes all of the columns, even if you are only
reading a few

In either case, for long term storage, you should seriously consider the
default codec since that will provide much tighter compression (at the cost
of cpu to compress it).

-- Owen
yongqiang he 2012-06-27, 04:40
Chalcy Raja 2012-06-27, 23:01