Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
Hive >> mail # user >> When to use bucketed tables with/instead of partitioned tables


Copy link to this message
-
When to use bucketed tables with/instead of partitioned tables
I am accustomed to using partitioned tables to obtain separate directories
for data files in each partition.

When looking at the documentation for bucketed tables it seems they are
typically used in conjunction with distribute by/sort by and an appropriate
partitioning key - and thus provide ability to do map side joins.

An explanation of when to use bucketed tables by themselves (in lieu of
partitioned tables)  as well as in conjunction with partitoined tables
would be appreciated.

thanks!

stephenb
+
bejoy_ks@... 2013-06-17, 07:38
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB