Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase >> mail # user >> Export / Import and table splits

Copy link to this message
Export / Import and table splits

When we are doing an export, we are only exporting the data. Then when
we are importing that back, we need to make sure the table is
pre-splitted correctly else we might hotspot some servers.

If you simply export then import without pre-splitting at all, you
will most probably brought some servers down because they will be
overwhelmed with splits and compactions.

Do we have any tool to pre-split a table the same way another table is
already pre-splitted?

Something like
> duplicate 'source_table', 'target_table'

Which will create a new table called 'target_table' with exactly the
same parameters as 'source_table' and the same regions boundaries?

If we don't have, will it be useful to have one?

Or event something like:
> create 'target_table', 'f1', {SPLITS_MODEL => 'source_table'}
Mohammad Tariq 2013-05-07, 22:33
Ted Yu 2013-05-07, 22:29
Michael Segel 2013-05-07, 22:34
Jean-Marc Spaggiari 2013-05-07, 23:02
Ted Yu 2013-05-07, 23:11
Ted Yu 2013-05-07, 23:18
Michael Segel 2013-05-07, 23:11
Jeremy Carroll 2013-05-08, 00:08
Jean-Marc Spaggiari 2013-05-14, 00:48
Matteo Bertozzi 2013-05-14, 00:54
Jean-Marc Spaggiari 2013-05-14, 00:57