Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Accumulo >> mail # user >> compressing values returned to scanner


Copy link to this message
-
Re: compressing values returned to scanner
In other words, "handle compression/decompression in the application"  :)

I'm looking to see if there's a way to do this in Accumulo. Maybe a table
level config parameter. There's already the "table.file.compress.type",
which when set to NONE disables compression. Instead, I would like to keep
compression on, and defer the decompression to the client.

Ameet
On Mon, Oct 1, 2012 at 3:19 PM, Marc Parisi <[EMAIL PROTECTED]> wrote:

> You could compress the data in the value, and decompress the data upon
> receipt by the scanner.
>
>
> On Mon, Oct 1, 2012 at 3:03 PM, ameet kini <[EMAIL PROTECTED]> wrote:
>
>>
>> My understanding of compression in Accumulo 1.4.1 is that it is on by
>> default and that data is decompressed by the tablet server, so data on the
>> wire between server/client is decompressed. Is there a way to shift the
>> decompression from happening on the server to the client? I have a use case
>> where each Value in my table is relatively large (~ 8MB) and I can benefit
>> from compression over the wire. I don't have any server side iterators, so
>> the values don't need to be decompressed by the tablet server. Also, each
>> scan returns a few rows, so client-side decompression can be fast.
>>
>> The only way I can think of now is to disable compression on that table,
>> and handle compression/decompression in the application. But if there is a
>> way to do this in Accumulo, I'd prefer that.
>>
>> Thanks,
>> Ameet
>>
>
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB