Search Hadoop -
Anoop Sam John
Stephen Yuan Jiang
last 7 days (0)
last 30 days (0)
last 90 days (5)
last 6 months (5)
last 9 months (40)
Solr & Elasticsearch trainings in New York & San Fransisco [more info]
San Francisco - Oct 4-6
New York - Oct 10-12
San Francisco - Oct 4-7
New York - Oct 10-12
and all its subprojects:
newest on top
oldest on top
. Results from
Loading phrases to help you
refine your search...
[HBASE-5509] MR based copier for copying HFiles (trunk version)
...This copier is a modification of the distcp tool in HDFS. It does the following:1. List out all the regions in the HBase cluster for the required table2. Write the above out to a file3. Each...
, 2014-03-14, 00:56
[HBASE-6925] Change socket write size from 8K to 64K for HBaseServer
...Creating a JIRA for this, but the change is trivial: change NIO_BUFFER_LIMIT from 8K to 64K in HBaseServer. This seems to increase scan throughput....
, 2013-09-18, 22:21
[HBASE-5355] Compressed RPC's for HBase
...Some application need ability to do large batched writes and reads from a remote MR cluster. These eventually get bottlenecked on the network. These results are also pretty compressible some...
, 2013-06-05, 00:27
[HBASE-6874] Implement prefetching for scanners
...I did some quick experiments by scanning data that should be completely in memory and found that adding pre-fetching increases the throughput by about 50% from 26MB/s to 39MB/s.The idea is t...
, 2013-05-25, 08:07
[HBASE-7477] Remove Proxy instance from HBase RPC
...Currently, we use HBaseRPC.getProxy() to get an Invoker object to serialize the RPC parameters. This is pretty inefficient as it uses reflection to lookup the current method name.The aim is ...
, 2013-04-30, 23:32
[HBASE-6770] Allow scanner setCaching to specify size instead of number of rows
...Currently, we have the following api's to customize the behavior of scans:setCaching() - how many rows to cache on client to speed up scanssetBatch() - max columns per row to return per row ...
, 2013-04-25, 19:04
[HBASE-6923] Create scanner benchmark
...Create a simple program to benchmark performance/throughput of scanners, and print some results at the end....
, 2013-04-25, 19:03
[HBASE-5783] Faster HBase bulk loader
...We can get a 3x to 4x gain based on a prototype demonstrating this approach in effect (hackily) over the MR bulk loader for very large data sets by doing the following:1. Do direct multi-put...
, 2013-03-24, 05:17
[HBASE-6423] Writes should not block reads on blocking updates to memstores
...We have a big data use case where we turn off WAL and have a ton of reads and writes. We found that:1. flushing a memstore takes a while (GZIP compression)2. incoming writes cause the new me...
, 2012-12-18, 02:19
[HBASE-6583] Enhance Hbase load test tool to automatically create column families if not present
...The load test tool currently disables the table and applies any changes to the cf descriptor if any, but does not create the cf if not present....
, 2012-12-03, 21:47
Apache Lucene, Apache Solr and all other Apache Software Foundation projects and their respective logos are trademarks of the Apache Software Foundation.
Elasticsearch, Kibana, Logstash, and Beats are trademarks of Elasticsearch BV, registered in the U.S. and in other countries. This site and Sematext Group is in no way affiliated with Elasticsearch BV.
Service operated by