Jesus Camacho Rodriguez
Thejas M Nair
Vikram Dixit K
Joydeep Sen Sarma
Hari Sankar Sivarama Subr...
mail # dev
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (112)
newest on top
oldest on top
. Results from
Loading phrases to help you
refine your search...
[HIVE-7679] JOIN operator should update the column stats when number of rows changes
...JOIN operator does not update the column stats when the number of rows changes. All other operators scales up/down the column statistics when the number of rows changes. Same should be done ...
, 2014-08-12, 18:49
[HIVE-5579] Update statistics rules for different types of joins
...This is a followup of HIVE-5369. Current statistics rules for join operator are generic and are not specific to the type of join. Update the rules based on the type of joins....
, 2014-08-11, 21:32
[HIVE-7231] Improve ORC padding
...Current ORC padding is not optimal because of fixed stripe sizes within block. The padding overhead will be significant in some cases. Also padding percentage relative to stripe size is not ...
, 2014-08-10, 00:26
[HIVE-7490] Revert ORC stripe size
...HIVE-6037 reverted the changes to ORC stripe size introduced by HIVE-7231....
, 2014-08-10, 00:23
[HIVE-7219] Improve performance of serialization utils in ORC
...ORC uses serialization utils heavily for reading and writing data. The bitpacking and unpacking code in writeInts() and readInts() can be unrolled for better performance. Also double reader/...
, 2014-08-09, 20:41
[HIVE-6578] Use ORC file footer statistics through StatsProvidingRecordReader interface for analyze command
...ORC provides file level statistics which can be used in analyze partialscan and noscan cases to compute basic statistics like number of rows, number of files, total file size and raw data si...
, 2014-08-07, 09:04
[HIVE-7589] Some fixes and improvements to statistics annotation rules
...FIXES:1) JOIN rule does not properly propagate the column statistics from its parent2) Multi-way join rule computes the denominator for #rows estimation wrongly3) GROUPBY rule does not accou...
, 2014-08-06, 19:33
[HIVE-6287] batchSize computation in Vectorized ORC reader can cause BufferUnderFlowException when PPD is enabled
...nextBatch() method that computes the batchSize is only aware of stripe boundaries. This will not work when predicate pushdown (PPD) in ORC is enabled as PPD works at row group level (stripe ...
, 2014-07-31, 19:46
[HIVE-6382] PATCHED_BLOB encoding in ORC will corrupt data in some cases
...In PATCHED_BLOB encoding (added in HIVE-4123), gapVsPatchList is an array of long that stores gap (g) between the values that are patched and the patch value (p). The maximum distance of gap...
, 2014-07-31, 19:44
[HIVE-6711] ORC maps uses getMapSize() from MapOI which is unreliable
...HIVE-6707 had issues with map size. getMapSize() of LazyMap and LazyBinaryMap does not deserialize the keys and count the number of unique keys. Since getMapSize() may return non-distinct co...
, 2014-07-31, 19:39
Apache Lucene, Apache Solr and all other Apache Software Foundation project and their respective logos are trademarks of the Apache Software Foundation.
Elasticsearch, Kibana, Logstash, and Beats are trademarks of Elasticsearch BV, registered in the U.S. and in other countries. This site and Sematext Group is in no way affiliated with Elasticsearch BV.
Service operated by