1. all key components: rowkey + column family + column qualifier + timestamp
2. all key components except for column family (since the index is in a HFile that is a part of the storage for a column family): rowkey + column qualifier + timestamp
3. rowkey only
I've tried to find the information, But the articles only say that it contains a 'key'. For me as a newbie, it is confusing since in a KeyValue the all key components comprise the 'key', while a rowkey is also a 'key'.
So, is it safe to assume that there is no documentation for the exact content of the block index? I think that reading the source code should be the last resort, since one cannot sure whether it is an implementation detail, or it is the specification that can be relied upon. The information on the exact content of the block index is important, since it is related in the size of the index (let alone the query performance) and therefore the schema design.
Or, could you please simply provide the exact answer to my original question(with proper reference)?
Though it's a bit strange to include CF in the index, since all the block index is contained in a HFile for a specific CF, I'm sure there would be a good reason (maybe for the performance of the comparison). Anyways it should be almost no issue since the length of the CF should be short(mostly one byte).
NEW: Monitor These Apps!
Apache Lucene, Apache Solr and all other Apache Software Foundation projects and their respective logos are trademarks of the Apache Software Foundation.
Elasticsearch, Kibana, Logstash, and Beats are trademarks of Elasticsearch BV, registered in the U.S. and in other countries. This site and Sematext Group is in no way affiliated with Elasticsearch BV.
Service operated by Sematext