1. all key components: rowkey + column family + column qualifier + timestamp
2. all key components except for column family (since the index is in a HFile that is a part of the storage for a column family): rowkey + column qualifier + timestamp
3. rowkey only
I've tried to find the information, But the articles only say that it contains a 'key'. For me as a newbie, it is confusing since in a KeyValue the all key components comprise the 'key', while a rowkey is also a 'key'.
So, is it safe to assume that there is no documentation for the exact content of the block index? I think that reading the source code should be the last resort, since one cannot sure whether it is an implementation detail, or it is the specification that can be relied upon. The information on the exact content of the block index is important, since it is related in the size of the index (let alone the query performance) and therefore the schema design.
Or, could you please simply provide the exact answer to my original question(with proper reference)?
Though it's a bit strange to include CF in the index, since all the block index is contained in a HFile for a specific CF, I'm sure there would be a good reason (maybe for the performance of the comparison). Anyways it should be almost no issue since the length of the CF should be short(mostly one byte).
All projects made searchable here are trademarks of the Apache Software Foundation.
Service operated by Sematext