Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # user >> What does ROW__OFFSET__INSIDE__BLOCK FROM mean?


Copy link to this message
-
What does ROW__OFFSET__INSIDE__BLOCK FROM mean?
Hi,

Could anybody explain me what ROW__OFFSET__INSIDE__BLOCK means?
For example, I make the following query, and return two rows. But why does
the column of ROW__OFFSET__INSIDE__BLOCK show 0?
For my understanding from the name of column, it should return the line
number of the records in the block files, but now both are 0.  So, what is
the BLOCK, BLOCK offset, and row offset in a block?
The Hive bitmap document is very confusing.
hive> SELECT  `url`,  INPUT__FILE__NAME,BLOCK__OFFSET__INSIDE__FILE,
ROW__OFFSET__INSIDE__BLOCK FROM `testresult` WHERE url='
http://www.domain022.tl04/page035.html';

http://www.domain022.tl04/page035.html
hdfs://pc01:54310/user/hive/warehouse/testresult/testresults.csv 0 0
http://www.domain022.tl04/page035.html  hdfs://pc01:54310/
user/hive/warehouse/testresult/testresults.csv 3200250 0
Time taken: 19.653 seconds
hive>
Regards,
afancy