I am trying to join two tables, the smaller being of size 4GB. When I set
hive.mapjoin.smalltable.filesize parameter above 500MB, Hive tries to
perform a local task to read the smaller file. This of-course fails since
the file size is greater and the backup common join is then run. What I do
not understand is why did Hive attempt a map join when small file size was
greater than the smalltable.filesize parameter.