Mohammad Tariq 2012-06-28, 18:47
Internally In hive the processing is done using MapReduce. So like in mapreduce the splits are calculated on job submission and a mapper is assigned per split. So a mapper ideally process a split and not a row.
You can store data in various formats as text, sequence files, RC files etc. No restriction just on text files.
Sent from handheld, please excuse typos.
From: Mohammad Tariq <[EMAIL PROTECTED]>
Date: Fri, 29 Jun 2012 00:17:05
To: user<[EMAIL PROTECTED]>
Reply-To: [EMAIL PROTECTED]
Subject: Hive mapper creation
Since Hive tables are assumed to be of text input format, is
it right to assume that a mapper is created per row of a particular
table??Please correct me if my understanding is wrong. Also let me
know how mappers are created corresponding to a Hive query. Many
Mohammad Tariq 2012-06-28, 18:59
Bejoy KS 2012-06-28, 19:07
Mohammad Tariq 2012-06-28, 19:25
Bejoy KS 2012-06-28, 19:29
Mohammad Tariq 2012-06-28, 19:35
Nitin Pawar 2012-06-28, 18:51