Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive, mail # user - Re: OPTIMIZING A HIVE QUERY


Copy link to this message
-
Re: OPTIMIZING A HIVE QUERY
Bertrand Dechoux 2012-08-14, 15:39
You may want to be clearer. Is your question : how can I change the
serialization strategy of Hive? (If so I let other users answer and I am
also interested in the answer.)

Else the answer is simple. If you want to join data which can not be stored
into memory, you need to serialize them. The only solution is to store the
data in a smarter way which would not require you to do the join. By the
way, how do you know the serialisation is the bottleneck?

Bertrand

On Tue, Aug 14, 2012 at 5:11 PM, sudeep tokala <[EMAIL PROTECTED]>wrote:

>
>
> On Tue, Aug 14, 2012 at 11:08 AM, sudeep tokala <[EMAIL PROTECTED]>wrote:
>
>> Hi all,
>>
>> How to avoid serialization and deserialization overhead in hive join
>> query ? will this optimize my query performance.
>>
>> Regards
>> sudeep
>>
>
>
--
Bertrand Dechoux