Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive, mail # user - Why no two aggregations can have different DISTINCT columns ?


Copy link to this message
-
Re: Why no two aggregations can have different DISTINCT columns ?
Amr Awadallah 2010-02-25, 09:25
+1, please post jira/patch.

-- amr

On 2/25/2010 1:20 AM, Zheng Shao wrote:
> Yes definitely. Do you want to open a JIRA and post a patch?
> Please link the new JIRA to the other 2 JIRA that was mentioned in the
> same email thread.
>
> Zheng
>
> On Thu, Feb 25, 2010 at 1:16 AM, Mafish Liu<[EMAIL PROTECTED]>  wrote:
>    
>> Hive does not support multi-distinct in one query.
>>
>> We have implemented multi-distinct based on hive 0.4.2rc to our demand.
>> We don't know that if Hive is intresting in this feature.
>>
>> 2010/2/25 Jeff Zhang<[EMAIL PROTECTED]>:
>>      
>>> Hi all,
>>>
>>> I read the tutorial of Hive, and it says that "no two aggregations can have
>>> different DISTINCT columns". Could anyone tell what is the reason ? Does the
>>> following Distinct will been translate to map-reduce job or just do it
>>> locally ?
>>>
>>>      INSERT OVERWRITE TABLE pv_gender_agg
>>>      SELECT pv_users.gender, count(DISTINCT pv_users.userid), count(DISTINCT
>>> pv_users.ip)
>>>      FROM pv_users
>>>      GROUP BY pv_users.gender;
>>>
>>> --
>>> Best Regards
>>>
>>> Jeff Zhang
>>>
>>>        
>>
>>
>> --
>> [EMAIL PROTECTED]
>>
>>      
>
>
>