Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive, mail # user - ROW_NUMBER() equivalent in Hive


Copy link to this message
-
Re: ROW_NUMBER() equivalent in Hive
Owen O'Malley 2013-02-21, 16:08
What are the semantics for ROW_NUMBER? Is it a global row number? Per a
partition? Per a bucket?

-- Owen
On Wed, Feb 20, 2013 at 11:33 PM, kumar mr <[EMAIL PROTECTED]> wrote:

> Hi,
>
>  This is Kumar, and this is my first question in this group.
>
>  I have a requirement to implement ROW_NUMBER() from Teradata in Hive
> where partitioning happens on multiple columns along with multiple column
> ordering.
> It can be easily implemented in Hadoop MR, but I have to do in Hive. By
> doing in UDF can assign same rank to grouping key considering dataset is
> small, but ordering need to be done in prior step.
> Can we do this in lot simpler way?
>
>  Thanks in advance.
>
>  Regards,
> Kumar
>