Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive, mail # user - question..


Copy link to this message
-
RE: question..
Ashish Thusoo 2010-08-20, 19:18
I think the query did finish properly. Can you recheck the data to see if you would really get a few rows of output?

Ashish

________________________________
From: Ronak Bhatt [mailto:[EMAIL PROTECTED]]
Sent: Friday, August 20, 2010 12:12 PM
To: [EMAIL PROTECTED]
Subject: question..

Hi - in my HIVE environment, I ran the following query and expecting to see the rows (the data is present)...however, after 2339 seconds, the o/p I get is as shown below in the email (I've pasted last 5~10 lines of screen output)...

Is there anything that I'm missing? Did the process finish correctly? should there be something that could point me how to debug?

----------------------------- Query ------------------------------

select substr(CB.EXEC_DATE,1,10), count(CB.ID<http://CB.ID>)
    from callbacks CB JOIN
      (select * from  pages p where
               p.page like '%google.com/search%<http://google.com/search%><http://google.com/search%>'
            or p.page like '%google.com/custom%<http://google.com/custom%><http://google.com/custom%>'
            or p.page like '%google.com/#hl%<http://google.com/#hl%><http://google.com/#hl%>'
            or p.page like '%google.com/cse%<http://google.com/cse%><http://google.com/cse%>'
            or p.page like '%search.yahoo.com/search%<http://search.yahoo.com/search%><http://search.yahoo.com/search%>'
            or p.page like '%bing.com/search%<http://bing.com/search%><http://bing.com/search%>'
            or p.page like '%google.com/product%<http://google.com/product%><http://google.com/product%>' ) s
ON s.id<http://s.id> = cb.id<http://cb.id>
    group by substr(CB.EXEC_DATE,1,10);
================= o/p on screen =================
10/08/20 11:13:38 INFO mapred.TaskRunner: Task 'attempt_local_0001_r_000000_0' done.
2010-08-20 11:13:39,526 map = 100%,  reduce =100%
10/08/20 11:13:39 INFO exec.ExecDriver: 2010-08-20 11:13:39,526 map = 100%,  reduce =100%
Ended Job = job_local_0001
10/08/20 11:13:39 INFO exec.ExecDriver: Ended Job = job_local_0001
10/08/20 11:13:39 INFO exec.FileSinkOperator: Moving tmp dir: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001> to: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate>
10/08/20 11:13:39 INFO exec.FileSinkOperator: Moving tmp dir: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate> to: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/10001<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/10001>

OK
Time taken: 2339.331 seconds
====================================================

thanks, ronak

408 504 4847
My Blog : http://ronakbaps.posterous.com