|
|
-
RE: question..Ashish Thusoo 2010-08-20, 19:18
I think the query did finish properly. Can you recheck the data to see if you would really get a few rows of output?
Ashish ________________________________ From: Ronak Bhatt [mailto:[EMAIL PROTECTED]] Sent: Friday, August 20, 2010 12:12 PM To: [EMAIL PROTECTED] Subject: question.. Hi - in my HIVE environment, I ran the following query and expecting to see the rows (the data is present)...however, after 2339 seconds, the o/p I get is as shown below in the email (I've pasted last 5~10 lines of screen output)... Is there anything that I'm missing? Did the process finish correctly? should there be something that could point me how to debug? ----------------------------- Query ------------------------------ select substr(CB.EXEC_DATE,1,10), count(CB.ID<http://CB.ID>) from callbacks CB JOIN (select * from pages p where p.page like '%google.com/search%<http://google.com/search%><http://google.com/search%>' or p.page like '%google.com/custom%<http://google.com/custom%><http://google.com/custom%>' or p.page like '%google.com/#hl%<http://google.com/#hl%><http://google.com/#hl%>' or p.page like '%google.com/cse%<http://google.com/cse%><http://google.com/cse%>' or p.page like '%search.yahoo.com/search%<http://search.yahoo.com/search%><http://search.yahoo.com/search%>' or p.page like '%bing.com/search%<http://bing.com/search%><http://bing.com/search%>' or p.page like '%google.com/product%<http://google.com/product%><http://google.com/product%>' ) s ON s.id<http://s.id> = cb.id<http://cb.id> group by substr(CB.EXEC_DATE,1,10); ================= o/p on screen ================= 10/08/20 11:13:38 INFO mapred.TaskRunner: Task 'attempt_local_0001_r_000000_0' done. 2010-08-20 11:13:39,526 map = 100%, reduce =100% 10/08/20 11:13:39 INFO exec.ExecDriver: 2010-08-20 11:13:39,526 map = 100%, reduce =100% Ended Job = job_local_0001 10/08/20 11:13:39 INFO exec.ExecDriver: Ended Job = job_local_0001 10/08/20 11:13:39 INFO exec.FileSinkOperator: Moving tmp dir: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001> to: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate> 10/08/20 11:13:39 INFO exec.FileSinkOperator: Moving tmp dir: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate> to: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/10001<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/10001> OK Time taken: 2339.331 seconds ==================================================== thanks, ronak 408 504 4847 My Blog : http://ronakbaps.posterous.com |