| clear query|facets|time |
Search criteria: .
Results from 81 to 90 from
112 (0.071s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: Re: bz2 Splits. - Hive - [mail # user]
|
|
... Will do. My bad here. I think I forgot to import some files when running the queries earlier. The counts are exactly the same. However the timings for "select count(1)" q...
|
|
|
Author: Saurabh Nanda,
2009-07-27, 05:05
|
|
|
Re: Importing log files in custom (non-delimited) format - Hive - [mail # user]
|
|
... After I manage to load previous log files into the Hadoop cluster, that's precisely what I'm going to do -- change the log format to be tab delimited. Saurabh. http://nandz.blog...
|
|
|
Author: Saurabh Nanda,
2009-07-25, 13:37
|
|
|
Re: Re: bz2 Splits. - Hive - [mail # user]
|
|
... This really clears things up. I guess adding a note in the Wiki will put an end to the confusion permanently. A little note on the approach (compressed textfile vs compressed sequence...
|
|
|
Author: Saurabh Nanda,
2009-07-25, 10:48
|
|
|
Re: Re: bz2 Splits. - Hive - [mail # user]
|
|
...I tried both the approaches. Approach #1: a) gunzip log file b) import into textfile table c) set hive.exec.compress.output to true d) inserted into sequencefile table It seems t...
|
|
|
Author: Saurabh Nanda,
2009-07-25, 10:46
|
|
|
Re: Re: bz2 Splits. - Hive - [mail # user]
|
|
... I'm completely confused right now. If sequencefiles are not used for compressed data storage then what are they used for? If I have a gz file, and I want to import it as is (with...
|
|
|
Author: Saurabh Nanda,
2009-07-25, 10:27
|
|
|
Re: Re: bz2 Splits. - Hive - [mail # user]
|
|
...I tried the following and ran into an error message: create table compressed_raw(line string) partitioned by(dt string) row format delimited fields terminated by '\t' lines terminated ...
|
|
|
Author: Saurabh Nanda,
2009-07-25, 10:05
|
|
|
Re: bz2 Splits. - Hive - [mail # user]
|
|
...Please excuse my ignorance, but can I import gzip compressed files directly as Hive tables? I have separate gzip files for each days weblog data. Right now I am gunzipping them and then impo...
|
|
|
Author: Saurabh Nanda,
2009-07-24, 15:09
|
|
|
Re: Importing log files in custom (non-delimited) format - Hive - [mail # user]
|
|
...Hi Zheng, Thanks for the reply, but I gave up on UDFs & SerDe and resorted to custom map/reduce scripts instead. In case you're interested, I've written about my Hive experience at htt...
|
|
|
Author: Saurabh Nanda,
2009-07-24, 12:27
|
|
|
Re: insert into not supported? - Hive - [mail # user]
|
|
...Never ran into this problem till now, but will soon begin to once I start loading more days of weblog data into the cluster. Can I append data into a table if each INSERT INTO is in a new pa...
|
|
|
Author: Saurabh Nanda,
2009-07-22, 05:38
|
|
|
Re: dense_rank() equivalent in Hive? - Hive - [mail # user]
|
|
... I guess the answer to my query is given at http://wiki.apache.org/hadoop/Hive/LanguageManual/SortBy -- "Hive uses the columns in *Distribute By* to distribute the rows among redu...
|
|
|
Author: Saurabh Nanda,
2009-07-20, 06:37
|
|
|
|