Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 81 to 90 from 112 (0.071s).
Loading phrases to help you
refine your search...
Re: Re: bz2 Splits. - Hive - [mail # user]
... Will do.     My bad here. I think I forgot to import some files when running the queries earlier. The counts are exactly the same. However the timings for "select count(1)" q...
   Author: Saurabh Nanda, 2009-07-27, 05:05
Re: Importing log files in custom (non-delimited) format - Hive - [mail # user]
...  After I manage to load previous log files into the Hadoop cluster, that's precisely what I'm going to do -- change the log format to be tab delimited.  Saurabh. http://nandz.blog...
   Author: Saurabh Nanda, 2009-07-25, 13:37
Re: Re: bz2 Splits. - Hive - [mail # user]
...  This really clears things up. I guess adding a note in the Wiki will put an end to the confusion permanently. A little note on the approach (compressed textfile vs compressed sequence...
   Author: Saurabh Nanda, 2009-07-25, 10:48
Re: Re: bz2 Splits. - Hive - [mail # user]
...I tried both the approaches.  Approach #1: a) gunzip log file b) import into textfile table c) set hive.exec.compress.output to true d) inserted into sequencefile table  It seems t...
   Author: Saurabh Nanda, 2009-07-25, 10:46
Re: Re: bz2 Splits. - Hive - [mail # user]
... I'm completely confused right now. If sequencefiles are not used for compressed data storage then what are they used for?  If I have a gz file, and I want to import it as is (with...
   Author: Saurabh Nanda, 2009-07-25, 10:27
Re: Re: bz2 Splits. - Hive - [mail # user]
...I tried the following and ran into an error message:  create table compressed_raw(line string) partitioned by(dt string) row format delimited fields terminated by '\t' lines terminated ...
   Author: Saurabh Nanda, 2009-07-25, 10:05
Re: bz2 Splits. - Hive - [mail # user]
...Please excuse my ignorance, but can I import gzip compressed files directly as Hive tables? I have separate gzip files for each days weblog data. Right now I am gunzipping them and then impo...
   Author: Saurabh Nanda, 2009-07-24, 15:09
Re: Importing log files in custom (non-delimited) format - Hive - [mail # user]
...Hi Zheng,  Thanks for the reply, but I gave up on UDFs & SerDe and resorted to custom map/reduce scripts instead. In case you're interested, I've written about my Hive experience at htt...
   Author: Saurabh Nanda, 2009-07-24, 12:27
Re: insert into not supported? - Hive - [mail # user]
...Never ran into this problem till now, but will soon begin to once I start loading more days of weblog data into the cluster. Can I append data into a table if each INSERT INTO is in a new pa...
   Author: Saurabh Nanda, 2009-07-22, 05:38
Re: dense_rank() equivalent in Hive? - Hive - [mail # user]
... I guess the answer to my query is given at http://wiki.apache.org/hadoop/Hive/LanguageManual/SortBy --  "Hive uses the columns in *Distribute By* to distribute the rows among redu...
   Author: Saurabh Nanda, 2009-07-20, 06:37
Sort:
project
Hive (112)
Hadoop (4)
type
mail # user (111)
issue (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (1)
last 9 months (112)
author
Namit Jain (802)
Edward Capriolo (636)
Zheng Shao (613)
Carl Steinbach (604)
John Sichi (297)
Mark Grover (269)
Ning Zhang (255)
Ashutosh Chauhan (250)
Nitin Pawar (220)
Ashish Thusoo (169)
Kevin Wilfong (161)
He Yongqiang (156)
Prasad Chakka (152)
Bejoy Ks (132)
Navis (128)
Saurabh Nanda