Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # user >> ORC vs TEXT file


Copy link to this message
-
Re: ORC vs TEXT file
Thanks Owen and Hongzhan.
I understand the behavior now.
On Tue, Aug 13, 2013 at 6:28 AM, hongzhan li <[EMAIL PROTECTED]>wrote:

> if you select all the columns ,the orc will not faster than textfile.but
> if you select some column (not all of the colimns),orc will run faster.
> —
>
>
> On Mon, Aug 12, 2013 at 6:40 PM, pandees waran <[EMAIL PROTECTED]> wrote:
>
>> Hi,
>>
>> Currently, we use TEXTFILE format in hive 0.8 ,while creating the
>> external tables in intermediate processing .
>> I have read about ORC in 0.11. I have created the same table in 0.11
>> with ORC format.
>> Without any compression, the ORC file(totally 3 files) occupied the
>> space twice more than the TEXTFILE(only one file).
>> Even, when i query the data from ORC:
>> Select count(*) from orc_table
>>
>> It took more time than the same query against textfile.
>> But, i see cumulative CPU time is lesser in ORC than the text file.
>>
>> What sort of queries will benefit, if we use ORC?
>> In which cases TEXTFILE will be preferred more than ORC?
>>
>> Thanks.
>>
>
>
--
Thanks,
Pandeeswaran
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB