-Re: Problem: LINES TERMINATED BY only supports newline '\n' right now.
Mark Grover 2012-06-04, 16:26
As far as I know, newlines are the only supported way to separate records right now. As a corollary if a single logical records exists across multiple lines, you will have to get rid of the extra newlines for all of it to be in the same record.
So, to get around it, you can do one of two things:
1) Pre-process your files to break records apart on newlines.
2) As Ed Capriolo suggested in a previous email thread, you could try to use streaming, parse out your XML there and emit out multiple records.
----- Original Message -----
From: "tabraiz anwer" <[EMAIL PROTECTED]>
To: "hive group" <[EMAIL PROTECTED]>
Sent: Monday, June 4, 2012 12:08:12 PM
Subject: Problem: LINES TERMINATED BY only supports newline '\n' right now.
i had tried to create the table by "LINES terminated by '\001' "
and it is giving me the error
Error in semantic analysis: 3:66 LINES TERMINATED BY only supports newline '\n' right now. Error encountered near token ''\001''
CREATE TABLE xmlgw4 ( transactionid string, typeid string,
sentxml string,receivedxml string )
ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\001'
STORED AS TEXTFILE;
instead of '\n' i am using '\001' because i have an xml value , which i want to store in hive that invludes \n values..
using hive version : hive-0.8.1