-Re: Declaring schema for unknown number of columns
Chan, Tim 2013-01-08, 01:48
Since I don't know how many columns I will have, I do something like this.
six_month_and_variable_month_sales_2 = FOREACH
GENERATE $0 AS ed_style_id,
$1 AS sale_start_month,
$2 AS sale_month_1,
$3 AS sale_month_2,
$4 AS sale_month_3,
$5 AS sale_month_4,
$6 AS sale_month_5,
$7 AS sale_month_6,
I still get the same error when I try to join on this relation.
On Mon, Jan 7, 2013 at 2:27 PM, Jinyuan Zhou <[EMAIL PROTECTED]> wrote:
> If you can load it but join operation need the complete schema, then you
> can try do a generate statement to project your original relation to
> produce the one you can define schema for all fields.
> On Mon, Jan 7, 2013 at 2:19 PM, Chan, Tim <[EMAIL PROTECTED]> wrote:
> > Is it possible to declare a schema when doing a LOAD for data in which
> > do not know the total number of columns?
> > For instance. I know the data contains 6 or more columns. These columns
> > of the same data type.
> > I basically want to join this data with another data set, but I was
> > the following error:
> > ERROR 1109: Input (six_month_and_variable_month_sales) on which outer
> > join is desired should have a valid schema
> -- Jinyuan (Jack) Zhou