|
|
-
how to compute histogram on non-numeric data set?
Richard 2012-03-12, 05:18
I have noticed histogram_numeric(col, n), but it seems to require numeric column. I have a string column, they are numeric like string but are category label, e.g, 100001, 200034 two different strings are two different category but the numeric value does not mean anything, so it is not proper to use histogram_numeric(cast(col to BIGINT), n). thanks. Richard
+
Richard 2012-03-12, 05:18
-
Re: how to compute histogram on non-numeric data set?
Philip Tromans 2012-03-12, 12:36
Is that not just a COUNT(1) and a GROUP BY?
Phil.
2012/3/12 Richard <[EMAIL PROTECTED]>: > I have noticed histogram_numeric(col, n), but it seems to require numeric > column. > I have a string column, they are numeric like string but are category label, > e.g, > > 100001, 200034 > > two different strings are two different category but the numeric value does > not mean anything, > so it is not proper to use histogram_numeric(cast(col to BIGINT), n). > > thanks. > Richard > >
+
Philip Tromans 2012-03-12, 12:36
|
|
All projects made searchable here are trademarks of the Apache Software Foundation.
Service operated by
Sematext