Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 274 (0.69s).
Loading phrases to help you
refine your search...
Querying Directories - Drill Wiki - Apache Software Foundation - Drill - [wiki]
...                Go to start of metadata                          ...
https://cwiki.apache.org/confluence/display/DRILL/Querying+Directories    Author: Bob Rumsby, 2014-08-20, 00:00
Re: Is hive UDF are supported in HiveContext - Spark - [mail # user]
...there is no collect_list in hive 0.12try this after this ticket is donehttps://issues.apache.org/jira/browse/SPARK-2706i am also looking forward to this.View this message in context: http://...
   Author: chutium, 2014-08-19, 23:58
[expand - 4 more] - Multiple column families vs Multiple tables - HBase - [mail # user]
...We are doing schema design for our application, One thing we are not soclear about is multiple column families (more than 3, probably 4 - 5) vsmultiple tables. In our use case, we will have ...
   Author: Wei Liu, 2014-08-19, 23:56
[expand - 6 more] - Re: Impala queries running as root - Impala - [mail # user]
...Hi,You can download the latest rpms (1.4.0) that work with CDH 4 from:http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Cloudera-Impala-Version-and-Download-Inform...
   Author: Ippokratis Pandis, 2014-08-19, 23:56
[expand - 3 more] - Re: Naive Bayes - Spark - [mail # user]
...The ratio should be okay. Could you try to pre-process the data andmap -999.0 to 0 before calling NaiveBayes? Btw, I added a check toensure nonnegative features values:https://github.com/apa...
   Author: Xiangrui Meng, 2014-08-19, 23:51
[expand - 1 more] - Re: Only master is really busy at KMeans training - Spark - [mail # user]
...There are only 5 worker nodes. So please try to reduce the number ofpartitions to the number of available CPU cores. 1000 partitions aretoo bigger, because the driver needs to collect to tas...
   Author: Xiangrui Meng, 2014-08-19, 23:50
[expand - 1 more] - Re: How to incorporate the new data in the MLlib-NaiveBayes model along with predicting? - Spark - [mail # user]
...No. Please create one but it won't be able to catch the v1.1 train. -XiangruiOn Tue, Aug 19, 2014 at 4:22 PM, Chris Fregly  wrote: ...
   Author: Xiangrui Meng, 2014-08-19, 23:47
[expand - 1 more] - Re: Decision tree: categorical variables - Spark - [mail # user]
...The categorical features must be encoded into indices starting from 0:0, 1, ..., numCategories - 1. Then you can provide thecategoricalFeatureInfo map to specify which columns containcategor...
   Author: Xiangrui Meng, 2014-08-19, 23:46
Re: slower worker node in the cluster - Spark - [mail # user]
...perhaps creating Fair Scheduler Pools might help?  there's no way to pincertain nodes to a pool, but you can specify minShares (cpu's).  not sureif that would help, but worth looki...
   Author: Chris Fregly, 2014-08-19, 23:41
Issue with Hadoop/Kerberos security as client - Hadoop - [mail # user]
...We are encountering a really strange issue accessing Hadoop securely as a client.  We go through the motions of calling setting the security configuration:    YarnConfigu...
   Author: John Lilley, 2014-08-19, 23:36
Sort:
project
Spark (68)
HBase (32)
Cassandra (26)
Ambari (25)
Hive (24)
Hadoop (23)
Kafka (14)
Impala (8)
Bigtop (7)
HDFS (7)
Mesos (5)
Sqoop (5)
Tez (5)
YARN (5)
Drill (4)
Pig (4)
Storm (3)
Zookeeper (3)
Flume (2)
MapReduce (2)
Accumulo (1)
Avro (1)
type
issue (138)
mail # user (110)
mail # dev (23)
wiki (2)
mail # general (1)
date
last 7 days (3179)
last 30 days (12168)
last 90 days (28014)
last 6 months (43232)
last 9 months (162752)
author
Ted Yu (7)
Misty Stanley-Jones (5)
Xiangrui Meng (5)
Guozhang Wang (4)
Roman Shaposhnik (4)
Xuefu Zhang (4)
Akira AJISAKA (3)
Andrew Purtell (3)
Chengxiang Li (3)
Chris Fregly (3)
Dmitry Lysnichenko (3)
Jean-Marc Spaggiari (3)
Joel Koshy (3)
Reynold Xin (3)
Robert Stupp (3)
Sean Owen (3)
Xi Wang (3)
salemi (3)
Abraham Elmahrek (2)
Aleksandr Kovalenko (2)
Andrey Stepachev (2)
Andrii Babiichuk (2)
Arpit Agarwal (2)
Ashish Kumar Singh (2)
Charles Lamb (2)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB