Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 290 (0.618s).
Loading phrases to help you
refine your search...
HiveClient - Apache Hive - Apache Software Foundation - Hive - [wiki]
...                    show comment                    hide comment   &nb...
https://cwiki.apache.org/confluence/display/Hive/HiveClient    Author: Lefty Leverenz, 2014-08-20, 00:00
Querying Directories - Drill Wiki - Apache Software Foundation - Drill - [wiki]
...                Go to start of metadata                          ...
https://cwiki.apache.org/confluence/display/DRILL/Querying+Directories    Author: Bob Rumsby, 2014-08-20, 00:00
Re: Is hive UDF are supported in HiveContext - Spark - [mail # user]
...there is no collect_list in hive 0.12try this after this ticket is donehttps://issues.apache.org/jira/browse/SPARK-2706i am also looking forward to this.View this message in context: http://...
   Author: chutium, 2014-08-19, 23:58
[expand - 4 more] - Multiple column families vs Multiple tables - HBase - [mail # user]
...We are doing schema design for our application, One thing we are not soclear about is multiple column families (more than 3, probably 4 - 5) vsmultiple tables. In our use case, we will have ...
   Author: Wei Liu, 2014-08-19, 23:56
[expand - 6 more] - Re: Impala queries running as root - Impala - [mail # user]
...Hi,You can download the latest rpms (1.4.0) that work with CDH 4 from:http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Cloudera-Impala-Version-and-Download-Inform...
   Author: Ippokratis Pandis, 2014-08-19, 23:56
[expand - 3 more] - Re: Naive Bayes - Spark - [mail # user]
...The ratio should be okay. Could you try to pre-process the data andmap -999.0 to 0 before calling NaiveBayes? Btw, I added a check toensure nonnegative features values:https://github.com/apa...
   Author: Xiangrui Meng, 2014-08-19, 23:51
[expand - 1 more] - Re: Only master is really busy at KMeans training - Spark - [mail # user]
...There are only 5 worker nodes. So please try to reduce the number ofpartitions to the number of available CPU cores. 1000 partitions aretoo bigger, because the driver needs to collect to tas...
   Author: Xiangrui Meng, 2014-08-19, 23:50
[expand - 1 more] - Re: How to incorporate the new data in the MLlib-NaiveBayes model along with predicting? - Spark - [mail # user]
...No. Please create one but it won't be able to catch the v1.1 train. -XiangruiOn Tue, Aug 19, 2014 at 4:22 PM, Chris Fregly  wrote: ...
   Author: Xiangrui Meng, 2014-08-19, 23:47
[expand - 1 more] - Re: Decision tree: categorical variables - Spark - [mail # user]
...The categorical features must be encoded into indices starting from 0:0, 1, ..., numCategories - 1. Then you can provide thecategoricalFeatureInfo map to specify which columns containcategor...
   Author: Xiangrui Meng, 2014-08-19, 23:46
Re: slower worker node in the cluster - Spark - [mail # user]
...perhaps creating Fair Scheduler Pools might help?  there's no way to pincertain nodes to a pool, but you can specify minShares (cpu's).  not sureif that would help, but worth looki...
   Author: Chris Fregly, 2014-08-19, 23:41
Sort:
project
Spark (71)
HBase (36)
Cassandra (28)
Ambari (25)
Hive (25)
Hadoop (24)
Kafka (14)
Impala (9)
Bigtop (7)
HDFS (7)
YARN (6)
Mesos (5)
Sqoop (5)
Tez (5)
Drill (4)
Pig (4)
Storm (4)
Zookeeper (3)
Accumulo (2)
Flume (2)
MapReduce (2)
Avro (1)
Tajo (1)
type
issue (153)
mail # user (110)
mail # dev (23)
wiki (3)
mail # general (1)
date
last 7 days (2446)
last 30 days (9564)
last 90 days (20177)
last 6 months (39525)
last 9 months (144288)
author
Ted Yu (7)
Misty Stanley-Jones (6)
Andrew Purtell (5)
Xiangrui Meng (5)
Guozhang Wang (4)
Reynold Xin (4)
Roman Shaposhnik (4)
Xuefu Zhang (4)
Akira AJISAKA (3)
Chengxiang Li (3)
Chris Fregly (3)
Dmitry Lysnichenko (3)
Jean-Marc Spaggiari (3)
Joel Koshy (3)
Robert Stupp (3)
Sean Owen (3)
Tyler Hobbs (3)
Xi Wang (3)
salemi (3)
Abraham Elmahrek (2)
Aleksandr Kovalenko (2)
Andrey Stepachev (2)
Andrii Babiichuk (2)
Arpit Agarwal (2)
Ashish Kumar Singh (2)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB