Big Data / Search / DevOps
  • About
  • project

    • Spark (125748)
    • HBase (172323)
    • Ambari (165441)
    • Hive (125745)
    • Hadoop (105828)
    • Cassandra (103593)
    • Flink (67001)
    • Kafka (65262)
    • Zeppelin (61768)
    • Accumulo (60374)
    • Drill (51933)
    • Mesos (49960)
    • Metron (47742)
    • HDFS (46265)
    • Storm (43598)
    • Trafodion (41256)
    • Pig (37966)
    • Bigtop (34959)
    • Tez (34737)
    • MapReduce (34524)
    • Phoenix (33867)
    • Impala (29729)
    • YARN (27241)
    • NiFi (25473)
    • Lens (25436)
    • Aurora (24461)
    • Kylin (24360)
    • Beam (24198)
    • ZooKeeper (23501)
    • Eagle (22537)
    • Falcon (21772)
    • Flume (21212)
    • Tajo (20685)
    • Sqoop (20199)
    • Calcite (20177)
    • Oozie (19122)
    • Apex (17786)
    • Giraph (17329)
    • Ranger (17103)
    • Sentry (16198)
    • SystemML (15341)
    • Reef (13591)
    • Samza (13023)
    • Avro (12921)
    • CarbonData (11987)
    • Hama (11524)
    • Kudu (11348)
    • Bookkeeper (9701)
    • Atlas (9647)
    • Helix (9239)
    • Knox (8680)
    • Crunch (7696)
    • Arrow (7598)
    • Parquet (7452)
    • Gora (6387)
    • Chukwa (6100)
    • Curator (6044)
    • Orc (5485)
    • MetaModel (4568)
    • Fluo (3550)

    author

    • Reynold Xin (2572)
    • Sean Owen (2466)
    • Ted Yu (2136)
    • Michael Armbrust (1806)
    • Akhil Das (1510)
    • Tathagata Das (1418)
    • Patrick Wendell (1411)
    • Xiangrui Meng (1362)
    • Matei Zaharia (1259)
    • Mich Talebzadeh (1242)
    • Marcelo Vanzin (1005)
    • Nicholas Chammas (915)
    • Cody Koeninger (910)
    • Josh Rosen (873)
    • Davies Liu (864)
    • Cheng Lian (862)
    • Joseph K. Bradley (755)
    • Jacek Laskowski (754)
    • Koert Kuipers (710)
    • Andrew Or (698)
    • ayan guha (675)
    • Yin Huai (631)
    • Mark Hamstra (599)
    • Xiao Li (560)
    • Shixiong Zhu (536)
    • Wenchen Fan (527)
    • Steve Loughran (525)
    • Andrew Ash (480)
    • Hyukjin Kwon (469)
    • Nick Pentreath (464)
    • Yanbo Liang (463)
    • Jörn Franke (447)
    • Sandy Ryza (438)
    • kant kodali (438)
    • shane knapp (416)
    • Aaron Davidson (399)
    • Debasish Das (395)
    • Shivaram Venkataraman (386)
    • Saisai Shao (372)
    • Felix Cheung (368)
    • Holden Karau (354)
    • Takeshi Yamamuro (354)
    • Liang-Chi Hsieh (348)
    • Mayur Rustagi (343)
    • Nan Zhu (327)
    • DB Tsai (326)
    • Imran Rashid (308)
    • Jeff Zhang (299)
    • Burak Yavuz (292)
    • Dongjoon Hyun (287)
    • Gourav Sengupta (251)
    • Tobias Pfeiffer (242)
    • ÐΞ€ρ@... (241)
    • Andy Davidson (232)
    • Gerard Maas (215)
    • Yana Kadiyska (214)
    • Joseph Bradley (210)
    • Stephen Boesch (209)
    • holdenk (209)
    • Denny Lee (205)

    type

    • mail # user (71983)
    • issue (23782)
    • mail # dev (20007)
    • javadoc (7094)
    • source code (2867)
    • web site (211)
    • wiki (23)
  • date

    • last 7 days (283)
    • last 30 days (1191)
    • last 90 days (3222)
    • last 6 months (5919)
    • last 9 months (115787)
clear query| facets| time Search criteria: .   Results from 41 to 50 from 125748 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-21661] SparkSQL can't merge load table from Hadoop - Spark - [issue]
...Here is the original text of external table on HDFS:Permission Owner Group Size Last Modified Replication Block Size Name-rw-r--r-- root supergroup 0 B 8/6/2017, 11:43:03 PM 3 256 MB income_...
http://issues.apache.org/jira/browse/SPARK-21661    Author: Dapeng Sun , 2018-04-26, 13:21
  
[SPARK-23151] Provide a distribution of Spark with Hadoop 3.0 - Spark - [issue]
...Provide a Spark package that supports Hadoop 3.0.0. Currently the Spark packageonly supports Hadoop 2.7 i.e. spark-2.2.1-bin-hadoop2.7.tgz. The implication isthat using up to date Kinesis li...
http://issues.apache.org/jira/browse/SPARK-23151    Author: Louis Burke , 2018-04-26, 13:08
  
[SPARK-23929] pandas_udf schema mapped by position and not by name - Spark - [issue]
...The return struct of a pandas_udf should be mapped to the provided schema by name. Currently it's not the case.Consider these two examples, where the only change is the order of the fields i...
http://issues.apache.org/jira/browse/SPARK-23929    Author: Omri , 2018-04-26, 11:26
  
[expand - 3 more] [collapse] - saveAsNewAPIHadoopDataset must not enable speculation for parquet file? - Spark - [mail # dev]
...sorry, not noticed this followup. Been busy with other issuesOn 3 Apr 2018, at 11:19, cane > wrote:Now, if we use saveAsNewAPIHadoopDataset with speculation enable.It may causedata loss.I...
   Author: Steve Loughran , 周康 , ... , 2018-04-26, 11:24
  
saveAsNewAPIHadoopDataset must not enable speculation for parquet file? - Spark - [mail # dev]
...I observe that.If commit Job done on driver and commit task done on executor.With speculation enable,it may cause data loss.Since commit Job will call listStatus and commit Task will delete ...
   Author: 周康 , 2018-04-08, 05:20
saveAsNewAPIHadoopDataset must not enable speculation for parquet file? - Spark - [mail # dev]
...> On 3 Apr 2018, at 11:19, cane  wrote:> > Now, if we use saveAsNewAPIHadoopDataset with speculation enable.It may cause> data loss.> I check the comment of thi api:> ...
   Author: Steve Loughran , 2018-04-03, 20:45
saveAsNewAPIHadoopDataset must not enable speculation for parquet file? - Spark - [mail # dev]
...Now, if we use saveAsNewAPIHadoopDataset with speculation enable.It may causedata loss.I check the comment of thi api:  We should make sure our tasks are idempotent when speculation is ...
   Author: cane , 2018-04-03, 10:19
[SPARK-4781] Column values become all NULL after doing ALTER TABLE CHANGE for renaming column names (Parquet external table in HiveContext) - Spark - [issue]
...I have a table say created like follows:CREATE EXTERNAL TABLE pmt (  `sorted::cre_ts` string)STORED AS PARQUETLOCATION '...'And I renamed the column from sorted::cre_ts to cre_ts by doi...
http://issues.apache.org/jira/browse/SPARK-4781    Author: Jianshi Huang , 2018-04-26, 11:13
  
[SPARK-11748] Result is null after alter column name of table stored as Parquet - Spark - [issue]
...1. Test with the following code    hctx.sql(" create table " + table + " (id int, str string) STORED AS PARQUET ")    val df = hctx.jsonFile("g:/vip.json")    d...
http://issues.apache.org/jira/browse/SPARK-11748    Author: pin_zhang , 2018-04-26, 11:12
  
[SPARK-11334] numRunningTasks can't be less than 0, or it will affect executor allocation - Spark - [issue]
...With Dynamic Allocation function, a task failed over maxFailure time, all the dependent jobs, stages, tasks will be killed or aborted. In this process, SparkListenerTaskEnd event will be beh...
http://issues.apache.org/jira/browse/SPARK-11334    Author: meiyoula , 2018-04-26, 10:54
  
[SPARK-23885] trying to spark submit 2.3.0 on minikube - Spark - [issue]
... spark-submit on minikube(kubernets) failing .Kindly refere link for details  https://stackoverflow.com/questions/49689298/exception-in-thread-main-org-apache-spark-sparkexception-must-speci...
http://issues.apache.org/jira/browse/SPARK-23885    Author: anant pukale , 2018-04-26, 10:27
  
[SPARK-24098] ScriptTransformationExec should wait process exiting before output iterator finish - Spark - [issue]
...In our spark cluster, some users find that spark may lost data when they use transform in sql.We check the output file and discovery that some file are empty.Then we check the executor's log...
http://issues.apache.org/jira/browse/SPARK-24098    Author: Lijia Liu , 2018-04-26, 08:58
  
Standard scaler on multiple columsn without a vector - Spark - [mail # user]
...Hi,we are heavy users of spark, but only for data munging, not for trainingour models. Now we are planing to use MLLib transformers for our offlinetransformations, and want to have something...
   Author: Brammert Ottens , 2018-04-26, 08:05
  
< Prev 3 4 5 6 7 Next >
Apache Lucene, Apache Solr and all other Apache Software Foundation project and their respective logos are trademarks of the Apache Software Foundation.
Elasticsearch, Kibana, Logstash, and Beats are trademarks of Elasticsearch BV, registered in the U.S. and in other countries. This site and Sematext Group is in no way affiliated with Elasticsearch BV.
Service operated by Sematext