Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 28 (0.222s).
Loading phrases to help you
refine your search...
ETL and workflow management on Spark - Spark - [mail # user]
...Hi,We are moving into adopting the full stack of Spark. So far, we have usedShark to do some ETL work, which is not bad but is not prefect either. Weended writing UDF and UDGF, UDAF that can...
   Author: William Kang, 2014-05-22, 14:50
Hadoop 2.3 Centralized Cache vs RDD - Spark - [mail # user]
...Hi,Any comments or thoughts on the implications of the newly released featurefrom Hadoop 2.3 on the centralized cache? How different it is from RDD?Many thanks.Cao ...
   Author: William Kang, 2014-05-16, 14:19
Finding the latest updated rows - HBase - [mail # user]
...Hi, In HBase, the time stamp is set for each column, not for the entire row. If somehow I want to find the latest updated (put new row, or update only certain columns in some rows, etc) rows...
   Author: William Kang, 2014-01-21, 04:06
[expand - 1 more] - Re: HBase load distribution vs. scan efficiency - HBase - [mail # user]
...Hi James, Thanks for the link.  Does this mean that the system has to remember the prefix, and append the prefix to the original key before the scan starts?  If this is the case, i...
   Author: William Kang, 2014-01-21, 02:59
[expand - 1 more] - Re: Books and good starting point for Hive - Hive - [mail # user]
...Hi guys, Thank you very much for pointing me to the right direction. I am glad that this community is so active.   William  On Sun, Feb 24, 2013 at 4:29 PM, Dean Wampler  wrot...
   Author: William Kang, 2013-02-26, 02:28
[expand - 1 more] - Re: Just started - Pig - [mail # user]
...Hi Alan, Thanks a lot for the great suggestion. The book looks very helpful.   William  On Sun, Feb 24, 2013 at 11:54 AM, Alan Gates  wrote:...
   Author: William Kang, 2013-02-26, 02:26
[expand - 2 more] - Re: Anyway to load certain Key/Value pair fast? - Hadoop - [mail # user]
...Hi Harsh, Thanks a lot for your reply and great suggestions.  In the practical cases, the values usually do not reside in the same data node. Instead, they are mostly distributed by the...
   Author: William Kang, 2013-02-13, 06:08
[expand - 3 more] - Re: Build Hadoop 1.0.4 eclipse plugin - Hadoop - [mail # general]
...Hi Gaurav, I looked up in the eclipse's error log, and the errors are listed: The command ("dfs.browser.action.delete") is undefined The command ("dfs.browser.action.refresh") is undefined T...
   Author: William Kang, 2013-01-20, 18:46
[expand - 2 more] - Re: Hbase 0.90.2 problems - HBase - [mail # user]
...Hi Harsh J, Thanks for your reply.  I will try the append version. But I already set HBase manage zookeeper. For some reason, it did not start it.   William  On Sat, Apr 9, 20...
   Author: William Kang, 2011-04-09, 18:45
[expand - 1 more] - Re: HBase random access in HDFS and block indices - HBase - [mail # user]
...Hi JG and Ryan, Thanks for the excellent answers.  So, I am going to push everything to the extremes without considering the memory first. In theory, if in HBase, every cell size equals...
   Author: William Kang, 2010-10-19, 03:21
HBase (19)
Hadoop (5)
Spark (2)
Hive (1)
Pig (1)
mail # user (27)
mail # general (1)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (2)
last 9 months (28)
Ted Yu (1700)
Harsh J (1296)
Todd Lipcon (995)
Stack (978)
Jun Rao (969)
Jonathan Ellis (844)
Andrew Purtell (817)
Jean-Daniel Cryans (752)
Yusaku Sako (718)
stack (714)
Jarek Jarcec Cecho (703)
Eric Newton (688)
Jonathan Hsieh (673)
Roman Shaposhnik (662)
Namit Jain (649)
Hitesh Shah (627)
Owen O'Malley (625)
Steve Loughran (624)
Siddharth Seth (614)
Josh Elser (557)
Brock Noland (549)
Eli Collins (545)
Neha Narkhede (545)
Arun C Murthy (543)
Doug Cutting (533)
William Kang