Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 94 (0.165s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: high memory usage for catalogd - Impala - [mail # user]
...For some reason reverting to the older metadata load path byusing -load_catalog_in_background=false avoids the issue.NorbertOn Tue, Sep 16, 2014 at 12:59 PM, Norbert Burger wrote:To unsubscr...
   Author: Norbert Burger, 2014-09-16, 18:13
[expand - 1 more] - Re: regex filters and scanner caching - HBase - [mail # user]
...Thanks Ted.  I'll try to extract a unit test.NorbertOn Wed, Mar 26, 2014 at 9:46 PM, Ted Yu  wrote: ...
   Author: Norbert Burger, 2014-03-27, 02:44
Re: pig based mapreduce jobs: /usr/lib/hadoop/lib - Pig - [mail # user]
...I'll usually do something like this:CLASSPATH=$(hadoop classpath):/usr/lib/pig/pig-withouthadoop.jar java -jar...NorbertOn Sat, Mar 22, 2014 at 8:14 PM, Jay Vyas  wrote: ...
   Author: Norbert Burger, 2014-03-24, 02:47
[expand - 2 more] - Re: Metastore performance on HDFS-backed table with 15000+ partitions - Hive - [mail # user]
...Thanks everyone for the feedback.  Just to follow up in case someone elseruns into this: I can confirm that local client works around the OOMEs, butit's still very slow.It does seem lik...
   Author: Norbert Burger, 2014-02-27, 12:57
[expand - 2 more] - Re: 答复: one table flushes at much smaller sizes than other? - HBase - [mail # user]
...Thanks Ted - this config change appears to have reduced quite a bit of the memstore flushes.  Norbert   On Fri, Dec 27, 2013 at 12:03 AM, Ted Yu  wrote:  ...
   Author: Norbert Burger, 2013-12-27, 17:44
Re: Custom counters from Jython UDF function - Pig - [mail # user]
...Yes - take a look at PigCounterHelper.  Instantiate a variable of this type, and then you can call the method incrCounter() on it:  _counter = PigCounterHelper() _counter.incrCount...
   Author: Norbert Burger, 2013-07-31, 16:24
[expand - 1 more] - Re: Nb of reduce tasks when GROUPing - Pig - [mail # user]
...As Jonathan mentioned, TOP should obviate this particular use case.  But for future examples, the parameters pig.exec.reducers.bytes.per.reducer and pig.exec.reducers.max might be usefu...
   Author: Norbert Burger, 2013-05-21, 17:23
Re: Ignore first record of a file - Pig - [mail # user]
...Perhaps the general way to do this is to write a custom loader, but for this simpler usecase, can you just filter out the record?  FILTER ... BY $0 MATCHES '^[0-9]+'  Norbert  ...
   Author: Norbert Burger, 2013-03-14, 17:38
Re: removing dupes from a bag while saving first occurrence - Pig - [mail # user]
...Looking at your sample, it seems you have a GROUPBY generating these bags...?  Could you just insert a DISTINCT before this GROUP BY?  Norbert  On Fri, Mar 8, 2013 at 5:00 PM,...
   Author: Norbert Burger, 2013-03-08, 22:10
Re: too many memory spills - Pig - [mail # user]
...I thought Todd Lipcon's Hadoop Summit presentation [1] had some good info on this topic.  [1] http://www.slideshare.net/cloudera/mr-perf  Norbert  On Thu, Mar 7, 2013 at 7:25 ...
   Author: Norbert Burger, 2013-03-08, 02:47
Sort:
project
Pig (66)
HBase (18)
Hadoop (8)
Hive (1)
Impala (1)
type
mail # user (92)
mail # dev (2)
date
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (1)
last 9 months (94)
author
Ted Yu (1687)
Harsh J (1293)
Jun Rao (1056)
Todd Lipcon (1001)
Stack (977)
Jonathan Ellis (843)
Andrew Purtell (821)
Jean-Daniel Cryans (754)
jacques@... (738)
Yusaku Sako (733)
stack (717)
Jarek Jarcec Cecho (702)
Eric Newton (697)
Jonathan Hsieh (675)
Brock Noland (666)
Roman Shaposhnik (665)
Neha Narkhede (660)
Namit Jain (649)
Hitesh Shah (626)
Owen O'Malley (625)
Steve Loughran (619)
Siddharth Seth (614)
Josh Elser (584)
Eli Collins (545)
Arun C Murthy (543)
Norbert Burger