Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Accumulo >> mail # user >> how to use CountingIterator to count records?


Copy link to this message
-
RE: how to use CountingIterator to count records?
It's an adaptation of a feature table where the weight is the number of
occurrences found during ingest.  The rowId's are features that are
relevant to my queries/row counts (e.g. timespan, geo-space, document
partition id, keywords, etc.)  

Example:

ROWID FAM QUAL VIS VALUE
===== === ==== === ====White KEYWORD OTHER public 123
14SU GEO MGRS public 456
9223 TIMESPAN EPOC public 7890
DOCPART1 DOCUMENT PARTITION public 1234567
One tablet server will know how many rows exist across the cluster for
any ROWID.  So I can quickly determine how many rows exist in all my
tablet servers with one simple scan.

Obviously you have counter them all on ingest and update the edge table.
-----Original Message-----
From: David Medinets [mailto:[EMAIL PROTECTED]]
Sent: Thursday, June 07, 2012 09:00
To: [EMAIL PROTECTED]
Subject: Re: how to use CountingIterator to count records?

Can you describe the Edge Table approach or provide a reference?

On Thu, Jun 7, 2012 at 8:55 AM,  <[EMAIL PROTECTED]> wrote:
> have moved to the Edge Table approach for a direct look up of
occurrences.