The Common Big Data Architecture is a broad description that encompasses many big data systems and consists of 7 components: filesystem, ingest processes, databases, analytic processes, web services, resource scheduler, and elastic computing. A reference will most likely appear in IEEE HPEC 2014.
Accumulo is the database of choice in many CBDA systems.
The D4M schema is used in many Accumulo systems.
On Mon, Apr 28, 2014 at 05:23:00PM -0400, David Medinets wrote:
I'm not getting what exactly the "Common Big Data Architecture" is?
Is it just a term that describes any system that has the 7 components Jeremy mentioned (fs, ingest, DB, analytics, web services, resource scheduler, elastic compute)? If so, what's the significance of naming this collection?
And how exactly is D4M related to this? (I understand it (D4M) hits a subset of those features, but don't think it encompasses all of those)
Apologies if these are obtuse questions, I just feel like I"m not comprehending what information is trying to be conveyed? On Mon, Apr 28, 2014 at 8:51 PM, Jeremy Kepner <[EMAIL PROTECTED]> wrote:
(1) A set of software for doing analytics. (2) A schema for ingesting and indexing diverse data into a NoSQL database like Accumulo
It hits two parts of the Common Big Data Architecture. The CBDA is merely a restating of the obvious components a system needs to effective at processing Big Data. It can be implemented with a variety of technologies.
Regards. -Jeremy On Apr 28, 2014, at 9:08 PM, Chris Bennight <[EMAIL PROTECTED]> wrote:
David started this thread yesterday. Since then I have read everything, I think, and like what I see. I still have a question. To populate an Accumulo database, using the D4M schema, it would appear one would do so using Mutation objects et. al. just as if one were not using D4M schema. Am I correct? All the examples appear to focus on the analytic side of things.
Thanks On Mon, Apr 28, 2014 at 9:12 PM, Kepner, Jeremy - 0553 - MITLL < [EMAIL PROTECTED]> wrote:
You could do mutations or bulk loading. As long as you can phrase your data in terms of keys and values, you can store it in Accumulo. On Tue, Apr 29, 2014 at 1:48 PM, Geoffry Roberts <[EMAIL PROTECTED]>wrote:
All projects made searchable here are trademarks of the Apache Software Foundation.
Service operated by Sematext