I am in the throws of converting some(else's) code from MongoDB to Accumulo. I am seeing a situation where one DBObject if being embedded into another DBObject. I see that Mutation supports a method called getRow() that returns a byte array. I gather I can use this to achieve a similar result if I were so inclined.
Am I so inclined? i.e. Is this the way we do things in Accumulo?
DBObject, roughly speaking, is Mongo's counterpart to Mutation.
The use case is, I am walking a complex object graph and persisting what I find there. Said object graph in my case is always EMF (eclipse modeling framework) compliant. An EMF graph can have in if references to--brace yourself--a non-cross document containment reference. When using Mongo, these were persisted as a DBObject embedded into a containing DBObject. I'm trying to decide whether I want to follow suit.
Any thoughts? On Thu, Apr 24, 2014 at 4:03 PM, Sean Busbey <[EMAIL PROTECTED]> wrote: There are ways and there are ways,
Interesting, multiple mutations that is. Are we talking multiples on the same row id?
Upon reflection, I realized the embedded thing is nothing special. I think I'll keep adding columns to a single mutation. This will make for a wide row, but I'm not seeing that as a problem. I am I being naive?
Another question if I may. As I walk my graph, I must keep track of the type of the value being persisted. I am using the qualifier for this, putting in it a URI that indicates the type. Is this a proper use for the qualifier?
Thanks for the discussion On Thu, Apr 24, 2014 at 11:23 PM, William Slacum < [EMAIL PROTECTED]> wrote:
I am storing the object's name in the colfam: e.g. "patientId", the object's data type goes in the colq: e.g "org.hl7.v3.II", then the value in the colval. I think the largest graph I'm likely to have is < 5k and you say I soul have memory problems. This is good topic. How then can I estimate? On Fri, Apr 25, 2014 at 10:17 AM, Josh Elser <[EMAIL PROTECTED]> wrote:
I think you told me something. I must watch the rowid colfam colq sequence and be sure they are unique within the row. Will do. I believe I do have distinct datatypes for now (they're medical) but the future may rear it's ugly head. On Fri, Apr 25, 2014 at 11:02 AM, Josh Elser <[EMAIL PROTECTED]> wrote:
There are ways and there are ways,
NEW: Monitor These Apps!
Apache Lucene, Apache Solr and all other Apache Software Foundation project and their respective logos are trademarks of the Apache Software Foundation.
Elasticsearch, Kibana, Logstash, and Beats are trademarks of Elasticsearch BV, registered in the U.S. and in other countries. This site and Sematext Group is in no way affiliated with Elasticsearch BV.
Service operated by Sematext