Storing tags in a graph database

Tags:

I've found some advice for setting up tagging systems in relational and document databases, but nothing for graph/multi-model databases.

I am trying to set up a tagging system for documents (let's call them "articles") in ArangoDB. I can think of two obvious ways to store tags in a multi-model (graph+document) database like Arango:

as an array within each article document (document database-style)
as a separate document class with each tag as a unique document and edges connecting tag documents to the article documents (something closer to relational database-style)

Are these in fact the two main ways to do this? Neither seems ideal. For example:

If I'm storing tags within each article document, I can index the tags and presumably ArangoDB is optimizing the space they use. However, I can't use graph features to link or traverse tags (or I have to do it separately).
If I'm storing tags as separate tag documents, it seems like extra overhead (an extra query) when I just want to get a list of tags on a document.

Which leads me to an explicit question: with regard to the latter option, is there any simple way to automatically make connected 'tag' documents show up within the article documents? E.g. have an array property that somehow 'mirrored' the tag.name properties of the connected tag documents?

General advice is also welcome.

573

asked Mar 02 '16 19:03

ropeladder

1 Answers

You already mention most of the available decision criterias. Maybe I can add some more:

Relational tags inside the documents could use array indices to filter on them, which could make queries on them fast. However, if you like to add a rating or an explanation to each item of that tag array, there is no way to. If you want to count the documents tagged, this may also be more expensive than counting all edges that originate from a specific tag, or maybe find all tags matching a search criteria.

One of the powers of multi model is, that you don't need to decide between the both aproaches. You can have an edge collection connecting tags with attributes to your documents, and have an indexed array with the same (flat) tags inside of the document. If you find all (or most) of your queries just use one method, try to convert the rest and remove the other solution. If that doesn't work, your application simply needs both of them.

In both cases finding other tagged documents alongside could be done in a subequery:

LET docs=(FOR ftDoc IN FULLTEXT(articles, 'text', 'search')
    COLLECT tags = ftDoc.tags INTO tags RETURN {tags, ftDoc})
LET tags = FLATTEN(FOR t IN docs[*].tags RETURN t)
LET otherArticles = (FOR oneTag IN tags 
    FOR oneD IN articles FILTER oneTag IN oneD.tag RETURN oneD._key)
RETURN {articles: docs, tags: tags, otherArticles: otherArticles}

149

answered Sep 21 '22 04:09

dothebart

Related questions
                            
                                How to navigate to previous page using Cassandra manual paging
                            
                                Dynamically load templates (partials) in Angular.js
                            
                                Displaying a JSON dataset as a table with Node.js and Express
                            
                                Typescript ES7 descriptors with ES3 output?
                            
                                Perf is not defined - running react app with webpack-dev-server
                            
                                onchange event of table td?
                            
                                Vue.js v-for not working in the application
                            
                                Rails: Updating quantities in shopping cart
                            
                                Flowtype: dynamically extending classes
                            
                                How can I make the regular expression not result in “catastrophic backtracking”?
                            
                                Close Modal using CSS only
                            
                                what does function () {} mean when assigned to a variable
                            
                                getSubscription returns a null subscription
                            
                                Is it possible to override native fetch api to use a desired Promise library instead of native browser Promise?
                            
                                Javascript: split by this|that
                            
                                Detect if input placeholder is visible
                            
                                Angular use root scope vs services to share data
                            
                                Add Dependency in Webpack Plugin
                            
                                add a string to a textarea using jQuery
                            
                                How to extend ElementFinder object in ProtractorJS?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Storing tags in a graph database

Tags:

javascript

graph-databases

arangodb

multi-model-database

ropeladder

People also ask

1 Answers

dothebart

Recent Activity

Donate For Us