Suppose that I have the following Turtle declaration: <pre class="prettyprint"><code>@prefix : <http://example.org#> . :ls :list (:a :b :c) </code></pre> Is there a way to get the positions of the elements in the collection? For example, with this query: <pre class="prettyprint"><code>PREFIX : <http://example.org#> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> SELECT ?elem WHERE { ?x :list ?ls . ?ls rdf:rest*/rdf:first ?elem . } </code></pre> I get: <pre class="prettyprint"><code>-------- | elem | ======== | :a | | :b | | :c | -------- </code></pre> But I would like a query to obtain: <pre class="prettyprint"><code>-------------- | elem | pos | ============== | :a | 0 | | :b | 1 | | :c | 2 | -------------- </code></pre> Is it possible?

I have found a way to do it using the property function library in ARQ. As Steve Harris says, this is non-standard. <pre class="prettyprint"><code>PREFIX : <http://example.org#> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX list: <http://jena.hpl.hp.com/ARQ/list#> SELECT ?elem ?pos WHERE { ?x :list ?ls . ?ls list:index (?pos ?elem). } </code></pre>

<h3>A Pure SPARQL 1.1 Solution</h3> I've extended the data to make the problem a little harder. Let's add a duplicate element to the list, e.g., an additional <code>:a</code> at the end: <pre class="prettyprint"><code>@prefix : <http://example.org#> . :ls :list (:a :b :c :a) . </code></pre> Then we can use a query like this to extract each list node (and its element) along with the position of the node in the list. The idea is that we can match all the individual nodes in the list with a pattern like <code>[] :list/rdf:rest* ?node</code>. The position of each node, though, is the number of intermediate nodes between the head of the list and <code>?node</code>. We can match each of those intermediate nodes by breaking the pattern down into <pre class="prettyprint"><code>[] :list/rdf:rest* ?mid . ?mid rdf:rest* :node . </code></pre> Then if we group by <code>?node</code>, the number of distinct <code>?mid</code> bindings is the position of <code>?node</code> in the list. Thus we can use the following query (which also grabs the element (the <code>rdf:first</code>) associated with each node) to get the positions of elements in the list: <pre class="prettyprint lang-sql prettyprint-override"><code>prefix : <https://stackoverflow.com/q/17523804/1281433/> prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> select ?element (count(?mid)-1 as ?position) where { [] :list/rdf:rest* ?mid . ?mid rdf:rest* ?node . ?node rdf:first ?element . } group by ?node ?element </code></pre> <pre class="prettyprint"><code>---------------------- | element | position | ====================== | :a | 0 | | :b | 1 | | :c | 2 | | :a | 3 | ---------------------- </code></pre> This works because the structure of an RDF list is a linked list like this (where <code>?head</code> is the beginning of the list (the object of <code>:list</code>), and is another binding of <code>?mid</code> because of the pattern <code>[] :list/rdf:rest* ?mid</code>): <img src="https://i.stack.imgur.com/MQJUT.png" alt="graphical representation of RDF list"> <h3>Comparison with Jena ARQ Extensions</h3> The asker of the question also posted an answer that uses Jena's ARQ extensions for working with RDF lists. The solution posted in that answer is <pre class="prettyprint lang-sql prettyprint-override"><code>PREFIX : <http://example.org#> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX list: <http://jena.hpl.hp.com/ARQ/list#> SELECT ?elem ?pos WHERE { ?x :list ?ls . ?ls list:index (?pos ?elem). } </code></pre> This answer depends on using Jena's ARQ and enabling the extensions, but it is more concise and transparent. What isn't obvious is whether one has an obviously preferable performance. As it turns out, for small lists, the difference isn't particularly significant, but for larger lists, the ARQ extensions have much better performance. The runtime for the pure SPARQL query quickly becomes prohibitively long, while there's almost no difference in the version using the ARQ extensions. <pre class="prettyprint"><code>------------------------------------------- | num elements | pure SPARQL | list:index | =========================================== | 50 | 1.1s | 0.8s | | 100 | 1.5s | 0.8s | | 150 | 2.5s | 0.8s | | 200 | 4.8s | 0.8s | | 250 | 9.7s | 0.8s | ------------------------------------------- </code></pre> These specific values will obviously differ depending on your setup, but the general trend should be observable anywhere. Since things could change in the future, here's the particular version of ARQ I'm using: <pre class="prettyprint"><code>$ arq --version Jena: VERSION: 2.10.0 Jena: BUILD_DATE: 2013-02-20T12:04:26+0000 ARQ: VERSION: 2.10.0 ARQ: BUILD_DATE: 2013-02-20T12:04:26+0000 </code></pre> As such, if I knew that I had to process lists of non-trivial sizes and that I had ARQ available, I'd use the extension.

Is it possible to get the position of an element in an RDF Collection in SPARQL?

Tags:

Suppose that I have the following Turtle declaration:

@prefix : <http://example.org#> .  :ls :list (:a :b :c)

Is there a way to get the positions of the elements in the collection?

For example, with this query:

PREFIX :     <http://example.org#> PREFIX rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#>   SELECT ?elem WHERE {  ?x :list ?ls .  ?ls rdf:rest*/rdf:first ?elem . }

I get:

-------- | elem | ======== | :a   | | :b   | | :c   | --------

But I would like a query to obtain:

-------------- | elem | pos | ============== | :a   |  0  | | :b   |  1  | | :c   |  2  | --------------

Is it possible?

708

asked Jul 08 '13 09:07

Labra

2 Answers

I have found a way to do it using the property function library in ARQ. As Steve Harris says, this is non-standard.

PREFIX :     <http://example.org#> PREFIX rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#>  PREFIX list: <http://jena.hpl.hp.com/ARQ/list#>  SELECT ?elem ?pos WHERE {  ?x :list ?ls .  ?ls list:index (?pos ?elem). }

answered Oct 26 '22 09:10

Labra

A Pure SPARQL 1.1 Solution

I've extended the data to make the problem a little harder. Let's add a duplicate element to the list, e.g., an additional :a at the end:

@prefix : <http://example.org#> .  :ls :list (:a :b :c :a) .

Then we can use a query like this to extract each list node (and its element) along with the position of the node in the list. The idea is that we can match all the individual nodes in the list with a pattern like [] :list/rdf:rest* ?node. The position of each node, though, is the number of intermediate nodes between the head of the list and ?node. We can match each of those intermediate nodes by breaking the pattern down into

[] :list/rdf:rest* ?mid . ?mid rdf:rest* :node .

Then if we group by ?node, the number of distinct ?mid bindings is the position of ?node in the list. Thus we can use the following query (which also grabs the element (the rdf:first) associated with each node) to get the positions of elements in the list:

prefix : <https://stackoverflow.com/q/17523804/1281433/> prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>  select ?element (count(?mid)-1 as ?position) where {    [] :list/rdf:rest* ?mid . ?mid rdf:rest* ?node .   ?node rdf:first ?element . } group by ?node ?element

---------------------- | element | position | ====================== | :a      | 0        | | :b      | 1        | | :c      | 2        | | :a      | 3        | ----------------------

This works because the structure of an RDF list is a linked list like this (where ?head is the beginning of the list (the object of :list), and is another binding of ?mid because of the pattern [] :list/rdf:rest* ?mid):

graphical representation of RDF list

Comparison with Jena ARQ Extensions

The asker of the question also posted an answer that uses Jena's ARQ extensions for working with RDF lists. The solution posted in that answer is

PREFIX :     <http://example.org#> PREFIX rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#>  PREFIX list: <http://jena.hpl.hp.com/ARQ/list#>  SELECT ?elem ?pos WHERE {  ?x :list ?ls .  ?ls list:index (?pos ?elem). }

This answer depends on using Jena's ARQ and enabling the extensions, but it is more concise and transparent. What isn't obvious is whether one has an obviously preferable performance. As it turns out, for small lists, the difference isn't particularly significant, but for larger lists, the ARQ extensions have much better performance. The runtime for the pure SPARQL query quickly becomes prohibitively long, while there's almost no difference in the version using the ARQ extensions.

------------------------------------------- | num elements | pure SPARQL | list:index | =========================================== |      50      |    1.1s     |    0.8s    | |     100      |    1.5s     |    0.8s    | |     150      |    2.5s     |    0.8s    | |     200      |    4.8s     |    0.8s    | |     250      |    9.7s     |    0.8s    | -------------------------------------------

These specific values will obviously differ depending on your setup, but the general trend should be observable anywhere. Since things could change in the future, here's the particular version of ARQ I'm using:

$ arq --version Jena:       VERSION: 2.10.0 Jena:       BUILD_DATE: 2013-02-20T12:04:26+0000 ARQ:        VERSION: 2.10.0 ARQ:        BUILD_DATE: 2013-02-20T12:04:26+0000

As such, if I knew that I had to process lists of non-trivial sizes and that I had ARQ available, I'd use the extension.

answered Oct 26 '22 08:10

Joshua Taylor

Related questions
                            
                                How to insert data into table using stored procedures in postgresql
                            
                                Is there such "colsd" in R?
                            
                                Does Vim load plugins after loading vimrc?
                            
                                Loop through all descendants of a div - JS only
                            
                                How to make a div not move when scrolling?
                            
                                SQL use column from subselect in where clause
                            
                                What is the major scenario to use Socket.IO
                            
                                How can I change the location of files in xcode project?
                            
                                Execute a method after an activity is visible to user
                            
                                Given a bitonic array and element x in the array, find the index of x in 2log(n) time
                            
                                Fragment's onResume() not called when popped from backstack
                            
                                Intercepting backend 301/302 redirects (proxy_pass) and rewriting to another location block possible?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it possible to get the position of an element in an RDF Collection in SPARQL?

Tags:

Labra

People also ask

2 Answers

Labra

A Pure SPARQL 1.1 Solution

Comparison with Jena ARQ Extensions

Joshua Taylor

Recent Activity

Donate For Us