Slow DESCRIBE #1680

ktk · 2024-12-15T14:31:53Z

We use DESCRIBES on SHACL shapes so one can potentially get a lot of results back. For that particular query, the first run takes quite some time: https://qlever.cs.uni-freiburg.de/lindas/rer85n

joka921 · 2024-12-16T11:08:24Z

I just had a brief look at this.

This entitity is connected to rather long rdf collections (<el1> <el2> <el3>) which expand to triples with blank nodes according to the RDF standards. QLever's implementation of the SHACL DESCRIBE currently follows all blank nodes recursively, and the maximal depth of these blank nodes is the limiting factor here ( a single large chain of blank nodes gives the worst ratio of computation time to output result as in your example).
The first easy fix would be to not follow rdf collections or to limit the number of hops.
The more involved fix would be to make the DESCRIBE implementation more efficient by making it cheaper per hop. This can be done by e.g. some kind of caching, especially for datasets that are rather small, but have rather long reification chains.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slow DESCRIBE #1680

Slow DESCRIBE #1680

ktk commented Dec 15, 2024

joka921 commented Dec 16, 2024

Slow DESCRIBE #1680

Slow DESCRIBE #1680

Comments

ktk commented Dec 15, 2024

joka921 commented Dec 16, 2024