-
Notifications
You must be signed in to change notification settings - Fork 177
Description
Current Behavior
we run sp2b perf test against our software nightly and noticed a performance discrepancy while trying out rdf4j 5.2.0. You can read about sp2b here: https://observablehq.com/@lomoramic/sp2b-benchmark-results
effectively, it's a set of benchmark sparql queries and includes a data generator. there about 15 different queries in the test.
many of the queries are returning equivalent number of results but there are a few that return different number of results btwn 5.2.0 and 5.1.6
In particular:
PREFIX rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns#
PREFIX rdfs: http://www.w3.org/2000/01/rdf-schema#
PREFIX foaf: http://xmlns.com/foaf/0.1/
PREFIX dc: http://purl.org/dc/elements/1.1/
PREFIX dcterms: http://purl.org/dc/terms/
SELECT ?yr ?name ?document
WHERE {
?class rdfs:subClassOf foaf:Document .
?document rdf:type ?class .
?document dcterms:issued ?yr .
?document dc:creator ?author .
?author foaf:name ?name
OPTIONAL {
?class2 rdfs:subClassOf foaf:Document .
?document2 rdf:type ?class2 .
?document2 dcterms:issued ?yr2 .
?document2 dc:creator ?author2
FILTER (?author=?author2 && ?yr2<?yr)
} FILTER (!bound(?author2))
}
Is returning 229 triples in 5.2.0 consuming 1037ms and 36 triples in 5.1.6 consuming 147ms against a corpus of ~10000 triples.
I will dig around to see if there are other parity issues btwn other queries.
include 10k triples for local testing.
Expected Behavior
Is this expected behavior for 5.2.0?
Steps To Reproduce
No response
Version
5.2.0
Are you interested in contributing a solution yourself?
None
Anything else?
No response