Skip to content

Query result inconsistency btwn 5.2.0 and 5.1.6 on sp2b #5569

@nguyenm100

Description

@nguyenm100

Current Behavior

sp2b.txt

we run sp2b perf test against our software nightly and noticed a performance discrepancy while trying out rdf4j 5.2.0. You can read about sp2b here: https://observablehq.com/@lomoramic/sp2b-benchmark-results

effectively, it's a set of benchmark sparql queries and includes a data generator. there about 15 different queries in the test.

many of the queries are returning equivalent number of results but there are a few that return different number of results btwn 5.2.0 and 5.1.6

In particular:

PREFIX rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns#
PREFIX rdfs: http://www.w3.org/2000/01/rdf-schema#
PREFIX foaf: http://xmlns.com/foaf/0.1/
PREFIX dc: http://purl.org/dc/elements/1.1/
PREFIX dcterms: http://purl.org/dc/terms/

SELECT ?yr ?name ?document
WHERE {
?class rdfs:subClassOf foaf:Document .
?document rdf:type ?class .
?document dcterms:issued ?yr .
?document dc:creator ?author .
?author foaf:name ?name
OPTIONAL {
?class2 rdfs:subClassOf foaf:Document .
?document2 rdf:type ?class2 .
?document2 dcterms:issued ?yr2 .
?document2 dc:creator ?author2
FILTER (?author=?author2 && ?yr2<?yr)
} FILTER (!bound(?author2))
}

Is returning 229 triples in 5.2.0 consuming 1037ms and 36 triples in 5.1.6 consuming 147ms against a corpus of ~10000 triples.

I will dig around to see if there are other parity issues btwn other queries.

include 10k triples for local testing.

Expected Behavior

Is this expected behavior for 5.2.0?

Steps To Reproduce

No response

Version

5.2.0

Are you interested in contributing a solution yourself?

None

Anything else?

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    wontfixissue won't be fixed (close reason)

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions