You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: section/introduction.tex
+10-8Lines changed: 10 additions & 8 deletions
Original file line number
Diff line number
Diff line change
@@ -12,31 +12,33 @@ \section{Introduction}
12
12
The goal of those methods is to lower server-side workloads while keeping fast query execution to the client~\cite{Azzam2021}.
13
13
The TREE hypermedia specification is an effort in that direction~\cite{ColpaertMaterializedTREE, lancker2021LDS}, that introduces the concept of domain-oriented fragmentation of large RDF datasets.
14
14
For example, in the case of periodic measurements of sensor data, a fragmentation can be made on the publication date of each data entity.
15
+
A fragment can be considered an RDF document published in a server.
15
16
TREE aims to describes dataset fragmentation in ways that enable clients to easily fetch query-relevant subsets.
16
17
The data inside a fragment are bounded with constraints expressed using hypermedia descriptions~\cite{thomasFieldingPhdThesis}.
17
-
More precisely, each fragment describes the constraints of the data of any reachable fragment.
18
-
In this paper, we refer to those constraints as domain-specific logical expressions.
19
-
They can be expressions such as $?t > \text{2022-01-09T00:00:00.000000} \implies\text{ex:afterFirstSeptember}$ given that $?t$ is the date of publication of sensor data and the implication pertains to the location of the data.
18
+
More precisely, each fragment describes the constraints of the data of every reachable fragment.
19
+
In this paper, we refer to those constraints as domain-specific expressions.
20
+
They can be expressions such as $?t > \text{2022-01-09T00:00:00.000000} \implies\text{ex:afterFirstSeptember}$
21
+
given that $?t$ is the date of publication of sensor data and the implication pertains to the location of the data respecting the constraint.
20
22
In English, the expression means ``the data produced by the sensors after the first of September are stored at \texttt{ex:afterFirstSeptember}.''
21
23
Because of the hyperlinked nature of the documents network, clients must traverse them to find the relevant data to answer their queries.
22
24
We propose to use Link Traversal Query Processing (LTQP)~\cite{Hartig2016} as a query mechanism to perform those queries.
23
25
24
26
LTQP starts by dereferencing a set of user-provided URLs~\cite{Hartig2016}.
25
27
From these dereferenced documents, links to other documents are dereferenced recursively and inserted in an internal data store.
26
28
LDQL~\cite{hartig2016Ldql} is a theoretical query language to define the traversal of LTQP queries.
27
-
However, LDQL is centered around nested regular expressions, thus, is not made to express the traversal of links based on domain-specific logical expressions
29
+
However, LDQL is centered around nested regular expressions, thus, is not made to express the traversal of links based on domain-specific expressions
28
30
such as time relations.
29
31
The subweb specifications language (SWSL)~\cite{bogaerts_rulemlrr_2021}, allows data providers to define traversal paths concerning the information they publish.
30
32
Thus, given that the query engine trusts the data publisher it can adapt its traversal to follow the paths given by the specification.
31
-
Akin to LDQL, it is difficult with the SWSL to express traversal using domain-specific logical expressions, because its syntax is centered around the matching of triple patterns and not reasoning rules or evaluation of literals.
32
-
Furthermore, SWSL does not propose a mechanism for using the query or input from the user in the source selection process, unlike LDQL.
33
+
Akin to LDQL, it is difficult with the SWSL to express traversal using domain-specific expressions, because its syntax is centered around the matching of triple patterns and not reasoning rules or evaluation of literals.
34
+
Furthermore, SWSL does not propose a mechanism for using the query or input from the user to impact the source selection process, unlike LDQL.
33
35
Given those limitations, we propose to return to the more abstract concept of reachability criteria~\cite{hartig2012},
34
36
to define a mechanism of traversal centered around rules.
35
37
36
38
In this paper, we propose to use a boolean solver as the main link pruning mechanism for a reachability criterion to traverse TREE documents.
37
39
The logical operators are defined by the \href{https://treecg.github.io/specification/}{TREE specification}.~\sepfootnote{sf:treeSpec}
38
40
As a concrete use case, we consider the publication of (historical) sensor data.
39
-
An example query is presented in Figure~\ref{lst:system} along with the triples representing the link between two documents describing sensor data following the TREE specification.
41
+
An example query is presented in Figure~\ref{lst:system} along with the triples representing the link between two documents expressed using the TREE specification.
0 commit comments