Multiple Graphs in RDFa (“RDFa Quads”)

Buzzword.org.uk Draft 6 December 2010

This version:: http://buzzword.org.uk/2009/rdfa4/spec-20101206
Latest version:: http://buzzword.org.uk/2009/rdfa4/spec
Previous version:: http://buzzword.org.uk/2009/rdfa4/spec-20090120
Authors:: Toby Inkster; Kjetil Kjernsmo
Editor:: Toby Inkster

Abstract

A set of RDF triples may be considered as a directed graph where the resources and literals form its nodes, and the predicates its edges. This graph in turn can be thought of as a resource in its own right, and described in another graph. This document explores an extension for expressing multiple graphs in RDFa.

Status of this Document

This document is published by buzzword.org.uk, a web site that hosts various specifications, articles and tools of use to web publishers. This is not a W3C recommendation. It is not even a buzzword.org.uk recommendation yet.

The authors welcome feedback on this draft. You can usually find one or both of us on #swig on freenode (our IRC nicks are tobyink and KjetilK).

Licence

This document is available under a licence which allows the creation of derivative works under certain conditions. For the purpose of licensing, implementations of the ideas considered in this specification shall not be considered derivative works.

Introduction
Design Decisions
Markup for Graphs
Processing Graphs

References
Implementations
Future Development Ideas

1. Introduction

RDFa is a family of attributes intended for the embedding of RDF data in XML markup [RDFA]. Its best known application is XHTML+RDFa — a format for embedding RDF data in XHTML documents — but RDFa can be incorporated into any XML-based or DOM-like file format. Indeed, the recent SVG Tiny 1.2 recommendation includes RDFa [SVGTINY12].

One limitation of RDFa is that it is only capable of embedding a single RDF graph per document. As this limitation is shared by RDF/XML [RDFXML], N-Triples [NTRIPLES] and Turtle [TURTLE], most people would not class it as a weakness of RDFa. However, a number of formats catering for multiple graphs do exist: Notation 3 [N3], TriX [TRIX], TriG [TRIG] and N-Quads [NQUADS] all include support for multiple graphs, allowing each graph to be identified with a URI, and thus be referred to by other graphs.

In the SPARQL Query Language for RDF [SPARQL], graph names are used to construct the default graph to query and also to restrict the query to certain named graphs.

Another use case for multiple graphs is describing assertions. A particular collection of RDF triples is bundled up as a graph and called an an assertion. We can then use another graph to describe that assertion — who asserted it? When? Has it been verified by an independent resource?

Graphs can also be used to model information that has changed over time. For example one graph might say that Ethelred the Unready is ruler of England; another might say that Elizabeth II is ruler of England. We could then use a third graph to note that the first graph was true in AD 1009, whereas the second is true in AD 2009.

This document investigates one possible method for marking up multiple graphs in RDFa. It does require some small changes to the RDFa parser to implement, but is backwards-compatible with parsers that do not support multiple graphs.

2. Design Decisions

3. Markup for Graphs

The attribute to use for marking up different subgraphs is graph in the same namespace as other RDFa attributes. By default RDFa attributes are not in any namespace, so neither is graph.

Although the value space of this attribute is the set of URIs and blank nodes, graph has a lexical space identical to about. Therefore, if the base URI of the document is http://example.com/document then the attribute graph="foo" represents the URI http://example.com/foo. This allows any absolute or relative URI to be used as a named graph. Safe CURIEs and blank nodes are allowed.

When the graph attribute has been set on an element, all triples found on that element and its descendants are taken to be part of the subgraph specified. The following is an example XHTML document using multiple graphs:

3.1. Triples on Multiple Elements

In RDFa, many triples are generated from attributes split across multiple elements. A slightly contrived example:

When subgraphs are specified, it may seem unclear as to which graph the triples should be added.

The rule is that a triple is added to the graph of the element which set the predicate of the triple. So, in the previous example, the following Notation 3 is generated.

4. Processing Graphs

The standard RDFa processing sequence [RDFA] requires only minor modifications to allow for multiple graphs. The modifications required are as follows:

4.1. Other Attributes: “Private Agreement”

An alternative attribute, such as id may be used to markup graph information rather than graph, but only through private agreement between producers and consumers. An alternative attribute may have a different lexical space.

It is not meant by "private agreement" that consumers and producers would need to personally discuss and agree on an attribute to be used. Instead, a consumer that needs a named graph facility would publish a link to this draft in their documentation together with the specific details of how their parser consumes named graphs. People then targetting that particular consumer would follow the directions in the consumer's documentation.