Swignition Output Formats
- RDF/XML
- The most detailed output format, including every bit of information
which Swignition has been able to glean from the page. To make sense of
it you probably need a fairly good knowledge of RDF. RDF/XML is fairly
tricky to parse with a plain XML parser — if you need to
access the information as RDF triples, RDF/JSON or TriX are probably
better bets.
- http://www.w3.org/TR/rdf-syntax-grammar/
- TriX
- An easy to parse XML syntax for RDF.
- http://www.hpl.hp.com/techreports/2004/HPL-2004-56.html
- RDF/JSON
- Pretty much as detailed as the RDF/XML output, but far easier to
parse — a JSON library should
give very easy access to the underlying triples.
- http://n2.talis.com/wiki/RDF_JSON_Specification
- Turtle and Notation3
- Turtle is the most readable RDF-based output. Turtle is a subset
of a more complex format called Notation3, so by definition
Swignition's Turtle output is also a Notation3 output. There is also
a dedicated Notation3 output which compacts the output slightly, but
uses a few non-Turtle syntax quirks.
- http://www.dajobe.org/2004/01/turtle/
- http://www.w3.org/DesignIssues/Notation3
- vCard
- vCard is an interoperable format for exchanging contact
information. Only information about contacts (people, groups,
organisations, etc) is output as vCard — e.g. information
gleaned from hCard,
XFN,
FOAF or
the W3C PIM
vocabulary. Swignition mostly uses vCard 3.0 but includes some
properties introduced in the draft vCard 4.0 standard. Swignition's vCards should be
importable in most address book applications such as Apple Address
Book and Microsoft Outlook.
- http://www.ietf.org/rfc/rfc2426.txt
- jCard
- jCard exposes the same information as vCard, but in a format that's
easier to parse. If you want to use Swignition to glean contact
information from a web page and then pass that along to another script
for further processing, jCard is a good choice of output format.
- http://microformats.org/wiki/jcard
- iCalendar
- iCalendar is an interoperable format for exchanging event and
todo information. Only the information about events, todo lists,
free/busy times, alarms and journals are output as
iCalendar — e.g. information gleaned from hCalendar or hAtom. Swignition's iCalendar files should be importable into
most calendar applications such as Apple iCal and Microsoft
Outlook.
- http://www.ietf.org/rfc/rfc2445.txt
- Atom
- Atom is a format for syndicating (mostly textual) content,
such as blog articles or news stories. RSS– and Atom-like
content found on pages (including hAtom) is output
as Atom and can be read in feed readers such as NewsGator/NetNewsWire or
Google
Reader.
- http://www.ietf.org/rfc/rfc4287.txt
- KML
- KML is a file format for recording annotated geographic
co-ordinates. Any appropriately marked up co-ordinates found on the
page, such as co-ordinates for an address or for the location where
an event is occuring can be output as KML, which can be imported
into some mapping software, such as Google Earth.
- http://code.google.com/apis/kml/documentation/
- M3U
- This is a playlist format supported by most common media
players, such as Wimamp,
iTunes and XMMS. Any audio files marked up
in hAudio or the
RDF Audio Vocabulary
can be exported as M3U playlists.
- http://en.wikipedia.org/wiki/M3U
- RecipeBook XML
- XML-based format for recipes for food and drink. Data provided
using the RDF recipe schema or hRecipe can be output in this format.
- http://www.happy-monkey.net/recipebook/