CS Lab HUT

Tik-76.275 Seminar on Knowledge Engineering
Spring 2001 (2 cp)

The Semantic Web - The evolution towards machine-understandable content formats

Semantic Web -- a web of data that can be processed directly or indirectly by machines
- Tim BL, Weaving the Web

Initial list of presentation topics

XML

The Extensible Markup Language (XML) has become the universal format for structured documents and data on the Web.

This talk should cover the central aspects of XML, URIs, XML Inclusing, XML Namespaces, the cencepts of XML Processors (and XML Infosets), relationship of XML to SGML, etc.

XML Pointer, XML Base and XML Linking

XML Linking Language (XLink) allows elements to be inserted into XML documents in order to create and describe links between resources. It uses XML syntax to create structures that can describe links similar to the simple unidirectional hyperlinks of today's HTML, as well as more sophisticated links.

XML Base allows authors to explicitly specify a document's base URI for the purpose of resolving relative URIs in links to external images, applets, form-processing programs, style sheets, and so on

XPointer, which is based on the XML Path Language (XPath), supports addressing into the internal structures of XML documents. It allows for traversals of a document tree and choice of its internal parts based on various properties, such as element types, attribute values, character content, and relative position.

XML Schema

"XML Schemas express shared vocabularies and allow machines to carry out rules made by people. They provide a means for defining the structure, content and semantics of XML documents."

XML Query

"The mission of the XML Query working group is to provide flexible query facilities to extract data from real and virtual documents on the Web, therefore finally providing the needed interaction between the web world and the database world. Ultimately, collections of XML files will be accessed like databases."

XML Protocol

This representation covers XML-based messaging protocol that is being developed at W3C. One of the major inputs to that effort is SOAP (Simple Object Access Protocol) defined by Microsoft. In short, SOAP provides a remote procedure call facility on top of HTTP and XML.

XML Style sheets

This talk should cover the different mechanisms to specify documents styles, including CSS, XSL and XSLT.

RDF Model and Syntax

RDF Model and Syntax specification defines a simple data model based on labeled directed graphs. The specification defines an XML syntax for these kind of graphs.

RDF Schema Language

RDF Schema specification explains how RDF can be used to describe RDF vocabularies.

ebXML - Electronic Business XML

"ebXML is a set of specifications that together enable a modular electronic business framework. The vision of ebXML is to enable a global electronic marketplace where enterprises of any size and in any geographical location can meet and conduct business with each other through the exchange of XML based messages. ebXML is a joint initiative of the United Nations (UN/CEFACT) and OASIS, developed with global participation for global usage."

UDDI - Universal Description, Discvorvery and Integration of Business for the Web

"Introduced in August 2001 by Microsoft, IBM and Ariba, UDDI is intended to be an Internet standard for creating an online business registry."
"A cornerstone of UDDI is WSDL (Web Services Description Language) - a language that allows businesses to describe their offerings in a standard way."

Ontology languages and Web standards

Talk should discuss about ontology languages and their relations to Web standards, such as XML Schema and RDF(S). Some links are Noy & Hafner: The State of the Art in Onbtology Design, Fensel: Relating Ontology Languages and Web Standards and Staab, Erdmann & Maedche: Ontologies in RDF(S).

OIL - Ontology Inference Layer

"The Ontology Inference Layer OIL is a proposal for a web-based representation and inference layer for ontologies, which combines the widely used modelling primitives from frame-based languages with the formal semantics and reasoning services provided by description logics. It is compatible with RDF Schema (RDFS), and includes a precise semantics for describing term meanings (and thus also for describing implied information)."

SHOE - Simple HTML Ontology Extension

Talk should provide an overview of SHOE and at least cover the following papers: Heflin & Hendler: Semantic Interoperability on the Web, Heflin & Hendler: Dynamic Ontologies on the Web and Heflin & Hendler: Searching the Web with SHOE.

Topic Maps

"Topic maps is an ISO Standard (ISO13250) that enables information resources to be classified and navigated in a consistent manner. It allows for the concepts or topics that underlie a set of information objects to be exposed to those people or applications processing the information." See an article about the topic map technology.

A topic map browser is also available.

Building web site with Topic Maps

Material for presentation:

Managing links with Topic Maps

Material for presentation:

CC/PP - Composite Capabilities/Preference Profiles

Specification of device profiles using RDF.

RSS - RDF Site Summary

Application of RDF to specity site information, especially channels.

DAML - Darpa Agent Markup Language

Language for agent description developed by DARPA.

Collaboration and annotation

Plans for annotation infrastructure developed at W3C.

Tools for RDF

Overview of tools developed for manipulation and storage of RDF, such as Redland, Jena and rdfdb. Also ideas for storing RDF in relational databases.

RDF and logics

Some possible starting points:

Semantic portals

How the technologies developed for the Semantic Web can be utilized in the implementation of a portal. Suggested starting point: Staab et al: Semantic Community Web Portals.

PRISM & NewsML


The following topics have been removed from the seminar due to their peripheral nature with respect to the semantic web.

XHTML and its modularization

HTML 4 reformulated as an XML application. The reformulation work has continued in the definition of the core of XHTML (XHTML Basic) that could be used in resource constrained Web clients such as mobile phones, PDAs, etc., and in the the specification of the abstract modules of XHTML that provide different types of functionality (XHTML Modularization) to make XHTML usable in devices that support different capabilities (digital TVs, mobile devices, etc.).

DOM

"Document Object Model (DOM) is a standard API (Application Programmer Interface) to the document structure and aims to make it easy for programmers to access components and delete, add or edit their content, attributes and style."
"The Document Object Model is a platform- and language-neutral interface that will allow programs and scripts to dynamically access and update the content, structure and style of documents. The document can be further processed and the results of that processing can be incorporated back into the presented page."

SVG - Scalable Vector Graphics

"SVG is a language for describing two-dimensional graphics in XML." SVG is an approach towards a structured representation of graphics on the Web. It is an XML instance, on one hand, and connected to DOM, on the other.

Seppo Törmä
Last modified: Mon Apr 2 15:46:13 EET DST 2001