Synthetic Biology:Semantic web ontology/Semantic Web

Home About Conferences Labs Courses Resources FAQ

Main components

RDF/XML: statements of facts or assertions about resources in the form of triples, e.g.: doc.html has author Jeremy and is about Genetics
RDF Schema:
- definition of the vocabulary or ontology used in the triples, e.g. author, first name, article, etc.
- description of hierarchies of concepts and relations, e.g.:
  - article is a kind of document
  - firstname is a kind of designation
  - firstname applies to persons
OWL:
- provides formal characterisation of types and relations, e.g.:
  - hasSpouse is a symmetric relation
  - hasChild is the inverse of hasParent
  - a Mother is a woman with at least one child
- allows data merging
SWRL: semantic web rule language

Definition

"an extension of the current web with metadata for machines" (Fabien Gandon)

"an initiative to enable cross-platform data exchange and reuse through well-defined ontologies and a common XML-based framework."

"The goal of the Semantic Web initiative is to create a universal medium for the exchange of data where data can be shared and processed by automated tools as well as by people." [11]

allows to model real things, not just documents or database tables (knowledge representation)
consists of statements about resources in the form of triples:

SUBJECT -> PROPERTY -> VALUE
-OR-
SUBJECT -> PREDICATE -> OBJECT

identifies every resource with a globally unique URI: don't say "color", say <http://example.com/2005/std6#col>
allows “serendipitous reuse”: integration with data sources in other fields (“web join”)

A semantic Web will not require proof generation to be useful: proof validation will be enough.
The first uses, such as access control on web sites, involve validation of a previously prepared proof, not a requirement to answer an arbitrary question, find the path the construct a valid proof. It is well known that to search for and generate a proof for an arbitrary question is typically an intractable process for many real world problems, and RDF does not require this (unsolvable) problem to be solved to be useful.
from http://www.w3.org/DesignIssues/RDFnot.html

Levels

(from Semantic Web roadmap)

Basic model contains just the concept of an assertion, and the concept of quotation - making assertions about assertions.
Schema layer makes simple assertions about permitted combinations
Conversion language contains rules for converting a document in one RDF schema into another one (which presumably one has an innate understanding of how to process).
Logical layer - ways of writing logic into documents to allow such things as rules the deduction of one type of document from a document of another type; the checking of a document against a set of rules of self-consistency; and the resolution of a query by conversion from terms unknown into terms known. A simple example of the application of this layer is when two databases, constructed independently and then put on the web, are linked by semantic links which allow queries on one to converted into queries on another.

Annotation problem (evolvability)

In the case of a program which finds a version 2 document and wants to find the rules to convert it into a version 1 document, then the version 2 schema would naturally contain or point to the rules.
In the case of retrospective documentation of the relationship between two independently invented schemas, then of course pointers to the rules could be added to either schema, but if that is not (socially) practical, then we have another example of the the annotation problem. This can be solved by third party indexes which can be searched for connections between two schemata. In practice of course search engines provide this function very effectively - you would just have to ask a search engine for all references to one schema and check the results for rules which like the two.

Engines

If an engine of the future combines a reasoning engine with a search engine, it may be able to get the best of both worlds, and actually be able to construct proofs in a certain number of cases of very real impact. It will be able to reach out to indexes which contain very complete lists of all occurrences of a given term, and then use logic to weed out all but those which can be of use in solving the given problem.
Many real life problems can be solved using just a few (say two) steps of inference out on the wild web.
Engines and algorithms which will efficiently tackle specific types of problem.

Semantic Web and Entity-Relationship models (from http://www.w3.org/DesignIssues/RDFnot.html)

RDF is more general than ER and can be a basis for it.
The mapping is very direct:
- a record is an RDF node;
- the field (column) name is RDF propertyType; and
- the record field (table cell) is a value.

Trust

(access control, user authentication, etc)

W3C ACL System
Authentication schemes used by Intellidimension RDF Gateway
URIQA from Nokia has a web form based, ontology driven metadata editor with validation and user authentication & access control
Authentication for Semantic Web
- use metadata for access control
Trust and Security Resource Guide

Critique

Lowercase semantic web

microformats.org, microformats @ Wikipedia - humans first, machines second
RDFa vs microformats
Structured Blogging plugins to create and syndicate structured information, such as reviews and events
Ontology vs Taxonomy vs Folksonomy vs Collabulary
- Taxonomy - concepts and relationships but no attributes
- Controlled vocabulary - only concepts
Ontology vs taxonomy vs thesaurus vs list
"Folktologies" - beyond folksonomy vs ontology.

Resources

Semantic Web Science Association (SWSA)
Web Science Research Initiative (WSRI)
Semantic Web User Interaction (SWUI): mailing list and workshop series
UMBC Semantic Web Reference Card - v2
Get Semantic - Semantic Web for web developers
ESW Wiki
- Semantic Web Architecture
A No-Nonsense Guide to Semantic Web Specs for XML People Part I and Part II - good intro for those familiar with XML
Semantic Web (for Web Developers)
Semantic Web In Breadth
The Semantic Web: An Introduction - overview of many topics based on "Semantic Web In Breadth"
The Semantic Web Intro article in Scientific American
W3C Semantic Web Activity (links to RDF, OWL, etc)
- W3C Semantic Web FAQ
Semantic Web primer from 2000 at xml.com
Web Services - Semantic Web by Tim-Berners Lee
Introduction to the Semantic Web and RDF by A.M. Kuchling
Wikipedia article on Semantic Web
Semantic Web tutorials from W3C
Semantic Web tutorial using N3
Primer: Getting into RDF & Semantic Web using N3
Semantic Web: interview with Tim Berners-Lee
W3C press release
Design Issues @ W3C - web architecture and metadata
Notions & Notations of the Semantic Web - MIT 6.898 Fall Seminar Course
Semantic Web news aggregator
Semantic Web Trust and Security Resource Guide
Semantic Web links in Google Directory
Semantic Web Road map An attempt to give a high-level plan of the architecture of the Semantic WWW
Semantic Web Trust and Security Resource Guide
DAML+OIL (March 2001) Reference Description
KSL Software and Network Services
What is Semantic Web? - very light intro with examples from Altova
semanticweb.org - Semantic Web Community Portal
W3C Semantic Web Best Practices and Deployment Working Group
Journal of Web Semantics
Decentralized Information Group @ CSAIL
"semantic web" tag on Connotea
Links from DynamO project
Collection of some Semantic Web related resourses translated to Russian.
The future of the Web is Semantic - from IBM

Books

Owl: Representing Information Using the Web Ontology Language
- Author: Lee W. Lacy
- ISBN: 1412034485
- Published: January 1, 2005

Periodicals

International Journal on Semantic Web and Information Systems
Biological Knowledge is an open access, peer-reviewed, online journal. The fields covered by the journal include, but are not limited to: ontologies, knowledge representation, and knowledge bases; reasoning, discovery, and machine learning; natural language processing and linguistics; history, philosophy, sociology, and anthropology; cognitive science, including cognitive and social psychology; education; and system design and software for knowledge creation, extraction, and manipulation.

Articles

Web 3.0
Search Engines for Semantic Web Knowledge from umbc.edu (PDF)
A Life Science Semantic Web: Are We There Yet? from Sanofi-Aventis Pharmaceuticals
Semantically speaking from PC Pro magazine (mSpace as a semantic web browser)
Towards a World Wide Database (WWDB): "The Semantic Web solves the problem of data interoperability by providing open standards for defining and integrating data schemas using formal ontologies."
The 7 (f)laws of the Semantic Web
Ontology recapitulates philology by Sydney Brenner
A Web That Thinks Like You

Blogs

TimBL's blog
Connoly's blog
semantic weblog by benjamin nowack
TechnicaLee Speaking: semantic web archives
Dave Beckett - semantic web blog
Burningbird (formerly practicalrdf.info)
Raw - Danny Ayers
James A. Hendler - University of Maryland
Semantic Wave
Planet RDF
ARQtick: related to SPARQL, RDF, ARQ, Jena, Joseki

People

Eric Miller Semantic Web Activity Lead @ W3C
Eric Neumann - semantic web for life sciences
John Wilbanks - Creative Commons
James A. Hendler University of Maryland
Ken Baclawski - Northeastern University, presentations
Chris Bizer - D2R Map, etc
Dennis Quan
Ora Lassila
Peter F. Patel-Schneider's talks (OWL, knowledge representation, SWRL, ontology, etc)
Raik Grünberg
Jeremy Zucker
Danny Ayers
Carole Goble - Semantic Web and e-Science/Grids
Jason Morrison - biology and semantic web

Semantic Web for Life Sciences

"Browsers that can automatically identify entities such as protein and gene names, molecular processes, diseases, types of tissue, etc. and the relationships between them, in any Web document, collect these entities and then apply further analyses to them using applicable Web and Grid services." (Fabien Gandon)

ESW Wiki
- Semantic Web For Life Sciences
- Semantic Bioinformatics
BioIT World
- Berners-Lee - Semantic Web Life Sciences - BioIT World Expo 2005 (May 2005)
- Masters of the Semantic Web - Oct 2005
- RDF — The Web’s Missing Link by Eric Neumann (Mar 2006)
  "Many suggest that ontologies will be the key to integrate data properly. Although I see clear value in defining and using ontologies, there is nothing intrinsic to them that will make data integration any easier for scientists. Ontologies will require a lot of consensus building between community members (see BioPAX.org), and a common enterprise architecture that validates and connects data using ontologies is still lacking (though OWL makes this tractable). I for one see connecting existing data first through RDF’s triple model as a way to greatly facilitate the building of ontologies in a practical bottom-up approach."
  - XML, RDF, Ontologies and Bioinformatic Data Mining - comment on article
    "I would argue without both the bottom up and the top down approach to semantic mapping of biomedical data (raw, reduced, analyzed and the research literature), we will not be able to capture the labyrinthine interactions complemented by subtle, real-world detail critical to understanding complex phenotype and disease."
W3C Workshop on Semantic Web for Life Sciences
Which gene did you mean? BMC Bioinformatics
W3C Semantic Web Health Care and Life Sciences Interest Group
public-semweb-lifesci@w3.org - mail archives
Scientific publishing on the 'semantic web' - nature.com
A Little Semantic Web Goes a Long Way in Biology
Relations in biomedical ontologies
Biosemantics Group at the University Medical Center Rotterdam (Netherlands) aims at developing and validating advanced techniques for the processing and analysis of large, complex, and heterogeneous medical and biological data sets

nodalpoint.org

The semantic web for life sciences now - getting started guide/UniProt RDF tutorial (Redland RDF, LSID, Uniprot, data integration, etc)
The biological ontolgy backlash - "Are the current ontologies in biology good ontologies?" article by Larisa Soldatova and Ross King
- Wrestling with Bio-ontologies - discussion
Semantic Web term @ nodalpoint.org
BioDASH Demo - Semantic Web prototype of a Drug Development Dashboard built on Haystack

This site is hosted on OpenWetWare and can be edited by all members of the Synthetic Biology community.
Making life better, one part at a time.