Dub Dub Dub '06

The 15th International World Wide Web conference is currently underway in Edinburgh, Bonny Scotland. As usual, this popular conference has some good papers, only 11%* of submissions are accepted. One particular paper caught my eye: One Document to Bind Them: Combining XML, Web Services, and the Semantic Web. This paper has probably been selected because it will wind people up (sorry I mean “spark a debate”) so its an entertaining and sometimes enlightening read.

In this paper, Harry Halpin and Henry Thompson make some observations about the state of the web in 2006:

But, according to the authors, it doesn't have to be this way...

  1. Many (but not all) web services are functions that are available on the web,
  2. The semantic web gives us an elaborate type system, using ontologies, which can extend what we already have with XML Schema
  3. The combination of the first two, gives us
    Semantic Web Services which are typed functions. This allows us to invoke web services not just by their URI (e.g. http://xml.nig.ac.jp/xddbj/Blast for a Blast service), but by the type of information they have. E.g. you have an output of type BLAST_report or perhaps InterProScan_report, what services will take this as input? What operations can be performed on this data? This sounds a lot like BioMOBY, with bells on.

What Harry and Henry propose is tying all this together using a single XML vocabulary, called Semantic ƒXML, to put “a unified abstraction of data, types and functions” so that the web can compute. This is all a bit pie-in-the-sky vision of the future stuff, but what might it mean for your average bioinformatican? It would be seriously useful if we could make the current molecular biology web services easier to use, but agreeing on and using an ontology for annotating the types of the inputs and outputs of all the services is non-trivial task. Bioinformaticians already have a (somewhat limited) universal type system for describing all data in bioinformatics, its called string. Persuading them to use something more powerful is not easy unless the benefits are immediately obvious.

At the moment, it is difficult to tell if sƒXML will ever have any impact on bioinformatics but who cares? Despite this, the paper is enjoyable reminder of what is interesting about services on the Web. They transform the web from a place where we can merely search and browse for data (sequences, genes, proteins, metabolic pathways, systems etc), into “one vast de-centralised computer” a bit like the one described in can computers explain biology? This, in my humble opinion, is what makes the web and bioinformatics an exciting place to work in 2006.

* Footnote: Of nearly 700 papers submitted: only 81 research papers were accepted (11%). This is a 25% increase on the number of submissions last year to www2005 in Chiba, Japan.


Comments

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

IIRC Henry S. Thompson was

IIRC Henry S. Thompson was behind XML Schema, which hasn't exactly been a resounding success. Also the misuse of the term microformat was also annoying (RSS is not a microformat). Nonetheless it was an amusing read, thanks for pointing it out.

Harry Haplin also has a blog.


Good read

I enjoyed the paper (nice post too by the way); it's informative, interesting and very readable. Everything that most technical literature on XML/semantic web normally is not :)


glad you enjoyed it

Hi Neil, glad you enjoyed it. The other papers at www2006 are a good read too, the quality of this conference seems to be getting better every year...