archives

Date
  • 01
  • 02
  • 03
  • 04
  • 05
  • 06
  • 07
  • 08
  • 09
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30

3rd Integrative Bioinformatics Workshop Day 1

The 3rd Integrative Bioinformatics Workshop began yesterday at Rothamsted Research Centre in Harpenden, England. Rothamsted Research is a lovely campus reminescent of the Wellcome Trust Genome Campus just south of Cambridge. It has a long history of plant research, andwas also the workplace of the mathematician and statistician Ronald Fisher. This is a summary of the first day of the workshop (mainly discussing the keynote speech), on 4 September 2006.


Clarifying simple things that end up complicated: Percentage Identity

A new paper by Raghava and Barton has just gone online "Quantification of the variation in percentage identity for protein sequence alignments" at BMC Bioinformatics.

Initially I was shocked .. how, in 2006, could anyone manage to publish anything original about percentage identity (PID), that simple but oft used/abused measure that is fundamental in the definition of the "twilight-zone" of sequence similarity (for infering structural similarity or relatedness by sequence alone).

Well, it turns out (and becomes obvious when you try to code it), that there is more than one way to calculate the PID of a multiple sequence alignment, and each method yields different results. Authors rarely state exactly which method they used and, not surprisingly, no matter how you chose to measure the PID the multiple alignment algorithm used also has a substantial impact.