<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://www.nodalpoint.org" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>nodalpoint.org - The End of Relational Databases? - Comments</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases</link>
 <description>Comments for &quot;The End of Relational Databases?&quot;</description>
 <language>en</language>
<item>
 <title>RDF Databases</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases#comment-2909</link>
 <description>&lt;p&gt;I too am excited about the possibility of using RDF databases but disappointed by current performance issues. There&#039;s some obvious optimisations they could be doing wrt transitive closures, hopefully that will come soon.&lt;/p&gt;
&lt;p&gt;You may want to check out chado - &lt;a href=&quot;http://www.gmod.org/schema&quot;&gt;http://www.gmod.org/schema&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;In some ways it&#039;s a hybrid schema between a truly generic RDF type model and a more traditional relation schema. I think it&#039;s quite a nice middle ground, but then I would say that.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Fri, 27 Jan 2006 22:39:38 -0500</pubDate>
 <dc:creator>cjm</dc:creator>
 <guid isPermaLink="false">comment 2909 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Trolls: My remark was to the</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases#comment-2905</link>
 <description>&lt;blockquote&gt;&lt;p&gt;Trolls: My remark was to the article in BioIT-World - your coverage is perfectly adequate.&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;Thank you for clarifying.&lt;/p&gt;
&lt;p&gt;The problem is not so much the suitability of relational databases for storing data, but in terms of their flexibilty for integrating data. Every data warehouse implementation (SRS, Discovery Link) requires maintenance of the various wrappers used to pull-in data from external data sources. Each one of those wrappers is ad-hoc as the underlying relational framework does not facilitate that kind of diverse data integration.&lt;/p&gt;
&lt;p&gt;RDF goes some way towards solving the flexibilty problem, but does have performance issues (not to mention higher level semantic issues). I have also had the opportunity to talk to Kei Hoi Cheung (also of YeastHub) who expressed similar concerns about using RDF databases (in their case Sesame) for large scale data storage. Even given the limitations I will still be pushing RDF as a solution locally as I see this as a research opportunity for myself (let&#039;s see how quickly that gets crushed by the powers that be and the LIMS vendors).&lt;/p&gt;
&lt;p&gt;At the end of the day, technology choices are hard.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Wed, 25 Jan 2006 06:04:36 -0500</pubDate>
 <dc:creator>Greg</dc:creator>
 <guid isPermaLink="false">comment 2905 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Alternative trolls</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases#comment-2898</link>
 <description>&lt;p&gt;Trolls: My remark was to the article in BioIT-World - your coverage is perfectly adequate. &lt;/p&gt;
&lt;p&gt;In 1998, Lion Bioscience, my employer back then, commercialized the SRS system (the current &lt;a href=&quot;http://srs.ebi.ac.uk&quot;&gt;EBI installation&lt;/a&gt;), which was targeted to replace relational databases in the life sciences. Thus, I&#039;ve heard the requiem before. However, SRS could only only serve the biological knowledge side, not the transactional side that you need for a LIMS.&lt;br /&gt;
Nowadays, you can connect SRS to relational databases in every which way, using Oracles Gateway or IBMs Discovery Link and many other things. But SRS is only one of many ways to tackle to problem.&lt;br /&gt;
All of the modern relational databases can store and query XML documents, thereby removing many limitations on rich data structures - I don&#039;t really see relational DBs coughing, let alone dying.&lt;/p&gt;
&lt;p&gt;Alternative: RDF is all nice and shiny but seems to loose on the performance end. I talked to Andrew Smith (yeasthub) and you pay for the flexibilty by CPU time.  I would be surprised to see a system which is performant on normalized and highly non-normalized, rich data structures at the same time. However, semantic web technologies appear very appealing for the biological knowledge representation.&lt;/p&gt;
&lt;p&gt;Or do you think such RDF based extensions to Oracle can offer the speed and transactional safety of the highly optimized table  based system that it relies upon?&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Sat, 21 Jan 2006 12:01:14 -0500</pubDate>
 <dc:creator>Spitshine</dc:creator>
 <guid isPermaLink="false">comment 2898 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>industrially strengthen and rock solid ?</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases#comment-2897</link>
 <description>&lt;p&gt;We are missing an important point here. From an academic point of view, of course we must think and work towards making our bioinformatics future better through more semantics and integration of our data. But how about data holding vaults (see &lt;a href=&quot;http://www.gusdb.org&quot;&gt;GUS&lt;/a&gt; implemented in &lt;a href=&quot;http://www.oracle.com&quot;&gt;Oracle&lt;/a&gt;) for big institutes or companies in bioinformatics ?  Sure AceDB holds sequence + expression data from 2-3 species. But how about 10 genomes (&lt;a href=&quot;https://patric.vbi.vt.edu/&quot;&gt;PATRIC&lt;/a&gt;) ? What I am advocating for, is that those kind of big-mama DBMS can hold tera-bytes of data, and they are industry rated for they maintenance capabilities. Their DBMS server is rock-solid, and you can build whatever you want on it, without being afraid that it will break on large volumes of data.&lt;/p&gt;
&lt;p&gt;But they cost big bucks (including the certified DBAs to manage those complex systems). And true, modeling biological data in relational schemas can be tough. I believe that Myhrvold has got a point in his writing, but the big change will come when those BioIT - specific database systems (developed by academic labs, usually pushed by the industry) reach some maturity. An by maturity I mean all the above I mentioned for those big-mama DBMS above. &lt;/p&gt;
&lt;p&gt;Another point is that open-minded people working in BioIT (there are some still doing it with Excel) should be trailing along with those BioIT-specific DB systems as they mature. In this way those systems can end-up having the right objects for biological data, the right query mechanisms, plus semantic capabilities. Otherwise we&#039;ll be at the starting line again, trying to integrate the non-homogenous sources.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Fri, 20 Jan 2006 12:24:08 -0500</pubDate>
 <dc:creator>agbiotec</dc:creator>
 <guid isPermaLink="false">comment 2897 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Me trolling ? Never. 
At the</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases#comment-2894</link>
 <description>&lt;p&gt;Me trolling ? Never. &lt;/p&gt;
&lt;p&gt;At the moment I&#039;m dealing with two LIMS systems, one for microarray data and the other for proteomics, both are backed by relational databases. I know at some point I will be asked to &#039;mine data&#039; from both. The issue for me is how easy will that be ? Assuming that both systems provide development libraries (object interfaces etc.) for the relational stores this will be reasonably straight forward after investigating both schemas and writing a few scripts. In this sense relational databases are &#039;adequate&#039;. &lt;/p&gt;
&lt;p&gt;These &#039;interfaces&#039; (or APIs) are what Chris Dwan is referring to in his article &lt;a href=&quot;http://www.nodalpoint.org/node/1767&quot;&gt;Interfaces Will Save the World&lt;/a&gt;. Now I would argue that the interface technologies he suggests, SOAP/WSDL, are not likely to save the world, more likely it will be simple REST based interfaces like those provided by the NCBI (Eutils). Irrespective of what technology is used, SOAP or REST, APIs do not help you deal with data structure heterogeneity i.e. tables vs. tress vs. graphs. On top of data structure heterogeneity you then have to deal with data type heterogeneity. &lt;/p&gt;
&lt;p&gt;The alternative that you missed is using a single data structure to eliminate the first level of heterogeneity. This is what the Resource Description Framework (RDF) is all about and why it is the underlying technology of the semantic web. Thus the direction we need to be looking is RDF databases, for example &lt;a href=&quot;http://www.kowari.org/&quot;&gt;Kowari&lt;/a&gt;, &lt;a href=&quot;http://www.openrdf.org/&quot;&gt;Sesame&lt;/a&gt;, &lt;a href=&quot;http://www.oracle.com/technology/tech/semantic_technologies/index.html&quot;&gt;Oracle&#039;s semantic extensions&lt;/a&gt; and maybe Google&#039;s &lt;a href=&quot;http://andrewhitchcock.org/?post=214&quot;&gt;BigTable&lt;/a&gt;. Ontologies come into play at the next level, data type heterogeneity. Using ontologies, ontology mappings and automated reasoning to sift through data in an RDF database is where we want to be going. Not installing yet another LIMS with a relational database backend.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Fri, 20 Jan 2006 02:38:22 -0500</pubDate>
 <dc:creator>Greg</dc:creator>
 <guid isPermaLink="false">comment 2894 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>BioSQL</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases#comment-2891</link>
 <description>&lt;p&gt;I&#039;ve played around with it.  It&#039;s not very mature and I don&#039;t think there&#039;s a lot of active development, judging by &lt;a href=&quot;http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/sql/?cvsroot=biosql&quot;&gt;the CVS activity&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;It does seem like a possible solution to local storage though.  In theory you can take any of the common biological databases and just drop them into the schema.  If you understand the schema, it should be trivial to hook up a CGI web frontend with Perl/PHP, extract sequences using scripts and DB bindings and so on.  There&#039;s a guide to setting up and using BioSQL with a Swissprot example &lt;a href=&quot;http://bioteam.net/dag/BioTeam-HOWTO-1-BIOSQL.html&quot;&gt;at this Bioteam web page&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;I think relational databases tend to be slow when used in this context mostly because biological databases are so large.  There are plenty of ways to optimise your databases though - check the mysql documentation.  People are always telling me that postgresql performs a lot better but I&#039;ve never looked into it.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Thu, 19 Jan 2006 22:21:37 -0500</pubDate>
 <dc:creator>Neil</dc:creator>
 <guid isPermaLink="false">comment 2891 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Fixed.</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases#comment-2890</link>
 <description>&lt;p&gt;Fixed.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Thu, 19 Jan 2006 22:06:26 -0500</pubDate>
 <dc:creator>Greg</dc:creator>
 <guid isPermaLink="false">comment 2890 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>To paraphrase Winston Churchill...</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases#comment-2889</link>
 <description>&lt;p&gt;It has been said that relational databases are the worst form of genomics data storage except all the others that have been tried.&lt;/p&gt;
&lt;p&gt;Is there a viable alternative to established relational databases? Perhaps if development of AceDB started up again. Personally I find the hassle invovled in twisting table schemas to meet your needs gets outweighed in the long run by speed, ease of use and connectivity (try connecting to AceDB from, say, a .NET app).&lt;/p&gt;
&lt;p&gt;Does anybody have experience with the BioSQL schema? Is it any good (I suppose it might depend on what you want to do with it)?&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Thu, 19 Jan 2006 06:06:03 -0500</pubDate>
 <dc:creator>Stewb</dc:creator>
 <guid isPermaLink="false">comment 2889 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Broken link</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases#comment-2888</link>
 <description>&lt;p&gt;There&#039;s a loose &amp;lt;br&amp;gt; tag at the end of the Bio-IT World link.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Thu, 19 Jan 2006 05:36:40 -0500</pubDate>
 <dc:creator>karbak</dc:creator>
 <guid isPermaLink="false">comment 2888 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Did I miss the alternative?</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases#comment-2887</link>
 <description>&lt;p&gt;If I would post the original story, people would laugh or accuse me of trolling. The problems we are facing in Proteomics research are not at all different from other data base systems, so relational solutions seem to be as adequate. Besides, when was the last time you put &quot;knowledge&quot; in a LIMS. Warehousing solutions (separating the transaction and the analysis elements on a larger scale) seem appropiate the last time I looked at it... &lt;/p&gt;
&lt;p&gt;Anyway, I can only re-iterate your last statement - the real problem is lack of support and resources from stakeholders who are &quot;not interested how it works&quot;. While the problem is abundant in IT, it is   pretty severe in *omics research as you have to tackle complicated data structures that are not at all well studied yet (compared to address fields and parts lists).&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Thu, 19 Jan 2006 05:33:42 -0500</pubDate>
 <dc:creator>Spitshine</dc:creator>
 <guid isPermaLink="false">comment 2887 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>The End of Relational Databases?</title>
 <link>http://www.nodalpoint.org/the_end_of_relational_databases</link>
 <description>&lt;p&gt;&lt;a href=&quot;http://en.wikipedia.org/wiki/Nathan_Myhrvold&quot;&gt;Nathan Myhrvold&lt;/a&gt; is predicting the end of relational databases for *omics data storage:&lt;/p&gt;
&lt;blockquote&gt;&lt;p&gt;&#039;heavy reliance on relational databases like Oracle, SQL, and DB2 within the field of proteomics and genomics is &quot;completely boneheaded.&quot;&#039;&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt; This quote is from a short  &lt;a href=&quot;http://www.bio-itworld.com/archive/071102/businesswatch.html&quot;&gt;news piece&lt;/a&gt; on BioIT World. On the one hand it is quite a glib statement to make, &quot;the end of X as we know it...&quot;, however there might be some truth to it...&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;&lt;p&gt;&lt;a href=&quot;http://www.nodalpoint.org/the_end_of_relational_databases&quot;&gt;read more&lt;/a&gt;&lt;/p&gt;</description>
 <comments>http://www.nodalpoint.org/the_end_of_relational_databases#comments</comments>
 <category domain="http://www.nodalpoint.org/master_list/bioinformatics">Bioinformatics</category>
 <pubDate>Wed, 18 Jan 2006 04:09:28 -0500</pubDate>
 <dc:creator>Greg</dc:creator>
 <guid isPermaLink="false">1775 at http://www.nodalpoint.org</guid>
</item>
</channel>
</rss>
