<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://www.nodalpoint.org" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>nodalpoint.org - Yahoo! Research! Laboratories! - Comments</title>
 <link>http://www.nodalpoint.org/nodalpoint_tags/yahoo_research_laboratories</link>
 <description>Comments for &quot;Yahoo! Research! Laboratories!&quot;</description>
 <language>en</language>
<item>
 <title>Yeah, need to update my</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4335</link>
 <description>&lt;p&gt;Yeah, need to update my blog...&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 22 Jan 2008 19:47:42 -0500</pubDate>
 <dc:creator>ejain</dc:creator>
 <guid isPermaLink="false">comment 4335 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>700?</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4334</link>
 <description>&lt;p&gt;At 700 a pop for an Action Figure?&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 22 Jan 2008 13:18:11 -0500</pubDate>
 <dc:creator>nuin</dc:creator>
 <guid isPermaLink="false">comment 4334 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Dude, Where&#039;s My NAR?</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4333</link>
 <description>&lt;p&gt;Hello Maximillian, I wasn&#039;t attacking NAR, was just wondering about what proportion of the data is dead. It is an interesting technical challenge to find this out. It would also be useful to measure the cost of gathering noisy, redundant and poorly understood data in terms of wasted resources (people, time, money, computers, false positives etc). Perhaps somebody has done something like this already? Especially with all the irresponsible sequencing just for sake of it that goes on...&lt;/p&gt;
&lt;p&gt;As for the physicists, I mentioned them for comparison. Like you, I doubt they will use 100% of their data either, but will probably use much more of it. However, they keep &lt;a href=&quot;http://www.newscientist.com/channel/fundamentals/mg19426103.300-particle-smasher-aims-for-may-2008-switchon.html&quot;&gt;delaying switching their big machine on&lt;/a&gt;, which means they are still waiting for the data. That is, unless when they finally flip the switch on the &lt;a href=&quot;http://en.wikipedia.org/wiki/Large_Hadron_Collider&quot;&gt;LHC&lt;/a&gt;, &lt;a href=&quot;http://popsci.typepad.com/popsci/2007/07/the-large-hadro.html&quot;&gt;we all disappear into a black hole&lt;/a&gt;, tombs and all :)&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 22 Jan 2008 13:09:27 -0500</pubDate>
 <dc:creator>Duncan</dc:creator>
 <guid isPermaLink="false">comment 4333 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>It doesn&#039;t matter, they got published</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4332</link>
 <description>&lt;blockquote&gt;&lt;p&gt;Does it matter that large quantities of this data will probably never be used?&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;Probably not, as their authors are already happy enough to have those published, and then they can &lt;strike&gt;stop mantaining&lt;/strike&gt; move on to another &lt;strike&gt;project&lt;/strike&gt; publication. It is indeed a sad state of affairs.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 22 Jan 2008 06:46:50 -0500</pubDate>
 <dc:creator>lbbros</dc:creator>
 <guid isPermaLink="false">comment 4332 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Swiss Prot Databases / Action Figures</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4331</link>
 <description>&lt;p&gt;Hi Eric, yeah even more redundancy there. Talking of SWISS Prot people, I&#039;m just wondering when your &lt;a href=&quot;http://eric.jain.name/2007/12/04/amos-bairoch-action-figure/&quot;&gt;Amos Bairoch action figure&lt;/a&gt; will be available in shops?!?! What are you up to now that you&#039;ve moved on from SWISS Prot to Seattle?&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Mon, 21 Jan 2008 18:12:33 -0500</pubDate>
 <dc:creator>Duncan</dc:creator>
 <guid isPermaLink="false">comment 4331 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Why NAR</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4328</link>
 <description>&lt;p&gt;I add the usual defense of the NAR databases: At least they are indexed by pubmed, so as there are biologists that are not used to checking in google if there are websites about their subject, they will find them quicker via NAR. The data is not dead,  websites can always be exported. (I&#039;ve made the experience that for smaller databses it&#039;s usually quicker to scrape the data from an html page with something like HTTP::Recorder than writing to the person responsible for the data to send you an sql dump.) In addition, people get papers for their databases like this and other people can cite the database properly, so NAR makes the web citable and advances someone&#039;s career a little bit. It already eases the transition from a traditional paper-based science to a more web-orientated world. Databases are peer-reviewed, so they don&#039;t contain complete crap, a paper in NAR assures some minimal quality. And a write-only database is better than none at all, at least someone has collected something and you can scrape it from its tomb.&lt;/p&gt;
&lt;p&gt;I would just love to see a minimal requirement for a publication in NAR: They should all offer some simple text-based export, e.g. tab-delimited flatfiles. That could save me a lot of time...  &lt;/p&gt;
&lt;p&gt;Of course, large quantities of these data are never used and never read. But, heck, this is research, right? 90% will not be used in the end. It&#039;s not too different from those 500 alignments algorithms, 200 genomic analyses, hundreds of papers that describe &quot;new&quot; cloning strategies, yet another new gene, etc.  I don&#039;t believe that the physisists use 100% of their 1,5 GB / sec either.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Sun, 20 Jan 2008 10:59:14 -0500</pubDate>
 <dc:creator>maximilianh</dc:creator>
 <guid isPermaLink="false">comment 4328 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Also, think of all the</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4325</link>
 <description>&lt;p&gt;Also, think of all the redundant effort that goes into setting up the basic technical infrastructure each database needs (data storage, user interfaces etc)...&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Fri, 18 Jan 2008 20:12:16 -0500</pubDate>
 <dc:creator>ejain</dc:creator>
 <guid isPermaLink="false">comment 4325 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>See also: The problem with</title>
 <link>http://www.nodalpoint.org/2005/12/16/rethinking_the_semantic_web_part_1#comment-2859</link>
 <description>&lt;p&gt;See also: &lt;a href=&quot;http://mixingmemory.blogspot.com/2005/12/problem-with-meaning.html&quot;&gt;The problem with meaning&lt;/a&gt;.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Wed, 21 Dec 2005 12:26:55 -0500</pubDate>
 <dc:creator>Greg</dc:creator>
 <guid isPermaLink="false">comment 2859 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>proof of concept</title>
 <link>http://www.nodalpoint.org/2005/12/16/rethinking_the_semantic_web_part_1#comment-2858</link>
 <description>&lt;p&gt;I must admit, discussions about the semantic web leave me baffled.  I get the idea - that we use metadata to deliver relevant content rather than relying on keyword searches.  However, reading articles like those mentioned leaves me with no sense of how this is being achieved (if it is).  It seems to be an almost deliberately abstruse and jargon-riddled field - jargon for jargon&#039;s sake - which, for people whose goal is information delivery seems rather absurd.&lt;/p&gt;
&lt;p&gt;I&#039;d like to see some practical demonstrations, proof of concept, particularly with relevance to biology.  Perhaps such sites exist?  Set up a server with some content, write a prototype client and show me how it works and the advantages over what I already use.  Otherwise it&#039;s all just so much highbrow intellectual arm-waving.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 20 Dec 2005 23:23:41 -0500</pubDate>
 <dc:creator>Neil</dc:creator>
 <guid isPermaLink="false">comment 2858 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Rather interesting piece. It</title>
 <link>http://www.nodalpoint.org/2005/12/16/rethinking_the_semantic_web_part_1#comment-2857</link>
 <description>&lt;p&gt;Rather interesting piece. It is nice that Rob McCooool has offered up his reasons why the semantic web won&#039;t work, I mean *who hasn&#039;t* criticized the semantic web at some point ? It is becoming like a club or something, &quot;The semantic web is a &lt;a href=&quot;http://slashdot.org/askslashdot/01/03/21/0739222.shtml&quot;&gt;pipe dream&lt;/a&gt; and won&#039;t work because of...&quot;. Funny thing is I don&#039;t think TBL at any point has said the semantic web will be an easy thing to achieve... of course Rob has a solution, doesn&#039;t everyone have a solution to the semantic web ? but I think the contribution is not so much his solution but *how* the solution is presented.&lt;/p&gt;
&lt;p&gt;Think about it, the article is entitled &quot;Rethinking the Semantic Web part 1&quot; and the solution is presented in the &quot;To be continued...&quot; part II. Pure genius, you just can&#039;t help yourself begging for more ? So from this point on I plan to publish all papers in in either two parts or three. Part one will will have the introduction, materials and methods and a results teaser... just to keep you on the edge of your seat for part II, full results and discussion. I haven&#039;t figured out the trilogy format yet.&lt;/p&gt;
&lt;p&gt;Seriously though.&lt;/p&gt;
&lt;p&gt;I am curious about his proposed solution as I agree with many of the failings he states e.g. how OWL deails with exceptions and context. This is close to my current issue with the semantic web, that people miss the point that &lt;a href=&quot;http://kashori.com/2004/12/it-takes-agent-to-be-semantic.html&quot;&gt;It Takes an Agent to be Semantic&lt;/a&gt;. It is not just about controlled vocabularies and reasoners.&lt;/p&gt;
&lt;p&gt;However for someone claims to have a solution to the semantic web&#039;s problems there are some odd statements in the article. For example &quot;RDF triples must be maintained, either as separate files or within separate blocks inside HTML files.&quot; What about in a triples database ?&lt;/p&gt;
&lt;p&gt;&quot;The ontological data model makes representation of any nontrivial factual information difficult because it can’t represent context of any kind.&quot; What about named graphs ? What about the fact that context (in the semantic sense and not just the provenance of the triple) is dependent on shared meaning (&lt;a href=&quot;http://kashori.com/2005/11/comment-on-danny-ayers-context-what.html&quot;&gt;common knowledge&lt;/a&gt;) ?&lt;/p&gt;
&lt;p&gt;See &lt;a href=&quot;http://kashori.com/2005/09/semantic-web-awaits-real-semantic.html&quot;&gt;Kashori (John Black) for more&lt;/a&gt; on how the semantic web is likely to emerge.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 20 Dec 2005 07:05:26 -0500</pubDate>
 <dc:creator>Greg</dc:creator>
 <guid isPermaLink="false">comment 2857 at http://www.nodalpoint.org</guid>
</item>
</channel>
</rss>
