<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://www.nodalpoint.org" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>nodalpoint.org - Data management - Comments</title>
 <link>http://www.nodalpoint.org/computer_science/data_management</link>
 <description>Comments for &quot;Data management&quot;</description>
 <language>en</language>
<item>
 <title>User-friendly, High compatibility but low requirements</title>
 <link>http://www.nodalpoint.org/2008/05/03/introducing_the_eyelims_project#comment-4448</link>
 <description>&lt;p&gt;Hi Neil, &lt;/p&gt;
&lt;p&gt;And thanks for your interest in the project. I fixed the mistake in &quot;invaluable&quot;.&lt;/p&gt;
&lt;p&gt;I also completely agree on the facts that:&lt;br /&gt;
- we need a free open source Laboratory Information Management System&lt;br /&gt;
- to be adopted, it needs to be user-friendly, cross-platform, highly compatible with existing open source solutions.&lt;/p&gt;
&lt;p&gt;I had that in mind when I joined the eyeOS team, three years ago and I always kept it in mind during eyeOS development. And it is because eyeOS fits these needs that I really think it provides the perfect platform for a LIMS:&lt;br /&gt;
- For users, eyeOS provides a really intuitive and user-friendly desktop-like environment, accessible from anywhere.&lt;br /&gt;
- For developers, eyeOS provides a complete Toolkit and set of libraries to easily and quickly develop new eyeOS applications.&lt;br /&gt;
- For network manager, eyeOS has very low requirements. It only needs PHP and XML (javascript being provided by the navigator). It does NOT require a MySQL (or other) database or a Java server (such as Tomcat). For this reason, it can be installed on any basic hosting plan. However, it allows the easy integration of Flash (swf), Java applets or MySQL applications. Importantly, it constitutes a secured platform to store and share documents with a Virtual File System controlling the user access to files.       &lt;/p&gt;
&lt;p&gt;To conclude, the last but not the least, eyeOS is Free and Open Source. To my knowledge, it is unique. &lt;/p&gt;
&lt;p&gt;I hope you will take time to discover this platform and agree with me on its potential. &lt;/p&gt;
&lt;p&gt;Best regards, &lt;/p&gt;
&lt;p&gt;Alexandre&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://www.eyeLIMS.com&quot; title=&quot;www.eyeLIMS.com&quot;&gt;www.eyeLIMS.com&lt;/a&gt; - Free Open Source Laboratory Information Management System powered by eyeOS&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 06 May 2008 17:37:22 -0400</pubDate>
 <dc:creator>amejat</dc:creator>
 <guid isPermaLink="false">comment 4448 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Value</title>
 <link>http://www.nodalpoint.org/2008/05/03/introducing_the_eyelims_project#comment-4446</link>
 <description>&lt;p&gt;&lt;i&gt;provides an unvaluable system&lt;/i&gt;&lt;/p&gt;
&lt;p&gt;*Invaluable*, surely.  I hope?&lt;/p&gt;
&lt;p&gt;I&#039;ll be interested to see how this develops:  the research world is in desperate need of a quality, open-source LIMS.  However, I&#039;m concerned that this LIMS is tied to an existing product.  Everyone has a web browser, but not everyone will want to sign up for eyeOS and learn how to use it.  A true web application is something more like Google Documents, where you just open a page and start work.&lt;/p&gt;
&lt;p&gt;Remember, biologists will look for any excuse not to learn how to use an application; you need to keep the entry barrier as low as possible.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 06 May 2008 02:55:35 -0400</pubDate>
 <dc:creator>Neil</dc:creator>
 <guid isPermaLink="false">comment 4446 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Yeah, need to update my</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4335</link>
 <description>&lt;p&gt;Yeah, need to update my blog...&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 22 Jan 2008 19:47:42 -0500</pubDate>
 <dc:creator>ejain</dc:creator>
 <guid isPermaLink="false">comment 4335 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>700?</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4334</link>
 <description>&lt;p&gt;At 700 a pop for an Action Figure?&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 22 Jan 2008 13:18:11 -0500</pubDate>
 <dc:creator>nuin</dc:creator>
 <guid isPermaLink="false">comment 4334 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Dude, Where&#039;s My NAR?</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4333</link>
 <description>&lt;p&gt;Hello Maximillian, I wasn&#039;t attacking NAR, was just wondering about what proportion of the data is dead. It is an interesting technical challenge to find this out. It would also be useful to measure the cost of gathering noisy, redundant and poorly understood data in terms of wasted resources (people, time, money, computers, false positives etc). Perhaps somebody has done something like this already? Especially with all the irresponsible sequencing just for sake of it that goes on...&lt;/p&gt;
&lt;p&gt;As for the physicists, I mentioned them for comparison. Like you, I doubt they will use 100% of their data either, but will probably use much more of it. However, they keep &lt;a href=&quot;http://www.newscientist.com/channel/fundamentals/mg19426103.300-particle-smasher-aims-for-may-2008-switchon.html&quot;&gt;delaying switching their big machine on&lt;/a&gt;, which means they are still waiting for the data. That is, unless when they finally flip the switch on the &lt;a href=&quot;http://en.wikipedia.org/wiki/Large_Hadron_Collider&quot;&gt;LHC&lt;/a&gt;, &lt;a href=&quot;http://popsci.typepad.com/popsci/2007/07/the-large-hadro.html&quot;&gt;we all disappear into a black hole&lt;/a&gt;, tombs and all :)&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 22 Jan 2008 13:09:27 -0500</pubDate>
 <dc:creator>Duncan</dc:creator>
 <guid isPermaLink="false">comment 4333 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>It doesn&#039;t matter, they got published</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4332</link>
 <description>&lt;blockquote&gt;&lt;p&gt;Does it matter that large quantities of this data will probably never be used?&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;Probably not, as their authors are already happy enough to have those published, and then they can &lt;strike&gt;stop mantaining&lt;/strike&gt; move on to another &lt;strike&gt;project&lt;/strike&gt; publication. It is indeed a sad state of affairs.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Tue, 22 Jan 2008 06:46:50 -0500</pubDate>
 <dc:creator>lbbros</dc:creator>
 <guid isPermaLink="false">comment 4332 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Swiss Prot Databases / Action Figures</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4331</link>
 <description>&lt;p&gt;Hi Eric, yeah even more redundancy there. Talking of SWISS Prot people, I&#039;m just wondering when your &lt;a href=&quot;http://eric.jain.name/2007/12/04/amos-bairoch-action-figure/&quot;&gt;Amos Bairoch action figure&lt;/a&gt; will be available in shops?!?! What are you up to now that you&#039;ve moved on from SWISS Prot to Seattle?&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Mon, 21 Jan 2008 18:12:33 -0500</pubDate>
 <dc:creator>Duncan</dc:creator>
 <guid isPermaLink="false">comment 4331 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Why NAR</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4328</link>
 <description>&lt;p&gt;I add the usual defense of the NAR databases: At least they are indexed by pubmed, so as there are biologists that are not used to checking in google if there are websites about their subject, they will find them quicker via NAR. The data is not dead,  websites can always be exported. (I&#039;ve made the experience that for smaller databses it&#039;s usually quicker to scrape the data from an html page with something like HTTP::Recorder than writing to the person responsible for the data to send you an sql dump.) In addition, people get papers for their databases like this and other people can cite the database properly, so NAR makes the web citable and advances someone&#039;s career a little bit. It already eases the transition from a traditional paper-based science to a more web-orientated world. Databases are peer-reviewed, so they don&#039;t contain complete crap, a paper in NAR assures some minimal quality. And a write-only database is better than none at all, at least someone has collected something and you can scrape it from its tomb.&lt;/p&gt;
&lt;p&gt;I would just love to see a minimal requirement for a publication in NAR: They should all offer some simple text-based export, e.g. tab-delimited flatfiles. That could save me a lot of time...  &lt;/p&gt;
&lt;p&gt;Of course, large quantities of these data are never used and never read. But, heck, this is research, right? 90% will not be used in the end. It&#039;s not too different from those 500 alignments algorithms, 200 genomic analyses, hundreds of papers that describe &quot;new&quot; cloning strategies, yet another new gene, etc.  I don&#039;t believe that the physisists use 100% of their 1,5 GB / sec either.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Sun, 20 Jan 2008 10:59:14 -0500</pubDate>
 <dc:creator>maximilianh</dc:creator>
 <guid isPermaLink="false">comment 4328 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Also, think of all the</title>
 <link>http://www.nodalpoint.org/2008/01/18/one_thousand_databases_high_and_rising#comment-4325</link>
 <description>&lt;p&gt;Also, think of all the redundant effort that goes into setting up the basic technical infrastructure each database needs (data storage, user interfaces etc)...&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Fri, 18 Jan 2008 20:12:16 -0500</pubDate>
 <dc:creator>ejain</dc:creator>
 <guid isPermaLink="false">comment 4325 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>You might try out MedWorm too</title>
 <link>http://www.nodalpoint.org/2007/11/13/journal_article_search_via_rss_mashup#comment-4271</link>
 <description>&lt;p&gt;A nice introduction to Yahoo Pipes. You might also try out &lt;a href=&quot;http://www.medworm.com&quot;&gt;MedWorm&lt;/a&gt; which allows you to create RSS feeds from queries on its sources - which currently includes over 5000 medically related RSS feeds (sorry, I too am recommending my own site - which is where I came across this posting, since I like to keep a check on &lt;a href=&quot;http://www.medworm.com/rss/search.php?qu=rss&amp;amp;t=rss&amp;amp;r=Exact&amp;amp;o=d&amp;amp;f=tagblog&quot;&gt;blog postings tagged with &#039;RSS&#039;&lt;/a&gt; - I hope you may excuse this since the site is highly relevant to this posting).&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Mon, 19 Nov 2007 00:17:08 -0500</pubDate>
 <dc:creator>frankie</dc:creator>
 <guid isPermaLink="false">comment 4271 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Scintilla</title>
 <link>http://www.nodalpoint.org/2007/11/13/journal_article_search_via_rss_mashup#comment-4269</link>
 <description>&lt;p&gt;For the science side of things you could give &lt;a href=&quot;http://scintilla.nature.com/&quot;&gt;Scintilla&lt;/a&gt; a try (sorry for plugging my own site, but it is designed for this kind of thing: if it doesn&#039;t suit the way you work then let me know).&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Fri, 16 Nov 2007 05:43:28 -0500</pubDate>
 <dc:creator>alf</dc:creator>
 <guid isPermaLink="false">comment 4269 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>that works too,,,</title>
 <link>http://www.nodalpoint.org/2007/11/13/journal_article_search_via_rss_mashup#comment-4266</link>
 <description>&lt;p&gt;,,, but on a second thought, with the Pipes you can mashup and filter RSS from any source,,, so in addition to having keyword-filtered articles from papers, you can also have blog posts, newgroups and whatever else RSS content cleansed and delivered all in a single feed,,, I guess you could also do that by searching within Google reader after you aggregate your feeds there,,,&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Thu, 15 Nov 2007 13:58:15 -0500</pubDate>
 <dc:creator>agbiotec</dc:creator>
 <guid isPermaLink="false">comment 4266 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>What about PubMed?</title>
 <link>http://www.nodalpoint.org/2007/11/13/journal_article_search_via_rss_mashup#comment-4265</link>
 <description>&lt;p&gt;If your articles of interest are there of course (and a big if here). I have a couple saved searches in PubMed and subscribe to the feeds for those searches. One of these is a long list of scientists in my field for example. I subscribe using Google Reader so new papers come through every day and scanning through them in Reader is much easier than using PubMed directly. I then use a set of labels to prioritize articles for follow up and importing into the excellent pdf manager Papers.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Thu, 15 Nov 2007 12:32:41 -0500</pubDate>
 <dc:creator>dbtodd</dc:creator>
 <guid isPermaLink="false">comment 4265 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Pipes are pretty nifty</title>
 <link>http://www.nodalpoint.org/2007/11/13/journal_article_search_via_rss_mashup#comment-4264</link>
 <description>&lt;p&gt;Pipes are pretty nifty indeed. &lt;a href=&quot;http://opennfo.wordpress.com/2007/10/07/pipe-dreams/&quot;&gt;I created one&lt;/a&gt; for the same purpose myself a little while back for filtering items from a few different journals. I&#039;ve since thought of an even easier way to search through a large number of journal feeds, but have not had the time to blog about it lately because I&#039;ve been so caught-up in school work. Basically though, the idea is to combine &lt;a href=&quot;http://www.google.com/reader&quot;&gt;google reader&lt;/a&gt; + yahoo pipes. Yahoo pipes is powerful, but it is a pain to handle a high number of feeds. It is very easy to add feeds to google reader however, so rather than adding the feeds directly to yahoo pipes, you could instead create a google reader account just for journal feeds. Give them all a tag like &quot;article,&quot; and then feed your google reader &quot;article&quot; feed (which will include all items from all of the journals) into Yahoo pipes. This way you will only have to deal with a single feed in Yahoo pipes, and adding new journals becomes trivial.&lt;/p&gt;
&lt;p&gt;Not bad, huh? :)&lt;br /&gt;
Keith&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Thu, 15 Nov 2007 10:33:28 -0500</pubDate>
 <dc:creator>pwnedd</dc:creator>
 <guid isPermaLink="false">comment 4264 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>some points</title>
 <link>http://www.nodalpoint.org/2007/08/06/roundup_extract_a_sequence_from_a_fasta_file#comment-4224</link>
 <description>&lt;p&gt;Thanks Neil. I think there is no limit to the number of sequences you can download but a limit to the number of keywords you can search for. I did  &quot;id1 or id2 or id3 ...&quot; and hit the limit there. There should be probably a better way to interface with NCBI but I don&#039;t know it.&lt;/p&gt;
&lt;p&gt;Why 1 GB? Most EST-projects don&#039;t give their sequences genbank-compatible ids. They start with something homegrown, of course, then put the clones onto plates and distribute them. This is similar for mouse clones, zebrafish clones and also in my case ascidian cDNA clones. You see, a word in the description can be a real ID, if a sequence historically always had 2 different IDs. My EST sequences were just selected sequences, there is no way to filter from the sequence.&lt;/p&gt;
&lt;p&gt;I think Badboys is right in that very often plain old Perl is more than enough. With hindsight, I could have hacked a python/perl script together in 20 minutes. It&#039;s so simple: Slurp the ids into a hash, iterate over the lines of the fasta and output everything that has a fasta id that contains a word in the hash. My problem was that I assumed that there was already a solution for this. &lt;/p&gt;
&lt;p&gt;This happends to me all the time: I find programs on the internet that claim to do something and end up hacking something together 3 days later because the tools a) don&#039;t come with source code b) won&#039;t run at all c) do something slightly different which compromises the whole application for which I need them for.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Thu, 11 Oct 2007 13:01:13 -0400</pubDate>
 <dc:creator>maximilianh</dc:creator>
 <guid isPermaLink="false">comment 4224 at http://www.nodalpoint.org</guid>
</item>
</channel>
</rss>
