<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://www.nodalpoint.org" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>nodalpoint.org - 10.000 times faster blast, in JAVA??? - Comments</title>
 <link>http://www.nodalpoint.org/2006/09/29/10_000_times_faster_blast_in_java</link>
 <description>Comments for &quot;10.000 times faster blast, in JAVA???&quot;</description>
 <language>en</language>
<item>
 <title>BLAT and PatternHunter</title>
 <link>http://www.nodalpoint.org/2006/09/29/10_000_times_faster_blast_in_java#comment-3164</link>
 <description>&lt;p&gt;thanks for the reference to PatternHunter... I&#039;ve heard about it before but forgot it here. &lt;/p&gt;
&lt;p&gt;Yes, it seems they simply put the database into memory. But well, given today&#039;s RAM prices that&#039;s not such a bad idea. BLAT does the same, with less sensitivity. And buying RAM is cheaper than buying some strange special hardware.&lt;/p&gt;
&lt;p&gt;Neil: a) PatternHunter is also &lt;a href=&quot;http://www.bioinformaticssolutions.com/products/ph/&quot;&gt;commercial&lt;/a&gt; though free for academic use.&lt;br /&gt;
b) You don&#039;t want to use BLAT in many appplications as it is a lot less sensitive than BLAST. I would rely on a blat only if the sequence is from the same or a very, very close species (like chimp or mouse to human) but that might also depend on the gene... So BLAT is really no alternative. Besides, BLAT is only free for academic use.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Wed, 11 Oct 2006 11:27:17 -0400</pubDate>
 <dc:creator>maximilianh</dc:creator>
 <guid isPermaLink="false">comment 3164 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>How can we know?</title>
 <link>http://www.nodalpoint.org/2006/09/29/10_000_times_faster_blast_in_java#comment-3163</link>
 <description>&lt;p&gt;10 000 x does seem like an outrageous claim.  How are we to know?  The website is high on &quot;corporate speak&quot; and low on technical information.  &quot;More options than all other products combined&quot;, &quot; a new paradigm in biotechnology&quot;, &quot;the ultimate genomics search solution&quot; &lt;i&gt;etc&lt;/i&gt;.  The only way to test their claims is to obtain the product.  And that my friends is why open source is superior.&lt;br /&gt;
Itatsumaki has already noted the huge RAM dependency - 4 GB is hardly standard even on a modern laptop.  If you want a fast, in-memory BLAST alternative, why not try &lt;a href=&quot;http://www.soe.ucsc.edu/~kent/src/&quot;&gt;BLAT&lt;/a&gt;?&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Wed, 11 Oct 2006 00:34:51 -0400</pubDate>
 <dc:creator>Neil</dc:creator>
 <guid isPermaLink="false">comment 3163 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>RAM Dependent</title>
 <link>http://www.nodalpoint.org/2006/09/29/10_000_times_faster_blast_in_java#comment-3162</link>
 <description>&lt;p&gt;I looked at this, and it seems like they&#039;re just playing tricks.  Check out the RAM dependency they quote, it&#039;s huge.  They suggest your required RAM is 24 times the size of the database in letters, divided by the word-size.  So, for example if you try this algorithm against a large database like nt (non-redundant nucleotides) you would need a minimum of: 24 x 928,057,554 / 12 = 1.7 GB.  So it seems like they are making a speed/space trade-off somewhere.  Interestingly, that factor of 10k they quote sounds suspiciously like the difference between HD &amp;amp; memory access.&lt;/p&gt;
&lt;p&gt;It also seems likely that the played around with the seeding.  I think it&#039;s wrong to say that &quot;local alignments have been worked on for decades without a lot of speed improvement&quot;.  I think PatternHunter&#039;s asymmetric seeds are pretty convincing at doing just that, although I&#039;ve never benchmarked it myself.  Check out:&lt;br /&gt;
Bioinformatics. 2002 Mar;18(3):440-5.&lt;br /&gt;
PatternHunter: faster and more sensitive homology search.&lt;br /&gt;
Ma B, Tromp J, Li M.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Wed, 11 Oct 2006 00:03:26 -0400</pubDate>
 <dc:creator>Itatsumaki</dc:creator>
 <guid isPermaLink="false">comment 3162 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>10.000 times faster blast, in JAVA???</title>
 <link>http://www.nodalpoint.org/2006/09/29/10_000_times_faster_blast_in_java</link>
 <description>&lt;p&gt;Given that the backbone of sequence analysis and as such bioinformatics is alignment, this news has the potential to shake some ground in the community: There is a new company claiming that &lt;a href=&quot;http://www.slimsearch.co.nz/&quot;&gt;their new BLAST&lt;/a&gt; is as sensitive but 10.000 times faster than the original by improving the seed searching phase. I&#039;m very sceptical... local alignments have been worked on for decades without a lot of speed improvement while keeping accuracy. Which is why there are &lt;a href=&quot;http://www.timelogic.com/benchmark_blast.html&quot;&gt;special hardware solutions&lt;/a&gt; for this problem (I wonder who is buying them).&lt;br /&gt;
But the company&#039;s website does not look like a joke and they&#039;re shipping  demo versions... I have no clue how that could work. Suffix arrays? But that has been applied to local alignment seed search, right? Anyone out there with rumors about this company ?&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;&lt;p&gt;&lt;a href=&quot;http://www.nodalpoint.org/2006/09/29/10_000_times_faster_blast_in_java&quot;&gt;read more&lt;/a&gt;&lt;/p&gt;</description>
 <comments>http://www.nodalpoint.org/2006/09/29/10_000_times_faster_blast_in_java#comments</comments>
 <category domain="http://www.nodalpoint.org/master_list/bioinformatics">Bioinformatics</category>
 <category domain="http://www.nodalpoint.org/computer_science/data_management">Data management</category>
 <category domain="http://www.nodalpoint.org/bioinformatics/sequence_analysis">Sequence analysis</category>
 <pubDate>Fri, 29 Sep 2006 15:20:09 -0400</pubDate>
 <dc:creator>maximilianh</dc:creator>
 <guid isPermaLink="false">1962 at http://www.nodalpoint.org</guid>
</item>
</channel>
</rss>
