<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://www.nodalpoint.org" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>nodalpoint.org - Content Creation and Text processing - Comments</title>
 <link>http://www.nodalpoint.org/node/1765</link>
 <description>Comments for &quot;Content Creation and Text processing&quot;</description>
 <language>en</language>
<item>
 <title>ad voc scripts vs. integrated systems</title>
 <link>http://www.nodalpoint.org/node/1765#comment-2873</link>
 <description>&lt;p&gt;Yes, I prefer the *idea* of a streamlined approach (just because I posted that article doesn&#039;t mean I agree with what it says :) ).&lt;/p&gt;
&lt;p&gt;I think if the style of text being processed doesn&#039;t have a defined format, and it is a once off task, the ad hoc set of tools (which occasionally may be reusable) is often the way to go. Luckily, most raw bioinformatics data has a defined format, making a streamlined approach much more sensible.&lt;/p&gt;
&lt;p&gt;I guess the question you have to ask yourself is ... am I ever likely to use this script again ?&lt;br /&gt;
(or am I feeling altruistic, and will someone else use it after me even if I won&#039;t ever need it again ?).&lt;/p&gt;
&lt;p&gt;I guess it is the streamlined approach which is slowly emerging from the BioPython, BioPerl, Bio* etc projects, which is nice.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Wed, 11 Jan 2006 00:19:08 -0500</pubDate>
 <dc:creator>pansapiens</dc:creator>
 <guid isPermaLink="false">comment 2873 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>scripting != programming</title>
 <link>http://www.nodalpoint.org/node/1765#comment-2870</link>
 <description>&lt;p&gt;I like this part: &quot;Before embarking on writing scripts, you need one of two things: the right frame of mind to write a script, or someone else to do it for you. In either case, the frame of mind is very different from what&#039;s needed for making a product, so not all professional programmers are good at doing this until they understand the differences.&quot;&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Sat, 07 Jan 2006 10:26:36 -0500</pubDate>
 <dc:creator>maximilianh</dc:creator>
 <guid isPermaLink="false">comment 2870 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>clunky ^ 2</title>
 <link>http://www.nodalpoint.org/node/1765#comment-2869</link>
 <description>&lt;p&gt;as jim kent puts it: It&#039;s safer on the lagging edge.&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Sat, 07 Jan 2006 09:52:35 -0500</pubDate>
 <dc:creator>maximilianh</dc:creator>
 <guid isPermaLink="false">comment 2869 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>clunky</title>
 <link>http://www.nodalpoint.org/node/1765#comment-2867</link>
 <description>&lt;p&gt;Shouldn&#039;t we dream of a more streamlined approach, where you don&#039;t need glued together ad-hocery to process data?&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <pubDate>Thu, 05 Jan 2006 14:24:15 -0500</pubDate>
 <dc:creator>chris</dc:creator>
 <guid isPermaLink="false">comment 2867 at http://www.nodalpoint.org</guid>
</item>
<item>
 <title>Content Creation and Text processing</title>
 <link>http://www.nodalpoint.org/node/1765</link>
 <description>&lt;p&gt;Liam Quin from W3C has given &lt;a href=&quot;http://www.kuro5hin.org/story/2005/12/28/223217/93&quot;&gt;a few useful tips&lt;/a&gt; relating to processing documents (eg error-prone re-typed or scanned text) into XML.&lt;/p&gt;
&lt;p&gt;Many of these practises are important for the sort of text processing tasks that seem to come up in bioinformatics.&lt;/p&gt;
&lt;p&gt;&lt;i&gt;Article summary:&lt;/i&gt; use lots of small one-off scripts to make small changes, continually validate your output, briefly document your steps, automate steps with a meta-script or Makefile and keep input and output text seperate (.. well duh!).&lt;/p&gt;
&lt;br class=&quot;clear&quot; /&gt;</description>
 <comments>http://www.nodalpoint.org/node/1765#comments</comments>
 <category domain="http://www.nodalpoint.org/master_list/markup_technologies">Markup Technologies</category>
 <category domain="http://www.nodalpoint.org/markup_technologies/xml">XML</category>
 <pubDate>Sun, 01 Jan 2006 00:37:10 -0500</pubDate>
 <dc:creator>pansapiens</dc:creator>
 <guid isPermaLink="false">1765 at http://www.nodalpoint.org</guid>
</item>
</channel>
</rss>
