<?xml version="1.0" encoding="UTF-8"?> <rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:wfw="http://wellformedweb.org/CommentAPI/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
> <channel><title>Autarchy of the Private Cave &#187; bacteria</title> <atom:link href="https://bogdan.org.ua/tags/bacteria/feed" rel="self" type="application/rss+xml" /><link>https://bogdan.org.ua</link> <description>Tiny bits of bioinformatics, [web-]programming etc</description> <lastBuildDate>Wed, 28 Dec 2022 16:09:04 +0000</lastBuildDate> <language>en-US</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>https://wordpress.org/?v=3.8.27</generator> <item><title>Nobody wants higher-quality, complete bacterial genomes</title><link>https://bogdan.org.ua/2016/05/24/nobody-wants-higher-quality-complete-bacterial-genomes.html</link> <comments>https://bogdan.org.ua/2016/05/24/nobody-wants-higher-quality-complete-bacterial-genomes.html#comments</comments> <pubDate>Tue, 24 May 2016 15:18:07 +0000</pubDate> <dc:creator><![CDATA[Bogdan]]></dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Rant]]></category> <category><![CDATA[assembly]]></category> <category><![CDATA[bacteria]]></category> <category><![CDATA[basic income]]></category> <category><![CDATA[genome]]></category> <category><![CDATA[rant]]></category> <category><![CDATA[sequencing]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=2424</guid> <description><![CDATA[This is a piece of rant. Disclaimer The story, all names, characters, genomes and incidents portrayed in this blog post are fictitious. No identification with actual persons (living, dead or undead), places, companies, and processes is intended or should be inferred. No animals were harmed in the making of this blog post. Let&#8217;s try answering [&#8230;]]]></description> <content:encoded><![CDATA[<p>This is a piece of rant.</p><p><strong>Disclaimer</strong></p><blockquote><p>The story, all names, characters, genomes and incidents portrayed in this blog post are fictitious.<br
/> No identification with actual persons (living, dead or undead), places, companies, and processes is intended or should be inferred.<br
/> No animals were harmed in the making of this blog post.</p></blockquote><p>Let&#8217;s try answering a question:</p><blockquote><p>why are there many incomplete/draft bacterial genomes, and much fewer complete genomes?</p></blockquote><p><span
id="more-2424"></span></p><p>The answer is simple: insufficient value/cost ratio.<br
/> This can also be summarized as the <em>good enough</em> principle: if something is <em>good enough</em>, it does not get improved.</p><p><strong>Sample scenario 1</strong>.<br
/> Players: Principal Investigator (<strong>PI</strong>), Bacterial Genome (<strong>BG</strong>), Biologist (<strong>B</strong>), Sequencing Company (<strong>SC</strong>), (optional) Bioinformatician (<strong>oBI</strong>), Genomes Database (<strong>GD</strong>).</p><p><strong>B</strong> is interested to work with <strong>BG</strong>, and gets <strong>PI</strong>&#8216;s approval to sequence it.<br
/> Biomaterial is sent to <strong>SC</strong>, which sequences and even assembles the <strong>BG</strong>.<br
/> <strong>BG</strong> looks overall great and comes in just a handful fragments.<br
/> <strong>oBI</strong> is (optionally) involved, to annotate and describe the <strong>BG</strong>.<br
/> <strong>B</strong> works happily with the <strong>BG</strong>, describing and characterizing all the interesting biosynthetic features it contains.<br
/> An article is prepared, and <strong>oBI</strong> is (optionally) involved again, to prepare and submit the <strong>BG</strong> to the <strong>GD</strong>.<br
/> Preparing the <strong>BG</strong>, <strong>oBI</strong> has to answer a question if this <strong>BG</strong> contains any plasmids.<br
/> Upon closer examination, <strong>oBI</strong> finds that one of the fragments is actually the complete chromosome, and all others are just unplaced fragments of it.<br
/> <strong>oBI</strong> knows that this genome could probably be merged into a single draft scaffold<br
/> using bioinformatics tools and manual examination in maybe a few days (or a week&#8230; or two? <img
src="https://bogdan.org.ua/wp-includes/images/smilies/icon_smile.gif" alt=":)" class="wp-smiley" /> ).<br
/> <strong>oBI</strong> also knows that with a little bit of <strong>B</strong>&#8216;s help (a few primer walking experiments) it should be possible to have the complete <strong>BG</strong> within a month or two.<br
/> However, <strong>BG</strong> stays a draft, and is not going to be complete any time soon.</p><p>Why?</p><p>Let&#8217;s look at motivations of all the players, and see if any of the players <em>wants</em> the complete <strong>BG</strong>:</p><ul><li><strong>PI</strong> wants publications; spending extra time/effort to make <strong>BG</strong> complete does not present any obvious benefits;</li><li><strong>BG</strong> wants to be left alone;</li><li><strong>B</strong> wants to publish exciting new findings; they are already supported by the draft <strong>BG</strong>, so there is clearly no need for a complete <strong>BG</strong>;</li><li><strong>SC</strong> was happy to get payment in time; <strong>SC</strong> is also proud to be able to provide genome assembly as an extra service with its (primary) sequencing offers;</li><li><strong>oBI</strong> has interest in finishing the <strong>BG</strong>: it will then be complete; however, there are 5 more other BGs awaiting processing, and the backlog of semi-written manuscripts only keeps growing&#8230; finishing this specific <strong>BG</strong> will not result in a perceived benefit to <strong>oBI</strong>;</li><li><strong>GD</strong> stores genomes; it doesn&#8217;t care much if the genome submitted could have been better.</li></ul><p><em>Surprise</em>!<br
/> Looks like <strong>none of the players sees benefits in actually finishing the BG</strong>,<br
/> simply because efforts spent (or time waited) does not bring any perceived benefits to any of the players.</p><p><strong>Sample scenario 2</strong>.<br
/> Players: Bacterial Genome (<strong>BG</strong>), Biologist (<strong>B</strong>), Sequencing Company (<strong>SC</strong>), non-optional Bioinformatician (<strong>noBI</strong>), Genomes Database (<strong>GD</strong>).</p><p>This time, <strong>B</strong> (who is interested in <em>quickly</em> publishing a short genome announcement) asks for <strong>noBI</strong>&#8216;s help from the moment the <strong>BG</strong> is provided by the <strong>SC</strong>.<br
/> <strong>noBI</strong> has a cursory look at the <strong>BG</strong>, and although there is a huge discrepancy between thousands of contigs on the one hand and insanely high coverage on the other,<br
/> the <strong>BG</strong> otherwise appears <em>good enough</em> for further work, especially after scaffolding; after all, this is <em>just</em> a genome announcement, not a full-blown <em>article</em>!<br
/> There is also some weirdness about the coverage distribution of the <strong>BG</strong>, but <strong>noBI</strong> carelessly ignores that.<br
/> The <strong>BG</strong> is worked on: annotated, examined, described, prepared for submission to the <strong>GD</strong>.<br
/> Meanwhile, the announcement article is also nearly complete.<br
/> Genome is submitted, and <strong>GD</strong>&#8216;s response comes back: some scaffolds contain <em>orangutan</em> and <em>human</em> DNA, and some scaffolds contain known <em>adapter sequences</em> in the middle&#8230;<br
/> &#8220;<em>Oh crap</em>&#8220;, thinks <strong>noBI</strong>, &#8220;<em>I should have checked the raw reads for adapters and contamination, in spite of having the <strong>BG</strong> assembly already</em>&#8230;&#8221;<br
/> The <strong>GD</strong> also kindly offers an easy way out: just remove the obviously-orangutan scaffolds, and remove/mask/discard adapter sequences.<br
/> This is the <strong>easy way</strong>, leading to a <em>quicker</em> genome announcement, and a slight bump to the personal publication records of both <strong>B</strong> and <strong>noBI</strong>.</p><p>The <strong>right way</strong> is, of course, to clean raw reads from adapters and contamination, re-assemble, re-scaffold, re-annotate, re-describe the BG,<br
/> then prepare again for submission. This can delay the <em>quick</em> genome announcement by about a week,<br
/> but will highly likely result in a more contiguous and more correct BG &#8211; although still not complete.</p><p>As we have learned from Scenario 1, perceived benefits of going the <em>right</em> way (as opposed to the <em>easy</em> way) are nearly non-existent&#8230;</p><p>There was a genome I have finalized manually a few years ago.<br
/> I had some good quality data, obtained a 300-something contigs initial assembly,<br
/> then scaffolded and manually finalized to about 10 scaffolds.<br
/> There was simply not enough evidence (data) to keep merging scaffolds, so I had to stop.</p><p>Nowadays, as <em>bacterial genome sequencing prices are akin to weekend supermarket shopping expenses</em>,<br
/> nobody is going the extra mile to produce a better quality, more contiguous, or even a complete genome.<br
/> And this feels sad&#8230;</p><p>On the other hand, consumer markets function like that for decades.<br
/> An old water heater with a failed heating element is not repaired: it is replaced by a new water heater,<br
/> because human time cost to repair the old one is higher than just buying a new one.</p><p>Funnily, universal basic income might change that: without the need to spend 40+ hours a week at work<br
/> (and thus being unable to repair that water heater on one&#8217;s own),<br
/> one might just order that heating element and fix it &#8211; instead of buying the new one.</p><p>Would universal basic income have the same effect on draft and incomplete bacterial genomes? I have no idea.</p><p><a
class="a2a_button_citeulike" href="https://www.addtoany.com/add_to/citeulike?linkurl=https%3A%2F%2Fbogdan.org.ua%2F2016%2F05%2F24%2Fnobody-wants-higher-quality-complete-bacterial-genomes.html&amp;linkname=Nobody%20wants%20higher-quality%2C%20complete%20bacterial%20genomes" title="CiteULike" rel="nofollow noopener" target="_blank"></a><a
class="a2a_button_pocket" href="https://www.addtoany.com/add_to/pocket?linkurl=https%3A%2F%2Fbogdan.org.ua%2F2016%2F05%2F24%2Fnobody-wants-higher-quality-complete-bacterial-genomes.html&amp;linkname=Nobody%20wants%20higher-quality%2C%20complete%20bacterial%20genomes" title="Pocket" rel="nofollow noopener" target="_blank"></a><a
class="a2a_button_kindle_it" href="https://www.addtoany.com/add_to/kindle_it?linkurl=https%3A%2F%2Fbogdan.org.ua%2F2016%2F05%2F24%2Fnobody-wants-higher-quality-complete-bacterial-genomes.html&amp;linkname=Nobody%20wants%20higher-quality%2C%20complete%20bacterial%20genomes" title="Kindle It" rel="nofollow noopener" target="_blank"></a><a
class="a2a_button_evernote" href="https://www.addtoany.com/add_to/evernote?linkurl=https%3A%2F%2Fbogdan.org.ua%2F2016%2F05%2F24%2Fnobody-wants-higher-quality-complete-bacterial-genomes.html&amp;linkname=Nobody%20wants%20higher-quality%2C%20complete%20bacterial%20genomes" title="Evernote" rel="nofollow noopener" target="_blank"></a><a
class="a2a_button_pinterest" href="https://www.addtoany.com/add_to/pinterest?linkurl=https%3A%2F%2Fbogdan.org.ua%2F2016%2F05%2F24%2Fnobody-wants-higher-quality-complete-bacterial-genomes.html&amp;linkname=Nobody%20wants%20higher-quality%2C%20complete%20bacterial%20genomes" title="Pinterest" rel="nofollow noopener" target="_blank"></a><a
class="a2a_dd addtoany_share_save addtoany_share" href="https://www.addtoany.com/share#url=https%3A%2F%2Fbogdan.org.ua%2F2016%2F05%2F24%2Fnobody-wants-higher-quality-complete-bacterial-genomes.html&#038;title=Nobody%20wants%20higher-quality%2C%20complete%20bacterial%20genomes" data-a2a-url="https://bogdan.org.ua/2016/05/24/nobody-wants-higher-quality-complete-bacterial-genomes.html" data-a2a-title="Nobody wants higher-quality, complete bacterial genomes"><img
src="https://static.addtoany.com/buttons/share_save_120_16.png" alt="Share"></a></p>]]></content:encoded> <wfw:commentRss>https://bogdan.org.ua/2016/05/24/nobody-wants-higher-quality-complete-bacterial-genomes.html/feed</wfw:commentRss> <slash:comments>2</slash:comments> </item> <item><title>Ensembl Genomes launches Protists, Bacteria and Metazoa</title><link>https://bogdan.org.ua/2009/04/21/ensembl-genomes-launches-protists-bacteria-and-metazoa.html</link> <comments>https://bogdan.org.ua/2009/04/21/ensembl-genomes-launches-protists-bacteria-and-metazoa.html#comments</comments> <pubDate>Tue, 21 Apr 2009 13:27:40 +0000</pubDate> <dc:creator><![CDATA[Bogdan]]></dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[Software]]></category> <category><![CDATA[Web]]></category> <category><![CDATA[bacteria]]></category> <category><![CDATA[Ensembl]]></category> <category><![CDATA[fungi]]></category> <category><![CDATA[genome]]></category> <category><![CDATA[metazoa]]></category> <category><![CDATA[plants]]></category> <category><![CDATA[protists]]></category> <category><![CDATA[sequence]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=670</guid> <description><![CDATA[The following sites are available: http://bacteria.ensembl.org http://protists.ensembl.org http://metazoa.ensembl.org During summer, two more sites &#8211; for Fungi and Plants &#8211; should be made available. Learn more about Ensembl Genomes project.]]></description> <content:encoded><![CDATA[<p>The following sites are available:</p><p><a
href="http://bacteria.ensembl.org/index.html">http://bacteria.ensembl.org</a><br
/> <a
href="http://protists.ensembl.org/index.html">http://protists.ensembl.org</a><br
/> <a
href="http://metazoa.ensembl.org/index.html">http://metazoa.ensembl.org</a></p><p>During summer, two more sites &#8211; for Fungi and Plants &#8211; should be made available.</p><p>Learn more about <a
href="http://www.ensemblgenomes.org/">Ensembl Genomes</a> project.</p><p><a
class="a2a_button_citeulike" href="https://www.addtoany.com/add_to/citeulike?linkurl=https%3A%2F%2Fbogdan.org.ua%2F2009%2F04%2F21%2Fensembl-genomes-launches-protists-bacteria-and-metazoa.html&amp;linkname=Ensembl%20Genomes%20launches%20Protists%2C%20Bacteria%20and%20Metazoa" title="CiteULike" rel="nofollow noopener" target="_blank"></a><a
class="a2a_button_pocket" href="https://www.addtoany.com/add_to/pocket?linkurl=https%3A%2F%2Fbogdan.org.ua%2F2009%2F04%2F21%2Fensembl-genomes-launches-protists-bacteria-and-metazoa.html&amp;linkname=Ensembl%20Genomes%20launches%20Protists%2C%20Bacteria%20and%20Metazoa" title="Pocket" rel="nofollow noopener" target="_blank"></a><a
class="a2a_button_kindle_it" href="https://www.addtoany.com/add_to/kindle_it?linkurl=https%3A%2F%2Fbogdan.org.ua%2F2009%2F04%2F21%2Fensembl-genomes-launches-protists-bacteria-and-metazoa.html&amp;linkname=Ensembl%20Genomes%20launches%20Protists%2C%20Bacteria%20and%20Metazoa" title="Kindle It" rel="nofollow noopener" target="_blank"></a><a
class="a2a_button_evernote" href="https://www.addtoany.com/add_to/evernote?linkurl=https%3A%2F%2Fbogdan.org.ua%2F2009%2F04%2F21%2Fensembl-genomes-launches-protists-bacteria-and-metazoa.html&amp;linkname=Ensembl%20Genomes%20launches%20Protists%2C%20Bacteria%20and%20Metazoa" title="Evernote" rel="nofollow noopener" target="_blank"></a><a
class="a2a_button_pinterest" href="https://www.addtoany.com/add_to/pinterest?linkurl=https%3A%2F%2Fbogdan.org.ua%2F2009%2F04%2F21%2Fensembl-genomes-launches-protists-bacteria-and-metazoa.html&amp;linkname=Ensembl%20Genomes%20launches%20Protists%2C%20Bacteria%20and%20Metazoa" title="Pinterest" rel="nofollow noopener" target="_blank"></a><a
class="a2a_dd addtoany_share_save addtoany_share" href="https://www.addtoany.com/share#url=https%3A%2F%2Fbogdan.org.ua%2F2009%2F04%2F21%2Fensembl-genomes-launches-protists-bacteria-and-metazoa.html&#038;title=Ensembl%20Genomes%20launches%20Protists%2C%20Bacteria%20and%20Metazoa" data-a2a-url="https://bogdan.org.ua/2009/04/21/ensembl-genomes-launches-protists-bacteria-and-metazoa.html" data-a2a-title="Ensembl Genomes launches Protists, Bacteria and Metazoa"><img
src="https://static.addtoany.com/buttons/share_save_120_16.png" alt="Share"></a></p>]]></content:encoded> <wfw:commentRss>https://bogdan.org.ua/2009/04/21/ensembl-genomes-launches-protists-bacteria-and-metazoa.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> </channel> </rss>