<?xml version="1.0" encoding="UTF-8"?> <rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:wfw="http://wellformedweb.org/CommentAPI/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
> <channel><title>Autarchy of the Private CaveBioinformatics &#187;</title> <atom:link href="http://bogdan.org.ua/categories/science/bioinformatics/feed" rel="self" type="application/rss+xml" /><link>http://bogdan.org.ua</link> <description>Tiny bits of bioinformatics, [web-]programming etc</description> <lastBuildDate>Tue, 15 May 2012 21:56:55 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.2</generator> <item><title>Amazonia! 6462 human microarray datasets</title><link>http://bogdan.org.ua/2011/03/06/amazonia-6462-human-microarray-datasets.html</link> <comments>http://bogdan.org.ua/2011/03/06/amazonia-6462-human-microarray-datasets.html#comments</comments> <pubDate>Sun, 06 Mar 2011 19:18:51 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[amazonia]]></category> <category><![CDATA[data]]></category> <category><![CDATA[expression]]></category> <category><![CDATA[microarray]]></category> <category><![CDATA[stem cells]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=1460</guid> <description><![CDATA[Amazonia! &#8211; explore the jungle of microarray results Paradoxically, the tremendous downpour of microarray results prevents a simple use of expression data. Therefore, we propose a thematic entry to public transcriptomes: you may for instance query a gene on a &#8220;Stem Cells page&#8221;, where you will see the expression of your favorite gene across selected [...]]]></description> <content:encoded><![CDATA[<p><a
href="http://amazonia.transcriptome.eu/" rel="nofollow" ><img
src="http://bogdan.org.ua/wp-content/uploads/2011/03/AmaZoniaLogo.png" alt="Amazonia!" title="Amazonia!" width="357" height="100" class="alignleft size-full wp-image-1463" /></a><a
href="http://amazonia.transcriptome.eu/" rel="nofollow" >Amazonia! &#8211; explore the jungle of microarray results</a></p><blockquote><p>Paradoxically, the tremendous downpour of microarray results prevents a simple use of expression data. Therefore, we propose a thematic entry to public transcriptomes: you may for instance query a gene on a &#8220;Stem Cells page&#8221;, where you will see the expression of your favorite gene across selected microarray experiments related to stem cell biology. This selection of samples can be customized at will among the 6462 samples currently present in the database.</p></blockquote><blockquote><p>Every transcriptome study results in the identification of lists of genes relevant to a given biological condition. In order to include this valuable information in any new query in the Amazonia! database, we indicate for each gene in which lists it is included. This is a straightforward and efficient way to synthesize hundreds of microarray publications.</p><p>A special feature of Amazonia! is the field of human stem cells, notably embryonic stem cells.</p></blockquote><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2011/03/06/amazonia-6462-human-microarray-datasets.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2011/03/06/amazonia-6462-human-microarray-datasets.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2011%2F03%2F06%2Famazonia-6462-human-microarray-datasets.html&amp;title=Amazonia%21%206462%20human%20microarray%20datasets" id="wpa2a_4"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2011/03/06/amazonia-6462-human-microarray-datasets.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Introduction to Python for bioinformatics</title><link>http://bogdan.org.ua/2011/02/25/introduction-to-python-for-bioinformatics.html</link> <comments>http://bogdan.org.ua/2011/02/25/introduction-to-python-for-bioinformatics.html#comments</comments> <pubDate>Fri, 25 Feb 2011 12:03:55 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Python]]></category> <category><![CDATA[Software]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=1451</guid> <description><![CDATA[This overview presentation is two years old, but still a highly valuable resource: modules and tools mentioned are alive and useful. I think this is the second presentation by Giovanni I&#8217;m embedding (first one being about GNU/make for bioinformatics). Introduction to python for bioinformatics]]></description> <content:encoded><![CDATA[<p>This overview presentation is two years old, but still a highly valuable resource: modules and tools mentioned are alive and useful.<br
/> I think this is the second presentation by Giovanni I&#8217;m embedding (first one being about GNU/make for bioinformatics).</p><div
style="width:425px" id="__ss_1320208"><strong
style="display:block;margin:12px 0 4px"><a
href="http://www.slideshare.net/giovanni/introduction-to-python-for-bioinformatics" rel="nofollow"  title="Introduction to python for bioinformatics">Introduction to python for bioinformatics</a></strong><object
id="__sse1320208" width="425" height="355"><param
name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=introduction-090421044444-phpapp02&#038;stripped_title=introduction-to-python-for-bioinformatics&#038;userName=giovanni" /><param
name="allowFullScreen" value="true"/><param
name="allowScriptAccess" value="always"/><embed
name="__sse1320208" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=introduction-090421044444-phpapp02&#038;stripped_title=introduction-to-python-for-bioinformatics&#038;userName=giovanni" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object></div><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2011/02/25/introduction-to-python-for-bioinformatics.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2011/02/25/introduction-to-python-for-bioinformatics.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2011%2F02%2F25%2Fintroduction-to-python-for-bioinformatics.html&amp;title=Introduction%20to%20Python%20for%20bioinformatics" id="wpa2a_8"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2011/02/25/introduction-to-python-for-bioinformatics.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>How to replace newlines with commas, tabs etc (merge lines)</title><link>http://bogdan.org.ua/2010/11/16/how-to-replace-newlines-with-commas-tabs-etc-merge-lines.html</link> <comments>http://bogdan.org.ua/2010/11/16/how-to-replace-newlines-with-commas-tabs-etc-merge-lines.html#comments</comments> <pubDate>Tue, 16 Nov 2010 08:20:45 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[*nix]]></category> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[how-to]]></category> <category><![CDATA[Notepad]]></category> <category><![CDATA[Software]]></category> <category><![CDATA[awk]]></category> <category><![CDATA[grep]]></category> <category><![CDATA[linux]]></category> <category><![CDATA[paste]]></category> <category><![CDATA[sed]]></category> <category><![CDATA[sort]]></category> <category><![CDATA[tr]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=1208</guid> <description><![CDATA[Imagine you need to get a few lines from a group of files with missing identifier mappings. I have a bunch of files with content similar to this one: ENSRNOG00000018677 1368832_at 25233 ENSRNOG00000002079 1369102_at 25272 ENSRNOG00000043451 25353 ENSRNOG00000001527 1388013_at 25408 ENSRNOG00000007390 1389538_at 25493 In the example above I need '25353', which does not have corresponding [...]]]></description> <content:encoded><![CDATA[<p>Imagine you need to get a few lines from a group of files with missing identifier mappings. I have a bunch of files with content similar to this one:</p><blockquote><p> ENSRNOG00000018677      1368832_at      25233<br
/> ENSRNOG00000002079      1369102_at      25272<br
/> ENSRNOG00000043451                            25353<br
/> ENSRNOG00000001527      1388013_at      25408<br
/> ENSRNOG00000007390      1389538_at      25493</p></blockquote><p>In the example above I need '25353', which does not have corresponding affy_probeset_id in the 2nd column.</p><p>It is clear how to do that:</p><div
class="igBar"><span
id="lcode-7"><a
href="#" rel="nofollow"  onclick="javascript:showPlainTxt('code-7'); return false;">PLAIN TEXT</a></span></div><div
class="syntax_hilite"><span
class="langName">CODE:</span><div
id="code-7"><div
class="code"><ol><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">sort -u *_affy_ensembl.<span
style="">txt</span> | grep -v <span
style="color:#CC0000;">'_at'</span> | awk <span
style="color:#CC0000;">'{print $2}'</span></div></li></ol></div></div></div><p></p><p>This outputs a column of required IDs (EntrezGene in this example):</p><blockquote><p> 116720<br
/> 679845<br
/> 309295<br
/> 364867<br
/> 298220<br
/> 298221<br
/> 25353</p></blockquote><p>However, I need these IDs as a comma-separated list, not as newline-separated list.</p><p>There are several ways to achieve the desired result (only the last pipe commands differ):</p><div
class="igBar"><span
id="lcode-8"><a
href="#" rel="nofollow"  onclick="javascript:showPlainTxt('code-8'); return false;">PLAIN TEXT</a></span></div><div
class="syntax_hilite"><span
class="langName">CODE:</span><div
id="code-8"><div
class="code"><ol><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">sort -u *_affy_ensembl.<span
style="">txt</span> | grep -v <span
style="color:#CC0000;">'_at'</span> | awk <span
style="color:#CC0000;">'{print $2}'</span> | gawk <span
style="color:#CC0000;">'$1=$1'</span> ORS=<span
style="color:#CC0000;">', '</span></div></li></ol></div></div></div><p></p><div
class="igBar"><span
id="lcode-9"><a
href="#" rel="nofollow"  onclick="javascript:showPlainTxt('code-9'); return false;">PLAIN TEXT</a></span></div><div
class="syntax_hilite"><span
class="langName">CODE:</span><div
id="code-9"><div
class="code"><ol><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">sort -u *_affy_ensembl.<span
style="">txt</span> | grep -v <span
style="color:#CC0000;">'_at'</span> | awk <span
style="color:#CC0000;">'{print $2}'</span> | tr <span
style="color:#CC0000;">'<span
style="color:#000099; font-weight:bold;">\n</span>'</span> <span
style="color:#CC0000;">','</span></div></li></ol></div></div></div><p></p><div
class="igBar"><span
id="lcode-10"><a
href="#" rel="nofollow"  onclick="javascript:showPlainTxt('code-10'); return false;">PLAIN TEXT</a></span></div><div
class="syntax_hilite"><span
class="langName">CODE:</span><div
id="code-10"><div
class="code"><ol><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">sort -u *_affy_ensembl.<span
style="">txt</span> | grep -v <span
style="color:#CC0000;">'_at'</span> | awk <span
style="color:#CC0000;">'{print $2}'</span> | sed <span
style="color:#CC0000;">':a;N;$!ba;s/<span
style="color:#000099; font-weight:bold;">\n</span>/, /g'</span></div></li></ol></div></div></div><p></p><div
class="igBar"><span
id="lcode-11"><a
href="#" rel="nofollow"  onclick="javascript:showPlainTxt('code-11'); return false;">PLAIN TEXT</a></span></div><div
class="syntax_hilite"><span
class="langName">CODE:</span><div
id="code-11"><div
class="code"><ol><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">sort -u *_affy_ensembl.<span
style="">txt</span> | grep -v <span
style="color:#CC0000;">'_at'</span> | awk <span
style="color:#CC0000;">'{print $2}'</span> | sed <span
style="color:#CC0000;">':q;N;s/<span
style="color:#000099; font-weight:bold;">\n</span>/, /g;t q'</span></div></li></ol></div></div></div><p></p><div
class="igBar"><span
id="lcode-12"><a
href="#" rel="nofollow"  onclick="javascript:showPlainTxt('code-12'); return false;">PLAIN TEXT</a></span></div><div
class="syntax_hilite"><span
class="langName">CODE:</span><div
id="code-12"><div
class="code"><ol><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">sort -u *_affy_ensembl.<span
style="">txt</span> | grep -v <span
style="color:#CC0000;">'_at'</span> | awk <span
style="color:#CC0000;">'{print $2}'</span> | paste -s -d <span
style="color:#CC0000;">","</span></div></li></ol></div></div></div><p></p><p>These solutions differ in efficiency and (slightly) in output. <strong>sed</strong> will read all the input into its buffer to replace newlines with other separators, so it might not be best for large files. <strong>tr</strong> might be the most efficient, but I haven't tested that. <strong>paste</strong> will re-use delimiters, so you cannot really get comma-space ", " separation with it.</p><p>Sources: <a
href="http://www.linuxquestions.org/questions/programming-9/sed-how-do-you-replace-end-of-line-with-a-space-637013/" rel="nofollow" >linuxquestions 1 (explains used sed commands)</a>, <a
href="http://www.linuxquestions.org/questions/programming-9/merge-lines-in-a-file-using-sed-191121/" rel="nofollow" >linuxquestions 2</a>, <a
href="http://www.cyberciti.biz/faq/linux-unix-sed-replace-newline/" rel="nofollow" >nixcraft</a>.</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2010/11/16/how-to-replace-newlines-with-commas-tabs-etc-merge-lines.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2010/11/16/how-to-replace-newlines-with-commas-tabs-etc-merge-lines.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2010%2F11%2F16%2Fhow-to-replace-newlines-with-commas-tabs-etc-merge-lines.html&amp;title=How%20to%20replace%20newlines%20with%20commas%2C%20tabs%20etc%20%28merge%20lines%29" id="wpa2a_12"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2010/11/16/how-to-replace-newlines-with-commas-tabs-etc-merge-lines.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Overlaying gene expression data onto pathways from databases</title><link>http://bogdan.org.ua/2010/11/05/overlaying-gene-expression-data-onto-pathways-from-databases.html</link> <comments>http://bogdan.org.ua/2010/11/05/overlaying-gene-expression-data-onto-pathways-from-databases.html#comments</comments> <pubDate>Fri, 05 Nov 2010 13:20:06 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Software]]></category> <category><![CDATA[biocarta]]></category> <category><![CDATA[data]]></category> <category><![CDATA[expression]]></category> <category><![CDATA[genmapp]]></category> <category><![CDATA[KEGG]]></category> <category><![CDATA[microarray]]></category> <category><![CDATA[pathvisio]]></category> <category><![CDATA[pathway]]></category> <category><![CDATA[pathway explorer]]></category> <category><![CDATA[wikipathways]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=1174</guid> <description><![CDATA[Superimposing gene expression data onto pathways from databases is a common task in the final steps of microarray data analysis - that is, biological interpretation and results discussion. I have found many tools which claim to facilitate this procedure. Some of them are reviewed below (in no specific order). Pathway Explorer by Bernhard Mlecnik was [...]]]></description> <content:encoded><![CDATA[<p>Superimposing gene expression data onto pathways from databases is a common task in the final steps of microarray data analysis - that is, biological interpretation and results discussion.</p><p>I have found many tools which claim to facilitate this procedure. Some of them are reviewed below (in no specific order).<br
/> <span
id="more-1174"></span><br
/> <a
href="https://pathwayexplorer.genome.tugraz.at/" rel="nofollow" class="broken_link">Pathway Explorer</a> by Bernhard Mlecnik was last updated in 2007, but is fully functional (I believe it is being maintained without changes to the last-updated date). Both online and downloadable Java applications are available. Note that for downloadable application you will need to obtain a license key - the procedure is well documented and was very fast for me.<br
/> <a
href="http://bogdan.org.ua/wp-content/uploads/2010/11/pe.png"><img
src="http://bogdan.org.ua/wp-content/uploads/2010/11/pe-500x371.png" alt="" title="Pathway Explorer" width="500" height="371" class="aligncenter size-medium wp-image-1176" /></a><br
/> Pathway Explorer supports import from 3 sources: KEGG xml files, biocarta URLs, and GenMAPP URLs. Import from KEGG does work as described in the short manual, and seems functional (I had some problems exporting/saving the resulting picture, but didn't investigate further). Biocarta import seems to work, but for some reason does not display expression levels of pathway components. I could not test the import of GenMAPP pathways, because they are not available online.</p><p>I found Pathway Explorer good, but then switched to PathVisio (reviewed next), because for some reason Зathway Explorer was recognising only a small fraction of genes from my expression data. It could be that identifiers mappings are outdated, but this is just a guess.</p><p><a
href="http://www.pathvisio.org/" rel="nofollow" >PathVisio</a> appears to be a spin-off of <a
href="http://www.genmapp.org/" rel="nofollow" >GenMAPP</a> and <a
href="http://www.wikipathways.org/" rel="nofollow" >WikiPathways</a>. It excells at importing/visualizing WikiPathways data, which even comes bundled with PathVisio Java application. It is easier to use than Pathway Explorer, and it seems to recognize more genes (although still not all the genes which are present in the data). There is KEGG pathways support, but it is not always usable - many edges (links between genes/proteins) are absent, so instead of a pathway you get a bunch of nodes relevant to a pathway, but cannot really see how they are connected. PathVisio supports an insanely long list of database identifiers, so it is highly unlikely that you will have to map your data to use a different identifier. This pathway mapper exports to several formats, including PNG and PDF.</p><p><a
href="http://bogdan.org.ua/wp-content/uploads/2010/11/pv.png"><img
src="http://bogdan.org.ua/wp-content/uploads/2010/11/pv-500x302.png" alt="" title="PathVisio" width="500" height="302" class="aligncenter size-medium wp-image-1181" /></a></p><p>I could not fully test <a
href="http://lbb.ut.ac.ir/projects/Affy2KEGG/WS/affy2keggWs.jsp" rel="nofollow" class="broken_link">AffyWEB</a>, because it doesn't list rat arrays we used. Trying their barley genome example did work, so the tool is probably functional. It overlays your expression data onto KEGG pathways.</p><p><a
href="http://bogdan.org.ua/wp-content/uploads/2010/11/map00030.png"><img
src="http://bogdan.org.ua/wp-content/uploads/2010/11/map00030-500x345.png" alt="" title="AffyWEB" width="500" height="345" class="aligncenter size-medium wp-image-1177" /></a></p><p><a
href="http://www.g-language.org/data/marray/software/map2swf.cgi " rel="nofollow" >G-language Microarray System</a> is a comparatively simple pathway visualizer. It accepts CSV files containing EntrezGene IDs column with a single column of expression values normalized to 1-100 range, fetches requested KEGG pathway, and generates a Flash (SWF) object depicting that pathway with coloured components. It does work with sample data. I was too lazy to normalize my expression data to [1;100] range, and SWF is not exactly a usable format, so I haven't tested this tool any further (you can right-click to zoom in the flash pathway below).<br
/> <object
classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000"
id="fm_ecj00010_1594649779"
class="flashmovie"
width="500"
height="800"><param
name="movie" value="http://bogdan.org.ua/wp-content/uploads/2010/11/ecj00010.swf" /> <!--[if !IE]>--> <object
type="application/x-shockwave-flash"
data="http://bogdan.org.ua/wp-content/uploads/2010/11/ecj00010.swf"
name="fm_ecj00010_1594649779"
width="500"
height="800"> <!--<![endif]--><p><a
href="http://adobe.com/go/getflashplayer" rel="nofollow" ><img
src="http://www.adobe.com/images/shared/download_buttons/get_flash_player.gif" alt="Get Adobe Flash player" /></a></p> <!--[if !IE]>--> </object> <!--<![endif]--> </object><p>If time permits (or work requires) this post may be extended with the reviews of <a
href="http://www.genmapp.org/" rel="nofollow" >GenMAPP</a>, <a
href="http://gepat.sourceforge.net/screenshots.htm" rel="nofollow" >GEPAT</a>, <a
href="https://stat.ethz.ch/pipermail/bioconductor/2008-June/023112.html" rel="nofollow" >KEGG2heatmap script</a>, <a
href="http://akt.ucsf.edu/EGAN/downloads.php" rel="nofollow" >EGAN</a>, <a
href="http://mapman.gabipd.org/web/guest/mapman" rel="nofollow" >MapMan</a>, <a
href="http://bioinformatics.oxfordjournals.org/content/20/13/2156.abstract" rel="nofollow" >Pathway Miner</a>, <a
href="http://nar.oxfordjournals.org/content/32/suppl_2/W460.full" rel="nofollow" >ArrayXPath</a>, <a
href="http://nar.oxfordjournals.org/content/35/suppl_2/W625.full" rel="nofollow" >VisANT</a>, <a
href="http://www.win.tue.nl/~mwestenb/spotxplore/" rel="nofollow" >SpotXplore</a>, and maybe others.</p><p>Please comment to share your experience using pathway expression overlaying tools or to suggest other tools.</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2010/11/05/overlaying-gene-expression-data-onto-pathways-from-databases.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2010/11/05/overlaying-gene-expression-data-onto-pathways-from-databases.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2010%2F11%2F05%2Foverlaying-gene-expression-data-onto-pathways-from-databases.html&amp;title=Overlaying%20gene%20expression%20data%20onto%20pathways%20from%20databases" id="wpa2a_16"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2010/11/05/overlaying-gene-expression-data-onto-pathways-from-databases.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Batch-retrieve EntrezGene homologs using NCBI&#8217;s HomoloGene and R&#8217;s annotationTools</title><link>http://bogdan.org.ua/2010/10/27/batch-retrieve-entrezgene-homologs-using-ncbi-homologene-and-r.html</link> <comments>http://bogdan.org.ua/2010/10/27/batch-retrieve-entrezgene-homologs-using-ncbi-homologene-and-r.html#comments</comments> <pubDate>Wed, 27 Oct 2010 10:49:01 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[how-to]]></category> <category><![CDATA[Notepad]]></category> <category><![CDATA[annotationTools]]></category> <category><![CDATA[HomoloGene]]></category> <category><![CDATA[NCBI]]></category> <category><![CDATA[R]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=1168</guid> <description><![CDATA[Install the annotationTools R package: source("http://bioconductor.org/biocLite.R") biocLite("annotationTools") Download full HomoloGene data file from ftp://ftp.ncbi.nlm.nih.gov/pub/HomoloGene/current library(annotationTools) homologene = read.delim("homologene.data", header=FALSE) mygenes = read.table("file with one entrez ID of the source organism per line.txt") getHOMOLOG(unlist(mygenes), taxonomy_ID_of_target_organism, homologene) [alternatively, wrap the call to getHOMOLOG into unlist to get a vector] It might be easier to achieve the same [...]]]></description> <content:encoded><![CDATA[<ol><li>Install the <a
href="http://bioconductor.org/packages/release/bioc/html/annotationTools.html" rel="nofollow" >annotationTools</a> R package:<br
/> source("http://bioconductor.org/biocLite.R")<br
/> biocLite("annotationTools")</li><li>Download full HomoloGene data file from <a
href="ftp://ftp.ncbi.nlm.nih.gov/pub/HomoloGene/current" rel="nofollow" >ftp://ftp.ncbi.nlm.nih.gov/pub/HomoloGene/current</a></li><li>library(annotationTools)</li><li>homologene = read.delim("homologene.data", header=FALSE)</li><li>mygenes = read.table("file with one entrez ID of the source organism per line.txt")</li><li>getHOMOLOG(unlist(mygenes), <a
href="http://www.ncbi.nlm.nih.gov/taxonomy" rel="nofollow" >taxonomy_ID_of_target_organism</a>, homologene) [alternatively, wrap the call to getHOMOLOG into unlist to get a vector]</li></ol><p>It might be easier to achieve the same results with a Perl script calling NCBI's e-utils.</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2010/10/27/batch-retrieve-entrezgene-homologs-using-ncbi-homologene-and-r.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2010/10/27/batch-retrieve-entrezgene-homologs-using-ncbi-homologene-and-r.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2010%2F10%2F27%2Fbatch-retrieve-entrezgene-homologs-using-ncbi-homologene-and-r.html&amp;title=Batch-retrieve%20EntrezGene%20homologs%20using%20NCBI%E2%80%99s%20HomoloGene%20and%20R%E2%80%99s%20annotationTools" id="wpa2a_20"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2010/10/27/batch-retrieve-entrezgene-homologs-using-ncbi-homologene-and-r.html/feed</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item><title>Tools for conversion of IDs in genomics</title><link>http://bogdan.org.ua/2010/08/10/tools-for-conversion-of-ids-in-genomics.html</link> <comments>http://bogdan.org.ua/2010/08/10/tools-for-conversion-of-ids-in-genomics.html#comments</comments> <pubDate>Tue, 10 Aug 2010 12:31:44 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Science]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=1102</guid> <description><![CDATA[Tools for conversion of IDs in genomics]]></description> <content:encoded><![CDATA[<p><a
href="http://hum-molgen.org/NewsGen/08-2009/000020.html" rel="nofollow" >Tools for conversion of IDs in genomics</a></p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2010/08/10/tools-for-conversion-of-ids-in-genomics.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2010/08/10/tools-for-conversion-of-ids-in-genomics.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2010%2F08%2F10%2Ftools-for-conversion-of-ids-in-genomics.html&amp;title=Tools%20for%20conversion%20of%20IDs%20in%20genomics" id="wpa2a_24"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2010/08/10/tools-for-conversion-of-ids-in-genomics.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>R tutorial links</title><link>http://bogdan.org.ua/2010/03/29/r-tutorial-links.html</link> <comments>http://bogdan.org.ua/2010/03/29/r-tutorial-links.html#comments</comments> <pubDate>Mon, 29 Mar 2010 16:17:31 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[Systems Biology]]></category> <category><![CDATA[R]]></category> <category><![CDATA[statistics]]></category> <category><![CDATA[tutorial]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=1015</guid> <description><![CDATA[R time series tutorial (2010, a website of the "Time Series Analysis and Its Applications: With R Examples" book) Statistics with R (2007) R for programmers PDF (2008, 104 pages, linked to from here) Brief R tutorial (2004) Statistical computing with R: a tutorial (2004) An introduction to R (from the official r-project website, should [...]]]></description> <content:encoded><![CDATA[<ul><li><a
href="http://www.stat.pitt.edu/stoffer/tsa2/R_time_series_quick_fix.htm" rel="nofollow" >R time series tutorial</a> (2010, a website of the "Time Series Analysis and Its Applications: With R Examples" book)</li><li><a
href="http://zoonek2.free.fr/UNIX/48_R/all.html" rel="nofollow" >Statistics with R</a> (2007)</li><li><a
href="http://heather.cs.ucdavis.edu/~matloff/R/RProg.pdf" rel="nofollow" >R for programmers</a> PDF (2008, 104 pages, linked to from <a
href="http://heather.cs.ucdavis.edu/~matloff/r.html" rel="nofollow" >here</a>)</li><li><a
href="http://mercury.bio.uaf.edu/mercury/R/R.html" rel="nofollow" >Brief R tutorial</a> (2004)</li><li><a
href="http://math.illinoisstate.edu/dhkim/rstuff/rtutor.html" rel="nofollow" >Statistical computing with R: a tutorial</a> (2004)</li><li><a
href="http://cran.r-project.org/doc/manuals/R-intro.html" rel="nofollow" >An introduction to R</a> (from the official r-project website, should be always up-to-date)</li><li><a
href="http://www.cyclismo.org/tutorial/R/" rel="nofollow" >R tutorial</a> (date unknown, definitely newer than 2005)</li></ul><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2010/03/29/r-tutorial-links.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2010/03/29/r-tutorial-links.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2010%2F03%2F29%2Fr-tutorial-links.html&amp;title=R%20tutorial%20links" id="wpa2a_28"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2010/03/29/r-tutorial-links.html/feed</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item><title>R script to filter probesets with log-expression values below the lowest spike-in</title><link>http://bogdan.org.ua/2010/01/27/r-script-to-filter-probesets-with-log-expression-values-below-the-lowest-spike-in.html</link> <comments>http://bogdan.org.ua/2010/01/27/r-script-to-filter-probesets-with-log-expression-values-below-the-lowest-spike-in.html#comments</comments> <pubDate>Wed, 27 Jan 2010 12:44:02 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Programming]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[Affymetrix]]></category> <category><![CDATA[filter]]></category> <category><![CDATA[log-expression]]></category> <category><![CDATA[microarray]]></category> <category><![CDATA[probeset]]></category> <category><![CDATA[R]]></category> <category><![CDATA[spike-in]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=985</guid> <description><![CDATA[Sometimes there is a need to remove all the probesets, which have expression values below the minimal spike-in intensity on the Affymetrix microarray. The reasoning behind this procedure is simple: minimal-expression spike-ins represent the bottom margin of microarray sensitivity, and anything below that margin cannot be reliably quantified - which also means that both fold-change [...]]]></description> <content:encoded><![CDATA[<p>Sometimes there is a need to remove all the probesets, which have expression values below the minimal spike-in intensity on the <a
href="http://www.affymetrix.com/" rel="nofollow" >Affymetrix</a> microarray. The reasoning behind this procedure is simple: minimal-expression spike-ins represent the bottom margin of microarray sensitivity, and anything below that margin cannot be reliably quantified - which also means that both fold-change and p-value of expression variance will be unreliable for these probesets.</p><p>Here's a simple <a
href="http://www.r-project.org/" rel="nofollow" >R</a> script to do just that. It is abundantly commented, and also contains an optional (commented out) fragment which allows the removal of more low-variance, low-intensity probesets.</p><p><span
id="more-985"></span><br
/> <em>Hint: click the "plain text" box header to be able to right-click the code, "Select All", and then "Copy".</em></p><div
class="igBar"><span
id="lcode-14"><a
href="#" rel="nofollow"  onclick="javascript:showPlainTxt('code-14'); return false;">PLAIN TEXT</a></span></div><div
class="syntax_hilite"><span
class="langName">CODE:</span><div
id="code-14"><div
class="code"><ol><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">filter_below_spikes = function<span
style="color:#006600; font-weight:bold;">&#40;</span>eset<span
style="color:#006600; font-weight:bold;">&#41;</span> <span
style="color:#006600; font-weight:bold;">&#123;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # Finds max<span
style="color:#006600; font-weight:bold;">&#40;</span>lowest<span
style="color:#006600; font-weight:bold;">&#41;</span> AFFX/spike-in intensity, and removes rows consisting entirely of values below max<span
style="color:#006600; font-weight:bold;">&#40;</span>lowest<span
style="color:#006600; font-weight:bold;">&#41;</span>.</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # @param eset</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # ExpressionSet</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # @returns</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # exprs<span
style="color:#006600; font-weight:bold;">&#40;</span>ExpressionSet<span
style="color:#006600; font-weight:bold;">&#41;</span>, filtered</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # Without Biobase exprs<span
style="color:#006600; font-weight:bold;">&#40;</span><span
style="color:#006600; font-weight:bold;">&#41;</span> will not work.</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; <span
style="">require</span><span
style="color:#006600; font-weight:bold;">&#40;</span>Biobase<span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; expr = exprs<span
style="color:#006600; font-weight:bold;">&#40;</span>eset<span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # <span
style="color:#CC0000;">'expr'</span> sample:</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; #&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span
style="color:#800000;color:#800000;">1</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # <span
style="color:#800000;color:#800000;">1367452</span>_at <span
style="color:#800000;color:#800000;">10</span>.<span
style="color:#800000;color:#800000;">880208</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # <span
style="color:#800000;color:#800000;">1367453</span>_at <span
style="color:#800000;color:#800000;">10</span>.<span
style="color:#800000;color:#800000;">554647</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; cat<span
style="color:#006600; font-weight:bold;">&#40;</span>nrow<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#CC0000;">"rows before filtering.<span
style="color:#000099; font-weight:bold;">\n</span>"</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # Make a vector of spike row names.</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; <span
style="">spikes</span> = grep<span
style="color:#006600; font-weight:bold;">&#40;</span><span
style="color:#CC0000;">"AFFX"</span>, rownames<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span>, value = TRUE<span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; cat<span
style="color:#006600; font-weight:bold;">&#40;</span><span
style="color:#CC0000;">"Expression matrix has"</span>, length<span
style="color:#006600; font-weight:bold;">&#40;</span>spikes<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#CC0000;">"spike-in rows.<span
style="color:#000099; font-weight:bold;">\n</span>"</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; cat<span
style="color:#006600; font-weight:bold;">&#40;</span><span
style="color:#CC0000;">"Summary of spike-in values distribution follows:<span
style="color:#000099; font-weight:bold;">\n</span>"</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; print<span
style="color:#006600; font-weight:bold;">&#40;</span>summary<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#91;</span>spikes, <span
style="color:#006600; font-weight:bold;">&#93;</span><span
style="color:#006600; font-weight:bold;">&#41;</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # Find max<span
style="color:#006600; font-weight:bold;">&#40;</span>lowest<span
style="color:#006600; font-weight:bold;">&#41;</span> spike-in values.</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; <span
style="">minval_max</span> = max<span
style="color:#006600; font-weight:bold;">&#40;</span>as.<span
style="">double</span><span
style="color:#006600; font-weight:bold;">&#40;</span>substr<span
style="color:#006600; font-weight:bold;">&#40;</span>grep<span
style="color:#006600; font-weight:bold;">&#40;</span><span
style="color:#CC0000;">"Min"</span>, summary<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#91;</span>spikes, <span
style="color:#006600; font-weight:bold;">&#93;</span><span
style="color:#006600; font-weight:bold;">&#41;</span>, value = TRUE<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#800000;color:#800000;">10</span>, <span
style="color:#800000;color:#800000;">14</span><span
style="color:#006600; font-weight:bold;">&#41;</span><span
style="color:#006600; font-weight:bold;">&#41;</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; cat<span
style="color:#006600; font-weight:bold;">&#40;</span><span
style="color:#CC0000;">"max(minimal spike-in log-intensity values) ="</span>, minval_max, <span
style="color:#CC0000;">"<span
style="color:#000099; font-weight:bold;">\n</span>"</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # Remove spike-ins from expr.</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; <span
style="">expr</span> = expr<span
style="color:#006600; font-weight:bold;">&#91;</span>grep<span
style="color:#006600; font-weight:bold;">&#40;</span><span
style="color:#CC0000;">"AFFX"</span>, rownames<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span>, value = TRUE, invert = TRUE<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#006600; font-weight:bold;">&#93;</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; nospike_rows = nrow<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; cat<span
style="color:#006600; font-weight:bold;">&#40;</span>nospike_rows, <span
style="color:#CC0000;">"rows remaining after the removal of"</span>, length<span
style="color:#006600; font-weight:bold;">&#40;</span>spikes<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#CC0000;">"spike-in probesets.<span
style="color:#000099; font-weight:bold;">\n</span>"</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # Optional: calculate max<span
style="color:#006600; font-weight:bold;">&#40;</span>SD<span
style="color:#006600; font-weight:bold;">&#41;</span> of all removed rows.</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; #bad_sds_max = max<span
style="color:#006600; font-weight:bold;">&#40;</span>apply<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#91;</span>!apply<span
style="color:#006600; font-weight:bold;">&#40;</span><span
style="color:#006600; font-weight:bold;">&#40;</span>expr&gt; minval_max<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#800000;color:#800000;">1</span>, any<span
style="color:#006600; font-weight:bold;">&#41;</span>,<span
style="color:#006600; font-weight:bold;">&#93;</span>, <span
style="color:#800000;color:#800000;">1</span>, sd<span
style="color:#006600; font-weight:bold;">&#41;</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # Now remove all rows, where each value is &lt;= minval_max.</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; <span
style="">expr</span> = expr<span
style="color:#006600; font-weight:bold;">&#91;</span>!apply<span
style="color:#006600; font-weight:bold;">&#40;</span><span
style="color:#006600; font-weight:bold;">&#40;</span>expr &lt;= minval_max<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#800000;color:#800000;">1</span>, all<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#006600; font-weight:bold;">&#93;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; cat<span
style="color:#006600; font-weight:bold;">&#40;</span>nrow<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#CC0000;">"rows remaining after filtering out"</span>, nospike_rows - nrow<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#CC0000;">"probesets with all values below"</span>, minval_max, <span
style="color:#CC0000;">"<span
style="color:#000099; font-weight:bold;">\n</span>"</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; #cat<span
style="color:#006600; font-weight:bold;">&#40;</span>bad_sds_max, <span
style="color:#CC0000;">"is max(SD) of all"</span>, nospike_rows - nrow<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#CC0000;">"filtered probesets with all values below"</span>, minval_max, <span
style="color:#CC0000;">"<span
style="color:#000099; font-weight:bold;">\n</span>"</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; # Optional: Remove *some* of the rows, which have at least one value below minval_max, and row_SD &lt;= bad_sds_max.</div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; #pre_final_rows = nrow<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; #expr = expr<span
style="color:#006600; font-weight:bold;">&#91;</span><span
style="color:#006600; font-weight:bold;">&#40;</span>apply<span
style="color:#006600; font-weight:bold;">&#40;</span>expr, <span
style="color:#800000;color:#800000;">1</span>, sd<span
style="color:#006600; font-weight:bold;">&#41;</span>&gt; bad_sds_max<span
style="color:#006600; font-weight:bold;">&#41;</span> | <span
style="color:#006600; font-weight:bold;">&#40;</span>apply<span
style="color:#006600; font-weight:bold;">&#40;</span>expr, <span
style="color:#800000;color:#800000;">1</span>, min<span
style="color:#006600; font-weight:bold;">&#41;</span>&gt; minval_max<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#006600; font-weight:bold;">&#93;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; #cat<span
style="color:#006600; font-weight:bold;">&#40;</span>pre_final_rows-nrow<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#CC0000;">"rows with SD &lt;="</span>, bad_sds_max, <span
style="color:#CC0000;">"and min(row) &lt;="</span>, minval_max, <span
style="color:#CC0000;">"were removed.<span
style="color:#000099; font-weight:bold;">\n</span>"</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; #cat<span
style="color:#006600; font-weight:bold;">&#40;</span>nrow<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span>, <span
style="color:#CC0000;">"final rows returned.<span
style="color:#000099; font-weight:bold;">\n</span>"</span><span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp;</div></li><li
style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp; return<span
style="color:#006600; font-weight:bold;">&#40;</span>expr<span
style="color:#006600; font-weight:bold;">&#41;</span></div></li><li
style="font-weight: bold;color:#26536A;"><div
style="font-family: 'Courier New', Courier, monospace; font-weight: normal;"><span
style="color:#006600; font-weight:bold;">&#125;</span></div></li></ol></div></div></div><p></p><p>Sample use:</p><blockquote><p> > source("script.R")<br
/> > expr.filtered = filter_below_spikes(eset)</p></blockquote><p>Comments and suggestions are welcome.</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2010/01/27/r-script-to-filter-probesets-with-log-expression-values-below-the-lowest-spike-in.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2010/01/27/r-script-to-filter-probesets-with-log-expression-values-below-the-lowest-spike-in.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2010%2F01%2F27%2Fr-script-to-filter-probesets-with-log-expression-values-below-the-lowest-spike-in.html&amp;title=R%20script%20to%20filter%20probesets%20with%20log-expression%20values%20below%20the%20lowest%20spike-in" id="wpa2a_32"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2010/01/27/r-script-to-filter-probesets-with-log-expression-values-below-the-lowest-spike-in.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>GNU make for bioinformatics presentation</title><link>http://bogdan.org.ua/2009/06/05/gnu-make-for-bioinformatics-presentation.html</link> <comments>http://bogdan.org.ua/2009/06/05/gnu-make-for-bioinformatics-presentation.html#comments</comments> <pubDate>Fri, 05 Jun 2009 09:37:21 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Programming]]></category> <category><![CDATA[make]]></category> <category><![CDATA[presentation]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=725</guid> <description><![CDATA[Giovanni Dall’olio has recently posted a presentation on using make. Although it has "bioinformatics" on the title page, this is a good and very easy to understand make intro. Original post is here.]]></description> <content:encoded><![CDATA[<p>Giovanni Dall’olio has recently posted a presentation on using <strong>make</strong>.</p><p>Although it has "bioinformatics" on the title page, this is a good and very easy to understand <strong>make</strong> intro.<br
/> <img
style="visibility:hidden;width:0px;height:0px;" border=0 width=0 height=0 src="http://counters.gigya.com/wildfire/IMP/CXNID=2000002.0NXC/bT*xJmx*PTEyNDQxOTQxMzg2NzImcHQ9MTI*NDE5NDE5ODg4NCZwPTEwMTkxJmQ9c3NfZW1iZWQmZz*yJnQ9Jm89ZTA5MGYzZTEyYzUzNDRiZmEyZTJjNWJhMmE2MTE*NGMmb2Y9MA==.gif" /><div
style="width:425px;text-align:left" id="__ss_1111258"><object
style="margin:0px" width="425" height="355"><param
name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=makefilesbioinfo-090306113207-phpapp02&#038;stripped_title=makefiles-bioinfo" /><param
name="allowFullScreen" value="true"/><param
name="allowScriptAccess" value="always"/><embed
src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=makefilesbioinfo-090306113207-phpapp02&#038;stripped_title=makefiles-bioinfo" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object></div><p>Original post is <a
href="http://bioinfoblog.it/2009/10/a-seminar-on-makefile-and-pipelines-of-shell-scripts/" rel="nofollow" >here</a>.</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2009/06/05/gnu-make-for-bioinformatics-presentation.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2009/06/05/gnu-make-for-bioinformatics-presentation.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2009%2F06%2F05%2Fgnu-make-for-bioinformatics-presentation.html&amp;title=GNU%20make%20for%20bioinformatics%20presentation" id="wpa2a_36"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2009/06/05/gnu-make-for-bioinformatics-presentation.html/feed</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item><title>Ensembl Genomes launches Protists, Bacteria and Metazoa</title><link>http://bogdan.org.ua/2009/04/21/ensembl-genomes-launches-protists-bacteria-and-metazoa.html</link> <comments>http://bogdan.org.ua/2009/04/21/ensembl-genomes-launches-protists-bacteria-and-metazoa.html#comments</comments> <pubDate>Tue, 21 Apr 2009 13:27:40 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[Software]]></category> <category><![CDATA[Web]]></category> <category><![CDATA[bacteria]]></category> <category><![CDATA[Ensembl]]></category> <category><![CDATA[fungi]]></category> <category><![CDATA[genome]]></category> <category><![CDATA[metazoa]]></category> <category><![CDATA[plants]]></category> <category><![CDATA[protists]]></category> <category><![CDATA[sequence]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=670</guid> <description><![CDATA[The following sites are available: http://bacteria.ensembl.org http://protists.ensembl.org http://metazoa.ensembl.org During summer, two more sites - for Fungi and Plants - should be made available. Learn more about Ensembl Genomes project.]]></description> <content:encoded><![CDATA[<p>The following sites are available:</p><p><a
href="http://bacteria.ensembl.org/index.html" rel="nofollow" >http://bacteria.ensembl.org</a><br
/> <a
href="http://protists.ensembl.org/index.html" rel="nofollow" >http://protists.ensembl.org</a><br
/> <a
href="http://metazoa.ensembl.org/index.html" rel="nofollow" >http://metazoa.ensembl.org</a></p><p>During summer, two more sites - for Fungi and Plants - should be made available.</p><p>Learn more about <a
href="http://www.ensemblgenomes.org/" rel="nofollow" >Ensembl Genomes</a> project.</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2009/04/21/ensembl-genomes-launches-protists-bacteria-and-metazoa.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2009/04/21/ensembl-genomes-launches-protists-bacteria-and-metazoa.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2009%2F04%2F21%2Fensembl-genomes-launches-protists-bacteria-and-metazoa.html&amp;title=Ensembl%20Genomes%20launches%20Protists%2C%20Bacteria%20and%20Metazoa" id="wpa2a_40"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2009/04/21/ensembl-genomes-launches-protists-bacteria-and-metazoa.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>BioGPS: by the makers of Gene Expression Atlas</title><link>http://bogdan.org.ua/2008/11/22/biogps-makers-gene-expression-atlas.html</link> <comments>http://bogdan.org.ua/2008/11/22/biogps-makers-gene-expression-atlas.html#comments</comments> <pubDate>Sat, 22 Nov 2008 10:48:46 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[Software]]></category> <category><![CDATA[BioGPS]]></category> <category><![CDATA[gene expression atlas]]></category> <category><![CDATA[GNF]]></category> <category><![CDATA[SymAtlas]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=457</guid> <description><![CDATA[If you were a frequent user of GNF SymAtlas, then you'd better bookmark BioGPS - Your Gene Portal System. BioGPS is basically the same gene expression atlas, but with a completely different interface, and more flexible ideology (e.g. expression atlas is now just a "plugin", and more of those can be plugged in). There are [...]]]></description> <content:encoded><![CDATA[<p><img
src="http://bogdan.org.ua/wp-content/uploads/2008/11/biogps-logo4.gif" alt="BioGPS logo" width="83" height="24" class="alignleft size-full wp-image-458" /> If you were a frequent user of <a
href="http://biogps.gnf.org/" rel="nofollow" >GNF SymAtlas</a>, then you'd better bookmark <a
href="http://biogps.gnf.org/" rel="nofollow" >BioGPS - Your Gene Portal System</a>. BioGPS is basically the same gene expression atlas, but with a completely different interface, and more flexible ideology (e.g. expression atlas is now just a "plugin", and more of those can be plugged in).</p><p>There are also some easter-egg-like features: try hovering the BioGPS logo in the top left corner several times...</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2008/11/22/biogps-makers-gene-expression-atlas.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2008/11/22/biogps-makers-gene-expression-atlas.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2008%2F11%2F22%2Fbiogps-makers-gene-expression-atlas.html&amp;title=BioGPS%3A%20by%20the%20makers%20of%20Gene%20Expression%20Atlas" id="wpa2a_44"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2008/11/22/biogps-makers-gene-expression-atlas.html/feed</wfw:commentRss> <slash:comments>3</slash:comments> </item> <item><title>Ensembl release 51 is out!</title><link>http://bogdan.org.ua/2008/11/20/ensembl-release-51-is-out.html</link> <comments>http://bogdan.org.ua/2008/11/20/ensembl-release-51-is-out.html#comments</comments> <pubDate>Thu, 20 Nov 2008 13:51:34 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[Software]]></category> <category><![CDATA[51]]></category> <category><![CDATA[binding site]]></category> <category><![CDATA[COTRASIF]]></category> <category><![CDATA[custom track]]></category> <category><![CDATA[Ensembl]]></category> <category><![CDATA[finder]]></category> <category><![CDATA[genome browser]]></category> <category><![CDATA[release]]></category> <category><![CDATA[search tool]]></category> <category><![CDATA[taxonomy]]></category> <category><![CDATA[TFBS]]></category> <category><![CDATA[webcode]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=446</guid> <description><![CDATA[This in itself wouldn't be so exciting, if it were not for the new webcode! If you do visit Ensembl now, you will be definitely surprised with the page loading speed - it is amazingly faster than it used to be! Also, as you dig deeper and deeper, you'll see that there are a lot [...]]]></description> <content:encoded><![CDATA[<p><img
src="http://bogdan.org.ua/wp-content/uploads/2008/11/e-ensembl.gif" alt="ensembl logo" title="Ensembl" width="150" height="40" class="alignleft size-medium wp-image-448" />This in itself wouldn't be so exciting, if it were not for the new webcode! If you do visit <a
href="http://www.ensembl.org/index.html" rel="nofollow" >Ensembl</a> now, you will be definitely surprised with the page loading speed - it is amazingly faster than it used to be! Also, as you dig deeper and deeper, you'll see that there are a lot more other differences - starting with the new design, and not forgetting the changed page organization logic.</p><p>To cut the long story short, here's the <a
href="http://www.ensembl.org/info/docs/webcode/version_4.html" rel="nofollow" >list of changes in the new Ensembl 51 webcode release</a>. Other changes to Ensembl in release 51 are also <a
href="http://www.ensembl.org/info/website/news/index.html" rel="nofollow" class="broken_link">available</a>.</p><p>One of the new features which caught my attention is the ability to add custom tracks in Ensembl (which is a long-available feature in UCSC Genome Browser). Interestingly, you do not even have to be logged in to use this feature. We shall be considering providing the TFBS custom track for several species, as predicted de-novo by our <a
href="http://biomed.org.ua/COTRASIF/" rel="nofollow" >evolutionary conserved tfbs search tool</a> (<a
href="http://biomed.org.ua/COTRASIF/" rel="nofollow" >binding site finder</a>), but this is a long shot, given the already published <a
href="http://biomed.org.ua/COTRASIF/" rel="nofollow" >COTRASIF</a> <a
href="http://groups.google.com/group/cotrasif/msg/2b68dfcefab1669e" rel="nofollow" >development roadmap</a>.</p><p>There is one more <strong>great news</strong> which is kinda insufficiently highlighted: the brand-new <a
href="http://www.ensemblgenomes.org/" rel="nofollow" >Ensembl Genome Browser</a> website, which (as planned, it hasn't yet started operations) will provide access not only to vertebrates, but also to other taxonomy groups. The full list is:</p><ul><li>Metazoa</li><li>Protists</li><li>Bacteria</li><li>Plants</li><li>Fungi</li></ul><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2008/11/20/ensembl-release-51-is-out.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2008/11/20/ensembl-release-51-is-out.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2008%2F11%2F20%2Fensembl-release-51-is-out.html&amp;title=Ensembl%20release%2051%20is%20out%21" id="wpa2a_48"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2008/11/20/ensembl-release-51-is-out.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>CLANS: CLuster ANalysis of Sequences</title><link>http://bogdan.org.ua/2008/10/25/clans-cluster-analysis-of-sequences.html</link> <comments>http://bogdan.org.ua/2008/10/25/clans-cluster-analysis-of-sequences.html#comments</comments> <pubDate>Fri, 24 Oct 2008 23:20:39 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[analysis]]></category> <category><![CDATA[clustering]]></category> <category><![CDATA[java]]></category> <category><![CDATA[sequences]]></category> <category><![CDATA[tool]]></category> <category><![CDATA[visualization]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=423</guid> <description><![CDATA[CLANS (CLuster ANalysis of Sequences). Found via Freelancing science: CLANS runs (PSI)BLAST on your sequences, all vs all, and clusters them in 2D or 3D according to their similarity. This method allows for rapid classification of huge datasets and has the advantage over, lets say, phylogenetic tree, that one can quickly assess results of the [...]]]></description> <content:encoded><![CDATA[<p><a
href="http://bioinfoserver.rsbs.anu.edu.au/programs/clans/" rel="nofollow" >CLANS</a> (CLuster ANalysis of Sequences).<br
/> <span
id="more-423"></span><br
/> Found via <a
href="http://freelancingscience.com/2008/01/22/clans-java-tool-for-cluster-analysis-of-sequences/" rel="nofollow" >Freelancing science</a>:</p><blockquote><p>CLANS runs (PSI)BLAST on your sequences, all vs all, and clusters them in 2D or 3D according to their similarity. This method allows for rapid classification of huge datasets and has the advantage over, lets say, phylogenetic tree, that one can quickly assess results of the clustering in a visual way (I cannot imagine making any sense of looking at phylogenetic tree with 1500 branches, while the graphical output, as on the animation below, is pretty easy to read).</p></blockquote><div
align="center"><img
src="http://bogdan.org.ua/wp-content/uploads/2008/10/clansmicroanim.gif" alt="" title="animated 3D cluster" width="500" height="434" class="aligncenter size-full wp-image-424" /></div><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2008/10/25/clans-cluster-analysis-of-sequences.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2008/10/25/clans-cluster-analysis-of-sequences.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2008%2F10%2F25%2Fclans-cluster-analysis-of-sequences.html&amp;title=CLANS%3A%20CLuster%20ANalysis%20of%20Sequences" id="wpa2a_52"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2008/10/25/clans-cluster-analysis-of-sequences.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Weird orthology species names in Ensembl</title><link>http://bogdan.org.ua/2008/09/30/weird-orthology-species-names-in-ensembl.html</link> <comments>http://bogdan.org.ua/2008/09/30/weird-orthology-species-names-in-ensembl.html#comments</comments> <pubDate>Tue, 30 Sep 2008 11:01:59 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[compara]]></category> <category><![CDATA[convention]]></category> <category><![CDATA[cow]]></category> <category><![CDATA[database]]></category> <category><![CDATA[Ensembl]]></category> <category><![CDATA[non-unique]]></category> <category><![CDATA[orthology]]></category> <category><![CDATA[release]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=398</guid> <description><![CDATA[For the COTRASIF tool, I've been using the Ensembl Compara database (since release 47) to automatically import into COTRASIF gene orthology mappings. However, with the E!50 release, the Compara database was dropped. Looking for another option to get orthologs from Ensembl (using martservice, via biomart.org), I tried using the standard query - selecting "Homologs" group [...]]]></description> <content:encoded><![CDATA[<p>For the <a
href="http://biomed.org.ua/COTRASIF/" rel="nofollow" >COTRASIF tool</a>, I've been using the Ensembl Compara database (since release 47) to automatically import into COTRASIF gene orthology mappings.</p><p>However, with the E!50 release, the Compara database was dropped.</p><p>Looking for another option to get orthologs from Ensembl (using martservice, via <a
href="http://www.biomart.org/" rel="nofollow" >biomart.org</a>), I tried using the standard query - selecting "Homologs" group on the "Attributes" page for a single species database, and then selecting appropriate second species to get orthology mappings.</p><p>Imagine my surprise, when not only in the interface, but also in the generated XML file I found attribute names like "cow_ensembl_gene" :-O</p><p>I only need 11 species at the moment, and excluding the sufficiently unique name mappings like "zebrafish - danio rerio", there is a number of questionable mappings: "yeast" for <em>S. cerevisiae</em> (could be <em>S.pombe</em>), "rat" for <em>R. norvegicus</em> (could be <em>R.rattus</em>), "anopheles" for <em>A.gambiae</em> (could be some other <em>Anopheles</em>). Other mappings might be also non-unique, especially for people working with different species of the same genus.</p><p>Am I missing some system in this naming "convention", or am I the only one who finds it strange?</p><p>Is there a way <em>not</em> to use "common species names" when importing orthology data from Ensembl with the help of martservice?</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2008/09/30/weird-orthology-species-names-in-ensembl.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2008/09/30/weird-orthology-species-names-in-ensembl.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2008%2F09%2F30%2Fweird-orthology-species-names-in-ensembl.html&amp;title=Weird%20orthology%20species%20names%20in%20Ensembl" id="wpa2a_56"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2008/09/30/weird-orthology-species-names-in-ensembl.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>The 9th ICSB-2008 in Gothenburg, Sweden</title><link>http://bogdan.org.ua/2008/09/12/the-9th-icsb-2008-in-gothenburg-sweden.html</link> <comments>http://bogdan.org.ua/2008/09/12/the-9th-icsb-2008-in-gothenburg-sweden.html#comments</comments> <pubDate>Fri, 12 Sep 2008 09:52:46 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[Systems Biology]]></category> <category><![CDATA[CellDesigner]]></category> <category><![CDATA[conference]]></category> <category><![CDATA[Gothenburg]]></category> <category><![CDATA[ICSB]]></category> <category><![CDATA[ICSB 2008]]></category> <category><![CDATA[SBW]]></category> <category><![CDATA[Sweden]]></category> <category><![CDATA[tutorial]]></category> <category><![CDATA[workshop]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=325</guid> <description><![CDATA[It has been quite a time since my last serious and long post. On the one hand, summer is vacations time - so I've been to one in Turkey; on the other hand - the long-awaited ICSB-2008 conference finally took place in Gothenburg, Sweden, on August 23-27 (or 22-28, counting in tutorials and workshops). Update: [...]]]></description> <content:encoded><![CDATA[<p><a
href="http://bogdan.org.ua/wp-content/uploads/2008/08/newbannericsb2008.png"><img
src="http://bogdan.org.ua/wp-content/uploads/2008/08/newbannericsb2008-300x19.png" alt="" title="ICSB banner" width="300" height="19" class="alignleft size-medium wp-image-333" /></a> It has been quite a time since my last serious and long post. On the one hand, summer is vacations time -  so I've been to one in Turkey; on the other hand - the long-awaited ICSB-2008 conference finally took place in Gothenburg, Sweden, on August 23-27 (or 22-28, counting in tutorials and workshops).</p><p><ins
datetime="2010-06-08T04:59:28+00:00">Update:</ins> it appears that conference domain name http://www.icsb-2008.org/ was not maintained after the conference, so I had to remove all the links to that website (which now appears to be owned by someone not related to ICSB). However, there seems to be a mirror of ICSB-2008 at <a
href="http://www.gmm.gu.se/groups/icsb08test/" rel="nofollow" >http://www.gmm.gu.se/groups/icsb08test/</a>.</p><p>Synopsis: in this post I present a personal-perspective report on the 9th ICSB (with a condensed <a
href="http://gallery.bogdan.org.ua/v/ICSB-2008/" rel="nofollow" >ICSB-2008 photo-report</a> in my <a
href="http://gallery.bogdan.org.ua/main.php" rel="nofollow" >gallery</a>).</p><p><span
id="more-325"></span></p><p><img
src="http://bogdan.org.ua/wp-content/uploads/2008/08/sb_main_smaller.png" alt="ICSB-2008" title="ICSB-2008 dates and place" width="300" height="257" class="wp-image-329" align="left" /></p><p>I took part in almost all the conference events, starting with pre-conference tutorials and finishing with post-conference workshops, and including the Viking Dinner.</p><p>On August 22 (the tutorials day), I attended the tutorials by <a
href="http://www.sys-bio.org/sbwWiki/tutorials/icsb2008" rel="nofollow" >SBW</a> team, <a
href="http://www.async.ece.utah.edu/" rel="nofollow" >Chris Myers</a> (iBioSim), and <a
href="http://www.celldesigner.org/tutorial/CellDesignerTutorialICSB2008.pdf" rel="nofollow" >CellDesigner 4</a>. Out of these, I liked iBioSim tutorial the most: it had a balanced combination of presented theory and hands-on experience using the program. iBioSim is positioned as (and actually appears to be) a gene-network computer-aided construction software (or, gene regulatory network CAD - GRN CAD). I recommend reading through the <a
href="http://www.async.ece.utah.edu/" rel="nofollow" >Myers group homepage</a>, installing the iBioSim tool (I had problems running it on Linux, but on Windows and Mac it works fine - as long as your saved project's path has no non-latin symbols and spaces), and reading through the tutorial which is included in the iBioSim tool (in the help menu, choose Help, then scroll down the opened page, and click Tutorial).</p><p>CellDesigner's tutorial was also good, and referring to my older post on <a
href="http://bogdan.org.ua/2007/05/10/choosing-cell-modelling-software-virtual-cell-cytoscape-celldesigner-e-cell.html">choosing cell modelling software</a>, it is clear that CellDesigner is getting better with the new versions; it does include SBW inter-connectivity feature, and also can use COPASI for modelling. The SBW tutorial was probably somewhat outside the field of my experience, though I did gain some tool use experience and did learn the purposes for which the SBW package can be used. It should be noted that SBW isn't only a "workbench" now, as it includes reaction modelling and simulation tools itself, so it can be used without the conjunction with external tools (as it was initially intended).</p><p>On the 23rd of August, there were two more tutorials - <a
href="http://www.gmm.gu.se/groups/icsb08test/downloads/tutorials/MendesICSB2008.pdf" rel="nofollow" >Introduction into parameter estimation with COPASI</a> (see also <a
href="http://www.copasi.org/tiki-integrator.php?repID=9&#038;file=ch04s03.html" rel="nofollow" >here</a>), and <a
href="http://www.gmm.gu.se/groups/icsb08test/downloads/tutorials/StellingICSB2008.pdf" rel="nofollow" >Reaction Network theory for systems biology</a>. To me personally, these were not as exciting as iBioSim <img
src='http://bogdan.org.ua/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> , but nevertheless useful.</p><p>Here's <a
href='http://bogdan.org.ua/wp-content/uploads/2008/09/tutorial-schedule-icsb-2008.pdf'>ICSB tutorials program</a>.</p><p>During the main conference event (on August 23-27), we had a number of parallel sessions on various SB-related topics. As from my point of view, there were two strongly exposed main topics: quantitative systems biology (actually, I was really impressed by the number of high-quality <abbr
title="quantitative systems biology">QSB</abbr> studies presented), and... funding! The FUNDING topic took about 25-30% of the time of discussions; it was also tightly intertwined with the needs of the pharmaceutical industry (there was a dedicated session on this, too, and it was also somewhat about funding). It should be noted, that AstraZeneca and Novo Nordisk were the Platinum sponsors of the ICSB conference. Also, funding opportunities seem to be relatively good in Europe, especially for the integrative German projects - such as a probably well-known HepatoSys project (the results of which, unfortunately, are not publicly available - at least yet). However, in addition to the already known HepatoSys, I learned about a number of other large Systems Biology projects being carried out - such as ENFIN, APO-SYS, SYBILLA, UNICELLSYS and many others (see the <a
href="http://bogdan.org.ua/wp-content/uploads/2008/09/icsb-detailed-schedule.pdf">full ICSB conference program</a> for details).</p><p>For me personally, the main (and the most easy to comprehend) bits of information were obtained from the posters devoted to the reconstruction of <abbr
title="Gene Regulatory Network">GRNs</abbr>. I've found around 15 [potentially] interesting posters - all of which were scrupulously photocopied for further referencing (our systems biology group has just started working on the GRN reconstruction problem).</p><p>Also, there was some interest in the tool I presented - the <a
href="http://bogdan.org.ua/2008/05/20/cotrasif-conservation-aided-transcription-factor-bindin-site-finder.html">COTRASIF</a>. Although it is not strictly systems-biology (actually, it is pure genomics tool), there were several people interested, and also providing valuable suggestions and feedback. To summarize:</p><ol><li>we'd better put in TFBS enrichment statistics - showing how strongly is each promoter "enriched" with the sought TFBS (the higher the enrichment - the higher chances that that gene does respond to the TF; this parameter is a higher-level statistic than a plain old score)</li><li>we need to release the standalone (non-web) binary of the HMM finder tool. This was suggested by the ChIP-seq practitioner - such a tool would allow to better define the boundaries of the experimentally identified binding sites. Clearly, if it were to be used as ChIP-seq-tool, it would better have some good I<sub>c</sub> filter to cut off non-informative sequence flanks, and also an internal filter of the resulting Markov matrix I<sub>c</sub></li><li>we do need to increase the pre-defined promoter size. Actually, we'll increase the upstream-from-TSS part to 2kb (from the current 800b). And closer to the end of October we'll introduce the whole-genome search option</li><li>we need to publish an article comparing PWM method to HMM method, and providing the FP/FN statistics for both methods as implemented by our tool, plus the FP/FN statistics after applying orthology filtering. Actually, we are working on that, and an article should be re-submitted at the beginning of October</li></ol><p>We also had social program - an excursion through the Archipelago to the Island of Marstrand, followed by the Viking dinner at the Carlsten Fortress - one of the best-preserved stone fortresses of the Northern Europe. The 2.5h boat travel was pleasant - especially thanks to the unusually warm and sunny weather, which lasted for the whole conference (it only got somewhat colder on the final workshops day). I liked the dinner, although 3 meat dishes is too much even for such a seasoned meat-eater as I am <img
src='http://bogdan.org.ua/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> You can see some <a
href="http://gallery.bogdan.org.ua/v/ICSB-2008/Carlsten-Fortress-on-Marstrand-Island-Viking-Dinner/" rel="nofollow" >excursion photos</a> in my <a
href="http://gallery.bogdan.org.ua/main.php" rel="nofollow" >gallery</a>.</p><p>The next ICSB-2009 will be held at Stanford in California, USA; ICSB-2010 is scheduled to be held in Edinburgh, Scotland, UK. The place for ICSB-2011 is already being negotiated <img
src='http://bogdan.org.ua/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> (it's getting similar to the Olympic Games country scheduling procedures).</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2008/09/12/the-9th-icsb-2008-in-gothenburg-sweden.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2008/09/12/the-9th-icsb-2008-in-gothenburg-sweden.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2008%2F09%2F12%2Fthe-9th-icsb-2008-in-gothenburg-sweden.html&amp;title=The%209th%20ICSB-2008%20in%20Gothenburg%2C%20Sweden" id="wpa2a_60"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2008/09/12/the-9th-icsb-2008-in-gothenburg-sweden.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>New bioinformatics term: (high-) throughputomics</title><link>http://bogdan.org.ua/2008/07/16/new-bioinformatics-term-high-throughputomics.html</link> <comments>http://bogdan.org.ua/2008/07/16/new-bioinformatics-term-high-throughputomics.html#comments</comments> <pubDate>Wed, 16 Jul 2008 13:29:50 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[conference]]></category> <category><![CDATA[high-throughputomics]]></category> <category><![CDATA[ICSB 2008]]></category> <category><![CDATA[made-up]]></category> <category><![CDATA[term]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=314</guid> <description><![CDATA[Just made it up for convenience, stimulated by reading workshop descriptions for the upcoming ICSB 2008 in Gothenburg, Sweden. Here is formal definition: high-throughputomics the term is used to denote/mention any or all of the modern high-throughput techniques (in all of, but not limited to: genomics, transcriptomics, proteomics, ...), together with derived/applicable data-processing approaches. All [...]]]></description> <content:encoded><![CDATA[<p>Just made it up for convenience, stimulated by reading workshop descriptions for the upcoming <a
href="http://www.icsb-2008.org/" rel="nofollow" >ICSB 2008</a> in Gothenburg, Sweden.</p><p>Here is formal definition:</p><dl><dt><strong>high-throughputomics</strong></dt><dd>the term is used to denote/mention any or all of the modern high-throughput techniques (in all of, but not limited to: genomics, transcriptomics, proteomics, ...), together with derived/applicable data-processing approaches. All the "networks" things also conveniently fall into the high-throughputomics definition</dd></dl><p>As this is a general term, it might be even suitable as a conference title (but NOT for <a
href="http://www.icsb-2008.org/" rel="nofollow" >ICSB</a>, which I'm waiting for eagerly).</p><p>For a shorter and informal (spoken-only) term, <strong>putomics</strong> can be used:</p><dl><dt><strong>putomics</strong></dt><dd>spoken-only, informal short form of <strong>high-throughputomics</strong></dd></dl><p><strong>Putomics</strong> is also conveniently similar to "computation" (<strong>computomics</strong>):</p><dl><dt><strong>computomics</strong></dt><dd>application of computer hardware and software for the analysis of massive amounts of data, obtained using high-throughput methods; this is a research sub-field of <strong>high-throughputomics</strong></dd></dl><p>P.S. <img
src='http://bogdan.org.ua/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> <img
src='http://bogdan.org.ua/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /><br
/> For easier citing:</p><blockquote><p>Tokovenko, Bogdan. New bioinformatics term: high-throughputomics. 2008-07-16. URL:http://bogdan.org.ua/2008/07/16/new-bioinformatics-term-high-throughputomics.html. Accessed: 2008-07-16. (Archived by WebCite® at <a
href="http://www.webcitation.org/5ZMD9DITU" rel="nofollow" >http://www.webcitation.org/5ZMD9DITU</a>)</p></blockquote><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2008/07/16/new-bioinformatics-term-high-throughputomics.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2008/07/16/new-bioinformatics-term-high-throughputomics.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2008%2F07%2F16%2Fnew-bioinformatics-term-high-throughputomics.html&amp;title=New%20bioinformatics%20term%3A%20%28high-%29%20throughputomics" id="wpa2a_64"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2008/07/16/new-bioinformatics-term-high-throughputomics.html/feed</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item><title>BGRS-2008 conference in Novosibirsk, Russia</title><link>http://bogdan.org.ua/2008/07/10/bgrs-2008-conference-in-novosibirsk-russia.html</link> <comments>http://bogdan.org.ua/2008/07/10/bgrs-2008-conference-in-novosibirsk-russia.html#comments</comments> <pubDate>Wed, 09 Jul 2008 23:57:33 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Misc]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[2008]]></category> <category><![CDATA[6th]]></category> <category><![CDATA[Akademgorodok]]></category> <category><![CDATA[BGRS]]></category> <category><![CDATA[conference]]></category> <category><![CDATA[international]]></category> <category><![CDATA[Novosibirsk]]></category> <category><![CDATA[Russia]]></category> <category><![CDATA[sixth]]></category> <category><![CDATA[summer school]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=307</guid> <description><![CDATA[... was held on June 22-28, 2008, in Akademgorodok (Novosibirsk), Russia. It was the sixth conference held. The International Conference on Bioinformatics of Genome Regulation and Structure is the bi-annual event. It features several bioinformatics sections, which IMO cover most of bioinformatics sub-fields. The Sixth conference, BGRS-2008, was well-organized and had something to offer to [...]]]></description> <content:encoded><![CDATA[<p><img
src="http://bogdan.org.ua/wp-content/uploads/2008/07/ymbly2.gif" alt="International Conference on Bioinformatics of Genome Regulation and Structure logo" title="BGRS" width="198" height="198" class="alignleft size-full wp-image-308" />... was held on June 22-28, 2008, in Akademgorodok (Novosibirsk), Russia. It was the sixth conference held.</p><p>The <a
href="http://www.bionet.nsc.ru/meeting/bgrs2008/index.html" rel="nofollow" >International Conference on Bioinformatics of Genome Regulation and Structure</a> is the bi-annual event. It features several bioinformatics sections, which <abbr
title="in my opinion">IMO</abbr> cover most of bioinformatics sub-fields.</p><p>The Sixth conference, BGRS-2008, was well-organized and had something to offer to everyone. By far the largest section was Genomics and Transcriptomics (at least if judging by the abstracts book and by the posters presented; talks given were distributed more equally between sections). As I did some work in genomics (namely, our <a
href="http://bogdan.org.ua/2008/05/20/cotrasif-conservation-aided-transcription-factor-bindin-site-finder.html">COTRASIF tool</a>), I had quite a load of info to digest, and many new potentially fruitful contacts to establish (which I did quite good).</p><p>The second section on my scale of priorities was "COMPUTER ANALYSIS AND IMAGE RECOGNITION IN SYSTEMS BIOLOGY", which had several interesting researches presented in the field of spatial/developmental modelling. There was a very good talk on model reduction (with an actual example) for the purposes of both comparing different models and decreasing the model complexity without sacrificing model-predicted outcomes.</p><p>As for the other sections, I didn't find them interesting enough. Fortunately, there were social program events scheduled for every day, so I visited the <a
href="http://gallery.bogdan.org.ua/v/BGRS-2008/Zoo/" rel="nofollow" >Novosibirsk zoo</a> and the <a
href="http://gallery.bogdan.org.ua/v/BGRS-2008/Museum/" rel="nofollow" >Archaeological museum</a>. I did not do as many <a
href="http://gallery.bogdan.org.ua/v/BGRS-2008/" rel="nofollow" >pictures</a> as I usually do at conferences/schools, because there were two photographers at the conference, and their photos can be freely seen <a
href="http://www.bionet.nsc.ru/meeting/bgrs2008photo/" rel="nofollow" >here</a> and <a
href="http://www.bionet.nsc.ru/meeting/bgrs2008photo2/" rel="nofollow" >here</a>.</p><p>Most of the <a
href="http://www.bionet.nsc.ru/meeting/bgrs2008/participants.php" rel="nofollow" >conference participants</a> could speak Russian (I'd estimate the group of Russian-speaking participants at 90% of the number of participants), even though they were coming from e.g. Singapore or USA. But the official conference language was English, and the 10% of non-speakers were far not underprivileged, which goes well with the international status of the conference.</p><p>After the conference, there was a <a
href="http://www.bionet.nsc.ru/meeting/bgrs_school_2008/index.html" rel="nofollow" >BGRS-2008 summer school</a>. As I stayed for some extra days, I managed to attend up to 90% of the school's events (including the <a
href="http://gallery.bogdan.org.ua/v/BGRS-2008/Summer-school/Novosibirsk-excursion/" rel="nofollow" >guided tour to Novosibirsk</a> <img
src='http://bogdan.org.ua/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> ). For me, summer school was somewhat less useful than the conference, but nevertheless such presentations as on Petri nets and about SABIO-RK/Sycamore were informative and will be used in my future work.</p><p>There were 3 prizes for the <a
href="http://gallery.bogdan.org.ua/v/BGRS-2008/Summer-school/Student-presentations/" rel="nofollow" >student presentations</a>; winners are at the end of the <a
href="http://www.bionet.nsc.ru/meeting/bgrs_school_2008/presentations.html" rel="nofollow" >page</a>.</p><p><a
href="http://www.bionet.nsc.ru/meeting/bgrs_school_2008/sertifikat.pdf" rel="nofollow" >Certificates</a> were given after successful school completion. As I wasn't registered, I can now only print out the <a
href="http://www.bionet.nsc.ru/meeting/bgrs_school_2008/sertifikat.pdf" rel="nofollow" >empty certificate</a>, which is to signify that I did not attend the last day of the school and thus was disqualified <img
src='http://bogdan.org.ua/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> .</p><p>Just found that there are also some <a
href="http://www.bionet.nsc.ru/meeting/bgrs_school_2008/photos01/index.html" rel="nofollow" >photos</a> from the organizers.</p><p>There was also a <a
href="http://www.bionet.nsc.ru/meeting/bgrs_school_2008/photos02/" rel="nofollow" >football (soccer) game</a> between the <abbr
title="Institute of Cytology and Genetics">ICG</abbr> team and the school participants team. I'm a fan of neither watching nor playing football, so I skipped this event altogether.</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2008/07/10/bgrs-2008-conference-in-novosibirsk-russia.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2008/07/10/bgrs-2008-conference-in-novosibirsk-russia.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2008%2F07%2F10%2Fbgrs-2008-conference-in-novosibirsk-russia.html&amp;title=BGRS-2008%20conference%20in%20Novosibirsk%2C%20Russia" id="wpa2a_68"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2008/07/10/bgrs-2008-conference-in-novosibirsk-russia.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Gene regulatory network reconstruction from microarray data</title><link>http://bogdan.org.ua/2008/05/20/gene-regulatory-network-reconstruction-from-microarray-data.html</link> <comments>http://bogdan.org.ua/2008/05/20/gene-regulatory-network-reconstruction-from-microarray-data.html#comments</comments> <pubDate>Tue, 20 May 2008 17:51:55 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[data]]></category> <category><![CDATA[gene regulatory network]]></category> <category><![CDATA[microarray]]></category> <category><![CDATA[networks]]></category> <category><![CDATA[reconstruction]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=296</guid> <description><![CDATA[The title of this post is my current - "forthcome", as in "done" - field of interest. First article on topic: Fast network component analysis (FastNCA) for gene regulatory network reconstruction from microarray data. Another one, on combining different high-throughput data sources to get higher-quality results: Uncovering signal transduction networks from high-throughput data by integer [...]]]></description> <content:encoded><![CDATA[<p>The title of this post is my current - "forthcome", as in "done" - field of interest.</p><p>First article on topic: <a
href="http://biomed.org.ua/en/node/44" rel="nofollow" >Fast network component analysis (FastNCA) for gene regulatory network reconstruction from microarray data</a>.</p><p>Another one, on combining different high-throughput data sources to get higher-quality results: <a
href="http://biomed.org.ua/en/node/45" rel="nofollow" >Uncovering signal transduction networks from high-throughput data by integer linear programming</a>.</p><p>I'm especially interested in time-series network reconstruction algorithms. If you have a good advice to share with a newcomer to the networks field - don't hesitate <img
src='http://bogdan.org.ua/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /></p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2008/05/20/gene-regulatory-network-reconstruction-from-microarray-data.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2008/05/20/gene-regulatory-network-reconstruction-from-microarray-data.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2008%2F05%2F20%2Fgene-regulatory-network-reconstruction-from-microarray-data.html&amp;title=Gene%20regulatory%20network%20reconstruction%20from%20microarray%20data" id="wpa2a_72"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2008/05/20/gene-regulatory-network-reconstruction-from-microarray-data.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>COTRASIF: conservation-aided transcription factor binding site finder</title><link>http://bogdan.org.ua/2008/05/20/cotrasif-conservation-aided-transcription-factor-bindin-site-finder.html</link> <comments>http://bogdan.org.ua/2008/05/20/cotrasif-conservation-aided-transcription-factor-bindin-site-finder.html#comments</comments> <pubDate>Tue, 20 May 2008 17:33:47 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Links]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[Software]]></category> <category><![CDATA[Web]]></category> <category><![CDATA[COTRASIF]]></category> <category><![CDATA[eukaryotic]]></category> <category><![CDATA[HMM]]></category> <category><![CDATA[PFM]]></category> <category><![CDATA[PWM]]></category> <category><![CDATA[search]]></category> <category><![CDATA[tool]]></category> <category><![CDATA[web-based]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/?p=295</guid> <description><![CDATA[With this post, I'm finally announcing the opening of the (mostly) functional COTRASIF web-tool, created for the genome-wide identification of promoter regulatory sequences (transcription factor binding sits, TFBS). You can learn more from the About and Help pages. For an example of use, see the Supplement page (article is currently being prepared; as soon as [...]]]></description> <content:encoded><![CDATA[<p>With this post, I'm finally announcing the opening of the (mostly) functional <a
href="http://biomed.org.ua/COTRASIF/" rel="nofollow" >COTRASIF</a> web-tool, created for the <a
href="http://biomed.org.ua/COTRASIF/" rel="nofollow" >genome-wide identification of promoter regulatory sequences</a> (transcription factor binding sits, TFBS). You can learn more from the <a
href="http://biomed.org.ua/COTRASIF/about.html" rel="nofollow" >About</a> and <a
href="http://biomed.org.ua/COTRASIF/help.html" rel="nofollow" >Help</a> pages. For an example of use, see the <a
href="http://biomed.org.ua/COTRASIF/supplement.html" rel="nofollow" >Supplement</a> page (article is currently being prepared; as soon as it's ready, I'll make it available).</p><p>If you are interested - have a look at the <a
href="http://biomed.org.ua/COTRASIF/news.html" rel="nofollow" >News</a> page, where there is information on joining <a
href="http://groups.google.com/group/cotrasif" rel="nofollow" >COTRASIF Google group</a>. For non-public enquiries, please use my <a
href="http://bogdan.org.ua/contact">contact page</a>.</p><p>Note: the problem of identifying eukaryotic transcription factor binding sites stays acute for many years in a row - see e.g. the most recent <a
href="http://biomed.org.ua/en/blogs/bogdan/eukaryotic-transcription-factor-binding-sites-modeling-and-integrative-search-method" rel="nofollow" >Eukaryotic transcription factor binding sites - modelling and integrative search methods</a>.</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2008/05/20/cotrasif-conservation-aided-transcription-factor-bindin-site-finder.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2008/05/20/cotrasif-conservation-aided-transcription-factor-bindin-site-finder.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2008%2F05%2F20%2Fcotrasif-conservation-aided-transcription-factor-bindin-site-finder.html&amp;title=COTRASIF%3A%20conservation-aided%20transcription%20factor%20binding%20site%20finder" id="wpa2a_76"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2008/05/20/cotrasif-conservation-aided-transcription-factor-bindin-site-finder.html/feed</wfw:commentRss> <slash:comments>6</slash:comments> </item> <item><title>Using Cytoscape from behind an HTTP proxy which requires authentication (authorization)</title><link>http://bogdan.org.ua/2007/09/14/using-cytoscape-from-behind-an-http-proxy-which-requires-authentication-authorization.html</link> <comments>http://bogdan.org.ua/2007/09/14/using-cytoscape-from-behind-an-http-proxy-which-requires-authentication-authorization.html#comments</comments> <pubDate>Fri, 14 Sep 2007 14:44:48 +0000</pubDate> <dc:creator>Bogdan</dc:creator> <category><![CDATA[Bioinformatics]]></category> <category><![CDATA[Notepad]]></category> <category><![CDATA[Science]]></category> <category><![CDATA[Software]]></category> <guid
isPermaLink="false">http://bogdan.org.ua/2007/09/14/using-cytoscape-from-behind-an-http-proxy-which-requires-authentication-authorization.html</guid> <description><![CDATA[Cytoscape 2.5.1 supports proxies, including HTTP proxies, but there is no support for HTTP proxies requiring authentication/authorization. It's easy to use Cytoscape in the authentication-requiring proxy scenario; below is one possible method. Note, that exactly the same method can be used to allow any software, which supports proxies but not proxies with authentication, to be [...]]]></description> <content:encoded><![CDATA[<p>Cytoscape 2.5.1 supports proxies, including HTTP proxies, but there is no support for HTTP proxies requiring authentication/authorization. It's easy to use Cytoscape in the authentication-requiring proxy scenario; below is one possible method.</p><p>Note, that exactly the same method can be used to allow <strong>any</strong> software, which supports proxies but not proxies with authentication, to be able to access the internet.<br
/> <span
id="more-221"></span><br
/> First, let's outline what we will do:</p><ol><li>get a proxy which supports on-the-fly modifying of HTTP headers. I used <a
href="http://www.proxomitron.info/files/index.html" rel="nofollow" title="Proxomitron download page" >Proxomitron</a> (which is a great tool, BTW!), and recommend using <strong>4.5 June release</strong>. You need to have privileges sufficient for installing your local proxy of choice.</li><li>you need to know the ip (or hostname) of your proxy, proxy port, and both user and password for that proxy</li></ol><p>First, install your proxy of choice. In this example, I installed Proxomitron.</p><p>Second, you need to configure your local proxy and browser. With Proxomitron:</p><ol><li>launch proxomitron</li><li>an eye in a triangle will appear in your taskbar. click it once to bring up the main window: <img
src="http://bogdan.org.ua/wp-content/uploads/2007/09/proxomitron_main.png" alt="Proxomitron main window" /></li><li>you need to set the checkboxes as I did on the screenshot above</li><li>click the 'proxy' button, and enter your proxie's host and port (semicolon-separated): <img
src="http://bogdan.org.ua/wp-content/uploads/2007/09/proxomitron_proxy.png" alt="Proxomitron remote proxy window" /></li><li>optionally, click 'config' in the main window and set desired options, including the default port for local connections (which is 8080 by default)</li><li>configure your browser to use Proxomitron. Set proxy host to <strong>localhost</strong>, port to 8080 (default). More info on configuring browser proxy is available either in browser manual, or <a
href="http://www.proxomitron.info/45/help/Installation.html" rel="nofollow" >here</a> for older browsers.</li></ol><p>Third, you need to know the authentication string sent to your organisation's proxy server by your browser. You can see that string in the headers your browser sends out at each request. There are multiple ways to see headers. I recommend either using Live HTTP Headers FireFox add-on, or the Proxomitron's "Log window". You can also just calculate the hash.</p><p>The string you are looking for should be similar to:<br
/> <strong>Proxy-Authorization: Basic Yi5OLNrVa242Zw5rBzPpZxl1ZxBhDf==</strong><br
/> (it is the hash of your proxy user and password).</p><p>Here is what you need to do to see this string using Proxomitron (assuming Proxomitron is already launched and main window open):</p><ol><li>launch Proxomitron</li><li>click the 'Log Window' in Proxomitron main window</li><li>now try accessing e.g. google.com in the browser configured to use Proxomitron. You should get usual HTTP authentication form, asking for username and password. Enter correct data here. Now look into the 'HTTP Message Log' window - you must be able to locate the string we are looking for:<br
/> <a
href="http://bogdan.org.ua/wp-content/uploads/2007/09/proxomitron_http_message_log.png"><img
id="image224" src="http://bogdan.org.ua/wp-content/uploads/2007/09/proxomitron_http_message_log.thumbnail.png" alt="http log" /></a></li></ol><p>The last thing you need to do is to add a rule to your proxy to send the authentication header with every request.</p><p>With Proxomitron, do this:</p><ol><li>click the 'headers' button:<br
/> <a
href="http://bogdan.org.ua/wp-content/uploads/2007/09/proxomitron_headers.png"><img
id="image225" src="http://bogdan.org.ua/wp-content/uploads/2007/09/proxomitron_headers.thumbnail.png" alt="headers" /></a></li><li>select the Proxy-Authorization filter, and click 'edit'</li><li>in the 'Replacement text' field, paste yours auth-string ("Basic ....")</li><li>OK, Apply. Do not forget to set the checkbox in the 'Out' column for the Proxy-Authorization rule.</li></ol><p>This should be all. Now just open Cytoscape, Edit=>Preferences=>Proxy Server, HTTP, localhost, 8080.</p><p>This is it.</p><p><a
class="a2a_button_google_plusone addtoany_special_service" data-href="http://bogdan.org.ua/2007/09/14/using-cytoscape-from-behind-an-http-proxy-which-requires-authentication-authorization.html"></a><a
class="a2a_button_facebook_like addtoany_special_service" data-href="http://bogdan.org.ua/2007/09/14/using-cytoscape-from-behind-an-http-proxy-which-requires-authentication-authorization.html"></a><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbogdan.org.ua%2F2007%2F09%2F14%2Fusing-cytoscape-from-behind-an-http-proxy-which-requires-authentication-authorization.html&amp;title=Using%20Cytoscape%20from%20behind%20an%20HTTP%20proxy%20which%20requires%20authentication%20%28authorization%29" id="wpa2a_80"><img
src="http://bogdan.org.ua/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://bogdan.org.ua/2007/09/14/using-cytoscape-from-behind-an-http-proxy-which-requires-authentication-authorization.html/feed</wfw:commentRss> <slash:comments>0</slash:comments> </item> </channel> </rss>
