Autarchy of the Private Cave

Tiny bits of bioinformatics, [web-]programming etc

  • Exits

  • Categories

  • Archives

  • Tags list

    Archive for the 'Bioinformatics' Category

    Bioinformatics is a general term which refers to using computers and computational/math methods in applications to biology.

    R script to filter probesets with log-expression values below the lowest spike-in

    27th January 2010

    Sometimes there is a need to remove all the probesets, which have expression values below the minimal spike-in intensity on the Affymetrix microarray. The reasoning behind this procedure is simple: minimal-expression spike-ins represent the bottom margin of microarray sensitivity, and anything below that margin cannot be reliably quantified – which also means that both fold-change and p-value of expression variance will be unreliable for these probesets.

    Here’s a simple R script to do just that. It is abundantly commented, and also contains an optional (commented out) fragment which allows the removal of more low-variance, low-intensity probesets.

    Read the rest of this entry »

    • Delicious
    • Google Bookmarks
    • Yahoo Bookmarks
    • Windows Live Favorites
    • Technorati Favorites
    • Digg
    • Slashdot
    • StumbleUpon
    • Read It Later
    • Twitter
    • Share/Bookmark

    Posted in Bioinformatics, Programming, Science | No Comments »

    GNU make for bioinformatics presentation

    5th June 2009

    Giovanni Dall’olio has recently posted a presentation on using make.

    Although it has “bioinformatics” on the title page, this is a good and very easy to understand make intro.

    Original post is here.

    • Delicious
    • Google Bookmarks
    • Yahoo Bookmarks
    • Windows Live Favorites
    • Technorati Favorites
    • Digg
    • Slashdot
    • StumbleUpon
    • Read It Later
    • Twitter
    • Share/Bookmark

    Posted in Bioinformatics, Links, Programming | 1 Comment »

    Ensembl Genomes launches Protists, Bacteria and Metazoa

    21st April 2009

    The following sites are available:

    http://bacteria.ensembl.org
    http://protists.ensembl.org
    http://metazoa.ensembl.org

    During summer, two more sites – for Fungi and Plants – should be made available.

    Learn more about Ensembl Genomes project.

    • Delicious
    • Google Bookmarks
    • Yahoo Bookmarks
    • Windows Live Favorites
    • Technorati Favorites
    • Digg
    • Slashdot
    • StumbleUpon
    • Read It Later
    • Twitter
    • Share/Bookmark

    Posted in Bioinformatics, Links, Science, Software, Web | No Comments »

    BioGPS: by the makers of Gene Expression Atlas

    22nd November 2008

    BioGPS logo If you were a frequent user of GNF SymAtlas, then you’d better bookmark BioGPS – Your Gene Portal System. BioGPS is basically the same gene expression atlas, but with a completely different interface, and more flexible ideology (e.g. expression atlas is now just a “plugin”, and more of those can be plugged in).

    There are also some easter-egg-like features: try hovering the BioGPS logo in the top left corner several times…

    • Delicious
    • Google Bookmarks
    • Yahoo Bookmarks
    • Windows Live Favorites
    • Technorati Favorites
    • Digg
    • Slashdot
    • StumbleUpon
    • Read It Later
    • Twitter
    • Share/Bookmark

    Posted in Bioinformatics, Links, Science, Software | 3 Comments »

    Ensembl release 51 is out!

    20th November 2008

    ensembl logoThis in itself wouldn’t be so exciting, if it were not for the new webcode! If you do visit Ensembl now, you will be definitely surprised with the page loading speed – it is amazingly faster than it used to be! Also, as you dig deeper and deeper, you’ll see that there are a lot more other differences – starting with the new design, and not forgetting the changed page organization logic.

    To cut the long story short, here’s the list of changes in the new Ensembl 51 webcode release. Other changes to Ensembl in release 51 are also available.

    One of the new features which caught my attention is the ability to add custom tracks in Ensembl (which is a long-available feature in UCSC Genome Browser). Interestingly, you do not even have to be logged in to use this feature. We shall be considering providing the TFBS custom track for several species, as predicted de-novo by our evolutionary conserved tfbs search tool (binding site finder), but this is a long shot, given the already published COTRASIF development roadmap.

    There is one more great news which is kinda insufficiently highlighted: the brand-new Ensembl Genome Browser website, which (as planned, it hasn’t yet started operations) will provide access not only to vertebrates, but also to other taxonomy groups. The full list is:

    • Metazoa
    • Protists
    • Bacteria
    • Plants
    • Fungi
    • Delicious
    • Google Bookmarks
    • Yahoo Bookmarks
    • Windows Live Favorites
    • Technorati Favorites
    • Digg
    • Slashdot
    • StumbleUpon
    • Read It Later
    • Twitter
    • Share/Bookmark

    Posted in Bioinformatics, Science, Software | No Comments »

    CLANS: CLuster ANalysis of Sequences

    25th October 2008

    CLANS (CLuster ANalysis of Sequences).
    Read the rest of this entry »

    • Delicious
    • Google Bookmarks
    • Yahoo Bookmarks
    • Windows Live Favorites
    • Technorati Favorites
    • Digg
    • Slashdot
    • StumbleUpon
    • Read It Later
    • Twitter
    • Share/Bookmark

    Posted in Bioinformatics, Links, Science | No Comments »

    Weird orthology species names in Ensembl

    30th September 2008

    For the COTRASIF tool, I’ve been using the Ensembl Compara database (since release 47) to automatically import into COTRASIF gene orthology mappings.

    However, with the E!50 release, the Compara database was dropped.

    Looking for another option to get orthologs from Ensembl (using martservice, via biomart.org), I tried using the standard query – selecting “Homologs” group on the “Attributes” page for a single species database, and then selecting appropriate second species to get orthology mappings.

    Imagine my surprise, when not only in the interface, but also in the generated XML file I found attribute names like “cow_ensembl_gene” :-O

    I only need 11 species at the moment, and excluding the sufficiently unique name mappings like “zebrafish – danio rerio”, there is a number of questionable mappings: “yeast” for S. cerevisiae (could be S.pombe), “rat” for R. norvegicus (could be R.rattus), “anopheles” for A.gambiae (could be some other Anopheles). Other mappings might be also non-unique, especially for people working with different species of the same genus.

    Am I missing some system in this naming “convention”, or am I the only one who finds it strange?

    Is there a way not to use “common species names” when importing orthology data from Ensembl with the help of martservice?

    • Delicious
    • Google Bookmarks
    • Yahoo Bookmarks
    • Windows Live Favorites
    • Technorati Favorites
    • Digg
    • Slashdot
    • StumbleUpon
    • Read It Later
    • Twitter
    • Share/Bookmark

    Posted in Bioinformatics, Science | No Comments »

    The 9th ICSB-2008 in Gothenburg, Sweden

    12th September 2008

    It has been quite a time since my last serious and long post. On the one hand, summer is vacations time – so I’ve been to one in Turkey; on the other hand – the long-awaited ICSB-2008 conference finally took place in Gothenburg, Sweden, on August 23-27 (or 22-28, counting in tutorials and workshops).

    Synopsis: in this post I present a personal-perspective report on the 9th ICSB (with a condensed ICSB-2008 photo-report in my gallery).

    Read the rest of this entry »

    • Delicious
    • Google Bookmarks
    • Yahoo Bookmarks
    • Windows Live Favorites
    • Technorati Favorites
    • Digg
    • Slashdot
    • StumbleUpon
    • Read It Later
    • Twitter
    • Share/Bookmark

    Posted in Bioinformatics, Science, Systems Biology | No Comments »

    New bioinformatics term: (high-) throughputomics

    16th July 2008

    Just made it up for convenience, stimulated by reading workshop descriptions for the upcoming ICSB 2008 in Gothenburg, Sweden.

    Here is formal definition:

    high-throughputomics
    the term is used to denote/mention any or all of the modern high-throughput techniques (in all of, but not limited to: genomics, transcriptomics, proteomics, …), together with derived/applicable data-processing approaches. All the “networks” things also conveniently fall into the high-throughputomics definition

    As this is a general term, it might be even suitable as a conference title (but NOT for ICSB, which I’m waiting for eagerly).

    For a shorter and informal (spoken-only) term, putomics can be used:

    putomics
    spoken-only, informal short form of high-throughputomics

    Putomics is also conveniently similar to “computation” (computomics):

    computomics
    application of computer hardware and software for the analysis of massive amounts of data, obtained using high-throughput methods; this is a research sub-field of high-throughputomics

    P.S. :) ;)
    For easier citing:

    Tokovenko, Bogdan. New bioinformatics term: high-throughputomics. 2008-07-16. URL:http://bogdan.org.ua/2008/07/16/new-bioinformatics-term-high-throughputomics.html. Accessed: 2008-07-16. (Archived by WebCite® at http://www.webcitation.org/5ZMD9DITU)

    • Delicious
    • Google Bookmarks
    • Yahoo Bookmarks
    • Windows Live Favorites
    • Technorati Favorites
    • Digg
    • Slashdot
    • StumbleUpon
    • Read It Later
    • Twitter
    • Share/Bookmark

    Posted in Bioinformatics, Science | 1 Comment »

    BGRS-2008 conference in Novosibirsk, Russia

    10th July 2008

    International Conference on Bioinformatics of Genome Regulation and Structure logo… was held on June 22-28, 2008, in Akademgorodok (Novosibirsk), Russia. It was the sixth conference held.

    The International Conference on Bioinformatics of Genome Regulation and Structure is the bi-annual event. It features several bioinformatics sections, which IMO cover most of bioinformatics sub-fields.

    The Sixth conference, BGRS-2008, was well-organized and had something to offer to everyone. By far the largest section was Genomics and Transcriptomics (at least if judging by the abstracts book and by the posters presented; talks given were distributed more equally between sections). As I did some work in genomics (namely, our COTRASIF tool), I had quite a load of info to digest, and many new potentially fruitful contacts to establish (which I did quite good).

    The second section on my scale of priorities was “COMPUTER ANALYSIS AND IMAGE RECOGNITION IN SYSTEMS BIOLOGY”, which had several interesting researches presented in the field of spatial/developmental modelling. There was a very good talk on model reduction (with an actual example) for the purposes of both comparing different models and decreasing the model complexity without sacrificing model-predicted outcomes.

    As for the other sections, I didn’t find them interesting enough. Fortunately, there were social program events scheduled for every day, so I visited the Novosibirsk zoo and the Archaeological museum. I did not do as many pictures as I usually do at conferences/schools, because there were two photographers at the conference, and their photos can be freely seen here and here.

    Most of the conference participants could speak Russian (I’d estimate the group of Russian-speaking participants at 90% of the number of participants), even though they were coming from e.g. Singapore or USA. But the official conference language was English, and the 10% of non-speakers were far not underprivileged, which goes well with the international status of the conference.

    After the conference, there was a BGRS-2008 summer school. As I stayed for some extra days, I managed to attend up to 90% of the school’s events (including the guided tour to Novosibirsk ;) ). For me, summer school was somewhat less useful than the conference, but nevertheless such presentations as on Petri nets and about SABIO-RK/Sycamore were informative and will be used in my future work.

    There were 3 prizes for the student presentations; winners are at the end of the page.

    Certificates were given after successful school completion. As I wasn’t registered, I can now only print out the empty certificate, which is to signify that I did not attend the last day of the school and thus was disqualified ;) .

    Just found that there are also some photos from the organizers.

    There was also a football (soccer) game between the ICG team and the school participants team. I’m a fan of neither watching nor playing football, so I skipped this event altogether.

    • Delicious
    • Google Bookmarks
    • Yahoo Bookmarks
    • Windows Live Favorites
    • Technorati Favorites
    • Digg
    • Slashdot
    • StumbleUpon
    • Read It Later
    • Twitter
    • Share/Bookmark

    Posted in Bioinformatics, Misc, Science | No Comments »