Biopython 1.54 beta released

A beta release for Biopython 1.54 is now available for download and testing.

Since Biopython 1.53 was released at the end of last year, several new features and more documentation have been added, plus the usual bug fixes. For full details see the NEWS file.

All the new features have been tested by the dev team but it’s possible there are cases that we haven’t been able to foresee and test, especially for the updated multiple sequence alignment object (which is what you’ll now get when parsing alignments with Bio.AlignIO), the new Bio.Phylo module, and the Bio.SeqIO support for Standard Flowgram Format (SFF) files.

[Read More]

BioPerl at GMOD Meeting 2010

BioPerl developers and users attended the BioPerl satellite meeting on January 13th, just prior to the GMOD Meeting.  Several items were covered on the agenda:

  • In order to start addressing whole genome data with more lightweight objects, we are planning on setting up a lightweight Bio::SeqI object that has a flexible DB backend (i.e. Bio::DB::SeqFeature::Store or similar).  We are also contemplating adding lazy parsing for some parsers, possibly using the Bio::PullParserI methods (or similar) that Sendu Bala created.
  • After a final  1.6 branch point release, we may ‘freeze’ BioPerl in a maintenance mode, primarily so that we can reorganize core into several more easily installed subdistributions on a branch.  New modules will essentially be additional separate repos that will depend on BioPerl core.  This reorganization has been discussed for a few years now, and as we edge closer to starting this (probably this spring) we’ll announce more details.
  • Some initial thoughts on how to handle circular genomes more efficiently.  We essentially do this already, but it isn’t full-proof.
  • Need some significant time dedicated towards GFF3-based coding (reimplement FeatureIO but allow some flexibility).  Rob Buels had started the initial run at splitting out FeatureIO, so next step is a true reimplementation.
  • We don’t plan on including Moose support for the immediate future, feeling that it would be better to reimplement some of the classes from scratch using Moose and similar as a BioPerl 2.0, or possibly await the impending Rakudo Perl 6 alpha and start afresh using that instead of Moose.

Anything we missed?  Anything you would like to address?  Please add comments and we’ll discuss them on list.

[Read More]

BioRuby 1.4.0 released

We are pleased to announce the release of BioRuby 1.4.0. This new release contains many new features, in addition to bug fixes and improvements.

PhyloXML support: Support for reading and writing PhyloXML file format is added, developed by Diana Jaunzeikare, mentored by Christian M Zmasek and co-mentors, supported by Google Summer of Code 2009 in collaboration with the National Evolutionary Synthesis Center (NESCent).

FASTQ file format support: Support for reading and writing FASTQ file format is added. All of the three FASTQ format variants are supported. The code is written by Naohisa Goto, with the help of discussions in the open-bio-l mailing list. The prototype of Bio::Fastq class was first developed during the BioHackathon 2009 held in Okinawa.

[Read More]

Biopython 1.53 released

We are pleased to announce the availability of Biopython 1.53, a new stable release of the Biopython library, three months after the release of Biopython 1.52. This is our first release since migrating from CVS to git for source code control.

There have been some additions to our core objects - the Seq (and related UnknownSeq) objects gained upper and lower methods (like the string methods of the same name but alphabet aware) plus a new ungap method. The SeqFeature object now has an extract method to get the region of sequence it describes (useful for getting CDS nucleotide sequences from GenBank files). Also SeqRecord objects now support addition, giving a new SeqRecord with the combined sequence, all the SeqFeatures, and any common annotation.

[Read More]

BioPerl interview in latest FLOSS Weekly

Two of the core BioPerl developers, Jason Stajich and Chris Fields, were interviewed by FLOSS Weekly.  The interview is now available as an MP3 on the FLOSS Weekly website; several streaming versions (including podcast) are also available.

First 1.6.1 alphas of BioPerl-Run, BioPerl-DB, BioPerl-Network

Running a bit late on this, so just a quick note that the first alphas for BioPerl-Run, BioPerl-DB, and BioPerl-Network have been uploaded to CPAN:

They can also be downloaded from the BioPerl website:

http://bioperl.org/DIST/RC/

This is the first run where we’ve switched to a regular Module::Build installation, so expect some initial bumps! There are a few initial problems that I plan on addressing soon, the main one being none of the modules are assigned version numbers (this may be a consequence of not pulling the version from a specific module). The other, more serious one, is that the Build.PL script checks for DBI but isn’t checking for any compatible DBD::* adaptors for BioPerl-DB (so it fails tests if DBI is installed). We have code in core to check for DBI drivers, so I may adapt that for BioPerl-DB.

[Read More]

Working with FASTQ files in Biopython when speed matters

Biopython 1.51 onward includes support for Sanger, Solexa and Illumina 1.3+ FASTQ files in Bio.SeqIO, which allows a lot of neat tricks very concisely. For example, the tutorial ( PDF) has examples finding and removing primer or adaptor sequences.

However, because the Bio.SeqIO interface revolves around SeqRecord objects there is often a speed penalty. For example for FASTQ files, the quality string gets turned into a list of integers on parsing, and then re-encoded back to ASCII on writing.

[Read More]

Biopython CVS to git migration

The release of Biopython 1.52 earlier this week marked the end of an era, it was our last release using CVS for source code control.

As of now, Biopython is using a git repository, hosted on github.com who kindly provide git hosting for open source projects free of charge. The BioRuby project have been using github for some time, so we are in good company.

Our existing OBF hosted CVS repository will be maintained in the short to medium term as a backup, but will not be updated.

[Read More]

Biopython 1.52 released

We are pleased to announce the availability of Biopython 1.52, a new stable release of the Biopython library.

It may only have been one month since the last release, but in that time we’ve added enough useful features to warrant a new release. In particular, Biopython 1.52 includes more substantial support for population genetics, and adds new functions that will be useful for people working with next generation sequencing.

Tiago Antao’s work on the Population Genetics module brings a command line wrapper for GenePop which allows the estimation of F-statistics, null allele frequencies and migration rates as well as tests for isolation by distance (IBD) and deviation from Hardy-Weinberg equilibrium.

[Read More]