Please visit our ***NEW*** OBF/BOSC website: https://www.open-bio.org/ |
-
Difference between revisions of "BOSC/liveCD"
Line 66: | Line 66: | ||
Self publishing websites such as lulu.com can turn a PDF into a book and publish on demand - no up front cost to us! We would have to do all the typesetting though. | Self publishing websites such as lulu.com can turn a PDF into a book and publish on demand - no up front cost to us! We would have to do all the typesetting though. | ||
− | Distributing printed copy of the tutorials + the live CD or DVD would be fantastic for training, basically a bioinformatics mres module or three in a box. | + | Distributing printed copy of the tutorials + the live CD or DVD would be fantastic for training, basically a bioinformatics mres module or three in a box. In particular one can make sure that what is said in the book is true with the software on the CD, using the same versions, and without the student facing installing issues before they know what the software does |
==Link to Debian== | ==Link to Debian== |
Revision as of 23:50, 20 July 2007
Contents
BOSC Software Distribution
This page is a summary of the discussion at the Bioinformatics Open Source Conference (BOSC) in Vienna on the 19th July 2007. The focus is the possibility of an 'enriched' software distribution, curated to add extra value and built into a Debian based LiveCD.
Debian's package management and procedures guarantee primarily technical reliability - they ensure that the software will install (so it has dependencies defined etc) and run successfully on a given architecture. They do not, however, guarantee that the software is
- used for the right set of problems
- employed in concert with the best suite of complementary tools
work at its full potential and be fully understood by the user. The 'enrichment' is therefore in terms of a set of added content and metadata for each application that is provided.
Making this available as a LiveCD allows for very quick evaluation of tools. It may particularly appeal to (self-trained?) bioinformaticians working in wet-lab environments. Another major concern is its applicability for the use in training courses - today's students are tomorrow's scientists. Please send in requests for further adaptations.
What we add
- All applications must be categorized
- Who is the software for?
- Which domains does it address?
- What pre-requisites are required (in terms of expertise rather than other software, for example we can assume that BioRuby is only going to be useful if you can first use the base Ruby language)
- All applications have tutorial material, comprising
- usage scenarios
- example inputs, data files
- screen capture video of the tutorial examples
- Meta-level entry to realm of software solutions:
- Expert endorsement, Find a 'foo-omics' expert and get her to pick the set of tools she uses most commonly
- Make sure we include them!
- Make this grouping available somehow and use it as a 'here's how you get started doing foo-omics'
- Have an umbrella tutorial linking to each tool tutorial
- Possibly by creating specific 'foo-omics' user with pre-set application menus configured?
- Expert endorsement, Find a 'foo-omics' expert and get her to pick the set of tools she uses most commonly
Example applications
- Basic command line tools: emboss, fasta, blast
- PDB viewer (preferably one without the MOTIF-1980 look)
Example data
- Full GenBank file for a bacteria (like E. Coli), FASTA of proteins and nuclear types of genetic elements
- SwissProt (in which format?) - discussion: is quite large, in particular text and xml format (1.1GB and 2.1 GB), but FASTA is 121M
- Some nice PDB files for viewing structure in flashing 3d (JMol and friends)
Preferably we should have the same example represented in different forms and shapes.
Aggregate Configuration
If we bundle all the tools certain configuration issues (i.e. tool X needing to know where tool Y is installed) can be solved before the user hits them. This implies we can identify useful sets of applications but see the 'expert endorsement' bit above.
Book
Dead tree technology is still in favour.
Self publishing websites such as lulu.com can turn a PDF into a book and publish on demand - no up front cost to us! We would have to do all the typesetting though.
Distributing printed copy of the tutorials + the live CD or DVD would be fantastic for training, basically a bioinformatics mres module or three in a box. In particular one can make sure that what is said in the book is true with the software on the CD, using the same versions, and without the student facing installing issues before they know what the software does
Link to Debian
- The Debian community has invested considerable effort already in providing packages for Bioinformatics and Scientific Computing in general:
Strong communication between "upstream" BOSC developers and package maintainer expected. Packages that are provided in addition through this BOSC initiative are hoped to find their way into the Debian main distribution. (package needs to comply with Debian Free Software Guidelines and Debian Policy for that)
- overcoming communication barriers with people from the other (biology/computational) sides
Who?
- Steffen Möller <steffen_moeller gmx.de> volunteers as a link to the debian community and to help with expertise in packaging
- Kazuharu Arakawa offers their work on a Knoppix (also Debian)-based Bioinformatics CD that is accompanied by a Japanese book (Knoppix for Bio Project: http://knob.sourceforge.jp) and http://ez-tune-livecd.sourceforge.jp/pukiwiki/ (Live CD creator)
- Darin London <dlondon ebi.ac.uk>
- Tom Oinn <tmo ebi.ac.uk>
- Stian Soiland <ssoiland cs.man.ac.uk>
- Bastien Chevreux <bach chevreux.org>
Getting it started
Seed of 10 packages
To get this started we plan to pick (somewhere around) ten packages drawn from the assorted participants at BOSC this year.
- Tom actively invites
- Tom passively awaits requests for participation from Developers
- Expecting to extend ez-tune-livecd
Design of a web / wiki page
- to announce to the community and ask for contribution
- assignment of packaging tasks to the community of Debian-Developers.
- You have just finished reading it - success!