Computational biology without borders
This is a guest blog post from Aziz Khan, who was supported by the ongoing Open Bioinformatics Foundation travel fellowship program to attend the ISMB/ECCB and BOSC 2019 meeting in Basel, July 2019. The OBF’s Travel Fellowship program aimed at increasing diverse participation at events promoting open source bioinformatics software development and open science in the biological research community. Find more information here.
Computational tools and software are now becoming the core of scientific discovery, and making it open source and sharing it freely with the community helps to take scientific discoveries to the next level. We live in an era where international and interdisciplinary collaborations become very central to answer big scientific questions. Given science is becoming more collaborative and data-intensive, we need intelligent and robust computational algorithms to help us to understand and interpret such big-data.
Biology is no longer limited to PCR or gel electrophoresis. The high-throughput approaches to DNA sequencing at the genome and exome level are the state-of-the-art. For example, genomics and epigenomics are shifting from bulk sequencing to single-cell sequencing, which produces even more and sparse data. The (epi)genomics at single-cell can help us to understand the molecular mechanisms more precisely to understand disease and tumor biology. Integrative methods and software are needed to interpret multi-omics data at bulk and single-cell resolution to better understand the heterogeneity between cells, patients, or multi-region biopsies from a tumor. All these require open-source bioinformatics software and resources that are free and accessible for everyone.
The international community that develops such software and methods need a diverse and multidisciplinary platform to meet and share the latest developments in the field, fostering fresh dialogues and perspectives to learn about and shape the future of bioinformatics. And such a platform is provided by ISCB’s Intelligent Systems for Molecular Biology (ISMB) and European Conference on Computational Biology (ECCB) when they both meet in Europe every second year with diverse Communities of Special Interest (COSI) such as the Bioinformatics Open Source Conference (BOSC).
ISMB/ECCB is the world’s largest bioinformatics/computational biology meeting. I’ve been very lucky to attend three of those so far. I started my ISMB/ECCB journey when I was a Ph.D. student at Tsinghua University, China. In fact, first I joined the ISCB Student Council, a wonderful forum for early career researchers (ECR) in bioinformatics. I was part of the web-committee and later served as co-chair of the committee. I helped the community to organize and review student symposiums. If you’re an ECR in bioinformatics, I would definitely recommend joining the community. So, back to ISMB/ECCB, it was July 2015, I was accepted to present a poster and also to volunteer at ISMB/ECCB 2015 in Dublin. Of course, I saved the registration fee by volunteering but it also gave me an excellent opportunity to network. I met some of the great people in that meeting and we’re still connected. My second ISMB/ECCB was in 2017 in the beautiful city of Prague. Again, I signed up to volunteer and had a wonderful meeting and presented one of our open-source projects both at the Student Symposium and at the Regulatory and Systems Genomics (RegSys) COSI.
This year, 2019, the ISMB/ECCB came to Basel, a city in the heart of Europe, which shares its borders with France and Germany.
The tripoint (Dreiländereck) where the borders of France, Germany and Switzerland meet.
I knew I wanted to attend, especially my favorite ISMB/ECCB COSI on the Regulatory and Systems Genomics (RegSys). And this year, my PI Anthony Mathelier was one of the organizers of RegSys. I always wanted to attend BOSC, so I submitted our community initiative, ECRcentral, that aims to bring ECRs together to discuss funding opportunities, share experiences, and create impact through community engagement. ECRcentral was accepted for a long talk and on top of that, I got a travel fellowship from the Open Bioinformatics Foundation (OBF) to attend #ISMBECCB2019 and present @ECRcentral at #BOSC2019. What else do you need, right?
Welcome to Basel! We arrived in Basel on July 21st from Oslo, Norway.
Basel was on fire (~40°C), compared with 20°C in Oslo. While walking from the train station towards the hotel, we passed through a bridge (Basel has many bridges over the river Rhine) and saw many people floating down the Rhine to escape from the heat. For a while, it felt like liquid-liquid phase separation in biology, or maybe solid-liquid!
People are escaping from the heat by floating down the Rhine. Picture credit
I quickly checked-in, got fresh and ran to the conference venue. The conference kicked off in the San Francisco hall at Congress Center Basel with over 2,000 participants from all over the world, mainly from developed countries. I love the idea of naming conference rooms of Congress Center after different cities around the world like you could bump in Montreal from Delhi without border control. Anyway, I am not sure how many researchers couldn’t make it due to visa denials or solely due to lack of funding. I made it because as a European resident I don’t need a visa to enter Switzerland.
I found that at this year’s ISMB/ECCB “3” was my lucky number. As I mentioned earlier, this was my 3rd ISMB/ECCB meeting with 3 participations (2 posters and 1 talk). And surprisingly, three of my mentors were in Basel for the meeting who came from three different continents, Asia, Europe, and North America. My former Ph.D. advisor Xuegong Zhang from Tsinghua University, my current Postdoc PI Anthony Mathelier from the University of Oslo and my next PI Christina Curtis from Stanford University. This is the beauty of ISMB/ECCB, which connects us without borders although many of us have to cross several man-made borders. By the way, I am joining the Curtis lab this fall at Stanford as a research scientist. And congratulations to Xuegong on getting elected as ISCB Board of Directors. Back to lucky number 3, I visited three countries (Switzerland, Germany, and France) in one day with a lunch in Switzerland, dinner in Germany and went for a walk to France.
Anyway, the first day of the conference started with an exciting keynote talk by Nikolaus Rajewsky from MDC Berlin on the principles of gene regulation in space and time at single-cell resolution. He also shared a great European research initiative, The LifeTime Initiative which aims to use single-cell multi-omics and machine learning techniques to map disease progression to improve medicine.
We started the second day with a fascinating keynote by William Stafford Noble of the University of Washington on traveling across spaces: the power of embedding genomic and proteomic data into a latent space. He shared three powerful applications of machine learning and deep learning to make sense of complex genomic or proteomic data into latent representation.
After the keynote, participants started traveling to different “cities” to attend their favorite COSIs. My favorite COSI at ISMB has been SysReg. But this time, I stayed in San Francisco (the main hall) to attend the first keynote of HiTSeq COSI by Christina Curtis from Stanford University on quantifying the rates and routes of metastasis.
After Christina’s talk, I was back to RegSys which was on in the Singapore room with a great line of keynote speakers in the field of gene regulation. The exceptional keynote talks demonstrated the power of using model organisms for important discoveries including interpretation of genetic variants and characterization of different roles and usages of cis-regulatory regions. Other talks highlighted novel experimental and computational methods for characterizing the 3D organization of the genome, TF co-binding, chromatin accessibility, and linking them to different cellular and molecular outcomes. Transcriptional enhancers and their diverse roles in gene regulation were common themes across many talks highlighting the importance of studying these non-coding regulatory regions.
The Singapore room is full on the 2nd day of RegSys’s morning session on single-cell. Photo by @ferhatay
The second day of RegSys started in the Singapore room with sessions on the bleeding edge developments in the field of single-cell and transcription factor binding. The three amazing keynotes and other selected talks were full of exciting discoveries enabled by comparing single-cell data from different organisms or by integrating multi-omics such as scRNA-seq and scATAC-seq. Other talks demonstrated the power of machine learning and deep learning techniques at the base-pair resolution, in enabling a better understanding of gene regulatory programs through refined characterization of transcription factor binding.
On the 4th day, BOSC 2019, the 20th annual Open Source Bioinformatics Conference, was kicked off in Delhi room by BOSC Chair Nomi Harris followed by an introduction to the Open Bioinformatics Foundation by Heather Wiencko. This was my first official BOSC meeting although I dropped by at the BOSC 2015 in Dublin and BOSC 2017 in Prague while volunteering at ISMB/ECCB. This year the BOSC community presented some excellent open-source bioinformatics resources, tools, and workflows through talks and poster sessions. Nicola Mulder of the University of Cape Town gave a keynote talk on building infrastructure for responsible open science in Africa. She discussed the technical, ethical and social challenges of sharing data in Africa. She said, “It’s really hard to convince people to share their data and their tools when they have such a history of being exploited”. She and her team under the umbrella of H3ABioNet consortium are making progress in building a Pan-African bioinformatics network.
And then came the fifth and last day of ISMB/ECCB 2019. It was also the second and last day of BOSC 2019 which started with interesting talks on open-source, open science and reproducible research using containers and workflows. The last session of BOSC 2019 was on Building Open Source Communities, where I was given the opportunity to introduce our community initiative, ECRcentral. I made my slides and poster publicly available on F1000 Research.
ECRcentral is a community-driven initiative for early career researchers to find research fellowships and travel grants and to share experiences, resources and get feedback.Read more here…
I co-founded ECRcentral as part of eLife Ambassadors program in collaboration with other eLife Ambassadors. We are a group of ECRs advocating for best practices in science to make science open and reproducible. As an advocate for open science and open-source, I do share all of my code on GitHub and also the source code for the ECRcentral platform is openly available on GitHub.
My talk about ECRcentral was well received by the community!
And ISMB/ECCB 2019 concluded after an inspiring keynote talk by Bonnie Berger of Massachusetts Institute of Technology on Biomedical Data Sharing and Analysis at Scale. Bonnie Berger started her talk with a powerful statement to emphasize the importance of computational biology. She said:
“Biology used to be biology. Now biology is inseparable from computer science.”Bonnie Berger
I loved it and tweeted right away!
And I would say that this inseparable biology now needs to be more open and reproducible without borders.
Along the lines of openness to science, ISMB/ECCB and BOSC are setting an example for many other conferences and organizations by recording talks and making those publicly available with the consent of speakers licensed under CC-BY 4.0. I have noticed that most of the talks at several other conferences are recorded but sadly these are put behind the paywalls and community need subscriptions to watch these talks later. There is a great need to digitize conferences and make the recorded talks freely available to everyone, and especially for researchers from developing countries who can not attend these events due to lack of funding or visa restrictions.
To conclude, I would like to thank the OBF for providing the travel fellowship to participate in ISMB/ECCB and BOSC 2019. This travel fellowship was a great opportunity to connect with open source and open science enthusiasts, and I strongly believe that this community will push the boundaries even further to grow and diversify in the years to come.
Aziz is a computational biologist interested in gene regulation, machine learning, cancer, and regulatory genomics and epigenomics. He is a strong advocate of open science, open-source, preprints, and reproducible research and often tweets about it. He is also an eLife and ASAPbio Ambassador and co-founded ECRcentral, which is powered by eLife.