Tree for a Tango

Logo

Table of contents

Workshop in Phylogeny and Cophylogeny - 1st Edition

General presentation

Dates: November 4-5, 2024

Location: This workshop will be held in a virtual way

This workshop, will be on phylogeny and cophylogeny in general, mathematical modelling and algorithms, and on biology/health. The workshop is open to the community in general.

Keynote lecturers

Alt text

Francisco Pinto, BioISI/FC, University of Lisbon, Portugal.

Title: Evolution and regulation of microbial secondary metabolism [slides here] [presentation online here]

Abstract: Microbes have disproportionate impacts on the macroscopic world. This is in part due to their ability to grow to large populations that collectively secrete massive amounts of secondary metabolites and alter their environment. Yet, the conditions favoring secondary metabolism despite the potential costs for primary metabolism remain unclear. Here we investigated the biosurfactants that the bacterium Pseudomonas aeruginosa makes and secretes to decrease the surface tension of surrounding liquid. Using a combination of genomics, metabolomics, transcriptomics, and mathematical modeling we show that the ability to make surfactants from glycerol varies inconsistently across the phylogenetic tree; instead, lineages that lost this ability are also worse at reducing the oxidative stress of primary metabolism on glycerol. Experiments with different carbon sources support a link with oxidative stress that explains the inconsistent distribution across the P. aeruginosa phylogeny and suggests a general principle: P. aeruginosa lineages produce surfactants if they can reduce the oxidative stress produced by primary metabolism and have excess resources, beyond their primary needs, to afford secondary metabolism. These results add a new layer to the regulation of a secondary metabolite unessential for primary metabolism but important to change physical properties of the environments surrounding bacterial populations.

Alt text

Heather Carleton, Enteric Diseases Laboratory Branch DFWED/NCEZID/CDC, US.

Alt text

Joe Wirth, Enteric Diseases Laboratory Branch DFWED/NCEZID/CDC, US.

Molecular surveillance in public health using phylogenetics [presentation online here]


Abstract: Whole Genome Sequencing (WGS) technologies have revolutionized how Public Health and Regulatory Agencies respond to foodborne, waterborne, and one-health related surveillance, detection, and response activities. WGS data is comprehensive: antimicrobial resistance, virulence markers, serotype markers, plasmids, and core and accessory genes are all characterized as part of the WGS workflow. Currently, PulseNet USA sequences approximately 65,000 Salmonella, Escherichia, Listeria, Vibrio, Campylobacter, and Shigella isolates and uses these data to detect over 400 potential outbreaks annually. Allele calls from core genome and WGS multi-locus sequence typing (MLST) are used to generate single linkage trees. These trees are evaluated to identify potential outbreaks using a set of epidemiologic and phylogenetic criteria. Other algorithms such as maximum likelihood and Bayesian inference can be used to infer evolutionary relationships and to link the emergence of clades to specific times.

Alt text

Lars Arvestad, Stockholm University, Sweden.

Scalable distance-based phylogeny inference using divide-and-conquer[presentation online here]


Abstract: Inferring a phylogenetic tree is a common task in Computational Biology, and the choice of method depends on the analytical objectives and the characteristics of the dataset. Statistical methods are preferred when constructing a phylogenetic tree is the primary goal, due to their reliability, rich model specification options, and ability to evaluate alternative solutions. Distance-based methods are often used as subroutines for other methods because of their ease of use, straightforward implementation, and adequate speed in many applications.

Neighbor-Joining (NJ) has become the most popular distance-based method for several reasons. It is easy to implement, easy to understand, and have proven to be a practical method with good results. However, the appeal of NJ is often overshadowed by its cubic time complexity, making it challenging to use on even medium-sized datasets by modern standards.

We propose a new divide-and-conquer approach, implemented in two algorithms, DNCTREE and DNCTREE-k. Given a set S of sequences of length L and a distance-estimation method D as input, the algorithms compute an unrooted tree with the elements of S labeling the leaves. The methods partition S into three subproblems using a small set of sequences as guides, and only the distances to these guiding sequences are needed initially. The resulting trees from the subproblems are then connected to form the final tree. Base cases are computed using NJ. The DNCTREE algorithm has Atteson’s convergence radius, ensuring correct phylogenies when distance estimation yields accurate results.

Alt text

Laurent Jacob, LCQB lab of Sorbonne Université, France.

Neural networks for likelihood-free inference in evolutionary genomics [slides here] [presentation online here]

Abstract: Statistical inference in evolutionary genomics allows to estimate key parameters such as phylogenetic trees, diversification rates or reproductive numbers of pathogens. This inference relies on probabilistic models that represent how observed homologous sequences evolved given these parameters of interest. Standard approaches such as maximum likelihood or Bayesian inference are well-suited to this problem but require to compute likelihoods, which can become very costly or impossible under realistic (and typically complex) probabilistic models. I will present an alternative likelihood-free approach exploiting samples rather than evaluations of the model. The methods I will present rely on neural network architecture that are specifically tailored to account for the symmetries of each inference problem.

Alt text

Mariana Braga, Swedish University of Agricultural Sciences, Sweden.

Inferring host repertoire evolution[slides here] [presentation online here]

Abstract: In this talk I will describe the model for Bayesian inference of host repertoire evolution, give examples of how we have used it to investigate the evolution of species interactions, with a special focus on butterflies and their larval host plants. The talk will include results produced with software that have been available for a couple years, e.g. for reconstruction of network evolution, as well as software that is currently in development.

Alt text

Ran “RON” Libeskind-Hadas, Claremont McKenna College, US.

Adventures in Cophylogenetics

Abstract: This talk will explore the problem of phylogenetic tree reconciliation to provide insights into the evolutionary histories of pairs of linked taxa such as hosts and parasites, hosts and symbionts, and others. We will discuss algorithmic approaches for tree reconciliation, demonstrate them using software tools developed in our research group, and discuss challenges and opportunities for further research in this area.


Program

In order to cover a larger audience, the workshop will take place in the afternoon, CET time (French time), on both days.

  Monday 4 Tuesday 5
14h00 - 14h10 Introduction to the workshop Introduction to the second day
14h10 - 14h40 Talk1: Francisco Pinto Talk4: Laurent Jacob
14h40 - 14h55 Questions and discussion on Talk 1 Questions and discussion on Talk 4
14h55 - 15h05 Short Break Short Break
15h05 - 15h35 Talk2: Lars Arvestad Talk5: Heather Carleton and Joe Wirth
15h35 - 15h50 Questions and discussion on Talk 2 Questions and discussion on Talk 5
15h50 - 16h00 Short break Short break
16h00 - 16h30 Discussion 1 Discussion 2
16h30 - 17h00 Talk3: Mariana Braga Talk6: Ran “RON” Libeskind-Hadas
17h00 - 17h15 Questions and discussion on Talk 3 Questions and discussion on Talk 6
17h15 - 17h30 Final discussion and conclusion of the first day Final discussion and conclusion of the workshop

Registration

The registration is free but compulsory. A Zoom link will be sent just before the workshop starts. Registration can be done here.

Organization

Alexandre Francisco - INESC-ID/IST, University of Lisbon in Portugal.
Blerina Sinaimeri - Luiss University, Rome, Italy.
Cátia Vaz – INESC-ID/ISEL, Polytechnic Institute of Lisbon in Portugal.
Francisco Pinto - BioISI/FC, University of Lisbon in Portugal.
João Carriço.
Marie-France Sagot – INRIA/Université Claude Bernard (Lyon 1).
Susana Vinga - INESC-ID/IST, University of Lisbon in Portugal.

Participants

We will indicate later the total number of participants.

The H2020 Twinning Project Olissipo has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 951970.

Alt text Alt text Alt text Alt text Alt text
Alt text Alt text

Alt text