Phyloformer: Fast, accurate and versatile phylogenetic reconstruction with deep neural networks - Fédération de Recherche BioEnviS, BioEnvironnement et Santé (Université de Lyon)
Pré-Publication, Document De Travail Année : 2024

Phyloformer: Fast, accurate and versatile phylogenetic reconstruction with deep neural networks

Luca Nesterenko
  • Fonction : Auteur
  • PersonId : 1445639
Luc Blassel
Philippe Veber

Résumé

Phylogenetic inference aims at reconstructing the tree describing the evolution of a set of sequences descending from a common ancestor. The high computational cost of state-of-the-art Maximum likelihood and Bayesian inference methods limits their usability under realistic evolutionary models. Harnessing recent advances in likelihood-free inference and geometric deep learning, we introduce Phyloformer, a fast and accurate method for evolutionary distance estimation and phylogenetic reconstruction. Sampling many trees and sequences under an evolutionary model, we train the network to learn a function that enables predicting the former from the latter. Under a commonly used model of protein sequence evolution and exploiting GPU acceleration, it outpaces fast distance methods while matching maximum likelihood accuracy on simulated and empirical data. Under more complex models, some of which include dependencies between sites, it outperforms other methods. Our results pave the way for the adoption of sophisticated realistic models for phylogenetic inference.

Fichier principal
Vignette du fichier
2024.06.17.599404v1.full.pdf (1.1 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)
licence

Dates et versions

hal-04795764 , version 1 (21-11-2024)

Licence

Identifiants

Citer

Luca Nesterenko, Luc Blassel, Philippe Veber, Bastien Boussau, Laurent Jacob. Phyloformer: Fast, accurate and versatile phylogenetic reconstruction with deep neural networks. 2024. ⟨hal-04795764⟩
0 Consultations
0 Téléchargements

Altmetric

Partager

More