Phylogenetic Model Selection via Machine Learning
dc.contributor.author | Dong, Yanghe | |
dc.date.accessioned | 2024-10-27T22:36:35Z | |
dc.date.available | 2024-10-27T22:36:35Z | |
dc.date.issued | 2024 | |
dc.description | Deposited by the author 27.10.24 | |
dc.description.abstract | Phylogenetic inference, which reconstructs evolutionary trees from DNA or amino acid sequences, is crucial for understanding the evolutionary histories of species on Earth. Model selection is a fundamental step in this process, determining the best-fit model for the data. However, classic maximum likelihood-based methods for model selection are computationally intensive. This study introduces a machine learning-based framework for amino acid model selection, consisting of three components: protFinder for selecting the best-fit substitution model, RHASFinder for identifying the appropriate rate heterogeneity model, and protFFinder for determining the use of empirical pre-estimated frequencies. Our framework is an order of magnitude faster than the widely used ModelFinder, while maintaining comparable accuracy. | |
dc.identifier.uri | https://hdl.handle.net/1885/733721968 | |
dc.language.iso | en | |
dc.subject | phylogenetics | |
dc.subject | amino acid | |
dc.subject | model selection | |
dc.subject | rate heterogeneity | |
dc.subject | neural network | |
dc.title | Phylogenetic Model Selection via Machine Learning | |
dc.type | Thesis (Masters) | |
dcterms.valid | 2024 | |
local.contributor.affiliation | ANU School of Computing, Australian National University | |
local.contributor.authoremail | u7533843@anu.edu.au | |
local.contributor.supervisor | Bui, Minh | |
local.contributor.supervisorcontact | m.bui@anu.edu.au | |
local.identifier.doi | 10.25911/G27V-Q356 | |
local.mintdoi | mint | |
local.type.degree | Master by research (Masters) |