Phylogenetic Model Selection via Machine Learning

dc.contributor.authorDong, Yanghe
dc.date.accessioned2024-10-27T22:36:35Z
dc.date.available2024-10-27T22:36:35Z
dc.date.issued2024
dc.descriptionDeposited by the author 27.10.24
dc.description.abstractPhylogenetic inference, which reconstructs evolutionary trees from DNA or amino acid sequences, is crucial for understanding the evolutionary histories of species on Earth. Model selection is a fundamental step in this process, determining the best-fit model for the data. However, classic maximum likelihood-based methods for model selection are computationally intensive. This study introduces a machine learning-based framework for amino acid model selection, consisting of three components: protFinder for selecting the best-fit substitution model, RHASFinder for identifying the appropriate rate heterogeneity model, and protFFinder for determining the use of empirical pre-estimated frequencies. Our framework is an order of magnitude faster than the widely used ModelFinder, while maintaining comparable accuracy.
dc.identifier.urihttps://hdl.handle.net/1885/733721968
dc.language.isoen
dc.subjectphylogenetics
dc.subjectamino acid
dc.subjectmodel selection
dc.subjectrate heterogeneity
dc.subjectneural network
dc.titlePhylogenetic Model Selection via Machine Learning
dc.typeThesis (Masters)
dcterms.valid2024
local.contributor.affiliationANU School of Computing, Australian National University
local.contributor.authoremailu7533843@anu.edu.au
local.contributor.supervisorBui, Minh
local.contributor.supervisorcontactm.bui@anu.edu.au
local.identifier.doi10.25911/G27V-Q356
local.mintdoimint
local.type.degreeMaster by research (Masters)

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
u7533843_thesis.pdf
Size:
3.01 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
882 B
Format:
Item-specific license agreed upon to submission
Description: