Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Computational analysis of genetic variation

dc.contributor.authorField, Matthew Arnell
dc.date.accessioned2016-07-17T23:58:49Z
dc.date.available2016-07-17T23:58:49Z
dc.date.issued2015
dc.description.abstractHigh throughput sequences are generating increasingly detailed catalogues of genetic variation both in human disease and within the larger population. To effectively utilise this rich data set for maximum research benefit, as a discipline we require robust, flexible, and reproducible analysis pipelines capable of accurately detecting and prioritising variants. While data-specific computational algorithms aimed at deriving accurate data from these technologies have reached maturity, two major challenges remain in order to realise the goals of elucidating the underlying genetic causes of disease as a means of developing custom treatment options. The first challenge is the creation of high-throughput variant detection pipelines able to reliably detect sample variation from a variety of sequence data types. Such a system needs to be scalable, flexible, robust, highly automated, and able to support reproducible analyses in order to support both default and custom variant detection workflows. The second challenge is the effective prioritisation of the huge number of variants detected in each sample, a task required to reduce the large search space for causal variants down to variant lists suitable for manual interrogation. This thesis describes six publications describing components of the larger informatics framework I have developed over the last four years to address these challenges, a framework designed from the onset to effectively manage and process large data sets with an end goal of utilising computational analysis of sequence data to further understand the relationship between genetic variation and human disease. The first publication “Reliably detecting clinically important variants requires both combined variant calls and optimized filtering strategies” describes a variant detection strategy designed to minimize false negative variants as is desired when utilising patient variation data in the clinic. The next four publications describe custom workflows developed for detecting variants in sequence data from different sample types, namely paired cancer samples (“Tumour procurement, DNA extraction, coverage analysis and optimisation of mutation-calling algorithms for human melanoma genomes”), pedigrees (“Reducing the search space for causal genetic variants with VASP: Variant Analysis of Sequenced Pedigrees”), mixed cell populations containing ultra-rare mutations (“DeepSNVMiner: A sequence analysis tool to detect emergent, rare mutations in sub-sets of cell populations”) and mouse exome data containing ENU mutations (“Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models”) . The last publication, “Comparison of predicted and actual consequences of missense mutations” focuses on the validation of computational tools that predict functional impact of missense mutations and further attempts to explain why many missense mutations predicted to be damaging do not result in an observable phenotype as might be expected. Collectively these publications detail efforts to reliably detect and prioritise variants across a wide variety of data types, efforts all based around the significant underlying software framework I have developed to better elucidate the link between genetic variation and disease.en_AU
dc.identifier.otherb3990541x
dc.identifier.urihttp://hdl.handle.net/1885/106398
dc.language.isoenen_AU
dc.subjectbioinformaticsen_AU
dc.subjectgenetic variationen_AU
dc.subjecthuman diseaseen_AU
dc.subjectVASPen_AU
dc.subjectDeepSNVMineren_AU
dc.subjectmelanomaen_AU
dc.subjectvariant prioritisationen_AU
dc.subjectmissense mutationen_AU
dc.subjecthigh-throughputen_AU
dc.subjectgenomeen_AU
dc.subjectvariant prioritizationen_AU
dc.titleComputational analysis of genetic variationen_AU
dc.typeThesis (PhD)en_AU
dcterms.valid2015en_AU
local.contributor.affiliationJohn Curtin School of Medical Research (CMBE)en_AU
local.contributor.supervisorGoodnow, Chris
local.description.notesThesis deposited by author 18/7/2016.en_AU
local.identifier.doi10.25911/5d778acb1cc8f
local.mintdoimint
local.type.degreeDoctor of Philosophy (PhD)en_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Field M A Thesis 2015.pdf
Size:
11.26 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
884 B
Format:
Item-specific license agreed upon to submission
Description:
abcd