B2312 - Whole genome sequence based analysis of liver function within the UK10K project - 23/10/2014

B number: 
B2312
Principal applicant name: 
Dr Josine Min (Univeristy of Bristol, UK)
Co-applicants: 
Lydia Quaye (King's College London, UK), Scott Wilson (Not used 0, Not used 0), Dr Nic Timpson (Univeristy of Bristol, UK)
Title of project: 
Whole genome sequence based analysis of liver function within the UK10K project.
Proposal summary: 

The UK10K project represents one of the first large scale applications of next generation sequencing to population based epidemiological samples and the examination of complex phenotypes. The objectives of this work are to record whole genome sequence variation at and below 1% minor allele frequency, to provide an imputation reference and to use this, not only to provide a resource for the scientific community (both genotypes and phenotypes), but also to examine genetic associations across a spectrum of genetic variation.

The study is composed of two samples drawn from European population based epidemiological studies (The Avon Longitudinal Study of Parents and children and Twins UK) and forms a collection of nearly 4000 participants now with whole genome sequence data and phenotypes (along with complementary imputed internal replication data sets). A total of 1,990 individuals from TwinsUK and 2,040 individuals from ALSPAC were consented for sequencing. Variant sites and genotype likelihoods were called using samtools and genotypes were refined and phased using BEAGLE, with similar procedures to the 1000 Genomes Project.

In addition, both ALSPAC and TwinsUK consented to release phenotype data related to cardiovascular disease as a public resource for the association analyses. For TwinsUK, liver phenotypes including bilirubin, alkaline phosphatase, GGT and albumin were released. The association analyses for the liver phenotypes are currently reaching a critical phase for the singlepoint associations, and also for those more rare variants with analyses being undertaken using variant aggregation techniques such as SKAT. Consequently, we are following up possible avenues for increasing our sample sizes, in particular data sets with existing whole genome sequence data such as ALSPAC. To this end, we are writing to seek a collaborative arrangement efforts to continue our analysis of both single point and rare variant associations. These phenotypes would not be part of the UK10K public release.

Date proposal received: 
Wednesday, 1 October, 2014
Date proposal approved: 
Thursday, 23 October, 2014
Keywords: 
Primary keyword: 
UK10K