B1379 - Data mining for robust identification of causal hypotheses - 07/06/2012

B number: 
B1379
Principal applicant name: 
Louise Millard (University of Bristol, UK)
Co-applicants: 
George Davey-Smith (University of Bristol, UK), Nic Timpson (University of Bristol, UK)
Title of project: 
Data mining for robust identification of causal hypotheses
Proposal summary: 

The aim is to identify methods for generation of causal hypotheses, using data mining techniques. Therefore, this will not involve testing specific hypotheses, but rather identifying new hypotheses from a large search space. This search space consists of an exposure subset and outcome subset, and the approaches will be flexible in the types of associations which can be found (e.g. non-linear). The exposure variables will be a set of Mendelian randomisation indicator variables - scores derived from genetic variants which have be previously identified as associated with a particular trait. We will use a large subset of available ALSPAC data as the outcome variable set. This project will include ensuring robustness against the issues of multiple hypothesis testing and also issues related to using a BMI score such as ensuring the association is not due to pleiotropy.

Date proposal received: 
Thursday, 7 June, 2012
Date proposal approved: 
Thursday, 7 June, 2012
Keywords: 
Data mining
Primary keyword: