Feb 17, 2016 | Atlanta, GA
A study from School of Biology Professor Greg Gibson’s group just published in the American Journal of Human Genetics argues that we should be looking not just at the structural parts of genes, but also the regulatory regions around them. The paper demonstrates that there is a burden of rare genetic variants in these regions that associates with abnormal gene expression. It does not show that they cause birth defects, but does suggest that they need to be seriously considered as WGS technology develops.
Gibson explains it in the form of a metaphor about building a house. He says there are two critical components: the bricks and mortar, and the plans for where to put them. If there is a defect in the glass or a crack in a piece of wood, then sooner or later the structure may fall apart. This is what current approaches focus on, the so-called protein coding-regions. But if the architect’s plans call for more windows than the beams can support, or the contractor doesn’t deliver enough concrete, then the consequences can be just as bad.
We now know that a lot more of the genetic component of differences in the way we look and behave, or of what makes us susceptible to different diseases, is in the planning than the structural components. This insight is based on studies of common polymorphisms, namely the millions of genetic differences that we all share. The new study argues that it will also be true of rare genetic variants including new mutations that are specific to a single person.
Graduate student Jing Zhao sequenced the regulatory regions of almost 500 genes from 500 participants in the Georgia Tech-Emory Predictive Health Institute study, and added up the number of rare mutations in people whose expression of those genes was toward the extremes. The result is what she calls a smile plot, because the curve has a high number at either end and low number in the middle. It means that the plans can be off in either direction, making too little or too much transcript for each gene. Explains Gibson, it is as if all the houses with crooked window frames are that way not because of the wood quality, but because each builder made different mistakes when putting the frames in.
Furthermore, there seem to be specific subsets of genes where these events are more or less likely to happen. This is important, because it implies that we may be able to develop algorithms that identify the most likely places for regulation to go wrong, based on the evolutionary conservation of different parts of genes.
Projects such as the President’s Precision Medicine initiative aim to use genomics to help us decipher individual causes of disease. In the next few years, Gibson expects that much larger datasets of tens and eventually hundreds of thousands of people, in many different tissues, will appear. The challenges are as much in the bioinformatics than the technology.
A Burden of Rare Variants Associated with Extremes of Gene Expression in Human Peripheral Blood.
Zhao J, Akinsanmi I, Arafat D, Cradick TJ, Lee CM, Banskota S, Marigorta UM, Bao G, Gibson G.
Am J Hum Genet. 2016 Feb 4;98(2):299-309. doi: 10.1016/j.ajhg.2015.12.023. PMID: 26849112