Cladistic structure within the human Lipoprotein lipase gene and its implications for phenotypic association studies

A R Templeton, K M Weiss, D A Nickerson, E Boerwinkle, C F Sing, A R Templeton, K M Weiss, D A Nickerson, E Boerwinkle, C F Sing

Abstract

Haplotype variation in 9.7 kb of genomic DNA sequence from the human lipoprotein lipase (LPL) gene was scored in three populations: African-Americans from Jackson, Mississippi (24 individuals), Finns from North Karelia, Finland (24), and non-Hispanic whites from Rochester, Minnesota (23). Earlier analyses had indicated that recombination was common but concentrated into a hotspot and that recurrent mutations at multiple sites may have occurred. We show that much evolutionary structure exists in the haplotype variation on either side of the recombinational hotspot. By peeling off significant recombination events from a tree estimated under the null hypothesis of no recombination, we also reveal some cladistic structure not disrupted by recombination during the time to coalescence of this variation. Additional cladistic structure is estimated to have emerged after recombination. Many apparent multiple mutational events at sites still remain after removing the effects of the detected recombination/gene conversion events. These apparent multiple events are found primarily at sites identified as highly mutable by previous studies, strengthening the conclusion that they are true multiple events. This analysis portrays the complexity of the interplay among many recombinational and mutational events that would be needed to explain the patterns of haplotype diversity in this gene. The cladistic structure in this region is used to identify four to six single-nucleotide polymorphisms (SNPs) that would provide disequilibrium coverage over much of this region. These sites may be useful in identifying phenotypic associations with variable sites in this gene. Evolutionary considerations also imply that the SNPs in the 3' region should have general utility in most human populations, but the 5' SNPs may be more population specific. Choosing SNPs at random would generally not provide adequate disequilibrium coverage of the sequenced region.

References

    1. Hum Mol Genet. 1996;5 Spec No:1405-10
    1. J Virol. 1996 Oct;70(10):6947-54
    1. Genetics. 1998 Jun;149(2):983-98
    1. Genetics. 1998 Jun;149(2):999-1017
    1. Nat Genet. 1998 Jul;19(3):233-40
    1. Hum Mutat. 1998;12(2):75-82
    1. Am J Hum Genet. 1998 Aug;63(2):595-612
    1. Hum Mol Genet. 1998 Oct;7(11):1745-51
    1. Hum Genet. 1998 Aug;103(2):168-72
    1. Science. 1998 Oct 23;282(5389):682-9
    1. Hum Genet. 1998 Sep;103(3):311-8
    1. Am J Hum Genet. 2000 Jan;66(1):69-83
    1. Genetics. 1973 Sep;75(1):199-212
    1. Am J Hum Genet. 1984 Nov;36(6):1239-58
    1. Mol Biol Evol. 1987 May;4(3):315-23
    1. Genetics. 1988 Dec;120(4):1145-54
    1. Mol Biol Evol. 1990 Mar;7(2):111-22
    1. Hum Genet. 1991 Mar;86(5):425-41
    1. Hum Genet. 1991 Aug;87(4):457-61
    1. Bioessays. 1992 Jan;14(1):33-6
    1. Genetics. 1992 Oct;132(2):619-33
    1. Mol Cell Biol. 1994 Jun;14(6):4225-32
    1. Circulation. 1994 Jul;90(1):583-612
    1. Mol Phylogenet Evol. 1994 Jun;3(2):102-13
    1. Hum Genet. 1995 Aug;96(2):142-6
    1. Genetics. 1995 Jun;140(2):767-82
    1. Hum Mutat. 1997;9(6):537-47

Source: PubMed

3
Tilaa