Publications 2002








Roy-Engel, A., A.-H. Salem, T. Oyeniran, L. Deininger, D. J. Hedges, G. E. Kilroy, M. A. Batzer and P. L. Deininger (2002) Active Alu element “A-tails”: size does matter. Genome Research 12: 1333-1344


Abstract:
Long and short interspersed elements (LINEs and SINEs) are retroelements that make up almost half of the human genome. L1 and Alu represent the most prolific human LINE and SINE families, respectively. Only a few Alu elements are able to retropose, and the factors determining their retroposition capacity are poorly understood. The data presented in this paper indicate that the length of Alu "A-tails" is one of the principal factors in determining the retropositional capability of an Alu element. The A stretches of the Alu subfamilies analyzed, both old (Alu S and J) and young (Ya5), had a Poisson distribution of A-tail lengths with a mean size of 21 and 26, respectively. In contrast, the A-tails of very recent Alu insertions (disease causing) were all between 40 and 97 bp in length. The L1 elements analyzed displayed a similar tendency, in which the "disease"-associated elements have much longer A-tails (mean of 77) than do the elements even from the young Ta subfamily (mean of 41). Analysis of the draft sequence of the human genome showed that only about 1000 of the over one million Alu elements have tails of 40 or more adenosine residues in length. The presence of these long A stretches shows a strong bias toward the actively amplifying subfamilies, consistent with their playing a major role in the amplification process. Evaluation of the 19 Alu elements retrieved from the draft sequence of the human genome that are identical to the Alu Ya5a2 insert in the NF1 gene showed that only five have tails with 40 or more adenosine residues. Sequence analysis of the loci with the Alu elements containing the longest A-tails (7 of the 19) from the genomes of the NF1 patient and the father revealed that there are at least two loci with A-tails long enough to serve as source elements within our model. Analysis of the A-tail lengths of 12 Ya5a2 elements in diverse human population groups showed substantial variability in both the Alu A-tail length and sequence homogeneity. On the basis of these observations, a model is presented for the role of A-tail length in determining which Alu elements are active.


View this Publication:
Publication


Supplemental Data:


PubMed Link:
Link







Deininger, P. L. and M. A. Batzer (2002) Mammalian Retroelements. Genome Research 12: 1455-1465


Abstract:
The eukaryotic genome has undergone a series of epidemics of amplification of mobile elements that have resulted in most eukaryotic genomes containing much more of this 'junk' DNA than actual coding DNA. The majority of these elements utilize an RNA intermediate and are termed retroelements. Most of these retroelements appear to amplify in evolutionary waves that insert in the genome and then gradually diverge. In humans, almost half of the genome is recognizably derived from retroelements, with the two elements that are currently actively amplifying, L1 and Alu, making up about 25% of the genome and contributing extensively to disease. The mechanisms of this amplification process are beginning to be understood, although there are still more questions than answers. Insertion of new retroelements may directly damage the genome, and the presence of multiple copies of these elements throughout the genome has longer-term influences on recombination events in the genome and more subtle influences on gene expression.


View this Publication:
Publication


Supplemental Data:


PubMed Link:
Link







Kass, D. H. and M. A. Batzer (2002) Pseudogenes. In Genetics, R. Robinson (Ed.), Macmillan Reference USA , New York, NY


Abstract:


View this Publication:


Supplemental Data:


PubMed Link:







Kass, D. H. and M. A. Batzer (2002) Transposable genetic elements. In Genetics, R. Robinson (Ed.), Macmillan Reference USA , New York, NY


Abstract:


View this Publication:


Supplemental Data:


PubMed Link:







Roy-Engel, A. M. and M. A. Batzer (2002) Repetitive DNA sequences . In Genetics, R. Robinson (Ed.), Macmillan Reference USA ,New York, NY


Abstract:


View this Publication:


Supplemental Data:


PubMed Link:







Kayser, M., S. Brauer, S. Willuweit, H. Schadlich, M. A. Batzer , J. Zawacki, M. Prinz, L. Roewer and M. Stoneking (2002) Online Y-chromosomal short tandem repeat (STR) haplotype reference database for U.S. populations. Journal of Forensic Sciences 47: 513-519


Abstract:
We describe here an online Y-chromosomal short tandem repeat haplotype reference database (YHRD) for U.S. populations, which represents 9-locus Y-STR haplotypes for 1705 African-Americans, European-Americans and Hispanics as of October 2001. This database is available online (http://www.ystr. org/usa/), free to access and was generated in order to supply the U.S. forensic DNA community with a valuable resource for frequencies of complete or incomplete 9-locus Y-STR haplotypes, as well as information about typing protocols and population genetic analyses. Pairwise R(ST)-statistics derived from the Y-STR haplotypes indicate no significant substructure among African-American populations from different regions of the U.S., nor (usually) among European-American and Hispanic populations. Thus, pooling of Y-STR haplotype data from regional populations within these three major groups is appropriate in order to obtain larger sample sizes. However, pooling of different major populations is generally not recommended due to statistically significant differences between African-American populations and all European-American/Hispanic populations, as well as between some European-American and Hispanic populations.


View this Publication:


Supplemental Data:


PubMed Link:
Link







Myers, J. S.*, B. J. Vincent* , H. Udall, W. S.Watkins , T. A. Morrish, G. E. Kilroy, G. D. Swergold , J. Henke , L. Henke , J. V. Moran , L. B. Jorde and M. A. Batzer (2002) A Comprehensive Analysis of Recently Integrated Human L1 Ta L1 Elements. American Journal of Human Genetics 71: 312-326


Abstract:
The Ta (transcribed, subset a) subfamily of L1 LINEs (long interspersed elements) is characterized by a 3-bp ACA sequence in the 3' untranslated region and contains approximately 520 members in the human genome. Here, we have extracted 468 Ta L1Hs (L1 human specific) elements from the draft human genomic sequence and screened individual elements using polymerase-chain-reaction (PCR) assays to determine their phylogenetic origin and levels of human genomic diversity. One hundred twenty-four of the elements amenable to complete sequence analysis were full length ( approximately 6 kb) and have apparently escaped any 5' truncation. Forty-four of these full-length elements have two intact open reading frames and may be capable of retrotransposition. Sequence analysis of the Ta L1 elements showed a low level of nucleotide divergence with an estimated age of 1.99 million years, suggesting that expansion of the L1 Ta subfamily occurred after the divergence of humans and African apes. A total of 262 Ta L1 elements were screened with PCR-based assays to determine their phylogenetic origin and the level of human genomic variation associated with each element. All of the Ta L1 elements analyzed by PCR were absent from the orthologous positions in nonhuman primate genomes, except for a single element (L1HS72) that was also present in the common (Pan troglodytes) and pygmy (P. paniscus) chimpanzee genomes. Sequence analysis revealed that this single exception is the product of a gene conversion event involving an older preexisting L1 element. One hundred fifteen (45%) of the Ta L1 elements were polymorphic with respect to insertion presence or absence and will serve as identical-by-descent markers for the study of human evolution.


View this Publication:
Publication


Supplemental Data:
Supplemental Table 1
Supplemental Table 2
Supplemental data 1
Supplemental data 2
Supplemental data 3
Supplemental data 4
Supplemental data 5


PubMed Link:
Link







Battilana, J., S. L. Bonatto, L. B. Freitas, M. H. Hutz, T. A. Weimer, S. M. Callegari-Jacques, M. A. Batzer , K. Hill, A. M. Hurtado, M. L. Petzl-Erler and F. M. Salzano (2002) Alu insertions vs. blood group plus other protein genetic variability in four Amerindian populations. Annals of Human Biology 29: 334-347


Abstract:
Do the population relationships obtained using DNA or blood group plus protein markers remain the same or do they reveal different patterns, indicating that the factors which influence genetic variation at these two levels of analysis are diverse? Can these markers shed light on the biological classification of the Ache, a Paraguayan tribe which only recently established more permanent contacts with non-Indians? SUBJECTS AND METHODS: To consider these questions we typed 193 individuals from four Amerindian tribes in relation to 12 Alu polymorphisms (five of them never studied in these populations), while 22 blood group plus protein systems were studied among the Ache. These data were then integrated with those previously available (blood groups plus proteins) for the three other populations. DNA extraction and amplification, as well as the other laboratory procedures, were performed using standard methods currently in use in our laboratory. The genetic relationships were obtained using the D(A) distance, and the trees were constructed by the neighbour-joining method, both developed by M. Nei and collaborators. Reliability of the trees was tested by bootstrap replications. Other population variability values were also determined using Nei's methods. RESULTS: Alu polymorphism was observed in all populations and for most of the loci; in the seven systems from which we could compare our results with those of other Amerindian groups agreement was satisfactory. Unusual findings on the blood group plus protein systems of the Ache were a very low (5%) HP*1 frequency and the presence of the C(W) phenotype in the Rh blood group. The intertribal patterns of relationship and other aspects of their variation were remarkably congruent in the two sets (Alu; blood group plus protein) of systems. CONCLUSIONS: The answer to the first question posed above is affirmative. However, the problem of whether the Ache derived from a Ge group that preceded the Guarani colonization of Paraguay, or are just a differentiated Guarani group, could not be answered with the genetic information available; the second hypothesis seems more likely at present, but the point to be emphasized is the striking genetic distinctiveness of the Ache as compared to other Amerindians.


View this Publication:
Publication


Supplemental Data:


PubMed Link:
Link







Morrish, T. A., N. Gilbert, J. S. Myers, B .J. Vincent, T. Stamato, G. Taccioli, M. A. Batzer and J. V. Moran (2002) DNA repair mediated by endonuclease-independent LINE-1 retrotransposition. Nature Genetics 31: 159-165


Abstract:
Long interspersed elements (LINE-1s) are abundant retrotransposons in mammalian genomes that probably retrotranspose by target site-primed reverse transcription (TPRT). During TPRT, the LINE-1 endonuclease cleaves genomic DNA, freeing a 3' hydroxyl that serves as a primer for reverse transcription of LINE-1 RNA by LINE-1 reverse transcriptase. The nascent LINE-1 cDNA joins to genomic DNA, generating LINE-1 structural hallmarks such as frequent 5' truncations, a 3' poly(A)+ tail and variable-length target site duplications (TSDs). Here we describe a pathway for LINE-1 retrotransposition in Chinese hamster ovary (CHO) cells that acts independently of endonuclease but is dependent upon reverse transcriptase. We show that endonuclease-independent LINE-1 retrotransposition occurs at near-wildtype levels in two mutant cell lines that are deficient in nonhomologous end-joining (NHEJ). Analysis of the pre- and post-integration sites revealed that endonuclease-independent retrotransposition results in unusual structures because the LINE-1s integrate at atypical target sequences, are truncated predominantly at their 3' ends and lack TSDs. Moreover, two of nine endonuclease-independent retrotranspositions contained cDNA fragments at their 3' ends that are probably derived from the reverse transcription of endogenous mRNA. Thus, our results suggest that LINE-1s can integrate into DNA lesions, resulting in retrotransposon-mediated DNA repair in mammalian cells.


View this Publication:
Publication


Supplemental Data:
Supplemental data


PubMed Link:
Link







Batzer, M. A. and P. L. Deininger (2002) Alu repeats and human genomic diversity. Nature Reviews Genetics 3: 370-379


Abstract:
During the past 65 million years, Alu elements have propagated to more than one million copies in primate genomes, which has resulted in the generation of a series of Alu subfamilies of different ages. Alu elements affect the genome in several ways, causing insertion mutations, recombination between elements, gene conversion and alterations in gene expression. Alu-insertion polymorphisms are a boon for the study of human population genetics and primate comparative genomics because they are neutral genetic markers of identical descent with known ancestral states.


View this Publication:
Publication


Supplemental Data:


PubMed Link:
Link







Batzer, M. A. (2002) Review of “Genomics Protocols. Edited by M. P. Starkey and R. Elaswarapu. Humana Press, Totowa, NJ, 2001”. Analytical Biochemistry 304: 138


Abstract:
Book review of Genomics Protocols. Edited by M. P. Starkey and R. Elaswarapu. Humana Press, Totowa, NJ, 2001.


View this Publication:
Publication


Supplemental Data:


PubMed Link:







Romualdi, C., D. Balding, I. S. Nasidze, G. Risch, M. Robichaux, S. T. Sherry, M. Stoneking, M. A. Batzer and G. Barbujani (2002) Patterns of human diversity, within and among continents, inferred from biallelic DNA polymorphisms. Genome Research 12: 602-612


Abstract:
Previous studies have reported that about 85% of human diversity at Short Tandem Repeat (STR) and Restriction Fragment Length Polymorphism (RFLP) autosomal loci is due to differences between individuals of the same population, whereas differences among continental groups account for only 10% of the overall genetic variance. These findings conflict with popular notions of distinct and relatively homogeneous human races, and may also call into question the apparent usefulness of ethnic classification in, for example, medical diagnostics. Here, we present new data on 21 Alu insertions in 32 populations. We analyze these data along with three other large, globally dispersed data sets consisting of apparently neutral biallelic nuclear markers, as well as with a beta-globin data set possibly subject to selection. We confirm the previous results for the autosomal data, and find a higher diversity among continents for Y-chromosome loci. We also extend the analyses to address two questions: (1) whether differences between continental groups, although small, are nevertheless large enough to confidently assign individuals to their continent on the basis of their genotypes; (2) whether the observed genotypes naturally cluster into continental or population groups when the sample source location is ignored. Using a range of statistical methods, we show that classification errors are at best around 30% for autosomal biallelic polymorphisms and 27% for the Y chromosome. Two data sets suggest the existence of three and four major groups of genotypes worldwide, respectively, and the two groupings are inconsistent. These results suggest that, at random biallelic loci, there is little evidence, if any, of a clear subdivision of humans into biologically defined groups.


View this Publication:
Publication


Supplemental Data:


PubMed Link:
Link







Savas, S., B. Frischertz, M. Z. Pelias, M. A . Batzer, P. L. Deininger and B. J. B. Keats (2002) The USH1C 216G-->A mutation and the 9-repeat VNTR(t,t) allele are in complete linkage disequilibrium in the Acadian population. Human Genetics 110: 95-97


Abstract:
Recently, mutations in USH1C were shown to be associated with Usher syndrome type IC, and a mutation (216G-->A) in exon 3 was identified in an Acadian family. In addition, a 45-bp variable number of tandem repeat (VNTR) polymorphism was found in intron 5 of USH1C. Polymerase chain reaction amplification of the VNTR region and restriction enzyme analysis of exon 3 of USH1C showed that, of 44 Acadian patients, 43 were homozygous for both the 216G-->A mutation and nine repeats of the VNTR, with a "t" nucleotide replacing a "g" nucleotide at the 8th position of both the eighth and ninth copies of the repeat, viz., 9VNTR(t,t). The remaining Acadian patient was reported to be a compound heterozygote for 216G-->A/9VNTR(t,t) and 238-239insC, a USH1C mutation that has been found in other populations. These data demonstrate that the 9VNTR(t,t) allele is in complete linkage disequilibrium with the 216G-->A mutation in the Acadian population. Among 82 Acadian controls, one was heterozygous for 216G-->A/9VNTR(t,t). The 238-239insC mutation was not found in Acadian controls. Analysis of 340 non-Acadian normal samples showed the presence of a 9-repeat VNTR allele in one Hispanic sample. This individual had neither the 216G-->A mutation nor the Acadian VNTR(t,t) structure. These results suggest that the 216G-->A mutation and the 9VNTR(t,t) allele are restricted to the Acadians and are in complete linkage disequilibrium.


View this Publication:
Publication


Supplemental Data:


PubMed Link:
Link







Donaldson, C., J. P. Crapanzano, J. C. Watson, E. A. Levine and M. A. Batzer (2002) PROGINS Alu insertion and human genomic diversity. Mutation Research 501: 137-141


Abstract:
A polymorphic Alu element belonging to the young Ya5 subfamily of Alu repeats located in the progesterone receptor gene has been characterized. Using a polymerase chain reaction (PCR)-based assay, the genetic diversity associated with the PROGINS Alu repeat was determined in a diverse array of human populations. The level of insertion polymorphism associated with PROGINS suggests that it will be a useful marker for the study of human evolution. In addition, we determined the distribution of the PROGINS Alu insertion in two groups of women from greater New Orleans, LA with breast cancer. The PROGINS Alu insertion was not associated with breast cancer in the populations tested.


View this Publication:
Publication


Supplemental Data:


PubMed Link:
Link







Roy-Engel, A. M.*, M. L. Carroll*, M. El-Sawy, A.-H. Salem, R. K. Garber, S. V. Nguyen, P. L. Deininger‡ and M. A. Batzer‡ (2002) Non-traditional Alu evolution and primate genomic diversity. Journal of Molecular Biology 316: 1033-1040. [cover article]


Abstract:
Alu elements belonging to the previously identified "young" subfamilies are thought to have inserted in the human genome after the divergence of humans from non-human primates and therefore should not be present in non-human primate genomes. Polymerase chain reaction (PCR) based screening of over 500 Alu insertion loci resulted in the recovery of a few "young" Alu elements that also resided at orthologous positions in non-human primate genomes. Sequence analysis demonstrated these "young" Alu insertions represented gene conversion events of pre-existing ancient Alu elements or independent parallel insertions of older Alu elements in the same genomic region. The level of gene conversion between Alu elements suggests that it may have a significant influence on the single nucleotide diversity within the genome. All the instances of multiple independent Alu insertions within the same small genomic regions were recovered from the owl monkey genome, indicating a higher Alu amplification rate in owl monkeys relative to many other primates. This study suggests that the majority of Alu insertions in primate genomes are the products of unique evolutionary events.


View this Publication:
Publication


Supplemental Data:
Cover


PubMed Link:
Link