• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of plosonePLoS OneView this ArticleSubmit to PLoSGet E-mail AlertsContact UsPublic Library of Science (PLoS)
PLoS ONE. 2009; 4(6): e5972.
Published online Jun 22, 2009. doi:  10.1371/journal.pone.0005972
PMCID: PMC2696090

Mapping Insertions, Deletions and SNPs on Venter's Chromosomes

Mark A. Batzer, Editor

Abstract

Background

The very recent availability of fully sequenced individual human genomes is a major revolution in biology which is certainly going to provide new insights into genetic diseases and genomic rearrangements.

Results

We mapped the insertions, deletions and SNPs (single nucleotide polymorphisms) that are present in Craig Venter's genome, more precisely on chromosomes 17 to 22, and compared them with the human reference genome hg17. Our results show that insertions and deletions are almost absent in L1 and generally scarce in L2 isochore families (GC-poor L1+L2 isochores represent slightly over half of the human genome), whereas they increase in GC-rich isochores, largely paralleling the densities of genes, retroviral integrations and Alu sequences. The distributions of insertions/deletions are in striking contrast with those of SNPs which exhibit almost the same density across all isochore families with, however, a trend for lower concentrations in gene-rich regions.

Conclusions

Our study strongly suggests that the distribution of insertions/deletions is due to the structure of chromatin which is mostly open in gene-rich, GC-rich isochores, and largely closed in gene-poor, GC-poor isochores. The different distributions of insertions/deletions and SNPs are clearly related to the two different responsible mechanisms, namely recombination and point mutations.

Introduction

The very recent availability of fully sequenced individual human genomes [1][5] is a major revolution in biology which is certainly going to provide new insights into genetic diseases and genomic rearrangements in the near future. In the present work, we looked at the insertions, deletions and SNPs that are present in Craig Venter's genome [1], more precisely on chromosomes 17 to 22 (334 megabases, about 10% of the human genome), and compared them with the human reference genome hg17 from UCSC website.

The three main reasons for carrying out this investigation were the following: (i) to localize insertions, deletions and SNPs on chromosomes 17 to 22, in connection with the compartmentalization of the human genome into isochores [6], [7]; this was done at two levels, namely localization in isochore families (L1, L2, H1, H2, H3, in order of increasing GC and gene density) and mapping within the isochores; (ii) to correlate insertions, deletions and SNPs with the densities of genes, interspersed repeats and retroviral insertions, since these densities are correlated, in turn, with isochore GC levels [8][12], [6], and since they may provide indications for the preference of insertions/deletions for different isochore families; (iii) to prepare the ground for exploring the expression of genes located in the neighborhood of deletions and insertions; indeed it has been postulated [7] that compositional changes due to the accumulation of AT-biased point mutations or to deletions/insertions may be associated with alterations of chromatin structure that, in turn, may affect gene expression.

It should be pointed out that the present work only concerns (i) insertions and deletions among structural variations (not including copy-number variations such as segmental duplications; see ref. [13] for a review, and ref. [14]); and (ii) SNPs as detected by pairwise alignment of sequences. It should also be stressed that the Venter genome used in our comparison, represents a composite haploid version of the genome where the highest scoring alleles contained are represented in the consensus sequence. The human reference genome hg17 (practically identical to the latest hg18 version for the chromosomes under consideration) is a composite genome resulting from several individuals. Insertions and deletions, as well as SNPs, reported in this article are, therefore, the result of the comparison of one genome, the Venter genome, with several individual genomes. In other words, each insertion and deletion in Venter is derived from a comparison with another individual, but not necessarily the same individual. Obviously, this also applies to SNPs. We thought that our approach was acceptable in view of the fact that our primary aim was to look for the localization of insertions/deletions and SNPs on isochores.

Focusing on chromosomes 17–22 is justified by considering that these chromosomes are representative, in terms of isochores, of the whole human genome. A detailed comparison of the full Venter genome with the human reference genome was not warranted at the time of our investigations, because the human reference genome, as already mentioned, is a composite genome. Obviously, a comparison of full individual genomes will be of interest as soon as this will be possible.

Results

The choice of chromosomes 17 to 22 was due to the fact that while these chromosomes exhibit wide differences in their isochore patterns, they cumulatively show an overall similarity with the isochore patterns of the whole human genome [15]. Indeed, as shown in Figure 1, chromosomes 17 and 20 are characterized by a predominance of H1 and H2 isochores, whereas L1 isochores are poorly represented. In contrast, chromosomes 18 and 21 are characterized by abundant L1 isochores (as well as L2 isochores in the case of chromosome 18, which lacks H3 isochores altogether). Chromosomes 19 and 22 completely lack isochore family L1, are very scarce in L2 isochores, and show a great abundance of H1 and, especially, of H2 isochores. It should be noted that while Figure 1 reports the isochore patterns of chromosomes from release hg17, the isochore profiles of hg17 and hg18, the most recent release, are identical as far as chromosomes 17 to 22 are concerned, the only exceptions being three small gaps in hg17 of chromosome 22 which were filled in the hg18 version (see Figure S1).

Figure 1
Distribution of isochores on chromosomes 17 to 22 from the human reference genome.

Figure 2 compares the cumulative isochore pattern of chromosomes 17 to 22 with that of the whole human genome. The former one is characterized by an under-representation of GC-poor isochore families L1 and L2 and by an over-representation of GC-rich isochore families H1, H2 and H3. Chromosomes 17 to 22 still provide, however, a fair representation of the isochore pattern of the whole human genome, which is satisfactory for the purpose of this investigation. In addition, these differences are take care of the fact that our data on insertions/deletions are presented as densities.

Figure 2
Comparison of the cumulative isochore distribution on chromosomes 17 to 22 and on the whole human genome.

The locations of insertions and deletions, respectively, in the isochore families of Venter's chromosomes 17 to 22 are summarized in Figure 3 A,B. The correlation between the number of indels and proportion of sequence in isochors were determined using the Pearson correlation coefficient: very significant values (P<0.0001) were found. Densities of insertions and deletions in the three size ranges explored were extremely low in L1 isochores. While this is hardly surprising for chromosomes 19 and 22, which comprise few or no L1 isochores, this is also true for chromosomes 18 and 21, which are rich in L1 isochores. The density of insertions/deletions increased with increasing GC of isochore families, essentially paralleling the densities of genes and Alu sequences, except for the lower values of the longest (>1000 bp, base pairs) insertions/deletions in H3 isochores. In addition, in the latter case deletions and insertions showed a parallel behaviour, whereas insertions in Venter's chromosomes were more abundant than deletions in H1 to H3 families for the 10–100 and 100–1000 bp classes. The points made above expectedly appear more clearly on the cumulative plots of Figure 4.

Figure 3
Insertions/deletions in Venter's chromosomes.
Figure 4
Density of insertions and deletions in isochore families from chromosomes 17 to 22.

It should be pointed out that (i) if the Venter genome contains two contiguous Alu elements (~600 bp), while the human reference genome contains one Alu element (~300 bp) at the orthologous locus, this locus will be assessed as a Venter genome insertion; and (ii) Alu-Alu recombination-mediated deletions (ARMDs) have been shown to occur frequently throughout primate evolution [16], [17]. Therefore, if this locus was created by an ARMD event in the human reference genome, one should discard this locus in the Venter insertion category. While this is correct in our case, ARMD's could only represent 50 human specific deletions (10% of the 492 found by Sen et al., 2006, for the whole genome since Venter's chromosomes 17 to 22 that represent 10% of the human genome). This is, however, a negligible number compared to the 3468 insertions in Venter found by us and would therefore not change our conclusions.

The results in terms of numbers of insertions/deletions located in different isochore families are reported in Table S1, which also presents the corresponding amounts of DNA. The data show (i) that the predominant weight contribution (>90%) expectedly is that of the largest insertions/deletions; (ii) that the total amounts of both insertions and deletions represent 0.6–2.7% of chromosome sizes, except for the much larger levels in the case of chromosome 19 (3.9% and 12.1%, respectively, for insertions and deletions in Venter); and (iii) that, in general, the patterns of deletions and insertions tend to parallel each other, with the exception of the very abundant deletions in Venter's chromosome 19.

The localizations of insertions/deletions larger than 1000 bp in chromosomes 21 and 22 are showed in Figure 5. Two features are outstanding (i) the practical absence of insertions and deletions in sub-telomeric regions (e.g. positions 40 to 47 megabases on chromosome 21 of hg17), in spite of the fact that these regions are very GC-rich; and (ii) the highest concentrations of insertions/deletions in regions about position 37 megabase in chromosome 21 of hg17, and about position 39 megabase in chromosome 22 of hg17. These regions do not show any noticeable difference, in the present state of knowledge, when compared with compositionally similar regions located elsewhere on the chromosomes. The localizations of insertions/deletions of 10–100 bp and 100–1000 bp on chromosomes 21 and 22 are reported in Figures S2 and S3.

Figure 5
The largest insertions/deletions in chromosomes 21 and 22.

The parallelism between the densities of insertions and Alu sequences prompted a search for Alu sequences in the insertions of the reference human chromosomes that correspond to deletions in Venter's chromosomes. The results, presented in Table 1, indicate that all or most Alu sequences were present at the ends of 10–100 and 100–1000 bp insertions, respectively, whereas only about 30% of the >1000 bp insertions had Alu sequences at their ends, the majority of Alus being located in internal positions.

Table 1
The number and locations of Alu sequences are reported for three classes of insertions (10–100 bp, 100–1000 bp and >1000 bp) in the human reference genome(a).

In sharp contrast with insertions/deletions, the densities of SNPs were largely uniform over all isochore families (Figure 6; see also Table S2; Figure S4 presents the numbers of SNPs on chromosomes). Even if the vast majority of isochores showed relatively constant concentrations of SNPs, which did not vary with the different GC levels of isochores, a small number of them showed very high or very low concentrations (see Figure 6). When these isochores were analyzed individually (see Table S3), the high SNPs concentrations were found to be either distributed over most of the isochore length (as is the case for isochores having the average SNPs concentration) or present in limited regions (see Figure 7, in which five isochores are reported; for the other isochores see Figure S5). Insertions, being much less numerous than SNPs, were expectedly less widespread in their distribution and tended to coincide with SNPs spikes.

Figure 6
SNPs densities in isochores of chromosomes 17–22.
Figure 7
Insertions and SNPs in individual isochores of chromosomes.

Finally, a trend to avoid gene dense regions was evident when comparing gene density and SNPs density (Figure 8). P values <0.0001 were found for the correlation between gene density and SNPs density.

Figure 8
SNPs and gene densitites.

Discussion

The most relevant result of the present investigation concerns the large preference for both insertions and deletions to take place in GC-rich isochores, especially in the H2 and H3 families, which only represent together 15% of the human genome.

The increase in insertions and deletions in the H1-H3 isochore families, parallels the increase in the concentration of both Alu sequences and genes (see Introduction), as well as in the degree of “openness” of chromatin [18][20] and in the frequency of recombination [21][25]. The question should therefore be asked which one(s) of these factors is (are) the most biologically significant as an explanation for the distribution of insertions/deletions.

The correlation between the densities of insertions/deletions and Alu sequences is indicated in the most evident way by the terminal distribution of Alu sequences in insertions in the reference human genome (see Table 1). While such terminal distribution is perfect for the 10–100 bp insertions and still predominant for the 100–1000 bp class, this is not, however, the case for the largest insertions, where Alu sequences are in terminal positions of only about 30% insertions. The distribution of insertions/deletions in GC-rich isochores is, however, not simply due to their richness in repeated sequences such as Alu sequences. Indeed, if this were the case, one would expect to have high levels of insertions/deletions also in GC-poor isochores, which are very rich in the other major family of interspersed repeats, the LINE-1 (long interspersed element-1) family, whereas this is not the case.

An overall positive correlation also exists between insertions/deletions and gene density but the longest insertions/deletions decrease in the most gene-dense isochores of the H3 family, as if this process were not allowed because of its deleterious impact on genes; and (ii) the insertions/deletions of the other size classes are scarce in telomeric regions, which are very gene-rich, as compared with similarly GC-rich, but less gene-rich isochores located elsewhere on chromosomes. At this point, one should conclude that the correlation between insertions/deletions and gene density is only a consequence of the correlation between gene density and GC level [6].

Having ruled out gene concentration as a factor favoring insertions/deletions (in fact, the opposite being true), and considering that Alu sequences are simply used in the recombination process (LINE-1 not favoring insertions/deletions in GC-poor isochores), the possibility remains that the real reason for the distribution of insertions/deletions reported here is the different chromatin structure of GC-poor vs GC-rich isochores [18][20]. This possibility is strongly supported by previous work on retroviral integration.

Indeed, Bovine Leukemia Virus (BLV; [26]), Human Hepatitis B (HBV a DNA virus with some retroviral features; [27]), Rous Sarcoma Virus (RSV; [28]), Human T-cell Leukemia Virus [29]), Murine Leukemia Virus (MuLV; [30]) were all shown to integrate in GC-rich isochores (see [6] for a review). One might, however, argue that, since all the retroviral sequences mentioned so far are GC-rich [31], integration into GC-rich isochores could depend upon the requirement for a compositional match between the retroviral sequence and the isochores of the host genome without being related to chromatin “openness”. Integration into GC-rich isochores was also found, however, for exogenous Mouse Mammary Tumor Virus (MMTV; [32]) and Human Immunodeficiency Virus (HIV-I; [6], [33][36]) which are GC-poor. This obviously favors the idea of an integration into open chromatin structures. Moreover, using different approaches, several authors [37][42] found high frequencies of RSV, Avian Leukosis Virus (ALV), and MuLV near DNase-hypersensitive sites, transcriptionally active regions and CpG islands. These results are in agreement with our conclusion since GC-rich isochores correspond to open chromatin regions [23] and since DNase-hypersensitive sites are concentrated in GC-rich isochores [24], [25] which are rich in genes and in CpG islands and are transcriptionally active. In conclusion, the results available indicate that the initial integration of retroviral sequences takes place in open chromatin regions (such as those corresponding to GC-rich isochores), whereas stability of integration and transcription requires a matching composition of retroviral and host sequences [6], [18]. Another result in favor of the open chromatin interpretation is that “new” Alu sequences integrate essentially at random in the genome, but this happens in the paternal germ line [43][45], where open chromatin is much more widespread over chromosomes.

At this point one should recall that the pattern of insertions/deletions follows the general pattern of chromosomal rearrangements [18] and recombination [20][22]. This might be an alternative possible explanation for the pattern of insertions/deletions. It seems, however, much more plausible that the pattern of recombination itself is dependent upon the distribution of open chromatin regions over the genome. Indeed, DNA duplications also occur more frequently in GC-rich compared to GC-poor isochores [44] and chromosomal fission takes place frequently within regions elevated in GC [46]. As already mentioned, in several cases the localizations of insertions/deletions in chromosomes indicate some specific preferences, such as those shown in Figure 5 and Table S1, which correspond to hot spots of recombination.

These observations are important because structural genome variations, such as insertions/deletions, may be involved in genetic diseases. We have already suggested that this may occur not so much through a direct impact on genes, but rather through local changes in chromatin structure that affect gene expression at a distance [7]. This explanation is supported by the fact that non-coding sequences are so overwhelmingly abundant compared to coding sequences in the human genome (98–99% vs 1–2%; [6]).

In sharp contrast with insertions/deletions, SNPs are rather uniformly distributed over all isochore families. The distribution of SNP is understandable because the main cause of SNPs are point mutations due to errors during DNA replication, which are apparently not very sensitive to the compositional context. Still, even if this applies to the vast majority of isochores, a small number of them showed very high or very low concentrations. Needless to say, the latter isochores deserve further investigation, also because of the coincidence of recombination hot spots and high SNP densities as shown by Figure 7 and Figure S6.

Methods

Venter's chromosomes were downloaded from GenBank (http://www.ncbi.nlm.nih.gov/GenBank; accession number ABBA01000000; [1] and were aligned with the human reference genome hg17 [47], [48] on the UCSC website http://genome.ucsc.edu). This release, used for the mapping of isochores by Costantini et al. [15] was compared with the most recent release hg18, and found to be identical as far as chromosomes 17 to 21 are concerned, whereas chromosome 22 showed three small gaps, which were filled in the hg18 version. A script implemented by us was used to align the sequences and to extract the insertions/deletions in Venter's chromosomes, considering three size classes (10–100, 100–1000, >1000 bp), as well as the single nucleotide polymorphisms (SNPs). Insertions/deletions of single nucleotides in Venter's genome were also estimated and represented less than 5% of SNPs. Alu sequences coordinates for human genome reference were downloaded from UCSC website.

The correlations between the number of indels and proportion of sequence in isochores and between gene density and SNPs density were determined using the Pearson correlation coefficient by the statistical program Prism 4 (GraphPad Software San Diego, CA, USA). A value of P<0.05 was considered to be statistically significant.

Supporting Information

Figure S1

(0.02 MB PDF)

Figure S2

(0.05 MB PDF)

Figure S3

(0.07 MB PDF)

Figure S4

(0.04 MB PDF)

Figure S5

(0.08 MB PDF)

Table S1

(0.03 MB XLS)

Table S2

(0.02 MB XLS)

Table S3

(0.02 MB XLS)

Acknowledgments

We thank Fabio Auletta for bioinformatic support. We also thank Mark Batzer for communicating to us unpublished data on structural variations in Venter's genome, and to an anonymous Referee for a specific comment on Alu-Alu recombination-mediated deletions (ARMDs).

Footnotes

Competing Interests: The authors have declared that no competing interests exist.

Funding: The funding that has supported the work came from the author's Institute. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Levy S, Sutton G, Ng PC, Feuk L, Halpern AL, et al. The diploid genome sequence of an individual human. PloS Biology. 2007;5:e254. [PMC free article] [PubMed]
2. Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, et al. Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008;456:53–59. [PMC free article] [PubMed]
3. Ley TJ, Mardis ER, Ding L, Fulton B, McLellan MD, et al. DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome. Nature. 2008;456:66–72. [PMC free article] [PubMed]
4. Wang J, Wang W, Li R, Li Y, Tian G, et al. The diploid genome sequence of an Asian individual. Nature. 2008;456:60–66. [PMC free article] [PubMed]
5. Wheeler DA, Srinivasan M, Egholm M, Shen Y, Chen L, et al. The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008;452:872–877. [PubMed]
6. Bernardi G. Amsterdam, The Netherlands: Elsevier; 2004, reprinted in 2005. Structural and Evolutionary Genomics. Natural Selection in Genome Evolution.
7. Bernardi G. The neo-selectionist theory of genome evolution. Proc Natl Acad Sci USA. 2007;104:8385–8390. [PMC free article] [PubMed]
8. Meunier-Rotival M, Soriano P, Cuny G, Strauss F, Bernardi G. Sequence organization and genomic distribution of the major family of interspersed repeats of mouse DNA. Proc Natl Acad Sci USA. 1982;79:355–359. [PMC free article] [PubMed]
9. Soriano P, Meunier-Rotival M, Bernardi G. The distribution of interspersed repeats is non-uniform and conserved in the mouse and human genome. Proc Natl Acad Sci USA. 1983;80:1816–1820. [PMC free article] [PubMed]
10. Bernardi G, Olofsson B, Filipski J, Zerial M, Salinas J, et al. The mosaic genome of warm-blooded vertebrates. Science. 1985;228:953–958. [PubMed]
11. Mouchiroud D, D'Onofrio G, Aïssani B, Macaya G, Gautier C, Bernardi G. The distribution of genes in the human genome. Gene. 1991;100:181–187. [PubMed]
12. Zoubak S, Clay O, Bernardi G. The gene distribution of the human genome. Gene. 1996;174:95–102. [PubMed]
13. Bailey JA, Eichler EE. Primate segmental duplications: crucibles of evolution, diversity and desease. Nature. 2006;7:552–564. [PubMed]
14. Korbel JO, Urban AE, Affourtit JP, Godwin B, Gribert F, et al. Paired-end mapping reveals extensive structural variation in the human genome. Science. 2007;318:420–426. [PMC free article] [PubMed]
15. Costantini M, Clay O, Auletta F, Bernardi G. An isochore map of human chromosomes. Genome Res. 2006;16:536–541. [PMC free article] [PubMed]
16. Sen SK, Han K, Wang J, Lee J, Wang H, et al. Human genomic deletions mediated by recombination between Alu elements. The American Journal of Human Genetics. 2006;79:41–53. [PMC free article] [PubMed]
17. Han K, Lee J, Meyer TJ, Wang J, Sen SK, et al. Alu Recombination-Mediated Structural Deletions in the chimpanzee genome. PLoS Genetics. 2007;3:e184. [PMC free article] [PubMed]
18. Saccone S, Federico C, Bernardi G. Localization of the gene-richest and the gene-poorest isochores in the interphase nuclei of mammals and birds. Gene. 2002;300:169–78. [PubMed]
19. Di Filippo M, Bernardi G. Mapping DNase-I hypersensitive sites on human isochores. Gene. 2008;419:62–65. [PubMed]
20. Di Filippo M, Bernardi G. The early apoptotic DNA fragmentation targets a small number of open chromatin regions. PLoS ONE. 2009;4:e5010. [PMC free article] [PubMed]
21. Bernardi G. The isochore organization of the human genome. Ann Rev Genet. 1989;23:637–661. [PubMed]
22. Holmquist GP. Chromosome bands, their chromatin flavors, and their functional features. Am J Hum Genet. 1992;51:17–37. [PMC free article] [PubMed]
23. Fullerton SM, Carvalho AB, Clark AG. Local rates of recombination are positively correlated with GC content in the human genome. Mol Biol Evol. 2001;18:1139–1142. [PubMed]
24. Kong A, Gudbjartsson DF, Sainz J, Jonsdottir GM, Gudjonsson SA, et al. A high-resolution recombination map of the human genome. Nature Genet. 2002;31:241–247. [PubMed]
25. Nachman MW. Variation in recombination rate across the genome: evidene and implications. Curr Opin Genet Dev. 2002;12:657–663. [PubMed]
26. Kettman R, Meunier-Rotival M, Cortadas J, Cuny G, Ghysdael J, Mammerickx M, Burny A, Bernardi G. Integration of bovine leukemia virus DNA in the bovine genome. Proc Natl Acad Sci USA. 1979;76:4822–4826. [PMC free article] [PubMed]
27. Zerial M, Salinas J, Filipski J, Bernardi G. Genomic localization of hepatitis B virus in a human hepatoma cell line. Nucleic Acid Res. 1986;14:8373–8386. [PMC free article] [PubMed]
28. Rynditch A, Kadi F, Geryk J, Zoubak S, Svoboda J, Bernardi G. The isopycnic, compartmentalized integration of Rous sarcoma virus sequences. Gene. 1991;106:165–172. [PubMed]
29. Zoubak S, Richardson J, Rynditch A, Höllsberg P, Hafler D, Boeri E, Lever AML, Bernardi G. Regional specificity of HTLV-I proviral integration in the human genome. Gene. 1994;143:155–163. [PubMed]
30. Rynditch A, Zoubak S, Tsyba L, Tryapitsina-Guley N, Bernardi G. The regional integration of retroviral sequences into the mosaic genomes of mammals. Gene. 1998;222:1–16. [PubMed]
31. Zoubak S, Rynditch A, Bernardi G. Compositional bimodality and evolution of retroviral genomes. Gene. 1992;119:207–213. [PubMed]
32. Salinas J, Zerial M, Filipski J, Bernardi G. Gene distribution and nucleotide sequence organization in the mouse genome. Eur J Biochem. 1986;160:469–478. [PubMed]
33. Glukhova LA, Zoubak SV, Rynditch A, Miller GG, Titova IV. Localization of HTLV-1 and HIV-1 proviral sequences in chromosomes of persistently infected cells. Chromosome Res. 1999;7:177–183. [PubMed]
34. Elleder D, Pavliceck A, Paces J, Hejnar J. Preferential integration of human immunodeficiency virus type 1 into genes, cytogenetic R bands and GC-rich DNA regions:insight from the human genome sequence. FEBS Lett. 2002;517:285–286. [PubMed]
35. Tsyba L, Rynditch A, Boeri E, Jabbari K, Bernardi G. Distribution of HIV-1 in the genomes of AIDS patients. Cell Mol Life Sci. 2004;61:721–726. [PubMed]
36. Mok HP, Lever AML. Location, location, location. Gene Therapy. 2005;12:1–2. [PubMed]
37. Schubach W, Groudine M. Alteration of c-myc chromatin structure by avian leucosis virus integration. Nature. 1984;307:702–708. [PubMed]
38. Vijaya S, Steffen DL, Kozak C, Robinson HL. Acceptor sites for retroviral integrations map near DNA I-hypersensitive sites in chromatin. J Virol. 1986;60:683–692. [PMC free article] [PubMed]
39. Rohdewhold H, Weinher H, Reik W, Jaenisch R, Breindl M. Retrovirus integration and chromatin structure: Moloney murine leukemia proviral integration sites map near DNase I-hypersensitive sites. J Virol. 1987;61:336–343. [PMC free article] [PubMed]
40. Mooslehner K, Karl U, Harbers K. Retroviral sites in transgenic Mov mice frequently map in the vicinity of transcribed DNA region. J Virol. 1990;64:3056–3058. [PMC free article] [PubMed]
41. Scherdin V, Rhodes K, Brendl M. Trancriptionally active genome regions and preferred targets for retrovirus integration. J Virol. 1990;64:907–912. [PMC free article] [PubMed]
42. Finchman VJ, Wyke JA. Differences between cellular integration sites of transcribed and non transcribed Rous sarcoma proviruses. J Virol. 1991;65:461–463. [PMC free article] [PubMed]
43. Jurka J. Evolutionary impact of human Alu repetitive elements. Current Opinion in Genetics and Development. 2004;14:603–608. [PubMed]
44. Jurka J, Kohany O, Pavliceck A, Kapitonov VV, Jurka MV. Duplication, coclustering and selection of human Alu retrotrasposons. Proc Natl Acad Sci USA. 2004;101:1268–1272. [PMC free article] [PubMed]
45. Jurka J, Kohany O, Pavliceck A, Kapitonov VV, Jurka MV. Clustering, duplication and chromosomal distribution of mouse SINE retrotrasposons. Cytogenet. Genome Res. 2005;110:117–123. [PubMed]
46. Webber C, Ponting C. Hotspots of mutation and breakage in dog and human chromosomes. Genome Research. 2005;15:1787–1797. [PMC free article] [PubMed]
47. International Human Genome Sequencing Consortium. Finishing the euchromatic sequence of the human genome. Nature. 2004;431:931–945. [PubMed]
48. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, et al. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. [PMC free article] [PubMed]
49. Pačes J, Zika R, Pavliček A, Clay O, Bernardi G. Representing GC variation along eukaryotic chromosomes. Gene. 2004;333:135–141. [PubMed]
50. Pavliček A, Pačes J, Clay O, Bernardi G. A compact view of isochores in the draft human genome sequence. FEBS Lett. 2002;511:165–169. [PubMed]

Articles from PLoS ONE are provided here courtesy of Public Library of Science
PubReader format: click here to try

Formats:

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...

Links

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...