Abstract
Low copy repeats (LCRs) are reported to trigger and mediate genomic rearrangements and may result in genetic diseases. The detection of LCRs provides help to interrogate the mechanism of genetic diseases. The complex structures of LCRs render existing genomic structural variation (SV) detection and segmental duplication (SD) tools hard to predict LCR copies in full length especially those LCRs with complex SVs involved or in large scale. We developed a de novo computational tool LCR_Finder that can predict large scale (>100Kb) complex LCRs in a human genome. Technical speaking, by exploiting fast read alignment tools, LCR_Finder first generates overlapping reads from the given genome, aligns reads back to the genome to identify potential repeat regions based on multiple mapping locations. By clustering and extending these regions, we predict potential complex LCRs. We evaluated LCR_Finder on human chromosomes, we are able to identify 4 known disease related LCRs, and predict a few more possible novel LCRs. We also showed that existing tools designed for finding repeats in a genome, such RepeatScout and WindowMasker are not able to identify LCRs and tools designed for detecting SDs also cannot report large scale full length complex LCRs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Smit, A.F.A., Hubley, R., Green, P.: unpublished data (2011); Current Version: open-3.3.0 (RMLib: 20110920)
Babcock, M., et al.: Hominoid lineage specific amplification of low-copy repeats on 22q11.2 (LCR22s) associated with velo-cardio-facial/digeorge syndrome. Hum. Mol. Genet. 16, 2560–2571 (2007)
Bailey, J.A., et al.: Recent segmental duplications in the human genome. Science 297, 1003–1007 (2002)
Cheung, V.G., et al.: Integration of cytogenetic landmarks into the draft sequence of the human genome. Nature 409, 953–958 (2001)
Jiang, Z., et al.: Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution. Nat. Genet. 39, 1361–1368 (2007)
Kurotaki, N., et al.: Sotos syndrome common deletion is mediated by directly oriented subunits within inverted Sos-REP low-copy repeats. Hum. Molec. Genet. 14, 535–542 (2005)
Krzywinski, M., et al.: Circos: an Information Aesthetic for Comparative Genomics. Genome Res. 19, 1639–1645 (2009)
Li, H., et al.: Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 18, 1851–1858 (2008)
Liu, C.M., et al.: SOAP3: Ultra-fast GPU-based parallel alignment tool for short reads. Bioinformatics 28, 878–879 (2012)
Morgulis, A., et al.: WindowMasker: window-based masker for sequenced genomes. Bioinformatics 22, 134–141 (2006)
Park, S.S., et al.: Structure and Evolution of the Smith-Magenis Syndrome Repeat Gene Clusters, SMS-REPs. Genome Res. 12, 729–738 (2002)
Price, A.L., et al.: De novo identification of repeat families in large genomes. Bioinformatics 21, i351–i358 (2005)
Saunier, S., et al.: Characterization of the NPHP1 locus: mutational mechanism involved in deletions in familial juvenile nephronophthisis. Am. J. Hum. Genet. 66, 778–789 (2000)
Stankiewicz, P., Lupski, J.R.: Genome architecture, rearrangements and genomic disorders. Trends Genet. 18, 74–82 (2002)
Valero, M.C., et al.: Fine-scale comparative mapping of the human 7q11.23 region and the orthologous region on mouse chromosome 5G: the low-copy repeats that flank the Williams-Beuren syndrome arose at breakpoint sites of an evolutionary inversion(s). Genomics 69, 1–13 (2000)
Zhang, F., et al.: Copy number variation in human health, disease, and evolution. Annu. Rev. Genomics hum. Genet. 10, 451–481 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, X., Cheung, D.Wl., Ting, HF., Lam, TW., Yiu, SM. (2013). LCR_Finder: A de Novo Low Copy Repeat Finder for Human Genome. In: Cai, Z., Eulenstein, O., Janies, D., Schwartz, D. (eds) Bioinformatics Research and Applications. ISBRA 2013. Lecture Notes in Computer Science(), vol 7875. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38036-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-38036-5_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38035-8
Online ISBN: 978-3-642-38036-5
eBook Packages: Computer ScienceComputer Science (R0)