The autosomal recessive genome instability disorder Ataxia-telangiectasia, caused by mutations in ATM kinase, is characterized by the progressive loss of cerebellar neurons. We find that DNA damage associated with ATM loss results in dysfunctional behaviour of human microglia, immune cells of the central nervous system. Microglial dysfunction is mediated by the pro-inflammatory RELB/p52 non-canonical NF-κB transcriptional pathway and leads to excessive phagocytic clearance of neuronal material. Activation of the RELB/p52 pathway in ATM-deficient microglia is driven by persistent DNA damage and is dependent on the NIK kinase. Activation of non-canonical NF-κB signalling is also observed in cerebellar microglia of individuals with Ataxia-telangiectasia. These results provide insights into the underlying mechanisms of aberrant microglial behaviour in ATM deficiency, potentially contributing to neurodegeneration in Ataxia-telangiectasia.
Publications
2022
Pathogenic variants in the SRCAP (SNF2-related CREBBP activator protein) gene, which encodes a chromatin-remodeling ATPase, cause neurodevelopmental disorders including Floating Harbor syndrome (FLHS). Here, we report the discovery of a de novo transposon insertion in SRCAP exon 13 from trio genome sequencing in a 28-year-old female with failure to thrive, developmental delay, mood disorder and seizure disorder. The insertion was a full-length (~2.8 kb), antisense-oriented SVA insertion relative to the SRCAP transcript, bearing a 5’ transduction and hallmarks of target-primed reverse transcription. The 20-bp 5’ transduction allowed us to trace the source SVA element to an intron of a long non-coding RNA on chromosome 12, which is highly expressed in testis. RNA sequencing and qRT-PCR confirmed significant depletion of SRCAP expression and low-level exon skipping in the proband. This case highlights a novel disease-causing structural variant and the importance of transposon analysis in a clinical diagnostic setting.
Somatic mutations are DNA variants that occur after the fertilization of zygotes and accumulate during the developmental and aging processes in the human lifespan. Somatic mutations have long been known to cause cancer, and more recently have been implicated in a variety of non-cancer diseases. The patterns of somatic mutations, or mutational signatures, also shed light on the underlying mechanisms of the mutational process. Advances in next-generation sequencing over the decades have enabled genome-wide profiling of DNA variants in a high-throughput manner; however, unlike germline mutations, somatic mutations are carried only by a subset of the cell population. Thus, sensitive bioinformatic methods are required to distinguish mutant alleles from sequencing and base calling errors in bulk tissue samples. An alternative way to study somatic mutations, especially those present in an extremely small number of cells or even in a single cell, is to sequence single-cell genomes after whole-genome amplification (WGA); however, it is critical and technically challenging to exclude numerous technical artifacts arising during error-prone and uneven genome amplification in current WGA methods. To address these challenges, multiple bioinformatic tools have been developed. In this review, we summarize the latest progress in methods for identification of somatic mutations and the challenges that remain to be addressed in the future.
2021
While oncogenic mutations have been found in non-diseased, proliferative non-neural tissues, their prevalence in the human brain is unknown. Targeted sequencing of genes implicated in brain tumors in 418 samples derived from 110 individuals of varying ages, without tumor diagnoses, detected oncogenic somatic single-nucleotide variants (sSNVs) in 5.4% of the brains, including IDH1 R132H. These mutations were largely present in subcortical white matter and enriched in glial cells, and surprisingly, were less common in older individuals. A depletion of high-allele frequency sSNVs representing macroscopic clones with age was replicated by analysis of bulk RNAseq data from 1,816 non-diseased brain samples ranging from fetal to old age. We also describe large clonal copy number variants, and that sSNVs show mutational signatures resembling those found in gliomas, suggesting that mutational processes of the normal brain drive early glial oncogenesis. This study helps understand the origin and early evolution of brain tumors.
Background
Results
Conclusions
Transposable elements (TEs) significantly contribute to shaping the diversity of the human genome, and lines of evidence suggest TEs as one of driving forces of human brain evolution. Existing computational approaches, including cross-species comparative genomics and population genetic modeling, can be adapted for the study of the role of TEs in evolution. In particular, diverse ancient and archaic human genome sequences are increasingly available, allowing reconstruction of past human migration events and holding the promise of identifying and tracking TEs among other evolutionarily important genetic variants at an unprecedented spatiotemporal resolution. However, highly degraded short DNA templates and other unique challenges presented by ancient human DNA call for major changes in current experimental and computational procedures to enable the identification of evolutionarily important TEs. Ancient human genomes are valuable resources for investigating TEs in the evolutionary context, and efforts to explore ancient human genomes will potentially provide a novel perspective on the genetic mechanism of human brain evolution and inspire a variety of technological and methodological advances. In this review, we summarize computational and experimental approaches that can be adapted to identify and validate evolutionarily important TEs, especially for human brain evolution. We also highlight strategies that leverage ancient genomic data and discuss unique challenges in ancient transposon genomics.
Transposable elements (TEs) help shape the structure and function of the human genome. When inserted into some locations, TEs may disrupt gene regulation and cause diseases. Here, we present xTea (x-Transposable element analyzer), a tool for identifying TE insertions in whole-genome sequencing data. Whereas existing methods are mostly designed for short-read data, xTea can be applied to both short-read and long-read data. Our analysis shows that xTea outperforms other short read-based methods for both germline and somatic TE insertion discovery. With long-read data, we created a catalogue of polymorphic insertions with full assembly and annotation of insertional sequences for various types of retroelements, including pseudogenes and endogenous retroviruses. Notably, we find that individual genomes have an average of nine groups of full-length L1s in centromeres, suggesting that centromeres and other highly repetitive regions such as telomeres are a significant yet unexplored source of active L1s. xTea is available at https://github.com/parklab/xTea.
Alzheimer's disease (AD) is a progressive neurodegenerative disease associated with a complex genetic etiology. Besides the apolipoprotein E ε4 (APOE ε4) allele, a few dozen other genetic loci associated with AD have been identified through genome-wide association studies (GWAS) conducted mainly in individuals of European ancestry. Recently, several GWAS performed in other ethnic groups have shown the importance of replicating studies that identify previously established risk loci and searching for novel risk loci. APOE-stratified GWAS have yielded novel AD risk loci that might be masked by, or be dependent on, APOE alleles. We performed whole-genome sequencing (WGS) on DNA from blood samples of 331 AD patients and 169 elderly controls of Korean ethnicity who were APOE ε4 carriers. Based on WGS data, we designed a customized AD chip (cAD chip) for further analysis on an independent set of 543 AD patients and 894 elderly controls of the same ethnicity, regardless of their APOE ε4 allele status. Combined analysis of WGS and cAD chip data revealed that SNPs rs1890078 (P = 6.64E-07) and rs12594991 (P = 2.03E-07) in SORCS1 and CHD2 genes, respectively, are novel genetic variants among APOE ε4 carriers in the Korean population. In addition, nine possible novel variants that were rare in individuals of European ancestry but common in East Asia were identified. This study demonstrates that APOE-stratified analysis is important for understanding the genetic background of AD in different populations.