Papers by David Ellinghaus
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2015
High-throughput genotyping technologies (such as SNP-arrays) allow the rapid collection of up to ... more High-throughput genotyping technologies (such as SNP-arrays) allow the rapid collection of up to a few million genetic markers of an individual. Detecting epistasis (based on 2-SNP interactions) in Genome-Wide Association Studies is an important but time consuming operation since statistical computations have to be performed for each pair of measured markers. Computational methods to detect epistasis therefore suffer from prohibitively long runtimes; e.g., processing a moderately-sized dataset consisting of about 500,000 SNPs and 5,000 samples requires several days using state-of-the-art tools on a standard 3 GHz CPU. In this paper, we demonstrate how this task can be accelerated using a combination of fine-grained and coarse-grained parallelism on two different computing systems. The first architecture is based on reconfigurable hardware (FPGAs) while the second architecture uses multiple GPUs connected to the same host. We show that both systems can achieve speedups of around four orders-of-magnitude compared to the sequential implementation. This significantly reduces the runtimes for detecting epistasis to only a few minutes for moderately-sized datasets and to a few hours for large-scale datasets.
Nature genetics, 2013
Atopic dermatitis is a common inflammatory skin disease with a strong heritable component. Pathog... more Atopic dermatitis is a common inflammatory skin disease with a strong heritable component. Pathogenetic models consider keratinocyte differentiation defects and immune alterations as scaffolds, and recent data indicate a role for autoreactivity in at least a subgroup of patients. FLG (encoding filaggrin) has been identified as a major locus causing skin barrier deficiency. To better define risk variants and identify additional susceptibility loci, we densely genotyped 2,425 German individuals with atopic dermatitis (cases) and 5,449 controls using the Immunochip array followed by replication in 7,196 cases and 15,480 controls from Germany, Ireland, Japan and China. We identified four new susceptibility loci for atopic dermatitis and replicated previous associations. This brings the number of atopic dermatitis risk loci reported in individuals of European ancestry to 11. We estimate that these susceptibility loci together account for 14.4% of the heritability for atopic dermatitis.
Circulation: Cardiovascular Genetics, 2014
Nucleic Acids Research, 2013
Scientists working with single-nucleotide variants (SNVs), inferred by next-generation sequencing... more Scientists working with single-nucleotide variants (SNVs), inferred by next-generation sequencing software, often need further information regarding true variants, artifacts and sequence coverage gaps. In clinical diagnostics, e.g. SNVs must usually be validated by visual inspection or several independent SNV-callers. We here demonstrate that 0.5-60% of relevant SNVs might not be detected due to coverage gaps, or might be misidentified. Even low error rates can overwhelm the true biological signal, especially in clinical diagnostics, in research comparing healthy with affected cells, in archaeogenetic dating or in forensics. For these reasons, we have developed a package called pibase, which is applicable to diploid and haploid genome, exome or targeted enrichment data. pibase extracts details on nucleotides from alignment files at user-specified coordinates and identifies reproducible genotypes, if present. In test cases pibase identifies genotypes at 99.98% specificity, 10-fold better than other tools. pibase also provides pair-wise comparisons between healthy and affected cells using nucleotide signals (10-fold more accurately than a genotype-based approach, as we show in our case study of monozygotic twins). This comparison tool also solves the problem of detecting allelic imbalance within heterozygous SNVs in copy number variation loci, or in heterogeneous tumor sequences.
Nature Genetics, 2013
l e t t e r s Primary sclerosing cholangitis (PSC) is a severe liver disease of unknown etiology ... more l e t t e r s Primary sclerosing cholangitis (PSC) is a severe liver disease of unknown etiology leading to fibrotic destruction of the bile ducts and ultimately to the need for liver transplantation 1-3 . We compared 3,789 PSC cases of European ancestry to 25,079 population controls across 130,422 SNPs genotyped using the Immunochip 4 . We identified 12 genome-wide significant associations outside the human leukocyte antigen (HLA) complex, 9 of which were new, increasing the number of known PSC risk loci to 16. Despite comorbidity with inflammatory bowel disease (IBD) in 72% of the cases, 6 of the 12 loci showed significantly stronger association with PSC than with IBD, suggesting overlapping yet distinct genetic architectures for these two diseases. We incorporated association statistics from 7 diseases clinically occurring with PSC in the analysis and found suggestive evidence for 33 additional pleiotropic PSC risk loci. Together with network analyses, these findings add to the genetic risk map of PSC and expand on the relationship between PSC and other immune-mediated diseases.
Nature genetics, 2010
Chronic kidney disease (CKD) is a significant public health problem, and recent genetic studies h... more Chronic kidney disease (CKD) is a significant public health problem, and recent genetic studies have identified common CKD susceptibility variants. The CKDGen consortium performed a meta-analysis of genome-wide association data in 67,093 individuals of European ancestry from 20 predominantly population-based studies in order to identify new susceptibility loci for reduced renal function as estimated by serum creatinine (eGFRcrea), serum cystatin c (eGFRcys) and CKD (eGFRcrea < 60 ml/min/1.73 m(2); n = 5,807 individuals with CKD (cases)). Follow-up of the 23 new genome-wide-significant loci (P < 5 x 10(-8)) in 22,982 replication samples identified 13 new loci affecting renal function and CKD (in or near LASS2, GCKR, ALMS1, TFDP2, DAB2, SLC34A1, VEGFA, PRKAG2, PIP5K1B, ATXN2, DACH1, UBE2Q2 and SLC7A9) and 7 loci suspected to affect creatinine production and secretion (CPS1, SLC22A2, TMEM60, WDR37, SLC6A13, WDR72 and BCAS3). These results further our understanding of the biologic...
Nature genetics, 2011
Genome-wide association studies and candidate gene studies in ulcerative colitis have identified ... more Genome-wide association studies and candidate gene studies in ulcerative colitis have identified 18 susceptibility loci. We conducted a meta-analysis of six ulcerative colitis genome-wide association study datasets, comprising 6,687 cases and 19,718 controls, and followed up the top association signals in 9,628 cases and 12,917 controls. We identified 29 additional risk loci (P < 5 × 10(-8)), increasing the number of ulcerative colitis-associated loci to 47. After annotating associated regions using GRAIL, expression quantitative trait loci data and correlations with non-synonymous SNPs, we identified many candidate genes that provide potentially important insights into disease pathogenesis, including IL1R2, IL8RA-IL8RB, IL7R, IL12B, DAP, PRDM1, JAK2, IRF5, GNA12 and LSP1. The total number of confirmed inflammatory bowel disease risk loci is now 99, including a minimum of 28 shared association signals between Crohn's disease and ulcerative colitis.
Nature, Jan 11, 2011
Multiple sclerosis is a common disease of the central nervous system in which the interplay betwe... more Multiple sclerosis is a common disease of the central nervous system in which the interplay between inflammatory and neurodegenerative processes typically results in intermittent neurological disturbance followed by progressive accumulation of disability. Epidemiological studies have shown that genetic factors are primarily responsible for the substantially increased frequency of the disease seen in the relatives of affected individuals, and systematic attempts to identify linkage in multiplex families have confirmed that variation within the major histocompatibility complex (MHC) exerts the greatest individual effect on risk. Modestly powered genome-wide association studies (GWAS) have enabled more than 20 additional risk loci to be identified and have shown that multiple variants exerting modest individual effects have a key role in disease susceptibility. Most of the genetic architecture underlying susceptibility to the disease remains to be defined and is anticipated to require ...
Gastroenterology, 2010
No abstract is available. To read the body of this article, please view the PDF online. ... © 201... more No abstract is available. To read the body of this article, please view the PDF online. ... © 2010 AGA. Published by Elsevier Inc. All rights reserved. ... Visit SciVerse ScienceDirect to see if you have access via your institution. ... Advertisements on this site do not constitute a ...
Gastroenterology, 2010
Primary sclerosing cholangitis (PSC) is a chronic bile duct disease affecting 2.4-7.5% of individ... more Primary sclerosing cholangitis (PSC) is a chronic bile duct disease affecting 2.4-7.5% of individuals with inflammatory bowel disease. We performed a genome-wide association analysis of 2,466,182 SNPs in 715 individuals with PSC and 2,962 controls, followed by replication in 1,025 PSC cases and 2,174 controls. We detected non-HLA associations at rs3197999 in MST1 and rs6720394 near BCL2L11 (combined P = 1.1 × 10 −16 and P = 4.1 × 10 −8 , respectively).
BMC Bioinformatics, 2008
Background: Transposable elements are abundant in eukaryotic genomes and it is believed that they... more Background: Transposable elements are abundant in eukaryotic genomes and it is believed that they have a significant impact on the evolution of gene and chromosome structure. While there are several completed eukaryotic genome projects, there are only few high quality genome wide annotations of transposable elements. Therefore, there is a considerable demand for computational identification of transposable elements. LTR retrotransposons, an important subclass of transposable elements, are well suited for computational identification, as they contain long terminal repeats (LTRs).
The American Journal of Human Genetics, 2012
Psoriasis (PS) and Crohn disease (CD) have been shown to be epidemiologically, pathologically, an... more Psoriasis (PS) and Crohn disease (CD) have been shown to be epidemiologically, pathologically, and therapeutically connected, but little is known about their shared genetic causes. We performed meta-analyses of five published genome-wide association studies on PS (2,529 cases and 4,955 controls) and CD (2,142 cases and 5,505 controls), followed up 20 loci that showed strongest evidence for shared disease association and, furthermore, tested cross-disease associations for previously reported PS and CD risk alleles in additional 6,115 PS cases, 4,073 CD cases, and 10,100 controls. We identified seven susceptibility loci outside the human leukocyte antigen region (9p24 near JAK2, 10q22 at ZMIZ1, 11q13 near PRDX5, 16p13 near SOCS1, 17q21 at STAT3, 19p13 near FUT2, and 22q11 at YDJC) shared between PS and CD with genome-wide significance (p &amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt; 5 × 10(-8)) and confirmed four already established PS and CD risk loci (IL23R, IL12B, REL, and TYK2). Three of the shared loci are also genome-wide significantly associated with PS alone (10q22 at ZMIZ1, p(rs1250544) = 3.53 × 10(-8), 11q13 near PRDX5, p(rs694739) = 3.71 × 10(-09), 22q11 at YDJC, p(rs181359) = 8.02 × 10(-10)). In addition, we identified one susceptibility locus for CD (16p13 near SOCS1, p(rs4780355) = 4.99 × 10(-8)). Refinement of association signals identified shared genome-wide significant associations for exonic SNPs at 10q22 (ZMIZ1) and in silico expression quantitative trait locus analyses revealed that the associations at ZMIZ1 and near SOCS1 have a potential functional effect on gene expression. Our results show the usefulness of joint analyses of clinically distinct immune-mediated diseases and enlarge the map of shared genetic risk loci.
Mechanisms of Ageing and Development, 2011
We conducted a case-control genome-wide association study (GWAS) of human longevity, comparing 66... more We conducted a case-control genome-wide association study (GWAS) of human longevity, comparing 664,472 autosomal SNPs in 763 long-lived individuals (LLI; mean age: 99.7 years) and 1085 controls (mean age: 60.2 years) from Germany. Only one association, namely that of SNP rs4420638 near the APOC1 gene, achieved genome-wide significance (allele-based P=1.8×10(-10)). However, logistic regression analysis revealed that this association, which was replicated in an independent German sample, is fully explicable by linkage disequilibrium with the APOE allele ɛ4, the only variant hitherto established as a major genetic determinant of survival into old age. Our GWAS failed to identify any additional autosomal susceptibility genes. One explanation for this lack of success in our study would be that GWAS provide only limited statistical power for a polygenic phenotype with loci of small effect such as human longevity. A recent GWAS in Dutch LLI independently confirmed the APOE-longevity association, thus strengthening the conclusion that this locus is a very, if not the most, important genetic factor influencing longevity.
Uploads
Papers by David Ellinghaus