Molecular estimation of neurodegeneration pseudotime in older brains

  • Sumit Mukherjee ,
  • Laura Heath ,
  • Christoph Preuss ,
  • Suman Jayadev ,
  • Gwenn Garden ,
  • Anna Greenwood ,
  • Solveig Sieberts ,
  • Phillip De Jager ,
  • Nilufer Ertekin-Taner ,
  • Gregory Carter ,
  • Lara Mangravite ,
  • Benjamin Logsdon

Accepted to Nature Communications

The temporal molecular changes that lead to disease onset and progression in Alzheimer’s disease are still unknown. Here we develop a temporal model for these unobserved molecular changes with a manifold learning method applied to RNA-Seq data collected from human postmortem brain samples collected within the ROS/MAP and Mayo Clinic RNA-Seq studies. We define an ordering across samples based on their similarity in gene expression and use this ordering to estimate the molecular disease stage – or disease pseudotime – for each sample. Disease pseudotime is strongly concordant with the burden of tau (Braak score, P = 1.0×10-5), Aβ (CERAD score, P = 1.8×10-5), and cognitive diagnosis (P = 3.5×10-7) of LOAD. Early stage disease pseudotime samples are enriched for controls and show changes in basic cellular functions. Late stage disease pseudotime samples are enriched for late stage AD cases and show changes in neuroinflammation and amyloid pathologic processes. We also identify a set of late stage pseudotime samples that are controls and show changes in genes enriched for protein trafficking, splicing, regulation of apoptosis, and prevention of amyloid cleavage pathways. In summary, we present a method for ordering patients along a trajectory of LOAD disease progression from brain transcriptomic data.