Abstract
Most tissue samples are composed of different cell types. Differential expression analysis without accounting for cell-type composition cannot separate the changes due to cell-type composition or cell type-specific expression. We propose a computational framework to address these limitations: CARseq (cell-type-aware analysis of RNA-seq). CARseq employs a negative binomial distribution that appropriately models the count data from RNA-seq experiments. Simulation studies show that CARseq has substantially higher power than a linear model-based approach and it also provides more accurate estimate of the rankings of differentially expressed genes. We have applied CARseq to compare gene expression of schizophrenia/autism subjects versus controls, and identified the cell types underlying the difference and similarities of these two neuron-developmental diseases. Our results are consistent with the results from differential expression analysis using single-cell RNA-seq data.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 253-261 |
| Number of pages | 9 |
| Journal | Nature Computational Science |
| Volume | 1 |
| Issue number | 4 |
| DOIs | |
| State | Published - Apr 2021 |
| Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Computer Science (miscellaneous)
- Computer Science Applications
- Computer Networks and Communications