TY - JOUR
T1 - Dependency-aware deep generative models for multitasking analysis of spatial omics data
AU - Tian, Tian
AU - Zhang, Jie
AU - Lin, Xiang
AU - Wei, Zhi
AU - Hakonarson, Hakon
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Nature America, Inc. 2024.
PY - 2024/8
Y1 - 2024/8
N2 - Spatially resolved transcriptomics (SRT) technologies have significantly advanced biomedical research, but their data analysis remains challenging due to the discrete nature of the data and the high levels of noise, compounded by complex spatial dependencies. Here, we propose spaVAE, a dependency-aware, deep generative spatial variational autoencoder model that probabilistically characterizes count data while capturing spatial correlations. spaVAE introduces a hybrid embedding combining a Gaussian process prior with a Gaussian prior to explicitly capture spatial correlations among spots. It then optimizes the parameters of deep neural networks to approximate the distributions underlying the SRT data. With the approximated distributions, spaVAE can contribute to several analytical tasks that are essential for SRT data analysis, including dimensionality reduction, visualization, clustering, batch integration, denoising, differential expression, spatial interpolation, resolution enhancement and identification of spatially variable genes. Moreover, we have extended spaVAE to spaPeakVAE and spaMultiVAE to characterize spatial ATAC-seq (assay for transposase-accessible chromatin using sequencing) data and spatial multi-omics data, respectively.
AB - Spatially resolved transcriptomics (SRT) technologies have significantly advanced biomedical research, but their data analysis remains challenging due to the discrete nature of the data and the high levels of noise, compounded by complex spatial dependencies. Here, we propose spaVAE, a dependency-aware, deep generative spatial variational autoencoder model that probabilistically characterizes count data while capturing spatial correlations. spaVAE introduces a hybrid embedding combining a Gaussian process prior with a Gaussian prior to explicitly capture spatial correlations among spots. It then optimizes the parameters of deep neural networks to approximate the distributions underlying the SRT data. With the approximated distributions, spaVAE can contribute to several analytical tasks that are essential for SRT data analysis, including dimensionality reduction, visualization, clustering, batch integration, denoising, differential expression, spatial interpolation, resolution enhancement and identification of spatially variable genes. Moreover, we have extended spaVAE to spaPeakVAE and spaMultiVAE to characterize spatial ATAC-seq (assay for transposase-accessible chromatin using sequencing) data and spatial multi-omics data, respectively.
UR - http://www.scopus.com/inward/record.url?scp=85193988508&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85193988508&partnerID=8YFLogxK
U2 - 10.1038/s41592-024-02257-y
DO - 10.1038/s41592-024-02257-y
M3 - Article
C2 - 38783067
AN - SCOPUS:85193988508
SN - 1548-7091
VL - 21
SP - 1501
EP - 1513
JO - Nature Methods
JF - Nature Methods
IS - 8
ER -