Abstract 摘要
Esophageal squamous cell carcinoma (ESCC) is prevalent in some geographical regions of the world. ESCC development presents a multistep pathogenic process from inflammation to invasive cancer; however, what is critical in these processes and how they evolve is largely unknown, obstructing early diagnosis and effective treatment. Here, we create a mouse model mimicking human ESCC development and construct a single-cell ESCC developmental atlas. We identify a set of key transitional signatures associated with oncogenic evolution of epithelial cells and depict the landmark dynamic tumorigenic trajectories. An early downregulation of CD8+ response against the initial tissue damage accompanied by the transition of immune response from type 1 to type 3 results in accumulation and activation of macrophages and neutrophils, which may create a chronic inflammatory environment that promotes carcinogen-transformed epithelial cell survival and proliferation. These findings shed light on how ESCC is initiated and developed.
食管鳞状细胞癌(ESCC)在世界某些地区较为常见。ESCC 的发展呈现从炎症到侵袭性癌症的多步骤病理过程;然而,在这些过程中什么是关键因素以及它们如何演变仍不清楚,阻碍了早期诊断和有效治疗。在这里,我们创建了一个模拟人类 ESCC 发展的小鼠模型,并构建了一个单细胞 ESCC 发育图谱。我们识别出一组与上皮细胞致癌演变相关的关键过渡特征,并描绘了里程碑式的动态致癌轨迹。早期 CD8 + 反应的下调伴随着免疫反应从 1 型向 3 型的转变,导致巨噬细胞和中性粒细胞的积累和激活,这可能形成一种慢性炎症环境,促进致癌转化的上皮细胞的生存和增殖。这些发现揭示了 ESCC 是如何启动和发展的。
Subject terms: Cancer microenvironment, Oesophageal cancer
主题词:癌微环境,食管癌
The multistep processes involved in the evolution of inflammation to invasive esophageal squamous cell carcinoma (ESCC) is unclear. Here, the authors report a mouse model of ESCC and the role of interplay between carcinogen-transformed epithelial cells and their microenvironment in ESCC development.
炎症演变为侵袭性食管鳞状细胞癌(ESCC)的多步骤过程尚不清楚。在这里,作者报告了一种 ESCC 的小鼠模型以及致癌物转化上皮细胞与其微环境之间的相互作用在 ESCC 发展中的作用。
Introduction 简介
Esophageal squamous cell carcinoma (ESCC) is a common gastrointestinal malignancy in some parts of the world like China, with a poor prognosis and high mortality mainly due to the lack of specific measures for early diagnosis and effective therapies. ESCC represents an extraordinary paradigm of carcinoma development, shaped in a sequential manner from inflammation (INF), hyperplasia (HYP), dysplasia (DYS), carcinoma in situ (CIS) to invasive carcinoma (ICA)1. However, up to date, how ESCC initiates and develops is largely unknown. This long-standing question is the major factor obstructing the early intervention and clinical care improvement of the disease. As such, exploring the mechanisms underlying ESCC formation and identifying biomarkers are the crucial tasks for early detection, diagnosis, and precision treatment of the cancer.
食管鳞状细胞癌(ESCC)是世界上某些地区,如中国,常见的消化道恶性肿瘤,预后较差且死亡率高,主要原因是缺乏早期诊断的特异性措施和有效的治疗方法。ESCC 是一种独特的癌变发展模式,从炎症(INF)、增生(HYP)、不典型增生(DYS)、原位癌(CIS)到浸润癌(ICA)按序发展。然而,到目前为止,ESCC 的起始和发展机制仍不清楚。这一长期未解的问题是阻碍该疾病早期干预和临床护理改善的主要因素。因此,探索 ESCC 形成的机制并识别生物标志物是早期检测、诊断和精准治疗癌症的关键任务。
Recent genome studies, including The Cancer Genome Atlas (TCGA) project, have identified many genome variations in ESCC by using whole-exome or whole-genome sequencing on clinical tissue samples2,3. Although these studies have revealed an important role of the identified genome alterations in ESCC, it remains unresolved that how the normal epithelial cells may transit by the mutations through precancerous lesions to invasive carcinoma because all these previous studies were in cross-sectional design. Another important issue in this regard is that somatic mutations solely might not be sufficient for ESCC initiation and development because such mutations also occur in pathologically normal human esophageal tissues4. These findings imply that other mechanisms such as transcriptome aberrance in ESCC tumorigenesis may merit further investigation. In addition, it has been demonstrated that the complex context of tumor microenvironment (TME) also has important roles in tumor initiation and development. Therefore, elucidating dynamic transcriptomic changes of TME cellular compositions during tumorigenesis are significant and inevitable in discovering how ESCC develops. Recently established single-cell transcriptomic analysis is a promising approach, which allows to analyze the complex cellular compositions and to decipher cell state transition in tissue samples5.
近年来,包括癌症基因组图谱(TCGA)项目在内的基因组研究,通过在临床组织样本中进行全外显子或全基因组测序,已经识别出许多食管鳞状细胞癌(ESCC)的基因组变异 2,3 。尽管这些研究揭示了已识别基因组改变在 ESCC 中的重要作用,但仍然不清楚正常上皮细胞是如何通过突变从癌前病变过渡到侵袭性癌变的,因为所有这些先前的研究都是横断面设计。另一个与此相关的重要问题是,体细胞突变可能不足以引发和促进 ESCC 的发生和发展,因为这些突变也发生在病理上正常的食管组织中 4 。这些发现表明,在 ESCC 肿瘤发生中,转录组异常可能需要进一步研究。此外,已经证明,肿瘤微环境(TME)的复杂背景在肿瘤的发生和发展中也起着重要作用。因此,在肿瘤发生过程中阐明 TME 细胞组成动态转录组变化是至关重要的。 最近建立的单细胞转录组分析是一种有前途的方法,可以分析复杂的细胞组成,并解析组织样本中的细胞状态转换 5 。
Capturing continuous tumorigenic lesions from sole patient over time to perform such study is impossible and, therefore, to address this important issue would be highly dependent on well-established animal models. Fortunately, it has been shown that chemical carcinogen 4-nitroquinoline 1-oxide (4NQO) can induce mouse ESCC development in a manner that mimics the tumorigenic processes of ESCC in humans6,7. The distinct multiple stages of ESCC tumorigenesis induced by the carcinogen provide an excellent opportunity to interrogate cell state transition by single-cell RNA sequencing (scRNA-seq), which would elucidate the dynamic ESCC tumorigenesis.
从单一患者中连续捕捉肿瘤前病变以进行此类研究是不可能的,因此要解决这一重要问题,很大程度上依赖于建立良好的动物模型。幸运的是,已经证明,化学致癌物 4-硝基喹啉-1-氧化物(4NQO)可以诱导小鼠食管鳞状细胞癌(ESCC)的发生,其过程类似于人类 ESCC 的肿瘤发生过程 6,7 。致癌物诱导的 ESCC 肿瘤发生的不同多阶段为通过单细胞 RNA 测序(scRNA-seq)探究细胞状态转换提供了绝佳机会,这将阐明动态的 ESCC 肿瘤发生过程。
Here we report a single cell-based transcriptomic profiling study on various types of cells across every pathogenic stage during ESCC initiation and development in a mouse model induced by 4NQO. We have built a complete atlas and characterized the transcriptomic profiles for the transition of esophageal epithelial cells under the attack of carcinogen and elucidated how they evolve over time in a holistic approach. We have also depicted the transition landscapes of non-epithelial cells, i.e., fibroblasts and immune cells, in the esophageal microenvironments of different stages of tumorigenesis. Furthermore, we have found that some key changes in mice also occur in human esophageal tissue samples. These results shed light on the phenotypes and transition fates of different cell types across the tumorigenic processes of ESCC in animal model and may be implicated in human ESCC.
在这里,我们报告了一项基于单细胞转录组学分析的研究,该研究涵盖了由 4NQO 诱导的小鼠模型中 ESCC 发生和发展过程中各种类型细胞的每个致病阶段。我们构建了一个完整的图谱,并详细描述了致癌物质攻击下食管上皮细胞的转录组学特征,以及它们在整体方法中随时间的变化。我们还描绘了不同肿瘤发生阶段食管微环境中非上皮细胞(如成纤维细胞和免疫细胞)的过渡景观。此外,我们发现小鼠中的一些关键变化也发生在人类食管组织样本中。这些结果揭示了不同细胞类型在 ESCC 致癌过程中的表型和过渡命运,并可能与人类 ESCC 相关。
Results 结果
Bulk RNA-seq and scRNA-seq of mouse esophageal samples
小鼠食管样本的 bulk RNA-seq 和 scRNA-seq
To explore the transcriptomic alterations at various pathological stages during ESCC tumorigenesis, 8-week-old female C57BL/6 mice were treated with 4NQO for 16 weeks, which resulted in five recognizable precancerous and cancerous lesions in the esophagus, i.e., INF, HYP, DYS, CIS, and ICA (Fig. 1a; Supplementary Fig. 1a). We examined mice receiving 4NQO and found they all developed the expected lesions in the esophagus at different time points of experiment. The ESCC number per animal (mean ± SD) at week 26 was 6.0 ± 3.6. We first performed conventional RNA-seq on mini-bulks of normal epithelial samples obtained by laser-capture microdissection (LCM) from control mice at different ages of 1, 2, 8, and 25 months and various precancerous or cancerous lesions from 4NQO-treated mice. Principal component analysis (PCA) of the differentially expressed genes showed that the expression programs in mice exposed to 4NQO were substantially different from that in control mice; however, there existed some overlaps in the expression profiles between 4NQO-exposed and non-exposed mice. For instance, the transcriptomic profile for stage INF in 4NQO-exposed mice was similar to that in control mice aged 25 months (Fig. 1b), indicating that conventional RNA-seq could not precisely clarify the path of malignant cell transition during the development and progression of ESCC due to high intra-tissue heterogeneity.
为了探索食管鳞状细胞癌(ESCC)发生过程中不同病理阶段的转录组学改变,使用 4-硝基偶氮苯(4NQO)处理 8 周龄的雌性 C57BL/6 小鼠 16 周,结果在食管中观察到五种可识别的癌前和癌变病变,即 INF、HYP、DYS、CIS 和 ICA(图 1a ;补充图 1a )。我们检查了接受 4NQO 处理的小鼠,发现它们在实验的不同时间点均在食管中发展出了预期的病变。第 26 周每只动物的 ESCC 数量(均值±标准差)为 6.0±3.6。我们首先对通过激光捕获显微切割(LCM)从对照小鼠不同年龄(1、2、8 和 25 个月)获取的正常上皮样本以及 4NQO 处理小鼠的各种癌前或癌变病变中获得的小批量样本进行了常规 RNA-seq 分析。差异表达基因的主成分分析(PCA)显示,暴露于 4NQO 的小鼠的表达程序与对照小鼠存在显著差异;然而,暴露于 4NQO 和未暴露的小鼠之间存在一些重叠的表达谱。 例如,在 4NQO 处理的小鼠中,INF 期的转录组学谱型与 25 个月大的对照小鼠的谱型相似(图 1b ),这表明由于组织内高度的异质性,常规 RNA-seq 无法精确阐明恶性细胞在 ESCC 的发生和发展过程中过渡的路径。
Fig. 1. Experimental design of RNA-seq on 4NQO-induced esophageal lesions in mice.
图 1. 小鼠 4NQO 诱导食管病变的 RNA-seq 实验设计。
a Induction of esophageal precancerous and cancerous lesions in mice. Mice were treated with 4NQO in drinking water (100 μg/ml) for 16 weeks and then kept without 4NQO treatment for another 10 weeks (upper panel). Mice were killed before (week 0), during (week 12) and after treatment (weeks 20, 22, 24, or 26), respectively. Hematoxylin–eosin (H&E) staining and immunohistochemistry (IHC) analysis of Mki67 on esophageal epithelium slides clearly identified six different pathological lesions, i.e., normal (NOR), inflammation (INF), hyperplasia (HYP), dysplasia (DYS), carcinoma in situ (CIS), and invasive carcinoma (ICA) (lower panel). Similar staining results were observed in over three visual fields from each stage of esophageal lesions (more staining image in Supplementary Fig. 2d). Scale bar, 100 μm. b Plot of principal component analysis (PCA) of mini-bulk tissue RNA-seq on different pathological lesions indicated by different colors. M month of mouse age. c Overview of the experimental design of scRNA-seq. Pathological lesions of the esophagus were dissected and digested into single-cell suspensions for further separation using FITC-CD45 antibody via FACS (1–4). CD45+ and CD45− cells whose numbers in different lesions are shown on right panel were scRNA-sequenced, respectively.
a 诱导小鼠食管癌前和癌变病变。小鼠在饮用水中加入 4NQO(100 μg/ml)处理 16 周,然后在没有 4NQO 处理的情况下再饲养 10 周(上图)。小鼠分别在治疗前(第 0 周)、治疗期间(第 12 周)和治疗后(第 20、22、24 或 26 周)被处死。通过苏木精-伊红(H&E)染色和食管上皮切片的免疫组织化学(IHC)分析,Mki67 染色清晰地识别出六种不同的病理病变,即正常(NOR)、炎症(INF)、增生(HYP)、不典型增生(DYS)、原位癌(CIS)和浸润癌(ICA)(下图)。每个阶段的食管病变从每个视野观察到相似的染色结果(更多染色图像见补充图 2d )。比例尺,100 μm。b 小鼠不同病理病变的 mini-bulk 组织 RNA-seq 主成分分析(PCA)图,不同颜色表示不同病理病变。M 表示小鼠月龄。c 单细胞转录组测序(scRNA-seq)实验设计概述。食管病变被分离并消化成单细胞悬液,通过 FITC-CD45 抗体和 FACS(1-4)进一步分离。 CD45 + 和 CD45 − 细胞的数量在不同病灶中如右图所示,分别进行了单细胞转录组测序。
We therefore conducted a time-ordered single-cell transcriptomic profiling on various esophageal lesions in 4NQO-exposed mice. For stage NOR or stage INF, we used the whole esophagus from 30 or 20 mice. For other stages, we used the lesion foci and the sample numbers were 32 HYP from 25 mice, 24 DYS from 17 mice, 24 CIS from 20 mice and 25 ICA from 23 mice, respectively (Fig. 1c). A total of 66,089 cells including 29,975 CD45+ immune cells and 36,114 CD45− non-immune cells were obtained across various pathological stages (Fig. 1c; Supplementary Fig. 1b). The median unique molecular identifier (UMI) per cell was 7748 in immune cells and 9370 in non-immune cells (Supplementary Fig. 1c); with a median signal detect ability of 1936 genes for immune cells and 2620 genes for non-immune cells (Supplementary Fig. 1c), respectively. Based on the expressions of canonical markers, we classified immune cells into T cells, B cells, myeloid cells and natural killer cells (Fig. 1c; Supplementary Fig. 1d, e) and identified four clusters of non-immune cells including epithelial cells, fibroblasts, endothelial cells and myocytes by using t-distributed stochastic neighbor embedding (tSNE) (Fig. 1c; Supplementary Fig. 1f, g).
因此,我们在 4NQO 处理的小鼠不同食管病灶中进行了时间顺序的单细胞转录组测序。对于 NOR 阶段或 INF 阶段,我们使用了 30 只或 20 只小鼠的整个食管。对于其他阶段,我们使用了病灶区域,样本数量分别为:32 个 HYP(25 只小鼠),24 个 DYS(17 只小鼠),24 个 CIS(20 只小鼠)和 25 个 ICA(23 只小鼠)(图 1c )。总共获得了 66,089 个细胞,包括 29,975 个 CD45 + 免疫细胞和 36,114 个 CD45 − 非免疫细胞(图 1c ;补充图 1b )。免疫细胞的中位唯一分子标识符(UMI)为 7748,非免疫细胞为 9370(补充图 1c );免疫细胞的中位信号检测能力为 1936 个基因,非免疫细胞为 2620 个基因(补充图 1c )。根据经典标记物的表达,我们将免疫细胞分类为 T 细胞、B 细胞、髓系细胞和自然杀伤细胞(图 1c ;补充图 1d, e 并通过 t 分布随机邻居嵌入(tSNE)识别了四种非免疫细胞簇,包括上皮细胞、成纤维细胞、内皮细胞和肌细胞(图 1c ;补充图 1f, g )。
Identifying epithelial cell types during ESCC tumorigenesis
在 ESCC 肿瘤发生过程中识别上皮细胞类型
To discover how normal esophageal epithelium develops into invasive carcinoma, we next examined the expression alterations and functional changes in epithelial cells during the transition from normal to precancer or cancer. We identified 1756 epithelial cells across all six stages that were classified into six subtypes designated as EpiC 1 to EpiC 6 (Fig. 2a; Supplementary Fig. 2a and Supplementary Table 1). Through the analysis of pathway activities (Fig. 2b; Supplementary Fig. 2b), we found that EpiC 1 (n = 339) had higher expression of genes (e.g., Birc5, Mki67, Top2a, and Ube2c) in mitosis and proliferation8–11 compared with other cell clusters, probably reflecting the ability of routine self-renewal of the esophageal epithelium. EpiC 2 (n = 463) had significantly higher expression of genes such as Gstm1, Gstm2, Adh7, and Aldh3a1, which were involved in the xenobiotic detoxification, likely due to the 4NQO exposure12–14. EpiC 3 (n = 421) had substantially higher expression of genes in response to extracellular carcinogen stimuli, pro-inflammation and Hif-1 signaling (e.g., Jun, Junb, Zfp36, and Atf3)15–18. EpiC 4 (n = 110) had a higher expression program for keratinization (e.g., Krt4, Krt13, S100a8, and S100a9)19,20. EpiC 5 (n = 213) featured by the upregulation of genes in the epithelial–mesenchymal transition (EMT) pathways (e.g., Mmp13 and Itga6)21,22 and angiogenesis (e.g., Vegfa)23. EpiC 6 (n = 210) had significantly upregulated expression in the genes controlling cancer invasiveness (e.g., Mmp14 and Ecm1)24,25 and metastasis (e.g., Tm4sf1 and Vim)26,27. Besides, EpiC 5 cells also showed an expression pattern resembling the combination of EpiC 3, EpiC 4, and EpiC 6.
为了发现正常食管上皮如何发展为浸润性癌,我们接下来检查了从正常到癌前或癌变过程中上皮细胞的表达变化和功能变化。我们识别出了六个阶段中的 1756 个上皮细胞,并将其分为六种亚型,分别命名为 EpiC 1 到 EpiC 6(图 2a; 补充图 2a 和补充表 1 )。通过通路活性分析(图 2b ;补充图 2b ),我们发现 EpiC 1(n=339)在有丝分裂和增殖(例如,Birc5、Mki67、Top2a 和 Ube2c)基因的表达上高于其他细胞簇,这可能反映了食管上皮的常规自我更新能力。EpiC 2(n=463)在异物解毒基因(例如,Gstm1、Gstm2、Adh7 和 Aldh3a1)的表达上显著高于其他细胞簇,这可能归因于 4NQO 暴露。EpiC 3(n=421)在对细胞外致癌刺激、促炎和 Hif-1 信号(例如,Jun、Junb、Zfp36 和 Atf3)的基因表达上显著增加。 EpiC 4(n=110)在角化方面具有更高的表达程序(例如 Krt4、Krt13、S100a8 和 S100a9) 19,20 。EpiC 5(n=213)在上皮-间质转化(EMT)途径相关基因(例如 Mmp13 和 Itga6) 21,22 和血管生成相关基因(例如 Vegfa) 23 的上调表达中表现出特征。EpiC 6(n=210)在控制癌症侵袭性(例如 Mmp14 和 Ecm1) 24,25 和转移(例如 Tm4sf1 和 Vim) 26,27 的基因表达上调方面显著。此外,EpiC 5 细胞还表现出类似于 EpiC 3、EpiC 4 和 EpiC 6 的表达模式。
Fig. 2. Distinct epithelial cell populations and their expression signatures.
Fig. 2. 不同的上皮细胞群体及其表达特征。
a tSNE plot of 1756 epithelial cells based on their different expression. b Heatmap showing pathway activities scored per cluster by GSVA. c Stacked histogram showing epithelial cell composition across the six pathological stages. d Diffusion map of all epithelial cells using the first two diffusion components. Dot represents single cell and arrows indicate five branches starting from stage NOR to the other stages. e Histograms of scale normalized expression levels of six representative genes in each pathological stage. f Left: IHC staining of protein levels produced by the six genes in mouse esophageal tissues with different lesion. Scale bars, 100 μm. Right: stacked histograms showing quantification of IHC staining scores (0, negative; 1+, weak positive; 2+, median positive; 3+, strong positive). Each column was summarized from at least three visual fields.
a tSNE 图基于不同表达的 1756 个上皮细胞。b 每个聚类由 GSVA 评分的通路活性热图。c 六个病理阶段上皮细胞组成堆叠直方图。d 使用前两个扩散成分对所有上皮细胞进行扩散图分析。每个点代表单个细胞,箭头表示从 NOR 阶段到其他阶段的五个分支。e 每个病理阶段六种代表性基因的缩放归一化表达水平直方图。f 左:不同病变小鼠食管组织中六种基因蛋白水平的 IHC 染色。比例尺,100 μm。右:每个柱状图汇总了至少三个视野的 IHC 染色评分(0,阴性;1+,弱阳性;2+,中度阳性;3+,强阳性)。
Three subtypes, EpiC 1–3, were all present in normal esophageal epithelium and at the various stages of 4NQO-induced abnormal epithelia, although the proportions of cells at each stage were different (Fig. 2c), suggesting that these subtypes of epithelial cells were the background components of esophageal tissues. EpiC 4 and EpiC 5 appeared from INF and HYP, respectively, while EpiC 6 was present uniquely at ICA as the major cell subtype (Fig. 2c). These cell subtype dynamics well concorded with the transitional role of EpiC 5 and the malignancy of EpiC 6 and prompted us to examine the cell status transition more closely. A single-cell diffusion map based on the gene expression similarity revealed a clear cell status transition across all pathological stages (Fig. 2d; Supplementary Fig. 2c). Gene expression vectors pointed from stage NOR cells to the centroid of cells from each stage. It was clear that major directional changes happened before the HYP and ICA stages, whereas the transition from stage HYP to stage CIS was mainly the expansion along the same direction. Together, these results indicated an important role of INF to HYP transition in ESCC development.
三个亚型 EpiC 1-3 在正常食管上皮和 4NQO 诱导的异常上皮的各个阶段中均存在,尽管每个阶段的细胞比例不同(图 2c ),这表明这些上皮细胞亚型是食管组织的背景成分。EpiC 4 和 EpiC 5 分别从 INF 和 HYP 出现,而 EpiC 6 仅在 ICA 阶段作为主要细胞亚型存在(图 2c )。这些细胞亚型的变化与 EpiC 5 的过渡作用以及 EpiC 6 的恶性程度高度一致,促使我们更仔细地研究细胞状态的转变。基于基因表达相似性的单细胞扩散图显示,在所有病理阶段中均存在明显的细胞状态转变(图 2d ;补充图 2c )。基因表达向量从 NOR 阶段的细胞指向每个阶段细胞的质心。很明显,在 HYP 和 ICA 阶段之前发生了主要的方向变化,而从 HYP 阶段到 CIS 阶段的转变主要是沿着相同方向的扩展。综上所述,这些结果表明 INF 到 HYP 的转变在食管癌的发展中起着重要作用。
Further analysis revealed that the six representative genes selected from each epithelial cluster were differentially distributed among disease stages (Fig. 2e). Aldh3a1 and Atf3 were expressed across all stages and the levels were significantly higher at stage INF than that at the advancing stages. High level of S100a8 appeared at stage HYP and covered all precancerous and ICA stages whereas the highest levels of Itga6 and Mmp14 were seen at stage ICA although their expressions were also detected in cells across all stages. The dynamic expressions of these genes at protein level in mice esophageal tissues with different disease stages were compared by using immunohistochemistry and the results were generally in line with their RNA expression despite of some disparity (Fig. 2f; Supplementary Fig. 2d). The abrupt upregulation of S100a8 in cells at stage HYP suggests a dramatic transition related to immune response.
进一步分析发现,从每个上皮簇中选择的六个人代表基因在不同疾病阶段中的分布情况不同(图 2e )。Aldh3a1 和 Atf3 在所有阶段均表达,且在 INF 阶段的表达水平显著高于进展阶段。S100a8 在 HYP 阶段的表达水平较高,并覆盖了所有癌前病变和 ICA 阶段,而 Itga6 和 Mmp14 在 ICA 阶段的表达水平最高,尽管它们在所有阶段的细胞中也有检测到。通过免疫组化比较不同疾病阶段小鼠食管组织中这些基因在蛋白水平上的动态表达,结果与 RNA 表达情况总体一致,尽管存在一些差异(图 2f; 补充图 2d )。S100a8 在 HYP 阶段细胞中的突然上调表明与免疫反应相关的剧烈转变。
Identifying cell fates of epithelial cell status transitions
识别上皮细胞状态转换的命运
We performed pseudotime and PCA analysis and found two evolution fates of esophageal epithelial cells during ESCC tumorigenesis both starting from EpiC 1 cells that had the lowest pseudotime value. Some cells transformed from proliferative EpiC 1 to normal differentiated EpiC 4 while other cells transformed to malignant EpiC 6, processing through EpiC 2 to EpiC 5 cells (Fig. 3a; Supplementary Fig. 3a, b). The evolution of EpiC 1 to EpiC 6 was mainly along component 1. Gene set variation analysis (GSVA) of component 1 revealed a significant enrichment of genes related to cell invasiveness, EMT and angiogenesis (Fig. 3b, c; Supplementary Fig. 3c), which was concordant with the expression programs of EpiC 6 cells (Fig. 2b). As EpiC 6 cells appeared only at the ICA stage, these results implied that component 1 might be the underlying molecular mechanism for malignant transition of the esophageal tissues (Supplementary Fig. 3d).
我们在假时间分析和 PCA 分析中发现,在 ESC 癌变过程中,从具有最低假时间值的 EpiC 1 细胞开始,存在两种进化命运。一些细胞从增殖的 EpiC 1 转变为正常分化 EpiC 4,而其他细胞则转变为恶性 EpiC 6,经过 EpiC 2 到 EpiC 5 细胞(图 3a ;补充图 3a, b )。从 EpiC 1 到 EpiC 6 的进化主要沿着成分 1。成分 1 的基因集变异分析(GSVA)揭示了与细胞侵袭性、EMT 和血管生成相关的基因显著富集(图 3b, c; 补充图 3c ),这与 EpiC 6 细胞的表达程序一致(图 2b )。由于 EpiC 6 细胞仅在 ICA 阶段出现,这些结果表明成分 1 可能是食管组织恶性转化的潜在分子机制(补充图 3d )。
Fig. 3. Characterization of epithelial cell transitions and key pathway changes.
图 3. 上皮细胞过渡及关键途径变化的表征
a Pseudotime trajectory over epithelial cells in a two-dimensional statespace. Cell orders are inferred from the expression of the most dispersed genes across epithelial cell populations. b Violin plots of the distribution of the component 1 values across epithelial clusters. c Correlation between EMT pathway enrichment scores and component 1 values of single cells. d Normalized expression of six selected ESCC driver genes, methylation dysregulation genes, and transcription factors, smoothed over pseudotime component 1 using LOESS regression. Shaded regions indicate 95% confidence interval with a line indicating the mean gene expression. e Violin plots of the distribution of the component 2 values among sub-clusters. f Correlation between G2/M pathway enrichment scores and component 2 values of single cells. g Bubble plot showing expression levels of the genes related to response to 4NQO treatment across six cluster. Size of dots represents the percentage of cells expressing the gene; color scale shows the average expression level. h Heatmap displaying scale normalized expression level of genes in NF-κB signaling across the six epithelial clusters.
a 二维状态空间中上皮细胞的伪时间轨迹。细胞顺序是从上皮细胞群体中最具分散性的基因表达中推断出来的。b 来自上皮簇的成分 1 值的小提琴图。c EMT 通路富集评分与单个细胞成分 1 值之间的相关性。d 使用 LOESS 回归平滑的六个选定的 ESCC 驱动基因、甲基化失调基因和转录因子的归一化表达,按伪时间成分 1 显示。阴影区域表示 95%的置信区间,线条表示平均基因表达。e 各亚簇中成分 2 值的分布的小提琴图。f G2/M 通路富集评分与单个细胞成分 2 值之间的相关性。g 与 4NQO 治疗反应相关的基因表达水平的气泡图。点的大小表示表达该基因的细胞百分比;颜色刻度显示平均表达水平。h 显示六个上皮簇中 NF-κB 信号通路基因缩放归一化表达水平的热图。
We then examined whether the alterations of any transcription factors (TFs), well-documented ESCC-related mutation, or methylation dysregulation were included in the oncogenic evolution along component 1. We found that the expression levels of Pitx1, Trp53, and Bclaf1, which are known tumor suppressor TFs28–30, were substantially decreased while TFs promoting EMT (e.g., Ets1 and Snai3)31,32, immunosuppression (e.g., Eomes)33, cell migration and invasion (e.g., Creb3 and Elk3)34,35 were significantly upregulated. We also found two previously reported methylation-regulated genes in human ESCC, Rab2536, and Met37, were substantially down- or upregulated, respectively, along component 1. Furthermore, the expression levels of Notch1, a tumor suppressor frequently mutated in human ESCC38, displayed a gradual decrease over time along the evolution of component 1 (Fig. 3d; Supplementary Fig. 3e).
我们随后检查了任何转录因子(TFs)的改变,这些 TFs 是已知与食管鳞状细胞癌(ESCC)相关的突变,或甲基化失调是否包括在沿成分 1 的致癌进化过程中。我们发现已知的肿瘤抑制 TFs Pitx1、Trp53 和 Bclaf1 的表达水平显著降低,而促进上皮-间质转化(EMT)的 TFs(例如,Ets1 和 Snai3) 31,32 、免疫抑制(例如,Eomes) 33 、细胞迁移和侵袭的 TFs(例如,Creb3 和 Elk3) 34,35 的表达水平显著上调。我们还发现两个之前在人类 ESCC 中报道的甲基化调节基因 Rab25 36 和 Met 37 在沿成分 1 的进化过程中分别显著下调或上调。此外,肿瘤抑制基因 Notch1 在人类 ESCC 中经常发生突变 38 ,其表达水平在成分 1 的进化过程中逐渐下降(图 3d ;补充图 3e )。
Genes in component 2 were mainly related to cell proliferation (e.g., Birc5, Mki67, and Top2a). Cell cycle regression analysis and GSVA scoring revealed a correlation between proliferative ability and value of component 2 (Fig. 3e, f). We found that EpiC 6 cells had the highest expression of proliferation-related genes and the highest proportion of cells at G2/M status as compared with other clusters except EpiC 1 cells (Fig. 3e, f; Supplementary Fig. 3f–h), indicating increased proliferative ability of this cell cluster. On the other hand, EpiC 4 cells were at the G1 and S status (Supplementary Fig. 3g, h), showing reduced proliferative capacity. Together, these unsupervised analyses depicted two clear cell fates during ESCC carcinogenesis and indicated that attacked by the carcinogen, a portion of esophageal epithelial cells went into oncogenic route (from EpiC 1 and EpiC 2 to EpiC 6) but not into normal differentiation route (from EpiC 1 and EpiC 2 to EpiC 4). To exclude the impact of potential uncertainty of pathologically staging of early lesions such as hyperplasia and dysplasia, we also performed a sensitive analysis by excluding cells in week 20 or week 22. As a result, the cell state transition was similar to that without the exclusion (Supplementary Fig. 3i, j).
组件 2 中的基因主要与细胞增殖相关(例如,Birc5、Mki67 和 Top2a)。细胞周期回归分析和 GSVA 评分揭示了增殖能力和组件 2 值之间的关联(图 3e, f )。我们发现,EpiC 6 细胞在与增殖相关的基因表达和 G2/M 期细胞比例方面最高,与其他簇相比,除了 EpiC 1 细胞(图 3e, f ;补充图 3f–h ),表明该细胞簇的增殖能力增强。另一方面,EpiC 4 细胞处于 G1 和 S 期(补充图 3g, h ),显示出降低的增殖能力。综上所述,这些无监督分析描绘了 ESCC 致癌过程中两种明确的细胞命运,并表明致癌物质攻击时,一部分食管上皮细胞进入了致癌途径(从 EpiC 1 和 EpiC 2 到 EpiC 6),而不是正常分化途径(从 EpiC 1 和 EpiC 2 到 EpiC 4)。为了排除早期病变如增生和异型增生的病理分期潜在不确定性的影响,我们还进行了敏感性分析,排除了第 20 周或第 22 周的细胞。 因此,细胞状态的转换与不排除情况下的结果相似(补充图 3i, j )。
We further investigated the key pathways driving epithelial cells from stage INF to stage HYP and found that non-malignant EpiC 1–4 epithelial cells had significantly elevated expressions of the genes involving in carcinogen detoxification such as Nqo1, Aldh3a1, and Gstp1 (Fig. 3g), which reflected normal cellular response to the damage induced by 4NQO. The continuous damage might induce immune response via the stimulator of interferon genes (STING), AIM2 and NF-κB signaling because the expression levels of Tmem173, Aim2, Nfkb1, and Nfkb2 were significantly elevated (Fig. 3g). In addition, we observed substantial differences in the expressions of NF-κB downstream genes in epithelial cell clusters (Fig. 3h). EpiC 3 cells had an increased expression of some transcription-related genes such as Junb and Fos, whereas EpiC 5 cells showed higher expression levels of inflammation genes (e.g., Cxcl1, Il6, Tnf, and Csf3) compared with other EpiCs, implying that they had crosstalks with immune cells. EpiC 5 cells also had high level of Hif1a, which may lead to metabolism remodeling that promoted cell transformation. As malignant cells, EpiC 6 showed strong activation of survival-related genes (e.g., Bcl2 and Xiap) (Fig. 3g, h). These results indicated that under 4NQO attack, some epithelial cells might endure the damage and activate the inflammatory response, resulting in consequent cell survival and transformation.
我们进一步研究了从 INF 阶段到 HYP 阶段上皮细胞的关键通路,发现非恶性 EpiC 1-4 上皮细胞中涉及致癌物解毒的基因(如 Nqo1、Aldh3a1 和 Gstp1)的表达显著升高(图 3g ),这反映了这些细胞对 4NQO 诱导的损伤的正常细胞反应。持续的损伤可能通过干扰素基因刺激器(STING)、AIM2 和 NF-κB 信号通路引发免疫反应,因为 Tmem173、Aim2、Nfkb1 和 Nfkb2 的表达水平显著升高(图 3g )。此外,我们观察到上皮细胞簇中 NF-κB 下游基因的表达存在显著差异(图 3h )。EpiC 3 细胞中一些与转录相关的基因(如 Junb 和 Fos)的表达增加,而 EpiC 5 细胞则表现出更高的炎症基因(如 Cxcl1、Il6、Tnf 和 Csf3)的表达水平,表明它们与免疫细胞存在相互作用。EpiC 5 细胞中 Hif1a 的水平也较高,这可能导致代谢重编程,促进细胞转化。 作为恶性细胞,EpiC 6 显示出强烈的生存相关基因(例如,Bcl2 和 Xiap)激活(图 3g, h )。这些结果表明,在 4NQO 攻击下,一些上皮细胞可能耐受损伤并激活炎症反应,从而导致随后的细胞生存和转化。
Transcriptomic changes in fibroblasts promote tumorigenesis
成纤维细胞的转录组变化促进肿瘤发生
We next examined the transcriptomic alterations of cells in the tissue microenvironment and identified eight fibroblast clusters (FibCs) by different gene expression patterns (Fig. 4a; Supplementary Fig. 4a, b) that showed similar expression pathways and dynamics to epithelial cells (Fig. 4b–d). FibC 1 was the dominant composition in stage NOR and then markedly decreased with the processing of tumorigenesis (Fig. 4c, d). FibC 3 cells were the major fibroblasts in stage INF and had a strong interferon response and MHC-mediated antigen presentation activity (Fig. 4b), representing the initial immune response to tissue damage. FibC 4 and FibC 6, the major fibroblasts in stage DYS, had the elevated expression of genes involved in the TGF-β, STAT5 induced by IL-2 or STAT3 induced by IL-6 signaling pathways. FibC 8 cells increased along the tumorigenic stages with high expressions of genes in Myc and angiogenesis pathways. The replacement of FibC 3 by FibC 4 and FibC 6 during INF to HYP transition further confirmed the shift of immune response during early ESCC development. Specifically, beginning from stage HYP, fibroblasts actively recruited immune cells through increasing the expressions of complements and chemokines (Fig. 4e). For example, we found that FibC 6 and FibC 8 had significantly elevated expressions of various chemokine-related genes (e.g., Ccl7 in FibC6 and Cxcl12 in FibC 8, Supplementary Fig. 4c), which were well-known molecules that recruit immune cells. The dynamic changes of fibroblast clusters suggested that the immune response is modulated during ESCC tumorigenesis.
我们接下来检查了组织微环境中细胞的转录组变化,并通过不同的基因表达模式识别出了八种成纤维细胞簇(FibCs)(图 4a; 补充图 4a, b ),这些簇显示出与上皮细胞相似的表达途径和动态变化(图 4b–d )。在 NOR 阶段,FibC 1 是主要组成成分,然后随着肿瘤发生的发展显著减少(图 4c, d )。在 INF 阶段,FibC 3 是主要的成纤维细胞,具有强烈的干扰素反应和 MHC 介导的抗原呈递活性(图 4b ),代表了对组织损伤的初始免疫反应。在 DYS 阶段,FibC 4 和 FibC 6 是主要的成纤维细胞,这些细胞中与 TGF-β、IL-2 诱导的 STAT5 信号或 IL-6 诱导的 STAT3 信号通路相关的基因表达升高。FibC 8 细胞在肿瘤发生阶段中表达 Myc 和血管生成途径相关基因的水平升高。INF 到 HYP 过渡期间 FibC 3 被 FibC 4 和 FibC 6 取代进一步证实了早期食管癌发展中免疫反应的转变。 特别是在从 HYP 阶段开始,成纤维细胞通过增加补体和趋化因子的表达积极招募免疫细胞(图 4e )。例如,我们发现 FibC6 和 FibC8 显著上调了多种趋化因子相关基因的表达(例如,FibC6 中的 Ccl7 和 FibC8 中的 Cxcl12,补充图 4c ),这些是已知能够招募免疫细胞的分子。成纤维细胞簇动态变化表明,在食管鳞状细胞癌(ESCC)发生过程中,免疫反应被调节。
Fig. 4. Identification of fibroblast clusters and their expression features.
图 4. 纤维细胞簇的识别及其表达特征
a tSNE plot of 31,654 fibroblasts, colored by cluster. b Expression-based pathway activities scored by GSVA per fibroblast cluster. c Stacked histogram showing fibroblasts composition across the 6 pathological stages. d Line chart displaying changing trend of proportion of the four selected clusters across the six pathological stages. e Bubble plot showing scale normalized expression of representative genes involved in cytokine secretion, complement activity, and antigen presentation. Size of dots represents percentage of cells expressing corresponding genes in the cluster.
a tSNE 图显示 31,654 个成纤维细胞,按簇着色。b 每个成纤维细胞簇的 GSVA 评分通路活性。c 显示成纤维细胞在 6 个病理阶段组成比例的堆叠直方图。d 线图显示四个选定簇在六个病理阶段比例的变化趋势。e 气泡图显示代表性参与细胞因子分泌、补体活性和抗原呈递基因的标准化表达水平。气泡的大小代表簇中表达相应基因的细胞百分比。
Turndown of adaptive anticancer immune during tumorigenesis
肿瘤发生期间适应性抗肿瘤免疫的下调
We then explored the T cell status during ESCC tumorigenesis and identified 4 clusters designated as CD8+ T cell, CD4+ T cell, CD4−CD8− T cell 1 and CD4−CD8− T cell 2, respectively (Fig. 5a). We further classified CD8+ T cells into 7 clusters (Fig. 5b) and CD4+ T cells into 7 clusters (Fig. 5c), respectively. During 4NQO-induced tumorigenesis, all CD8+ T cells expressed naive and memory marker genes (e.g., Tcf7, Il7r, and Ccr7; Supplementary Fig. 5a)39–41 rather than cytotoxic markers. In stage NOR, the major type was naive cells (CD8-TN1) while in stage INF, the major cell type was replaced by memory T cells, mainly CD8+ tissue resident memory T (CD8-TRM) cells and effector memory T (CD8-TEM) cells (Fig. 5d), suggesting that the immune responses were active in these two stages. Furthermore, we observed a high expression of the exhaustion markers (e.g., Pdcd1 and Lag3)42,43 in addition to the effective markers (e.g., Gzmk, Gzmb, and Prf1)44,45 in these memory T cells (Fig. 5e); but the proportion of these memory T cells were declined along with tumorigenic processing after stage INF, indicating a non-effective CD8+ T cell dominant microenvironment throughout precancerous stages.
我们随后探索了食管鳞状细胞癌(ESCC)肿瘤发生过程中 T 细胞的状态,并识别出 4 个簇,分别命名为 CD8 + T 细胞、CD4 + T 细胞、CD4 − CD8 − T 细胞 1 和 CD4 − CD8 − T 细胞 2(图 5a )。我们进一步将 CD8 + T 细胞分为 7 个簇(图 5b ),将 CD4 + T 细胞分为 7 个簇(图 5c )。在 4NQO 诱导的肿瘤发生过程中,所有 CD8 + T 细胞均表达幼稚和记忆标记基因(例如,Tcf7、Il7r 和 Ccr7;补充图 5a )而非细胞毒性标记基因。在 NOR 阶段,主要类型是幼稚细胞(CD8-TN1),而在 INF 阶段,主要细胞类型被记忆 T 细胞取代,主要是 CD8 + 组织驻留记忆 T 细胞(CD8-T RM )和效应记忆 T 细胞(CD8-T EM )(图 5d ),表明这两个阶段的免疫反应是活跃的。此外,我们观察到这些记忆 T 细胞中除了有效标记基因(例如,Gzmk、Gzmb 和 Prf1) 44,45 的高表达外,还存在耗竭标记基因(例如,Pdcd1 和 Lag3) 42,43 的高表达(图 5e );但在肿瘤发生阶段 INF 之后,这些记忆 T 细胞的比例下降,表明整个癌前阶段存在非有效的 CD8 + T 细胞主导微环境。
Fig. 5. Characterization of multiple changes in T cell subtypes.
图 5. 多种 T 细胞亚型变化的表征。
a–c tSNE plots of 8032 T cells, 3812 CD8+ T cells, and 2635 CD4+ T cells. Color indicates cluster. d Histogram showing CD8+ T cell composition across the six pathological stages. Color indicates cluster designated in b. e Violin plots of marker gene expression among CD8+ T cell clusters. f Histogram showing CD4+ T cell composition across the six pathological stages. Color indicates cluster designated in c. g Line chart displaying changing trend of the four selected cluster proportions across the six pathological stages. h Bubble plot showing scale normalized expression of three representative genes involved in type 1, 2, and 3 immune response, respectively, from stage INF to stage ICA. Size of dots represents percentage of cells expressing corresponding genes in the stage.
a–c 8032 T 细胞、3812 CD8 + T 细胞和 2635 CD4 + T 细胞的 tSNE 图。颜色表示簇。d 显示六种病理阶段中 CD8 + T 细胞组成的直方图。颜色表示 b 中指定的簇。e CD8 + T 细胞簇中标志基因表达的小提琴图。f 显示六种病理阶段中 CD4 + T 细胞组成的直方图。颜色表示 c 中指定的簇。g 显示四个选定簇比例在六种病理阶段中变化趋势的折线图。h 从阶段 INF 到阶段 ICA,分别显示参与 1 型、2 型和 3 型免疫反应的三种代表性基因的规模归一化表达的气泡图。圆点的大小表示相应基因在该阶段表达细胞的百分比。
Considering CD4+ T cells (Fig. 5f; Supplementary Fig. 5b), we noticed an imbalance among CD4-Th1-like, CD4-Th2, and CD4-Th17 cells, involving in three major kinds of cell-mediated effector immunity, respectively46. Beginning at stage HYP, the proportion of CD4-Th1-like cells was decreased and replaced by CD4-Th2 and CD4-Th17 cells (Fig. 5f, g). CD4-Th1-like cells expressing high levels of Tbx21, Gzmb, and Ifng (Fig. 5h; Supplementary Fig. 5c) had been shown to have anti-tumor capability through activating monocytes and responding to interferon-γ46. Differential gene expression analysis also revealed that some genes relative to immune activation (e.g., Ifitm1, Ifitm2, and Rora)47,48 were significantly higher in CD4-Th1-like cells than in CD4-Th2 cells (Supplementary Fig. 5d), a type of immune cells that their role in tumorigenesis was uncertain. The activation of CD4-Th17 produced high levels of Il22, Il17a, and Rorc at stage HYP (Fig. 5h) that may activate monocytes and recruit neutrophils to respond to epithelial injuries46. The proportion of CD4-Th17 cells increased sharply after stage INF (Fig. 5f). The reduced anti-tumor role of CD4+ T cells and increased inflammatory response suggest an immune response Type 1 to Type 3 transition46 in the esophageal microenvironment during tumorigenesis.
考虑到 CD4 + T 细胞(图 1# 补充图 2#),我们注意到 CD4-Th1 样、CD4-Th2 和 CD4-Th17 细胞之间存在不平衡,分别涉及三种主要的细胞介导效应免疫类型。从 HYP 阶段开始,CD4-Th1 样细胞的比例下降,并被 CD4-Th2 和 CD4-Th17 细胞所取代(图 4#)。表达高水平 Tbx21、Gzmb 和 Ifng 的 CD4-Th1 样细胞(图 5#;补充图 6#)已被证明具有抗肿瘤能力,通过激活单核细胞并响应干扰素-γ发挥作用 46 。差异基因表达分析还显示,一些与免疫激活相关的基因(如 Ifitm1、Ifitm2 和 Rora) 47,48 在 CD4-Th1 样细胞中的表达显著高于 CD4-Th2 细胞(补充图 9#),而 CD4-Th2 细胞在肿瘤发生中的作用尚不确定。CD4-Th17 的激活在 HYP 阶段产生了高水平的 Il22、Il17a 和 Rorc(图 10#),可能激活单核细胞并招募中性粒细胞以响应上皮损伤 46 。CD4-Th17 细胞的比例在 INF 阶段后急剧增加(图 12#)。 CD4+ T 细胞的抗肿瘤作用减弱和炎症反应增加表明在肿瘤发生过程中食管微环境中免疫反应从类型 1 向类型 3 过渡。
Activated myeloid cells create inflammatory microenvironment
激活的髓系细胞创建炎症微环境
We found that, among the 11 clusters of myeloid cells (Fig. 6a; Supplementary Fig. 6a), the proportions of monocyte/macrophage (Mo/Mφ)-C 1, Mo/Mφ-C 3, cDC2-C 1, tolerogenic dendritic cell (tDC), and neutrophils changed substantially along the stages of tumorigenesis (Fig. 6b, c). After stage INF, Mo/Mφ-C 1 cells were replaced by Mo/Mφ-C 3 cells, featured by decreased expression levels of the genes (e.g., H2-Ab1 and C1qa) relative to antigen presentation and complement (Supplementary Fig. 6b)49,50. Meanwhile, the expression level of genes involved in immunosuppression (e.g., Il10, Arg1, and Adora2b, Fig. 6d)51–53 elevated along with the increase in Mo/Mφ-C 3 proportion. Furthermore, the proportion of cDC2-C 1 sharply decreased since stage NOR with the downregulation of Icosl, a gene representing the immune stimulation activity43 (Fig. 6d). In contrast, the immune suppressive tDC cells with high expressions of Cd274, Pdcd1lg2, and Ido1 were persisted throughout all the tumorigenic stages (Fig. 6d). In addition, we found that the number of neutrophils continuously increased, and these cells expressed high levels of Mmp9, Prok2, Vegfa, and Nos2, but not tumor suppressors (Supplementary Fig. 6c). These results suggested that myeloid cells were activated during the processes of ESCC tumorigenesis, resulting in an inflammatory microenvironment and CD8+ T-associated adaptive immune suppression, which might allow initiated ESCC cells to survive and proliferate. Transcriptomic analysis also revealed that the immune suppression microenvironment might be created by the complex chemotaxis among various types of cells in this microenvironment. For example, neutrophils recruitment could be mediated through its high expression of Cxcr1/2 receptors by their ligands Cxcl1/2/5 from other cell types or by their brother neutrophils through the simultaneous expression of Ccl3 and Ccr1. Similarly, tDC, Th17, and CD4−CD8− T cells could be recruited by chemokines secreted by different cell types (Fig. 6e).
我们发现,在 11 个髓系细胞簇(图 6a; 补充图 6a )中,肿瘤发生不同阶段中单核细胞/巨噬细胞(Mo/Mφ)-C1、Mo/Mφ-C3、cDC2-C1、耐受性树突状细胞(tDC)和中性粒细胞的比例发生了显著变化(图 6b, c )。在 INF 阶段之后,Mo/Mφ-C1 细胞被 Mo/Mφ-C3 细胞取代,后者表现出与抗原呈递和补体相关的基因(如 H2-Ab1 和 C1qa)表达水平降低(补充图 6b ) 49,50 。同时,与免疫抑制相关的基因(如 Il10、Arg1 和 Adora2b,图 6d ) 51–53 的表达水平随着 Mo/Mφ-C3 比例的增加而升高。此外,cDC2-C1 的比例自 NOR 阶段以来急剧下降,伴随 Icosl 基因表达下调,该基因代表了免疫刺激活性(图 6d ) 43 。相比之下,高表达 Cd274、Pdcd1lg2 和 Ido1 的免疫抑制性 tDC 细胞在整个肿瘤发生阶段持续存在(图 6d )。 此外,我们发现中性粒细胞的数量持续增加,这些细胞表达了高水平的 Mmp9、Prok2、Vegfa 和 Nos2,但不表达肿瘤抑制基因(补充图 6c )。这些结果表明,在食管鳞状细胞癌(ESCC)肿瘤发生过程中,髓系细胞被激活,导致炎症微环境和 CD8 + T 细胞相关的适应性免疫抑制,这可能使启动的 ESCC 细胞得以存活和增殖。转录组分析还揭示,这种免疫抑制微环境可能是由微环境中各种细胞复杂的趋化作用共同创造的。例如,中性粒细胞的招募可能是通过其高表达的 Cxcr1/2 受体由其他细胞类型的 Cxcl1/2/5 配体或通过其兄弟中性粒细胞同时表达 Ccl3 和 Ccr1 而介导的。同样,tDC、Th17 和 CD4 − CD8 − T 细胞可以通过不同细胞类型分泌的趋化因子被招募(图 6e )。
Fig. 6. Compositional changes of myeloid cells and their interactions with other cells.
图 6. 神经胶质细胞组成的变化及其与其他细胞的相互作用。
a tSNE plots of 6177 myeloid cells, colored by cluster. b Histogram showing myeloid cells composition across the six pathological stages. Color indicates cluster designated in a. c Line chart displaying changing trend of the five selected cluster proportions across the six pathological stages. d Heatmap showing scale normalized expression of the immune co-stimulation (left panel) or suppression (right panel) genes in the 11 clusters of myeloid cells. e Heatmap showing scale normalized expression of selected Ccl and Cxcl chemokines, and their corresponding receptors in representative clusters of epithelial cells, fibroblasts, myeloid cells, and T cells. f Bubble plot showing scale normalized expression of the selected genes along tumorigenesis process in various cell types. Size of dots represents percentage of cells expressing corresponding genes across pathological stages.
a tSNE 图显示了 6177 个髓系细胞,按簇着色。b 六个病理阶段髓系细胞组成的直方图。颜色表示 a 中标记的簇。c 线图显示了五个选定簇的比例随六个病理阶段的变化趋势。d 热图显示了 11 个髓系细胞簇中免疫共刺激(左图)或抑制(右图)基因的规模归一化表达。e 热图显示了选定的 Ccl 和 Cxcl 趋化因子及其相应受体在代表性上皮细胞、成纤维细胞、髓系细胞和 T 细胞簇中的规模归一化表达。f 气泡图显示了各种细胞类型在肿瘤发生过程中选定基因的规模归一化表达。气泡的大小表示在各个病理阶段表达相应基因的细胞百分比。
Besides the roles of chemokines, cells in the esophageal microenvironment were also stimulated by each other through various signaling pathways. We found that from stage HYP, the expression levels of Il17a in CD4+ T cells and Il17ra in myeloid cells were elevated (Fig. 6f), which were well-known to have ability to activate macrophages and neutrophils54. On the other hand, we found significant correlations between Il1b and Il23a levels in macrophages and neutrophils and their receptors Il1r1 and Il23r in CD4+ T cells, implying close-loop activation among these cells. The activated immune cells secreted high levels of cytokines (Il1b, Il6, Il18, Tnf) and formed an inflammatory microenvironment starting from stage HYP (Fig. 6f). Analyses of interaction indicated increased interactions between malignant epithelial cells (EpiC 5 and EpiC 6) and inflammatory immune cells, Th17 and neutrophils (Supplementary Fig. 6d, e). The inflammatory microenvironment might further promote carcinogen-initiated epithelial cells to transit along the carcinogenic roadmap as illustrated in Figs. 2b and 4b.
除了趋化因子的作用外,食管微环境中的细胞还通过各种信号通路相互刺激。我们发现从 HYP 阶段开始,CD4 T 细胞中的 Il17a 表达和髓系细胞中的 Il17ra 表达升高(图 1),这两种因子已知能够激活巨噬细胞和中性粒细胞。另一方面,我们发现巨噬细胞和中性粒细胞中的 Il1b 和 Il23a 水平与其受体 Il1r1 和 Il23r 在 CD4 T 细胞中的表达之间存在显著相关性,表明这些细胞之间存在紧密的激活循环。激活的免疫细胞分泌高水平的细胞因子(Il1b、Il6、Il18、Tnf),从 HYP 阶段开始形成炎症微环境(图 4)。相互作用分析表明,恶性上皮细胞(EpiC 5 和 EpiC 6)与炎症免疫细胞(Th17 和中性粒细胞)之间的相互作用增加(补充图 5)。炎症微环境可能进一步促进致癌物引发的上皮细胞沿着致癌路径过渡,如图 6b 和图 7 所示。
Validation of the expression profiles in human ESCC samples
人类 ESCQ 样本中表达谱的验证
We last performed a validation examination of the identified expression profiles with mouse model in tissue samples obtained from four ESCC patients having both precancerous and cancerous lesions in the esophagus. RNA-seq was conducted with mini-bulk tissues of invasive carcinoma (ICA, n = 9), carcinoma in situ (CIS, n = 6), dysplasia (DYS, n = 17) and tumor-adjacent normal epithelia (taNOR, n = 8) obtained by LCM (Supplementary Fig. 7a). PCA showed a clearly different expression profile between cancer tissues (CIS and ICA) and non-cancer tissues (taNOR and DYS); however, an extensive overlap was present between taNOR and DYS (Fig. 7a; Supplementary Fig. 7b). In addition, we found that the six representative genes identified in mouse different epithelial cell clusters were also expressed in the precancerous lesions from ESCC patients (Fig. 7b) and the expression patterns were in line with those in mice. ALDH3A1 and S100A8 were highly expressed in the early precancerous lesions while MMP14 and ITGA6 were mainly present in invasive ESCC, which was concordant with their biological functions. As LCM captured mainly epithelial cells and the samples were not suitable for analyzing the status of other cell types, we performed bulk RNA-seq using biopsy samples. The results also showed a trend of increased neutrophils in the microenvironment, although this trend along the stage progress were marginally significant (trend test, P = 0.066; Fig. 7c). These results suggested that strongly supported our findings in mouse model and provided new clues in understanding ESCC initiation.
我们最后使用来自 4 例同时具有癌前病变和癌变病变的食管鳞状细胞癌(ESCC)患者组织样本的 mini-bulk 组织进行了已鉴定表达谱的验证检查。进行了 RNA-seq 分析,样本包括浸润性癌(ICA,n=9)、原位癌(CIS,n=6)、异型增生(DYS,n=17)和肿瘤相邻正常上皮(taNOR,n=8),这些样本通过激光捕获显微切割(LCM)获得(补充图 7a )。主成分分析(PCA)显示癌组织(CIS 和 ICA)和非癌组织(taNOR 和 DYS)之间有明显不同的表达谱;然而,taNOR 和 DYS 之间存在广泛的重叠(图 7a; 补充图 7b )。此外,我们发现,在小鼠不同上皮细胞簇中鉴定出的 6 个代表性基因也在 ESCC 患者的癌前病变中表达(图 7b ),其表达模式与小鼠中的模式一致。ALDH3A1 和 S100A8 在早期癌前病变中高度表达,而 MMP14 和 ITGA6 主要存在于侵袭性 ESCC 中,这与其生物学功能一致。 LCM 主要捕获了上皮细胞,且样本不适合分析其他细胞类型的状态,因此我们使用活检样本进行了 bulk RNA-seq。结果也显示微环境中中性粒细胞的趋势增加,尽管这种趋势沿阶段进展仅边缘显著(趋势检验,P=0.066;图 7c )。这些结果支持了我们在小鼠模型中的发现,并提供了理解食管癌起始的新线索。
Fig. 7. Validation of expression features in human epithelial tissues with different lesions.
图 7. 不同病变的人类上皮组织中表达特征的验证。
a PCA plot of mini-bulk tissue RNA-seq on human esophageal tissues with different lesions as indicated by different colors and each dot represents a sample. b Immunofluorescence image (upper panel; scale bar, 100 μm) and quantification of selected gene expression levels (lower panel) for human epithelia tissue samples with different lesions. Independent samples from stage INF to ICA are n = 5, 13, 8, 3, and 10, respectively. Data represent mean ± S.E.M. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 for two-sided Wilcoxon rank-sum test, compared with stage INF. P-values for TOP2A: 0.0098 (HYP), 0.0295 (DYS), 0.0357 (CIS), 0.0007 (ICA); ALDH3A1: 0.0007 (ICA); ATF3: 0.0007 (ICA); S100A8: 0.0127 (ICA); MMP14: 0.0140 (HYP), 0.0451 (DYS), 0.0357 (CIS), 0.0007 (ICA); ITGA6: 0.0357 (CIS), 0.0007 (ICA). c Proportion of neutrophils (relative to stage INF) in different precancerous and cancerous lesions based on bulk tissue RNA-seq using CIBERSORT. Each dot represents a sample, data represent mean ± S.D. P = 0.066, determined by two-sided Mantel–Haenszel chi-square test for linear trend. Independent patient samples from stage INF to ICA are n = 6, 15, 11, 6 and 7, respectively.
a PCA 图展示了不同病变阶段的人类食管组织 mini-bulk RNA-seq 数据,不同颜色代表不同病变阶段,每个点代表一个样本。b 免疫荧光图像(上图;比例尺,100 μm)和不同病变阶段人类上皮组织样本中选定基因表达水平的定量分析(下图)。从阶段 INF 到 ICA 的独立样本数量分别为 5、13、8、3 和 10。数据表示均值±标准误差。*P < 0.05,**P < 0.01,***P < 0.001,****P < 0.0001,与阶段 INF 进行双侧 Wilcoxon 秩和检验。TOP2A:HYP:0.0098,DYS:0.0295,CIS:0.0357,ICA:0.0007;ALDH3A1:ICA:0.0007;ATF3:ICA:0.0007;S100A8:ICA:0.0127;MMP14:HYP:0.0140,DYS:0.0451,CIS:0.0357,ICA:0.0007;ITGA6:CIS:0.0357,ICA:0.0007。c 基于 bulk RNA-seq 使用 CIBERSORT 计算的不同癌前和癌变病变中中性粒细胞的比例(相对于阶段 INF)。每个点代表一个样本,数据表示均值±标准差。P = 0.066,通过双侧 Mantel–Haenszel 卡方检验确定线性趋势。从阶段 INF 到 ICA 的独立患者样本数量分别为 6、15、11、6 和 7。
Discussion 讨论
In this study, we employed a high-throughput scRNA-seq analysis and have depicted a complete cell atlas of transition fates, characterized the transcriptomic features of cell clusters at each pathological stage, and elucidated the possible evolutionary trajectories of cells in the multistep processes of ESCC development. We majorly focused on the expression transitions of squamous epithelial and TME cells associated with tumorigenesis, because how they transform from normal to malignant phenotypes is a long-standing question. In this regard, our comprehensive findings shed light on the underlying molecular mechanism of carcinogen-induced esophageal squamous carcinoma.
在本研究中,我们采用了高通量单细胞转录组测序(scRNA-seq)分析,并绘制了完整的细胞转换图谱,描述了每个病理阶段细胞簇的转录组特征,并阐明了细胞在 ESCC 多步发展过程中的可能进化轨迹。我们主要关注与肿瘤发生相关的鳞状上皮细胞和肿瘤微环境(TME)细胞的表达转换,因为它们从正常状态转变为恶性表型的过程一直是一个长期未解的问题。在这方面,我们的全面发现为致癌物诱导的食管鳞状细胞癌的分子机制提供了新的见解。
Many previous studies have shown that human ESCC is a complex disease largely attributed to the exposure to carcinogens from the environment and lifestyles55. 4NQO represents a chemical carcinogen that can specifically induce ESCC in mice and this animal cancer model has been shown to mimic ESCC development in humans6,7. Thus, as it is impossible to obtain continuous tumorigenic lesions from the same human individuals over time, our results from this mouse model demonstrate its value for precisely understanding the gene expressions that may drive cell evolution from normal into malignancy in the multistep development of ESCC in humans. Indeed, we examine and validate the transcriptomic characters of epithelium and TME from patients bearing precancerous lesions.
许多先前的研究表明,人类食管鳞状细胞癌(ESCC)是一种复杂的疾病,主要归因于环境和生活方式中的致癌物质的暴露 55 。4NQO 是一种化学致癌物,可以特异性地在小鼠中诱导 ESCC,并且这种动物癌症模型已被证明可以模拟人类 ESCC 的发展 6,7 。因此,由于无法从同一人类个体中获得连续的肿瘤发生病变,我们从这种小鼠模型中获得的结果证明了其在精确理解基因表达方面的重要性,这些基因表达可能驱动正常细胞向恶性转化,在人类多步骤发展的 ESCC 中。事实上,我们检查并验证了携带癌前病变患者的上皮组织和肿瘤微环境(TME)的转录组特征。
Another interesting and important finding of this study is that the transcriptomic expression pattern of esophageal epithelial cells in 4NQO-induced early lesions is similar to that of esophageal normal epithelial cells in aged mice, while aging is a known risk factor for cancer development in humans. Previous genomic mutation studies in human esophagus4 have shown that although the exonic mutation burden in normal esophageal epithelium derived from ESCC-free human individuals was increased with age, no cancer-related lesions occurred in view of histopathology, suggesting that the transcriptomic expression alterations and other factors such as immune suppression driven by carcinogen insults may show crucial effects in ESCC tumorigenesis.
本研究的另一个有趣且重要的发现是,在 4NQO 诱导的早期病变中,食管上皮细胞的转录组表达模式与老年小鼠的正常食管上皮细胞相似,而衰老是人类癌症发展的一个已知风险因素。先前对人类食管的基因组突变研究 4 表明,虽然正常食管上皮细胞(来自无食管鳞状细胞癌个体)的外显子突变负担随年龄增加而增加,但从病理学角度来看,没有发生癌症相关病变,这表明转录组表达改变和其他因素,如致癌物刺激引起的免疫抑制,可能在食管鳞状细胞癌的发生中起关键作用。
Given that epithelial cells have two different fates, including oncogenic and differentiation, the deviation from the default fates during tumorigenesis implies a permissive or even supportive microenvironment for tumor development. Indeed, by analyze the dynamic changes of non-epithelial cell types in the microenvironment, we found decreased antigen presentation in both fibroblasts and myeloid cells reduced early downregulation of CD8+ T cell response during tumorigenesis. After inflammatory stage, no active cytotoxic responses by CD8+ T cells were detected indicating that neither epithelial cells nor fibroblasts in this pathogenic stage and hereafter had the activity tends to stimulate and activate cytotoxic T cells, whose activation has been considered to be a crucial mechanism against tumorigenesis. In addition, CD8+ T cell suppression may also result from a substantial change of myeloid cells because after pathogenic stage INF, MoMφ-C 3 and tDC cells, both of which are well-known as immune suppressors, became the main myeloid cell types in the tumor microenvironment. In addition, starting from hyperplasia stage, the macrophages and neutrophils activated by CD4-Th17 cells may form an inflammatory microenvironment during tumorigenesis. These findings strongly support our notion that the general immune suppression and chronic inflammatory microenvironment have crucial roles in promoting tumor development. Increased inflammatory responses and decreased cytotoxic T cell activities were also reported in the multistep process during cancer initiation for gastric cancer56 and pancreatic cancer57 depicted by single-cell analysis.
鉴于上皮细胞有两种不同的命运,包括致癌和分化,肿瘤发生过程中偏离默认命运意味着为肿瘤发展提供一种宽容甚至支持的微环境。确实,通过分析微环境中非上皮细胞类型的动态变化,我们发现成纤维细胞和髓系细胞中抗原呈递的减少降低了肿瘤发生早期 CD8 + T 细胞反应的早期下调。在炎症阶段之后,未检测到 CD8 + T 细胞的主动细胞毒性反应,表明在这一致病阶段及其之后,上皮细胞和成纤维细胞均没有刺激和激活细胞毒性 T 细胞的活动,而细胞毒性 T 细胞的激活被认为是对抗肿瘤发生的关键机制。此外,CD8 + T 细胞的抑制也可能源于髓系细胞的显著变化,因为在致病阶段 INF 之后,MoMφ-C 3 和 tDC 细胞成为肿瘤微环境中主要的髓系细胞类型,这两种细胞都是已知的免疫抑制细胞。 此外,从增生阶段开始,由 CD4-Th17 细胞激活的巨噬细胞和中性粒细胞可能会在肿瘤发生过程中形成炎症微环境。这些发现强烈支持了我们关于一般免疫抑制和慢性炎症微环境在促进肿瘤发展中的关键作用的观点。单细胞分析显示,在胃癌 56 和胰腺癌 57 的多步骤致癌过程中,也报道了炎症反应增强和细胞毒性 T 细胞活性降低的现象。
Although several important findings have been presented in this study, we acknowledge some limitations. First, the cells in this study were obtained from the mouse esophageal tissues induced by the carcinogen. Although the 4NQO-induced mouse ESCC model may mimic human ESCC development and we validated the expression of some genes and cells in human esophageal samples, caution should be taken in translating these results to humans. Second, although we have identified the carcinogen-induced epithelial cell and microenvironment transition fates, the underlying molecular mechanisms require further investigation. Furthermore, due to small size of esophageal epithelial tissues in mice and the vulnerable nature of esophagus epithelial cells in vitro, our data sets for early lesions were based on relatively small amount of epithelial cells. This limitation could not have biased our conclusion because the major results were derived from the comparison between different clusters of epithelial cells rather than setting the precursor stage data set as a benchmark for the interpretation. However, experimental methods such as stripping the epithelial layer, reducing sample processing time and raising sensitivity of vulnerable cell detection should be improved in the future studies.
尽管本研究中提出了几个重要的发现,但我们也承认一些局限性。首先,本研究中的细胞来源于致癌物质诱导的小鼠食管组织。虽然 4NQO 诱导的小鼠食管鳞状细胞癌(ESCC)模型可能模拟了人类 ESCC 的发展,并且我们已经在人类食管样本中验证了一些基因和细胞的表达,但在将这些结果应用于人类时仍需谨慎。其次,尽管我们已经识别出了致癌物质诱导的上皮细胞和微环境的转变命运,但其背后的分子机制仍需进一步研究。此外,由于小鼠食管上皮组织体积较小且体外培养的食管上皮细胞较为脆弱,我们早期病变的数据集基于相对较少的上皮细胞。这一局限性并未偏倚我们的结论,因为主要结果是基于不同簇的上皮细胞之间的比较,而不是将前体阶段的数据集作为解释的基准。 然而,未来的研究应改进实验方法,如剥离上皮层、缩短样本处理时间并提高脆弱细胞检测的灵敏度。
In conclusion, we have conducted the single-cell transcriptomic analysis of carcinogen-induced ESCC in mice, which mimics human ESCC development. By analyzing epithelial cells in different pathogenic stages, we have depicted a roadmap in carcinogen-induced mouse ESCC development and identified some key expression signatures. Comprehensive analysis of non-epithelial cells in tumor microenvironment revealed the suppression of anticancer immune response and robust expression of chronic inflammation genes. These results shed light on the transcriptomic alterations and transition status of various cell types in the esophagus when ESCC develops. Because the mouse model used in this study mimics human esophageal tumorigenesis, the findings may also represent human ESCC development.
结论,我们对致癌物诱导的小鼠食管鳞状细胞癌(ESCC)进行了单细胞转录组分析,模拟了人类 ESCC 的发展过程。通过分析不同病理阶段的上皮细胞,我们描绘了致癌物诱导的小鼠 ESCC 发展的 roadmap,并识别了一些关键的表达特征。对肿瘤微环境中非上皮细胞的全面分析揭示了抗癌免疫反应的抑制和慢性炎症基因的 robust 表达。这些结果阐明了 ESCC 发展过程中食管各种细胞类型的转录组改变和过渡状态。由于本研究中使用的 mouse model 模拟了人类食管肿瘤发生,这些发现也可能代表人类 ESCC 的发展。
Methods 方法
Human biospecimen collection
人类生物样本采集
ESCC tumor, dysplasia lesions and tumor-adjacent (>5 cm) normal tissues of the same patients (n = 4) used for LCM and esophageal lesions of different pathological stages (n = 45) used for bulk RNA sequencing were collected during surgery or endoscopy in Linzhou Esophageal Cancer Hospital (Henan Province, China) from 2018 to 2019. The various lesions were diagnosed independently by at least two pathologists according to the American Joint Committee on Cancer Eighth edition. No patient had received chemotherapy or radiotherapy before biopsy or surgery. This study was approved by the Institutional Review Boards of Cancer Hospital, Chinese Academy of Medical Sciences and informed consent was obtained from each patient. Clinical information was collected from patients’ medical records.
2018 年至 2019 年,在河南省林州市食管癌医院(中国)进行手术或内镜检查时,收集了 4 名患者的 ESCC 肿瘤、异型增生病变和距肿瘤>5 厘米的正常组织(用于激光捕获显微切割,LCM),以及 45 名不同病理阶段的食管病变组织(用于批量 RNA 测序)。各种病变由至少两位病理学家根据美国癌症联合委员会第八版独立诊断。在活检或手术前,患者未接受化疗或放疗。本研究经中国医学科学院肿瘤医院机构审查委员会批准,并从每位患者处获得了知情同意。临床信息从患者的医疗记录中收集。
Induction of multi-staged ESCC development and sample preparations
诱导多阶段食管鳞状细胞癌的发展及样本准备
Animal experiments in this study were conducted in compliance with approved protocols and guidelines from the Institutional Animal Care and Use Committee of the Chinese Academy of Medical Sciences. Eight-week-old female C57BL/6 mice, purchased from the Beijing Huafukang bioscience company in China, were maintained in local housing facility of a controlled condition (23 ± 1 °C, 50 ± 10% humidity and 12–12 h light-dark cycle). Mice were treated with 4NQO (Sigma-Aldrich) in drinking water (100 μg/ml) for 16 weeks to induce multi-staged ESCC carcinogenesis. Drinking water containing the carcinogen was replaced once a week with freshly prepared one and mice were allowed to access drinking water ad libitum during treatment. After 16 weeks of carcinogen treatment, 4NQO drinking water was replaced by sterile water until the mice were killed.
本研究中的动物实验符合中国医学科学院实验动物护理和使用委员会批准的协议和指南。使用中国北京华阜康生物公司提供的 8 周龄雌性 C57BL/6 小鼠,在受控条件下(温度 23±1℃,湿度 50±10%,12-12 小时光照-黑暗周期)饲养。小鼠通过饮用含 4NQO(Sigma-Aldrich,100μg/ml)的水诱导多阶段食管鳞状细胞癌(ESCC)发生,持续 16 周。每周更换一次含有致癌物的饮用水,并允许小鼠自由饮用。16 周致癌物处理后,用无菌水替代 4NQO 饮用水,直至处死小鼠。
The esophageal lesions were identified by two independent pathologists based on the histopathological criteria described previously58 (Fig. 1a). Briefly, stage NOR was well-oriented stratified epithelium consists of basal zone and superficial zone. Stage INF was normal epithelium with focal aggregates of epithelial lymphocytes. Stage DYS was defined as loss of polarity in the epithelial cells, nuclear pleomorphism, hyperchromatic, and increased or abnormal mitoses. In stage HYP, these abnormalities were confined to the lower third of the epithelium while in DYS they present in lower two thirds of the epithelium. Lesions with such abnormal changes involving the entire thickness of epithelium were considered as carcinoma in situ (stage CIS). Stage ICA was defined as a lesion with invasion into the sub-epithelial tissues. Esophageal samples of 4NQO-induced mice were subjected to single-cell RNA sequencing at six different time points and indicated different pathological stages: stage NOR (at week 0), stage INF (at week 12), stage HYP at (week 20), stage DYS (at week 22), stage CIS (at week 24), and stage ICA (at week 26). A group of control mice treated without 4NQO were killed at month 1, 2, 8, and 25 (n = 2, respectively). The esophagus was removed immediately when the animal was killed. Cross-sections of the esophagus were cut and stored at –80 °C and sections of the frozen tissues were stained with hematoxylin–eosin (H&E) for histopathological examination and microdissection for bulk RNA sequencing.
食管病变根据先前描述的组织病理学标准由两位独立的病理科医师识别 58 (图 1a )。简而言之,NOR 阶段为有基底区和表层区的定向良好分层上皮。INF 阶段为正常上皮,伴有局部上皮淋巴细胞聚集。DYS 阶段定义为上皮细胞失去极性,核多形性,染色深,以及增多或异常的有丝分裂。在 HYP 阶段,这些异常局限于上皮的下三分之一,而在 DYS 阶段则出现在下三分之二。涉及整个上皮厚度的异常变化被认为是原位癌(CIS 阶段)。ICA 阶段定义为侵入上皮下组织的病变。诱导食管癌的 4NQO 小鼠在六个不同时间点进行了单细胞 RNA 测序,并表明不同的病理阶段:NOR 阶段(第 0 周),INF 阶段(第 12 周),HYP 阶段(第 20 周),DYS 阶段(第 22 周),CIS 阶段(第 24 周),ICA 阶段(第 26 周)。 一组未接受 4NQO 处理的对照小鼠在第 1、2、8 和 25 个月时被处死(每组分别有 2 只)。动物被处死后立即取出食管。食管横截面被切成薄片并保存在-80 °C 下,冷冻组织的切片用苏木精-伊红(HE)染色进行病理学检查和 bulk RNA 测序的宏分离。
Bulk RNA sequencing and data analysis
批量 RNA 测序和数据分析
Tissue samples from human or mice were cut into 5–10 consecutive sections (8 μm) and the epithelial layer contained 30–50 cells on each section was micro-dissected with a Leica LMD7000 laser-capture microdissection system. RNA was isolated from the mini-bulk samples and cDNA was prepared for sequencing based on the Geo-seq protocol59. Sequencing libraries were built using the TruePrep DNA Library Prep Kit V2 for Illumina (Vazyme), and evaluated by Bioanalyzer (DNA HS kit, Agilent). RNA-seq data were mapped to GRCh38 human genome and GRCm38 murine genome by HISAT2 (version 2.1.0)60 with default parameter for human and murine samples, respectively. The gene expression matrix of raw reads counts after annotation by HTSeq (version 0.6.1p1) was processed using the DESeq2 (version 1.22.2)61 and visualized by showing the first 3 dimensions calculated by plotPCA function. We used TRIZOL to extract bulk RNA from patient biopsy (n = 45), and constructed sequencing library using NEBNext Ultra II RNA Library Prep Kit for Illumina. Sequencing data were processed by HISAT2 and HTseq as described above. The normalized expression from bulk samples of human precancerous lesions was used to estimated neutrophil fraction with CIBERSORT (version 1.06)62.
人或小鼠的组织样本被切成 5–10 个连续切片(8 μm),每个切片中的上皮层包含 30–50 个细胞,使用 Leica LMD7000 激光捕获显微切割系统进行微切割。从 mini-bulk 样本中提取 RNA,并根据 Geo-seq 协议制备 cDNA 进行测序 59 。使用 Vazyme 的 TruePrep DNA Library Prep Kit V2 for Illumina 构建测序文库,并通过 Bioanalyzer(DNA HS kit,Agilent)进行评估。RNA-seq 数据使用 HISAT2(版本 2.1.0) 60 分别对人和小鼠样本使用默认参数进行比对,映射到 GRCh38 人基因组和 GRCm38 小鼠基因组。使用 HTSeq(版本 0.6.1p1)对注释后的原始读数计数生成基因表达矩阵,并使用 DESeq2(版本 1.22.2) 61 处理,通过 plotPCA 函数计算的前 3 个维度进行可视化。我们使用 TRIZOL 从患者活检样本(n = 45)中提取 bulk RNA,并使用 NEBNext Ultra II RNA Library Prep Kit for Illumina 构建测序文库。测序数据的处理方法如上所述。 从 bulk 样本中的人癌前病变的归一化表达量使用 CIBERSORT(版本 1.06)估算中性粒细胞比例 62 。
Single-cell RNA sequencing (scRNA-seq) and data analysis
单细胞 RNA 测序(scRNA-seq)和数据分析
For mice at stage NOR and INF, the whole esophagi were taken and for mice at other stages, the dysplastic/malignant lesions were taken immediately after killed. The tissue samples of ESCC and various precursor lesions were gently minced into small pieces and digested for with in RPMI-1640 medium (Invitrogen) containing collagenase IV (Gibco) and hyaluronidase (Sigma-Aldrich). CD45-FITC (553080, BD Biosciences, dilution 1:20) antibody staining was performed for fluorescence activated cell sorting (FACS) on a FACSAria sorter (BD Biosciences). Single cells with or without GFP signal, representing immune or non-immune cells, were sorted and captured respectively in nanoliter droplets using Chromium (10× Genomics). scRNA-seq libraries were prepared using Chromium Single Cell 5′ Reagent Kits (10× Genomics) and sequencing was accomplished with an Illumina HiSeq x10 System.
对于 NOR 和 INF 阶段的小鼠,整个食管被取下;而对于其他阶段的小鼠,则在处死后立即取下异常增生/恶性病变组织。ESCC 及其各种前驱病变组织样本被轻轻剪碎成小块,并用含胶原酶 IV(Gibco)和透明质酸酶(Sigma-Aldrich)的 RPMI-1640 培养基(Invitrogen)消化。使用 BD Biosciences 的 CD45-FITC(553080,稀释度 1:20)抗体进行荧光激活细胞排序(FACS)染色,使用 BD Biosciences 的 FACSAria 分选器。带有或不带有 GFP 信号的单个细胞,代表免疫或非免疫细胞,分别被分选并用 Chromium(10× Genomics)捕获在纳升液滴中。使用 10× Genomics 单细胞 5′试剂盒制备 scRNA-seq 文库,并使用 Illumina HiSeq x10 系统进行测序。
Raw gene expression matrices obtained per sample using CellRanger (version 2.1.0, 10× genomics) were combined using the Seurat R package (version 2.3.4)63. Genes detected in <0.1% of all cells were filtered. We further excluded cells with gene counts <500 and cells that had >10% of mitochondrial gene expressions. After quality control, 66,089 cells were further analyzed for their gene expression profiles. The genes with normalized expression between 0.0125 and 3, and dispersion >0.5 were selected as highly variable genes. The resultants were first summarized by principle component analysis (PCA) and then first several PCs were selected for tSNE dimensional reduction using the default settings of the RunTSNE function. The numbers of resulting highly variable genes and the select PCs are shown in Supplementary Table 2. Cell clusters in the resulting two-dimensional representation were annotated as known biological cell types using canonical marker genes.
每个样本使用 CellRanger(版本 2.1.0,10× genomics)获得原始基因表达矩阵,并使用 Seurat R 包(版本 2.3.4)进行合并 63 。过滤掉在所有细胞中检测不到<0.1%的基因。进一步排除基因计数<500 的细胞以及线粒体基因表达>10%的细胞。经过质量控制后,66,089 个细胞进一步分析其基因表达谱。选择归一化表达在 0.0125 到 3 之间且离散度>0.5 的基因作为高度可变基因。首先通过主成分分析(PCA)汇总结果,然后使用 RunTSNE 函数的默认设置进行 tSNE 维度降维,选择前几个主成分。高度可变基因的数量和选择的主成分数量显示在补充表 2 中。在结果的二维表示中,细胞簇被注释为已知的生物细胞类型,使用经典标记基因。
Major cell-type clustering and marker gene identification
主要细胞类型聚类和标志基因识别
We reanalyzed epithelial cells and stromal cells separately to identify their sub-clusters by using highly variable gene identification and dimensional reduction as described above (Supplementary Table 2). Cells with mix features were removed from further analysis (e.g., cells with both Cd3d and Cd19 expression indicating T cell and B cell Multiplet). Clusters were identified using FindClusters function, and the specific gene markers for each cluster were determined using the FindAllMarkers function implanted in Seurat package.
我们分别重新分析了上皮细胞和间质细胞,通过上述方法(见补充表 2 )识别高度可变基因并进行维度减少,以识别其亚簇。具有混合特征的细胞被从进一步分析中移除(例如,同时表达 Cd3d 和 Cd19 的细胞表明是 T 细胞和 B 细胞的 Multiplet)。使用 FindClusters 函数识别簇,并使用 Seurat 包内置的 FindAllMarkers 函数确定每个簇的特定基因标记。
Gene set variation analysis (GSVA)
基因集变异分析(GSVA)
Pathway analyses were predominantly performed on the 50 hallmark pathways described in the molecular signature database, exported using the MSigDB database (version 6.2)64. We also assessed biological process activities using a described biological process of Gene Ontology (GO) dataset. To assign pathway activity estimates to individual cells, we applied the GSVA using standard settings, as implemented in the GSVA package (version 1.30.0)65. To assess differential activities of pathways between sub-cluster of cells, we contrasted the activity scores for each cell using Limma package (version 3.38.3)66. Differential activities of pathways were calculated for each identified cluster. T-values of the results of some significant differential pathways (P < 0.05) in top 10 were visualized using heatmaps with average pathway activity scores of each cluster.
通路分析主要针对分子签名数据库(MSigDB 数据库,版本 6.2)中描述的 50 个标志性通路进行。我们还使用基因本体(GO)数据集描述的生物学过程评估了生物学过程活性。为了将通路活性估计值分配给单个细胞,我们使用 GSVA 包(版本 1.30.0)的标准设置进行了 GSVA 分析 65 。为了评估细胞亚簇之间通路活性的差异,我们使用 Limma 包(版本 3.38.3)对比了每个细胞的活性评分 66 。对于每个识别出的簇,计算了通路活性的差异。对于一些显著差异的通路(P < 0.05),在前 10 个中,使用热图可视化每个簇的平均通路活性评分。
Analysis of transcription factor expression
转录因子表达分析
SCENIC (version 1.1.0)67 was used to assess the transcriptional activity of epithelial cells with high quality (UMI > 5500). The analysis used the motifs database for RcisTarget and GRNboost (corresponding to GENIE3 1.4.3, AUCell 1.4.1 and RcisTarget 1.2.1; with mm10__refseq-r80__10kb_up_and_down_tss.mc9nr). The input matrix was read counts.
SCENIC(版本 1.1.0) 67 用于评估具有高质量(UMI > 5500)的上皮细胞的转录活性。分析使用了 RcisTarget 和 GRNboost 的 motifs 数据库(对应于 GENIE3 1.4.3、AUCell 1.4.1 和 RcisTarget 1.2.1;使用 mm10__refseq-r80__10kb_up_and_down_tss.mc9nr)。输入矩阵为读取计数。
Cell transition trajectory and diffusion map analysis
细胞转换轨迹和扩散图分析
Monocle 2 (version 2.10.1)68 was used for the trajectory analysis on high quality epithelial cells (UMI > 5500). All the top 100 markers of each cluster were used for the cell ordering. Dimensionality reduction and trajectory construction were performed on the selected genes with default methods and parameters. We calculated diffusion components using the RunDiffusion function as implemented in Seurat package with default parameters. The first three dimensions were used to draw diffusion maps. Mean coordinates of all of each cluster’s cells were considered as the center of the cluster. The farthest cell of stage NOR from the total distance of other stages was the start point.
Monocle 2(版本 2.10.1) 68 用于高质量上皮细胞(UMI > 5500)的轨迹分析。每个簇的前 100 个标记基因用于细胞排序。在选定基因上进行降维和轨迹构建,默认方法和参数。我们使用 Seurat 包中的 RunDiffusion 函数计算扩散成分,默认参数。使用前三个维度绘制扩散图。每个簇所有细胞的平均坐标被视为簇的中心。从总距离来看,阶段 NOR 中最远的细胞作为起点。
Analysis of interaction between cell types
细胞类型间的相互作用分析
We used CellPhoneDB (version 2.0.6)69 with default arguments to reveal interaction between cell types. Each cluster analyzed was downsampled to 100 cells since the low cell number of some epithelial cluster. Interactions of epithelial cells with immune cells and immune cells with epithelial cells were demonstrated respectively.
我们使用了 CellPhoneDB(版本 2.0.6) 69 并采用默认参数来揭示不同细胞类型之间的相互作用。由于某些上皮细胞簇的细胞数量较低,每个分析的簇被下采样至 100 个细胞。上皮细胞与免疫细胞之间的相互作用以及免疫细胞与上皮细胞之间的相互作用分别得到了展示。
Immunohistochemistry and immunofluorescent detection
免疫组化和免疫荧光检测
Formalin-fixed paraffin-embedded (FFPE) sections of esophageal precursor lesions were collected from 30 patients between 2016 and 2018 in Linzhou Cancer Hospital, including INF (n = 5), HYP (n = 13), DYS (n = 8) and CIS (n = 3), to validate the results obtained in mice. The protein expression levels of the marker genes were detected by IHC staining for mice tissues and immunofluorescence for human specimens with antibodies (Abcam) shown in Supplementary Table 3. The samples were incubated with antibody against Ki67 (1:50 for IHC, ab16667), Top2a (1:8000 for IHC, 1:10,000 for IF, ab52934), Aldh3a1 (1:200 for IHC, 1:600 for IF, ab76976), Atf3 (1:200 for IHC, 1:600 for IF, ab216569), S100a8 (1:500 for IHC, 1:1500 for IF, ab92331), Mmp14 (1:2000 for IHC, 1:6000 for IF, ab51074), or Itga6 (1:250 for IHC, 1:750 for IF, ab181551). Opal multiplex staining was performed according to the Opal 5-Color Manual IHC Kit (Perkin Elmer). Opal DAPI, Opal 520, Opal 570, Opal 620, and Opal 690 were used to generate different signals. Slides were counterstained with DAPI (1:2000) for nuclei visualization, and subsequently coverslipped using a VectaShield Hardset mounting media. The slides were imaged using Vectra Polaris Automated Quantitative Pathology Imaging System (Perkin Elmer). We used inForm software (Perkin Elmer) to unmix and remove autofluorescence and to analyze the multispectral images.
食管前驱病变的甲醛固定石蜡包埋(FFPE)切片来自 2016 年至 2018 年林州癌症医院的 30 名患者,包括 INF(n=5)、HYP(n=13)、DYS(n=8)和 CIS(n=3),用于验证在小鼠中获得的结果。通过 IHC 染色检测小鼠组织中标志基因的蛋白表达水平,并通过免疫荧光检测人类标本,使用抗体(Abcam),详见补充表 3 。样本用抗体孵育,针对 Ki67(IHC 1:50,ab16667),Top2a(IHC 1:8000,IF 1:10,000,ab52934),Aldh3a1(IHC 1:200,IF 1:600,ab76976),Atf3(IHC 1:200,IF 1:600,ab216569),S100a8(IHC 1:500,IF 1:1500,ab92331),Mmp14(IHC 1:2000,IF 1:6000,ab51074),或 Itga6(IHC 1:250,IF 1:750,ab181551)。根据 Perkin Elmer 的 Opal 5-Color Manual IHC Kit 进行 Opal 多色染色。使用 Opal DAPI、Opal 520、Opal 570、Opal 620 和 Opal 690 生成不同信号。切片用 DAPI(1:2000)复染以观察核,随后使用 VectaShield Hardset 封片介质封片。 切片使用 Perkin Elmer 的 Vectra Polaris 自动定量病理成像系统成像。我们使用 Perkin Elmer 的 inForm 软件进行去混和去除自发荧光,并分析多光谱图像。
Statistical analysis 统计分析
Statistical analyses were conducted by using R v3.5.170 and Prism 7 (Graphpad Software). Pearson’s correlation was calculated with the R function cor() and the significance was determined using two-sided unpaired Wilcoxon rank-sum test. P < 0.05 was considered statistically significant.
统计分析使用了 R v3.5.1 和 Prism 7(Graphpad Software)。使用 R 函数 cor()计算了皮尔森相关系数,并使用双侧非配对 Wilcoxon 秩和检验确定了显著性。P < 0.05 被认为具有统计学意义。
Reporting summary 报告摘要
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
进一步的研究设计信息请参见本文链接的 Nature Research Reporting Summary 。
Supplementary information
补充信息
Acknowledgements 致谢
This project was supported by the National Natural Science Foundation of China (81725015 to C. Wu, 21675098 to J.W., 21927802 to J.W.), Medical and Health Technology Innovation Project of Chinese Academy of Medical Sciences (2016-I2M-3-019 to D.L., 2016-I2M-4-002 to C. Wu., 2019-I2M-2-001 to D.L. and C. Wu), Beijing Outstanding Young Scientist Program (BJJWZYJH01201910023027 to C. Wu), 2018 Beijing Brain Initiative (Z181100001518004 to J.W.), Beijing Advanced Innovation Center for Genomics, and Beijing Advanced Innovation Center for Structural Biology.
本项目得到了中国国家自然科学基金(81725015 致吴春,21675098 致吴杰,21927802 致吴杰)的支持,中国医学科学院医学与健康科技创新工程(2016-I2M-3-019 致杜立中,2016-I2M-4-002 致吴春,2019-I2M-2-001 致杜立中和吴春),北京市优秀青年科学家计划(BJJWZYJH01201910023027 致吴春),2018 北京脑计划(Z181100001518004 致吴杰),北京基因组学高级创新中心和北京结构生物学高级创新中心的支持。
Source data 原始数据
Author contributions 作者贡献
J.W., C. Wu., and D.L. conceptualized and supervised this study. J.Y., Q.C., W.F., Y.C., and T.L. contributed to the study design and performed most experiments. W.G., Y.X., and L.L. were engaged in sample collection and preparation. Y.M., J.Y., X.Z., and Y. Lin contributed to bioinformatics and statistical analysis. C. Wang, L.P., Y. Luo, and A.L. were responsible for animal experiments. J.Y., Q.C., W.F., Y.M., and Y.C. drafted the manuscript. W.T., C. Wu., J.W., and D.L. reviewed and prepared the final manuscript. All authors approved the final manuscript.
J.W.、C.吴.和 D.L.负责构思和监督本研究。J.Y.、Q.C.、W.F.、Y.C.和 T.L.参与了研究设计并完成了大部分实验。W.G.、Y.X.和 L.L.负责样本的收集和准备。Y.M.、J.Y.、X.Z.和林艳贡献了生物信息学和统计分析。C.王.、L.P.、罗艳.和 A.L.负责动物实验。J.Y.、Q.C.、Y.M.和 Y.C.起草了手稿。W.T.、C.吴.、J.W.和 D.L.负责审阅和准备最终手稿。所有作者均批准了最终手稿。
Data availability 数据可用性
The raw sequencing data and processed gene expression matrix of mouse model have been deposited in GSA (Genome Sequence Archive in BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, http://gsa.big.ac.cn) under the accession number CRA002118. The raw sequencing data of human esophageal tissues has been deposited in GSA-Human (https://bigd.big.ac.cn/gsa-human) under the accession number HRA000093. All the other data supporting the findings of this study are available within the article and its supplementary information files and from the corresponding author upon reasonable request. A reporting summary for this article is available as a Supplementary Information file. Source data are provided with this paper.
小鼠模型的原始测序数据和处理后的基因表达矩阵已存放在 GSA(中国科学院北京基因组研究所大数据中心基因序列档案, http://gsa.big.ac.cn )下,存档号为 CRA002118。人类食管组织的原始测序数据已存放在 GSA-Human( https://bigd.big.ac.cn/gsa-human )下,存档号为 HRA000093。本研究的所有其他支持数据均包含在本文及其补充信息文件中,并可应合理要求从相应作者处获得。本文提供了数据报告摘要作为补充信息文件。随文提供了源数据。
Code availability 代码可用性
Example scripts to process and analyze data are available at https://github.com/ESCCemAll/scESCC_mice. Detailed information will be available from the corresponding authors upon reasonable request.
可用于处理和分析数据的示例脚本可在 https://github.com/ESCCemAll/scESCC_mice 处获得。如有合理请求,详细信息将从相应作者处获得。
Competing interests 竞争利益
The authors declare no competing interests.
作者声明无竞争利益。
Footnotes 脚注
Peer review information
Nature Communications thanks Marnix Jansen, Shao Li and the other, anonymous, reviewer for their contribution to the peer review of this work. Peer reviewer reports are available.
同行评审信息 《自然通讯》感谢 Marnix Jansen、邵利和其他匿名评审人对本文同行评审工作的贡献。同行评审报告可供查阅。
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
出版商须知 Springer Nature 对出版地图中的管辖权主张和机构关系保持中立。
These authors contributed equally: Jiacheng Yao, Qionghua Cui, Wenyi Fan, Yuling Ma, Yamei Chen, Tianyuan Liu.
这些作者贡献相当: Yao Jiacheng、Cui Qionghua、Fan Wenyi、Ma Yuling、Chen Yamei、Liu Tianyuan。
Contributor Information 贡献者信息
Dongxin Lin, Email: lindx@cicams.ac.cn.
林东昕, Email: lindx@cicams.ac.cn.
Chen Wu, Email: chenwu@cicams.ac.cn.
吴晨, Email: chenwu@cicams.ac.cn.
Jianbin Wang, Email: jianbinwang@tsinghua.edu.cn.
王 Jianbin,邮箱: jianbinwang@tsinghua.edu.cn。
Supplementary information
补充信息
Supplementary information is available for this paper at 10.1038/s41467-020-17492-y.
补充信息可在 10.1038/s41467-020-17492-y 处获取。
References 参考文献
-
1.Chen XX, et al. Genomic comparison of esophageal squamous cell carcinoma and its precursor lesions by multi-region whole-exome sequencing. Nat. Commun. 2017;8:524. doi: 10.1038/s41467-017-00650-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
1. 陈 XX 等. 多区域全外显子测序比较食管鳞状细胞癌及其前驱病变的基因组. 自然通讯. 2017;8:524. doi: 10.1038/s41467-017-00650-0. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ] -
2.Chang J, et al. Genomic analysis of oesophageal squamous-cell carcinoma identifies alcohol drinking-related mutation signature and genomic alterations. Nat. Commun. 2017;8:15290. doi: 10.1038/ncomms15290. [DOI] [PMC free article] [PubMed] [Google Scholar]
2. 常 J 等. 食管鳞状细胞癌的基因组分析揭示饮酒相关的突变特征和基因组改变. 自然通讯. 2017;8:15290. doi: 10.1038/ncomms15290. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ] -
3.Gao Y, et al. Genetic landscape of esophageal squamous cell carcinoma. Nat. Genet. 2014;46:1097–1102. doi: 10.1038/ng.3076. [DOI] [PubMed] [Google Scholar]
3.高宇等. 食管鳞状细胞癌的遗传景观. 自然基因组学. 2014;46:1097–1102. doi: 10.1038/ng.3076. [ DOI ] [ PubMed ] [ Google Scholar ] -
4.Martincorena I, et al. Somatic mutant clones colonize the human esophagus with age. Science. 2018;362:911–917. doi: 10.1126/science.aau3879. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.马丁科雷纳 I 等. 随着年龄增长,体细胞突变克隆占据人类食管。科学. 2018;362:911–917. doi: 10.1126/science.aau3879. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ] -
5.Navin N, et al. Tumour evolution inferred by single-cell sequencing. Nature. 2011;472:90–94. doi: 10.1038/nature09807. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.纳文 N 等. 肿瘤进化通过单细胞测序推断。自然. 2011;472:90–94. doi: 10.1038/nature09807. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ] -
6.Tang X, et al. Oral cavity and esophageal carcinogenesis modeled in carcinogen-treated mice. Clin. Cancer Res. 2004;10:301–313. doi: 10.1158/1078-0432.ccr-0999-3. [DOI] [PubMed] [Google Scholar]
6.唐 X 等. 化学致癌物处理小鼠的口腔和食管致癌模型。临床癌症研究. 2004;10:301–313. doi: 10.1158/1078-0432.ccr-0999-3. [ DOI ] [ PubMed ] [ Google Scholar ] -
7.Chu J, et al. Metabolic remodeling by TIGAR overexpression is a therapeutic target in esophageal squamous-cell carcinoma. Theranostics. 2020;10:3488–3502. doi: 10.7150/thno.41427. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.朱 J 等. TIGAR 过表达引起的代谢重编程是食管鳞状细胞癌的治疗靶点。诊疗. 2020;10:3488–3502. doi: 10.7150/thno.41427. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ] -
8.Kelly RJ, et al. Impacting tumor cell-fate by targeting the inhibitor of apoptosis protein survivin. Mol. Cancer. 2011;10:35. doi: 10.1186/1476-4598-10-35. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Kelly RJ 等. 靶向凋亡抑制蛋白 survivin 影响肿瘤细胞命运. 分子癌症. 2011;10:35. doi: 10.1186/1476-4598-10-35. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ] -
9.Nitiss JL. DNA topoisomerase II and its growing repertoire of biological functions. Nat. Rev. Cancer. 2009;9:327–337. doi: 10.1038/nrc2608. [DOI] [PMC free article] [PubMed] [Google Scholar]
9. 尼蒂斯 JL. DNA 拓扑异构酶 II 及其不断扩大的生物学功能谱系. 自然评论·癌症. 2009;9:327–337. doi: 10.1038/nrc2608. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ] -
10.Sales Gil R, Vagnarelli P. Ki-67: more hidden behind a ‘classic proliferation marker’. Trends Biochem. Sci. 2018;43:747–748. doi: 10.1016/j.tibs.2018.08.004. [DOI] [PubMed] [Google Scholar]
10. 萨尔斯·吉尔 R, 瓦纳雷利 P. Ki-67:经典增殖标志物背后的更多内涵. 生化与分子生物学趋势. 2018;43:747–748. doi: 10.1016/j.tibs.2018.08.004. [ DOI ] [ PubMed ] [ Google Scholar ] -
11.Xiong, Y. et al. UBE2C functions as a potential oncogene by enhancing cell proliferation, migration, invasion, and drug resistance in hepatocellular carcinoma cells. Biosci. Rep. 39, BSR20182384 (2019). [DOI] [PMC free article] [PubMed]
11.熊, Y. 等. UBE2C 通过增强肝细胞癌细胞的增殖、迁移、侵袭和药物耐药性,在潜在致癌基因中发挥作用. 生物科学报告 39, BSR20182384 (2019). [ DOI ] [ PMC free article ] [ PubMed ] -
12.Bhattacharjee P, et al. Functional compensation of glutathione S-transferase M1 (GSTM1) null by another GST superfamily member, GSTM2. Sci. Rep. 2013;3:2704. doi: 10.1038/srep02704. [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Bhattacharjee P, 等. 通过另一 GST 超家族成员 GSTM2 的功能补偿 GSTM1 缺失。Sci. Rep. 2013;3:2704. doi: 10.1038/srep02704. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ] -
13.Chen CH, et al. Pharmacological recruitment of aldehyde dehydrogenase 3A1 (ALDH3A1) to assist ALDH2 in acetaldehyde and ethanol metabolism in vivo. Proc. Natl Acad. Sci. USA. 2015;112:3074–3079. doi: 10.1073/pnas.1414657112. [DOI] [PMC free article] [PubMed] [Google Scholar]
13. 陈 CH, 等. 通过 ALDH3A1 的药理学招募协助 ALDH2 在体内的乙醛和乙醇代谢。Proc. Natl Acad. Sci. USA. 2015;112:3074–3079. doi: 10.1073/pnas.1414657112. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ] -
14.Osier MV, et al. Possible epistatic role of ADH7 in the protection against alcoholism. Am. J. Med. Genet. B Neuropsychiatr. Genet. 2004;126B:19–22. doi: 10.1002/ajmg.b.20136. [DOI] [PubMed] [Google Scholar]
14. Osier MV, 等. ADH7 可能在防止酒精依赖中起表型作用。Am. J. Med. Genet. B Neuropsychiatr. Genet. 2004;126B:19–22. doi: 10.1002/ajmg.b.20136. [ DOI ] [ PubMed ] [ Google Scholar ] -
15.Carr TM, et al. JunB promotes Th17 cell identity and restrains alternative CD4(+) T-cell programs during inflammation. Nat. Commun. 2017;8:301. doi: 10.1038/s41467-017-00380-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Carr TM, 等. JunB 促进 Th17 细胞身份并限制炎症期间的替代 CD4(+) T 细胞程序。Nat. Commun. 2017;8:301. doi: 10.1038/s41467-017-00380-3. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ] - 16.Inoue M, et al. The stress response gene ATF3 is a direct target of the Wnt/beta-catenin pathway and inhibits the invasion and migration of HCT116 human colorectal cancer cells. PLoS ONE. 2018;13:e0194160. doi: 10.1371/journal.pone.0194160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Liu C, et al. Folate receptor alpha is associated with cervical carcinogenesis and regulates cervical cancer cells growth by activating ERK1/2/c-Fos/c-Jun. Biochem. Biophys. Res. Commun. 2017;491:1083–1091. doi: 10.1016/j.bbrc.2017.08.015. [DOI] [PubMed] [Google Scholar]
- 18.Tiedje C, et al. The RNA-binding protein TTP is a global post-transcriptional regulator of feedback control in inflammation. Nucleic Acids Res. 2016;44:7418–7440. doi: 10.1093/nar/gkw474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Nedjadi T, et al. S100A8 and S100A9 proteins form part of a paracrine feedback loop between pancreatic cancer cells and monocytes. BMC Cancer. 2018;18:1255. doi: 10.1186/s12885-018-5161-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Sakamoto K, et al. Down-regulation of keratin 4 and keratin 13 expression in oral squamous cell carcinoma and epithelial dysplasia: a clue for histopathogenesis. Histopathology. 2011;58:531–542. doi: 10.1111/j.1365-2559.2011.03759.x. [DOI] [PubMed] [Google Scholar]
- 21.Brooks DL, et al. ITGA6 is directly regulated by hypoxia-inducible factors and enriches for cancer stem cell activity and invasion in metastatic breast cancer models. Mol. Cancer. 2016;15:26. doi: 10.1186/s12943-016-0510-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zhang R, et al. Golgi membrane protein 1 (GOLM1) promotes growth and metastasis of breast cancer cells via regulating matrix metalloproteinase-13 (MMP13) Med. Sci. Monit. 2019;25:847–855. doi: 10.12659/MSM.911667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Leung DW, et al. Vascular endothelial growth factor is a secreted angiogenic mitogen. Science. 1989;246:1306–1309. doi: 10.1126/science.2479986. [DOI] [PubMed] [Google Scholar]
- 24.Nguyen AT, et al. Organelle specific o-glycosylation drives MMP14 activation, tumor growth, and metastasis. Cancer Cell. 2017;32:639–653 e636. doi: 10.1016/j.ccell.2017.10.001. [DOI] [PubMed] [Google Scholar]
- 25.Wang L, et al. Extracellular matrix protein 1 (ECM1) is over-expressed in malignant epithelial tumors. Cancer Lett. 2003;200:57–67. doi: 10.1016/s0304-3835(03)00350-1. [DOI] [PubMed] [Google Scholar]
- 26.Park YR, et al. MicroRNA-30a-5p (miR-30a) regulates cell motility and EMT by directly targeting oncogenic TM4SF1 in colorectal cancer. J. Cancer Res. Clin. Oncol. 2017;143:1915–1927. doi: 10.1007/s00432-017-2440-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Satelli A, et al. Vimentin in cancer and its potential as a molecular target for cancer therapy. Cell Mol. Life Sci. 2011;68:3033–3046. doi: 10.1007/s00018-011-0735-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Nakabayashi M, et al. PITX1 is a reliable biomarker for predicting prognosis in patients with oral epithelial dysplasia. Oncol. Lett. 2014;7:750–754. doi: 10.3892/ol.2013.1775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kastenhuber ER, et al. Putting p53 in context. Cell. 2017;170:1062–1078. doi: 10.1016/j.cell.2017.08.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Lee Y, et al. BCLAF1 is a radiation-induced H2AX-interacting partner involved in γH2AX-mediated regulation of apoptosis and DNA repair. Cell Death Dis. 2012;3:e359. doi: 10.1038/cddis.2012.76. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Rodgers JJ, et al. ETS1 induces transforming growth factor beta signaling and promotes epithelial-to-mesenchymal transition in prostate cancer cells. J. Cell Biochem. 2019;120:848–860. doi: 10.1002/jcb.27446. [DOI] [PubMed] [Google Scholar]
- 32.Puisieux A, et al. Oncogenic roles of EMT-inducing transcription factors. Nat. Cell Biol. 2014;16:488–494. doi: 10.1038/ncb2976. [DOI] [PubMed] [Google Scholar]
- 33.Li J, et al. Co-inhibitory molecule B7 superfamily member 1 expressed by tumor-infiltrating myeloid cells induces dysfunction of anti-tumor CD8(+) T cells. Immunity. 2018;48:773–786. doi: 10.1016/j.immuni.2018.03.018. [DOI] [PubMed] [Google Scholar]
- 34.Xiao X, Li B, Mitton B, Ikeda A, Sakamoto KM. Targeting CREB for cancer therapy: friend or foe. Curr. Cancer Drug Targets. 2010;10:384–391. doi: 10.2174/156800910791208535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lee JH, et al. ELK3 promotes the migration and invasion of liver cancer stem cells by targeting HIF-1alpha. Oncol. Rep. 2017;37:813–822. doi: 10.3892/or.2016.5293. [DOI] [PubMed] [Google Scholar]
- 36.Clausen MJ, et al. RAB25 expression is epigenetically downregulated in oral and oropharyngeal squamous cell carcinoma with lymph node metastasis. Epigenetics. 2016;11:653–663. doi: 10.1080/15592294.2016.1205176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ogunwobi OO, et al. Epigenetic upregulation of HGF and c-Met drives metastasis in hepatocellular carcinoma. PLoS ONE. 2013;8:e63765. doi: 10.1371/journal.pone.0063765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Katoh M, et al. Precision medicine for human cancers with Notch signaling dysregulation. Int. J. Mol. Med. 2020;45:279–297. doi: 10.3892/ijmm.2019.4418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Danilo M, et al. Suppression of Tcf1 by inflammatory cytokines facilitates effector CD8 T cell differentiation. Cell Rep. 2018;22:2107–2117. doi: 10.1016/j.celrep.2018.01.072. [DOI] [PubMed] [Google Scholar]
- 40.Forster R, et al. CCR7 and its ligands: balancing immunity and tolerance. Nat. Rev. Immunol. 2008;8:362–371. doi: 10.1038/nri2297. [DOI] [PubMed] [Google Scholar]
- 41.Kaech SM, et al. Selective expression of the interleukin 7 receptor identifies effector CD8 T cells that give rise to long-lived memory cells. Nat. Immunol. 2003;4:1191–1198. doi: 10.1038/ni1009. [DOI] [PubMed] [Google Scholar]
- 42.Barber DL, et al. Restoring function in exhausted CD8 T cells during chronic viral infection. Nature. 2006;439:682–687. doi: 10.1038/nature04444. [DOI] [PubMed] [Google Scholar]
- 43.Blackburn SD, et al. Coregulation of CD8+ T cell exhaustion by multiple inhibitory receptors during chronic viral infection. Nat. Immunol. 2009;10:29–37. doi: 10.1038/ni.1679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kim TD, et al. Human microRNA-27a* targets Prf1 and GzmB expression to regulate NK-cell cytotoxicity. Blood. 2011;118:5476–5486. doi: 10.1182/blood-2011-04-347526. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Zhang L, et al. Lineage tracking reveals dynamic relationships of T cells in colorectal cancer. Nature. 2018;564:268–272. doi: 10.1038/s41586-018-0694-x. [DOI] [PubMed] [Google Scholar]
- 46.Annunziato F, et al. The 3 major types of innate and adaptive cell-mediated effector immunity. J. Allergy Clin. Immunol. 2015;135:626–635. doi: 10.1016/j.jaci.2014.11.001. [DOI] [PubMed] [Google Scholar]
- 47.Narayana SK, et al. The interferon-induced transmembrane proteins, IFITM1, IFITM2, and IFITM3 inhibit hepatitis C virus entry. J. Biol. Chem. 2015;290:25946–25959. doi: 10.1074/jbc.M115.657346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Etesam Z, et al. Altered expression of specific transcription factors of Th17 (RORγt, RORα) and Treg lymphocytes (FOXP3) by peripheral blood mononuclear cells from patients with multiple sclerosis. J. Mol. Neurosci. 2016;60:94–101. doi: 10.1007/s12031-016-0789-5. [DOI] [PubMed] [Google Scholar]
- 49.Carroll MC. The complement system in regulation of adaptive immunity. Nat. Immunol. 2004;5:981–986. doi: 10.1038/ni1113. [DOI] [PubMed] [Google Scholar]
- 50.Stables MJ, et al. Transcriptomic analyses of murine resolution-phase macrophages. Blood. 2011;118:e192–e208. doi: 10.1182/blood-2011-04-345330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Arlauckas SP, et al. Arg1 expression defines immunosuppressive subsets of tumor-associated macrophages. Theranostics. 2018;8:5842–5854. doi: 10.7150/thno.26888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Iannone R, et al. Blockade of A2b adenosine receptor reduces tumor growth and immune suppression mediated by myeloid-derived suppressor cells in a mouse model of melanoma. Neoplasia. 2013;15:1400–1409. doi: 10.1593/neo.131748. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Mittal SK, et al. Suppression of antigen presentation by IL-10. Curr. Opin. Immunol. 2015;34:22–27. doi: 10.1016/j.coi.2014.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Pelletier M, et al. Evidence for a cross-talk between human neutrophils and Th17 cells. Blood. 2010;115:335–343. doi: 10.1182/blood-2009-04-216085. [DOI] [PubMed] [Google Scholar]
- 55.Abnet CC, et al. Epidemiology of esophageal squamous cell carcinoma. Gastroenterology. 2018;154:360–373. doi: 10.1053/j.gastro.2017.08.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Zhang P, et al. Dissecting the single-cell transcriptome network underlying gastric premalignant lesions and early gastric cancer. Cell Rep. 2019;27:1934–1947. doi: 10.1016/j.celrep.2019.04.052. [DOI] [PubMed] [Google Scholar]
- 57.Bernard V, et al. Single-cell transcriptomics of pancreatic cancer precursors demonstrates epithelial and microenvironmental heterogeneity as an early event in neoplastic progression. Clin. Cancer Res. 2019;25:2194–2205. doi: 10.1158/1078-0432.CCR-18-1955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Dawsey SM, et al. Squamous esophageal histology and subsequent risk of squamous cell carcinoma of the esophagus. A prospective follow-up study from Linxian, China. Cancer. 1994;74:1686–1692. doi: 10.1002/1097-0142(19940915)74:6<1686::aid-cncr2820740608>3.0.co;2-v. [DOI] [PubMed] [Google Scholar]
- 59.Chen J, et al. Spatial transcriptomic analysis of cryosectioned tissue samples with Geo-seq. Nat. Protoc. 2017;12:566–580. doi: 10.1038/nprot.2017.003. [DOI] [PubMed] [Google Scholar]
- 60.Kim D, et al. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods. 2015;12:357–360. doi: 10.1038/nmeth.3317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Love MI, et al. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Newman AM, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods. 2015;12:453–457. doi: 10.1038/nmeth.3337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Butler A, et al. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 2018;36:411–420. doi: 10.1038/nbt.4096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Subramanian A, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA. 2005;102:15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Hanzelmann S, et al. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics. 2013;14:7. doi: 10.1186/1471-2105-14-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Ritchie ME, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Aibar S, et al. SCENIC: single-cell regulatory network inference and clustering. Nat. Methods. 2017;14:1083–1086. doi: 10.1038/nmeth.4463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Trapnell C, et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotechnol. 2014;32:381–386. doi: 10.1038/nbt.2859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Vento-Tormo R, et al. Single-cell reconstruction of the early maternal–fetal interface in humans. Nature. 2018;563:347–353. doi: 10.1038/s41586-018-0698-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.R Development Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, Vienna, 2018).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw sequencing data and processed gene expression matrix of mouse model have been deposited in GSA (Genome Sequence Archive in BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, http://gsa.big.ac.cn) under the accession number CRA002118. The raw sequencing data of human esophageal tissues has been deposited in GSA-Human (https://bigd.big.ac.cn/gsa-human) under the accession number HRA000093. All the other data supporting the findings of this study are available within the article and its supplementary information files and from the corresponding author upon reasonable request. A reporting summary for this article is available as a Supplementary Information file. Source data are provided with this paper.
Example scripts to process and analyze data are available at https://github.com/ESCCemAll/scESCC_mice. Detailed information will be available from the corresponding authors upon reasonable request.







