Skip to main content

Systematic identification of the key candidate genes in breast cancer stroma



Tumor microenvironment, in particular the stroma, plays an important role in breast cancer cell invasion and metastasis. Investigation of the molecular characteristics of breast cancer stroma may reveal targets for future study.


The transcriptome profiles of breast cancer stroma and normal breast stroma were compared to identify differentially expressed genes (DEGs). The method was analysis of GSE26910 and GSE10797 datasets. Common DEGs were identified and then analyses of enriched pathways and hub genes were performed.


A total of 146 DEGs were common to GSE26910 and GSE10797. The enriched pathways were associated with “extracellular matrix (ECM) organization”, “ECM-receptor interaction” and “focal adhesion”. Network analysis identified six key genes, including JUN, FOS, ATF3, STAT1, COL1A1 and FN1. Notably, COL1A1 and FN1 were identified for the first time as cancer stromal key genes associated with breast cancer invasion and metastasis. Oncome analysis showed that the high expression levels of COL1A1 and FN1 correlated to an advanced stage of breast cancer and poor clinical outcomes.


We found that several conserved tumor stromal genes might regulate breast cancer invasion through ECM remodeling. The clinical outcome analyses of COL1A1 and FN1 suggest these two genes are promising targets for future studies.


Breast cancer develops from primary solid tumors that invade locally. Subsequently, distant metastasis occurs. It is this distant impact that causes more than 90% of cancer-related deaths [1].

The normal mammary gland is comprised of a central lumen, a well-defined layer of epithelial cells, a layer of contractile myoepithelial cells, basement membrane and the interstitial matrix or the stroma, which consists of randomly organized fibrillar collagen [2]. As the tumor microenvironment is known to affect tumor invasion and metastasis, it is important to consider how these elements interplay with tumors. Within the tumor microenvironment, stroma, which contain fibroblasts and immune cells are often altered as cancer progresses. Alterations include the activation of the fibroblasts, the remodeling of the extracellular matrix (ECM) and angiogenesis [3, 4].

The ECM regulates breast development and differentiation. It also provides structural support for the cells and mediates epithelial–stromal communication. It undergoes constant remodeling during normal mammary development, and an imbalance in this process causes ECM dysregulation and disorganization, further resulting in breast cancer [2]. However, the balance between the deposition of ECM components and their concomitant degradation by matrix metalloproteinases (MMPs) is closely associated with tumor growth and invasion [5]. ECM dynamics reveal that deposition and cross-linking of collagen lead to increased collagen fiber linearization and thickening. The orientation of the collagen fibers is also profoundly altered: they align perpendicular to the tumor boundary, forming migration tracks for invasive tumor cells to exit the tumor tissue and enter the blood stream [2].

During this process, stromal composition is also altered. Cancer-associated fibroblasts and immune cells appear and participate in the activation of local macrophages, fibroblasts and endothelial cells and the recruitment of a variety of leukocyte subsets [6, 7]. The activated macrophages and recruited leukocytes secrete their own repertoire of cytokines, chemokines and proteolytic enzymes, and then ECM degradation occurs.

The whole process facilitates the infiltration of invasive tumor cells through the basement membrane ECM into the surrounding breast stroma from the established migration tracks [8, 9]. Therefore, alterations in the activation of fibroblasts, remodeling of the extracellular matrix (ECM) and angiogenesis in cancer stroma can be considered as parts of the tumor invasion processes. In addition, the molecular markers regulating these cancer stromal alteration processes are important.

ECM deposition and degradation also play critical roles in breast cancer and its metastasis. ECM component degradation is primarily induced by MMPs, which are classified in six groups: collagenases, gelatinases, stromelysins, matrilysins, membrane-type MMPs, and other non-classified MMPs [10]. Importantly, the members of the MMP family are significantly associated with tumor progression. For example, MMP-2 and MMP-9 regulate the degradation of collagen type IV, a major element of the basement membrane [11]. MMP-1, MMP-2, MMP-9 and MMP-14 promote breast cancer growth and metastasis [12,13,14,15]. Some MMPs, including MMP-1, MMP-7 and MMP-12, have also been reported as markers associated with poor prognosis in breast cancer [16].

Lysyl oxidase (LOX), an amine oxidase enzyme, catalyzes cross-linking between collagens and elastins in the ECM. It increases ECM stiffness, thereby promoting invasive behavior in breast cancer cells [11]. Experiments in vitro showed that LOX overexpression promotes breast cancer invasion, metastasis and epithelial-to-mesenchymal transition (EMT) [17]. Silencing LOX suppresses breast cancer metastasis [18].

Besides MMPs and LOX, other ECM modifying enzymes, such as urokinase plasminogen activator (uPA) system and cathepsins, are also associated with breast cancer and metastatic progression. Additionally, matricellular proteins, a group of ECM glycoproteins, are greatly upregulated during tissue remodeling, e.g., during mammary gland involution and pathological conditions such as cancer [19]. Although these matrix proteins may have a limited effect on the mechanical structure of the ECM, they are actually important for cell regulators and the modulation of signaling pathways [20]. Proteins of the matricellular family include tenascin C (TNC), osteopontin/secreted phosphoprotein 1 (SPP1), periostin (POSTN), acidic and rich in cysteine (SPARC) and thrombospondin (THBS). They play important roles in breast cancer and metastasis.

The roles of proteins related to ECM remodeling and cell invasion have been studied well. However, the context-dependent functions of these genes and pathways in the tumor microenvironment are unclear.

The stromal tissue within the tumor microenvironment includes fibroblasts, adipocytes, and blood and lymph vessels. Alterations in these components influence tumor progression [9]. Here, we investigated the molecular characteristics of the tumor stroma, and identified the biomarker candidates related to breast cancer progression and metastasis. We discuss their potential roles in early tumor detection and tumor therapy. We analyzed the GSE26910 and GSE10797 datasets and compared the transcriptome profiles of stroma surrounding invasive breast primary tumors and normal breast stroma. We then overlapped the differentially expressed genes (DEGs) from the comparisons to identify commonality. The common DEGs were functionally enriched, and a protein–protein interaction (PPI) network was constructed to identify the key cancer stroma genes. Our results reveal the dysregulated stromal genes in invasive breast cancer and may provide novel targets for therapy against breast cancer invasion and metastasis.

Materials and methods

Derivation of genetic data

The gene expression profile datasets GSE26910 and GSE10797 were downloaded from the Gene Expression Omnibus (GEO) database (, repository of gene expression data from high-throughput hybridization arrays, ChIPs and microarrays. GSE26910 contains six samples of stroma surrounding invasive breast primary tumors and six matched samples of normal stroma obtained using the platform GPL570 Affymetrix Human Genome U133 Plus 2.0 Array. GSE10797 contains 28 samples of invasive breast cancer stroma and five samples of normal stroma obtained using GPL571 Affymetrix Human Genome U133A 2.0 Array.

Data processing

Series matrix file(s) (SMF) and an annotation soft table were collected from GSE26910 and GSE10797. The expressions of genes in each sample from were extracted from the SMF. R3.2.2 software was used to pre-process the downloaded raw data via background correction and quantile normalization. Using Perl [21], probes were transformed into gene names. For genes corresponding to more than one probe, gene expression levels were determined using the average probe values. An “impute” package [22] was applied to complement missing expression with its adjacent value.

Screening of DEGs

We performed two comparisons: one between breast cancer stroma and normal stroma from GSE26910; and the other between those tissue types from GSE10797. Limma package [23] was used to screen DEGs with p < 0.05 and |log (fold change) | ≥ 1. The DEGs were grouped into upregulated and downregulated genes, and cluster software was used to cluster the samples based on their gene expression values. In addition, the samples were subtyped based on clustering analysis and heat maps between the gene expression values and samples were generated. A volcano plot displaying the log-fold change against the –log(10) (p-value) was also generated using all the genes that were different in invasive breast cancer stroma. All DGEs from the GSE26910 dataset were matched with the DEGs in GSE10797 dataset using DIDS algorithm to identify the common DEGs consistently downregulated or upregulated in the two microarrays.

Functional annotation and pathway enrichment analysis

To functionally annotate the common DEGs, R packages, such as GOstats and clusterProfiler [24], were used to analyze the significantly enriched Gene Ontology (GO) biological processes and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways (p < 0.05).

Protein–protein interaction network construction

PPI databases from HPRD [25], BIOGRID [26] and PIP [27] were downloaded to extracted 562,252 pair interactions. The common DEGs identified above were directly mapped to the PPI databases to acquire significant PPI pairs from a range of sources, including data from experimental studies and data retrieved by text mining and homology searches. Cytoscape 3.2.1 [28] was used to construct the interaction network by importing the interacted gene pairs into our curated PPI database. Large nodes could then be identified based on the degree for each node.

Oncomine analysis

The expression levels of select genes in invasive breast cancer were analyzed using Oncomine gene expression array datasets (, which is an online cancer microarray database designed to facilitate discovery from genome-wide expression analyses. In this study, associations of mRNA expression levels with invasive breast tumor presence, advanced stage, recurrence and metastatic events were analyzed using Student’s t-test to generate the p value. The p value was set up at 0.05 and fold change was defined as 1.5. In many instances, we found several significant correlations in each clinical event, but only showed one or two representative examples.


Differentially expressed genes between stroma surrounding invasive breast primary tumors and normal breast stroma cells

The GSE26910 and GSE10797 databases were downloaded and analyzed. First, we analyzed the gene expression profiles for stroma surrounding invasive breast primary tumors and for normal breast stroma cells. A total of 3708 DEGs were identified from the GSE26910 dataset (Fig. 1) and 665 DEGs from GSE10797 dataset (Fig. 2). Second, to identify common genes between these two datasets, we overlapped the DEGs in the two comparisons, revealing 146 common DEGs with consistent changes (Table 1).

Fig. 1

Heat map and volcano plot for differentially expressed genes (DEGs) in breast cancer stroma cells compared with normal breast stroma from GSE26910. a – Volcano plot of DEGs in breast cancer stroma cells from GSE26910. x-axis: LogFC, large magnitude fold changes; y-axis: -log10 of p-value, high statistical significance. Red and green points: log2|fold change| ≥ 1 and p value < 0.05. Black points: log2|fold change| < 1 or p value > 0.05. b – Dendrogram of DEGs identified via cluster analysis. Each column represents one sample. The green header indicates normal stromal cells and red indicates breast cancer stromal cells. Each row represents one gene, with red representing high relative expression, and green representing low relative expression

Fig. 2

Heat map and volcano plot for differentially expressed genes (DEGs) in breast cancer stroma cells compared with normal breast stroma from GSE10797. a – Volcano plot of DEGs in breast cancer stroma cells from GSE10797. x-axis: LogFC, large magnitude fold changes; y-axis: -log10 of p-value, high statistical significance. Red and green points: log2|fold change| ≥ 1 and p value < 0.05. Black points: log2|fold change| < 1 or p value > 0.05. b – Dendrogram of DEGs identified via cluster analysis. Each column represents one sample. The green header indicates normal stromal cells and red indicates breast cancer stromal cells. Each row represents one gene, with red representing high relative expression, and green representing low relative expression

Table 1 The common genes altered in both the GSE26910 and GSE10797 datasets

MMP1, a member of the MMP family that is an inducer of ECM degradation, was highly expressed in breast cancer stroma. POSTN and SPARC respectively encode periostin and acidic and rich in cysteine, and are members of the matricellular family. They were also upregulated. These findings suggest that the dysregulated genes in invasive breast cancer stroma are involved in encoding ECM components.

Gene ontologies for biological processes and KEGG pathway enrichment analysis

To investigate the enriched pathways of the common genes, the GO and KEGG databases were used for the pathway analyses. In the GO biological analysis, 870 processes were enriched (Table 2). “Extracellular matrix organization” is the top pathway enriched by 24 genes. These genes, including POSTN, SPARC, COL1A1, COL5A1, COL5A2, COL10A1 and COL11A1, were also responsible for encoding collagens, which are the most abundant proteins in the ECM [29]. The significantly enriched KEGG pathways included “ECM-receptor interaction” (Table 2). Some of the genes involved in this pathway encode ECM remodeling-related proteins, such as collagens (COL1A1, COL5A1, COL5A2, and COL11A1), integrin (ITGA6), tenascin (TNXB) and fibronectin (FN1). In addition, the “Focal adhesion” pathway related to ECM ontologies was also enriched significantly.

Table 2 The significant GO biological processes and KEGG pathways enriched by the common genes identified in breast cancer stromal cells

Identification of key genes associated with breast cancer stroma alteration

To identify key genes involved in breast cancer stroma alteration, a network analysis of the 146 common genes was conducted using PPI datasets. Proteins are constructed as nodes in protein interaction networks (PINs) and their interactions are edges [30]. The biological network results revealed 66 pair interactions (Fig. 3). Based on the degree of each node in biological network, we further identified several key genes with many interactions, including JUN, ATF3, COL1A1, FOS, STAT1 and FN1.

Fig. 3

Protein–protein interaction networks of common differentially expressed genes (DEGs) consistently up- or downregulated in both comparisons

JUN/c-jun is a component of the transcription factor protein 1 (AP-1) complex and is phosphorylated by jun N-terminal kinase 1 (JNK1) [31], while c-jun correlates with MMP-9, which degrades ECM. Overexpression of c-jun has been reported to induce MMP-9 protein expression and activity, and elevated MMP-9 expression increases the numbers of invasive cells [32]. In breast cancer, activated c-jun is predominantly expressed at the invasive front and plays a role in proliferation and angiogenesis [31].

ATF3 (activating transcription factor), a stress response gene, has been reported to affect MMP-13 transcription. Knockdown of ATF3 expression in breast cancer cells decreases cell migration [33, 34].

Fos is a member of Fos family of AP-1 transcription factors. Like c-jun, high c-Fos protein levels promote high MMP-9 expression, whereas high FosB levels significantly correlate with MMP-1 overexpression [35]. c-Fos also enhances the invasion of the breast cancer cell line, MCF7.

STAT-1 is the first member of the family of signal transducers and activators of transcription (STATs). Knockdown of STAT1 in cancer-associated fibroblast co-cultured with human breast cancer cells altered cancer cell proliferation and delayed early breast cancer progression. This indicates a role for STAT1 as a stromal contributor of tumorigenesis [36].

Collagen 1A1 (COL1A1) belongs to the collagen family, members of which are major components of the tumor-stromal environment with important roles in cancer cell behavior [37]. Fibronectin1 (FN1) is a heterodimeric glycoprotein form at the cell surface and ECM, and is associated with cell adhesion, cell migration, wound healing and cell metastasis [38].

In conclusion, JUN, FOS, ATF3 and STAT1 have been reported to be associated with breast cancer invasion and directly or indirectly participate in ECM remodeling. Of note, the roles of COL1A1 and FN1 related to breast tumor invasion are still unclear.

COL1A1 and FN1 in human breast tumors

We used Oncomine to investigate the expressions of COL1A1 and FN1 in human breast tumor invasion and metastasis. COL1A1 was significantly increased in breast cancer compared with normal breast tissue (Fig. 4a). Furthermore, we observed a significant increase of COL1A1 expression level in high grade and advanced N1+ stage compared with the control group (Fig. 4b). Subsequently, we studied the association between the COL1A1 expression level and clinical outcome. A high expression level of COL1A1 was observed in breast cancer with recurrence or a metastatic event after 1 year (Fig. 4c).

Fig. 4

Validation of COL1A1 expression in breast cancer using Oncomine datasets. a – High COL1A1 expression was observed in breast cancer compared with normal breast tissue. b – High COL1A1 expression was observed in high-grade or advanced-stage breast cancer compared with low-grade or –stage breast cancer. c – High COL1A1 expression was observed in breast cancer with recurrence or a metastatic event at 1 year

Similarly to COL1A1, FN1 was also highly expressed in breast cancer (Fig. 5a) and the advanced N1+ stage of breast cancer (Fig. 5b) compared with their control groups. An elevated FN1 level in breast cancer patients with recurrence or a metastatic event was observed in comparison with controls (Fig. 5c).

Fig. 5

Validation of FN1 expression in breast cancer using Oncomine datasets. a – High FN1 expression was observed in breast cancer compared with normal breast tissue. b – High FN1 expression was observed in high-grade or advanced-stage breast cancer. c – High FN1 expression was observed in breast cancer with recurrence or a metastatic event at 3 years

These high expression levels of COL1A1 and FN1 were correlated to late stage of breast cancer and poor clinical outcomes.


Tumor angiogenesis and destruction of the extracellular matrix (ECM) are two essential factors for tumor invasion and metastasis. Previous studies concentrated on the tumor characteristics but there has been a lack of study of the tumor microenvironment. Genes altered in the tumor microenvironment may play important roles in the destruction process and be further involved in tumor progression.

To contribute to the understanding of tumor stroma roles in breast progression, we compared transcriptome profiles between cancer stroma (stroma surrounding invasive breast primary tumors) and normal breast stroma. A total of 3708 differentially expressed genes (DEGs) were identified in the GSE26910 dataset, whereas 665 DEGs were found in GSE10797. By overlapping the sets, 146 common DEGs were found.

Among the common DEGs, genes encoding matrix components were significantly dysregulated, including MMP1, POSTN, SPARC, COL1A1, COL5A1, COL5A2, COL10A1 and COL11A1. These altered genes were significantly associated with the pathways of “extracellular matrix organization”, “ECM-receptor interaction” and “focal adhesion”. Our findings indicated that these genes and pathways are related to the establishment of the tumor microenvironment.

The degradation of the basal membrane and the ECM results from proteolysis. Most members of the proteolytic system are mediated by AP-1 transcription factors through binding to TPA-responsive elements (TREs) in the promoter or enhancer region of the target genes [35]. The AP-1 factor is a heterodimer consisted of proteins belonging to the Fos, Jun, ATF and JDP families [39]. In our PPI network analysis of these common genes, JUN, FOS and ATF3 were recognized as key nodes. This suggests that they may play a critical role in tumor environment via AP-1 factor.

JUN/c-Jun is a member of the Jun family, FOS/c-Fos is a member of the Fos family and ATF3 belongs to the ATF family. AP-1 regulates several cytological processes, including differentiation, cell death, proliferation, oncogenic transformation, apoptosis and cell migration [40]. Due to its constitutive activation and the diverse responses it induces, AP-1 plays a crucial role in promoting tumor invasion and migration [41].

In our study, JUN, FOS and ATF3 were involved in 12 of the top 50 significant biological processes. These included “negative regulation of cellular process” and “negative regulation of biological process”.

FOS and ATF3, STAT1, COL1A1 and FN1 were also identified as key nodes. STAT1 mediates both type I (alpha and beta) and type II (gamma) IFNs that are associated with cell growth, regulation, and antiviral and immune defense [36]. Stromal STAT1 expression promotes tumor progression in breast cancer. COL1A1 and FN1 are individual ECM genes and have been reported to be associated with tumor invasion and metastasis. Increased extracellular levels of COL1A1 promote tumor cell invasiveness in culture and metastasis in animal models [37]. Furthermore, a high level of COL1A1 increases the likelihood of clinical metastasis of multiple human solid tumors [42, 43]. The elevated expression of FN1 promotes lymph node metastasis in human oral squamous cell carcinoma by promoting vascular endothelial growth factor-C expression and the epithelial–mesenchymal transition (EMT) [38]. However, no study on COL1A1 and FN1 associations with breast cancer has been reported. Therefore, we performed Oncomine analysis to investigate their expression in breast cancer. High expression levels of COL1A1 and FN1 were associated with high grade or advanced stage of breast cancer and poor prognosis.

In summary, breast cancer stroma-related genes, including JUN, FOS, ATF3, STAT1, COL1A1 and FN1, were all associated with tumor invasion and metastasis. This reveals that cancer stroma-related genes have important roles in tumor progression.

Despite the improvement of tumor detection tools (including imaging techniques like MRI and PET; and analysis of blood biomarkers shed by the tumor, such as proteins and cell free nucleic acids) and tumor therapies, there are still limitations. Clinical detection of tumors is limited to masses 1 cm in diameter. Resistance to therapy has been one of the major challenges in treating cancer [44].

Tumor microenvironment has been considered as prospective breakthrough point in early tumor detection and tumor treatment recently. Based on the two hallmarks of the tumor microenvironment, acidity and low oxygenation, the macromolecular near-infrared poly (ethylene glycol)-conjugatediridium (iii) complex is designed to successively respond to acidity and hypoxia while amplifying detection sensitivity via signal propagation. Primary tumors and metastasis tumor nodules as small as 1 mm in mice have been successfully detected [45]. In addition, tumor microenvironment biomarkers contribute to tumor detection and treatment. MMP-9, a major regulator inducing ECM component degradation, is highly expressed in many human cancer types compared with normal controls. Therefore, an exogenously administered tumor-penetrating nanosensor with an MMP-9 peptide substrate in its surface has been created [44]. Importantly, this nanosensor is reported to detect nodules with median diameters smaller than 2 mm in an orthotopic model of ovarian cancer. In terms of drug resistance, different components of the microenvironment have been found to participate in the development of chemoresistance and inhibit MMPs, although chemokine signaling is effective when combined with traditional chemotherapies [46].

In this study, COL1A1 and FN1 were all ECM genes with elevated mRNA levels in tumor stromal cells. Therefore, they might be used as key candidate genes for tumor detection and treatment.

The strength of our study is that we combined gene expression data of stroma surrounding invasive breast primary tumors and normal stroma from two GEO databases and obtained conserved genes. It is known that as breast tumors progress from ductal carcinoma in situ (DCIS) to invasive ductal carcinoma (IDC), the ECM undergoes increased collagen fiber linearization and thickening due to deposition and cross-linking of the collagen. In IDC, the collagen fibers are aligned perpendicularly to the tumor boundary, forming migration tracks for invasive tumor cells to exit the tumor tissue and enter the bloodstream [2]. Thus, the alteration of stroma composition from DCIS to IDC is important for tumor invasion. Further study may focus on the gene expression of cancer stroma in this process.

Our study also has several limitations. First, the analysis of more similar or larger databases may be necessary for identification of the conservative key genes. Second, experimental validation of the expressions of these conserved genes is needed. Third, investigation the roles of these critical genes should be conducted in the future.


Our results show several key genes in breast cancer stroma, including JUN, FOS, ATF3, STAT1, COL1A1 and FN1. These genes were involved in “extracellular matrix organization”, “ECM-receptor interaction” and “focal adhesion”. Oncomine analyses of COL1A1 and FN1 in different human breast cancer datasets suggest that these two genes may be promising targets for the future studies.



Benjamini and Hochberg’s


Differentially expressed genes


Extracellular matrix


False discovery rate


Gene Expression Omnibus


Gene ontology


Human Protein Reference Database


Kyoto Encyclopedia of Genes and Genomes


Protein-protein interaction prediction database


Protein–protein interaction


  1. 1.

    Chaffer CL, Weinberg RA. A perspective on cancer cell metastasis. Science. 2011;331(6024):1559–64.

    CAS  Article  Google Scholar 

  2. 2.

    Kaushik S, Pickup MW, Weaver VM. From transformation to metastasis: deconstructing the extracellular matrix in breast cancer. Cancer Metastasis Rev. 2016;35(4):655–67.

    CAS  Article  Google Scholar 

  3. 3.

    Noel A, Foidart JM. The role of stroma in breast carcinoma growth in vivo. J Mammary Gland Biol Neoplasia. 1998;3(2):215–25.

    CAS  Article  Google Scholar 

  4. 4.

    Tuxhorn JA, McAlhany SJ, Dang TD, Ayala GE, Rowley DR. Stromal cells promote angiogenesis and growth of human prostate tumors in a differential reactive stroma (DRS) xenograft model. Cancer Res. 2002;62(11):3298–307.

    CAS  PubMed  Google Scholar 

  5. 5.

    Fata JE, Werb Z, Bissell MJ. Regulation of mammary gland branching morphogenesis by the extracellular matrix and its remodeling enzymes. Breast Cancer Res. 2004;6(1):1–11.

    CAS  Article  Google Scholar 

  6. 6.

    Bhowmick NA, Chytil A, Plieth D, et al. TGF-beta signaling in fibroblasts modulates the oncogenic potential of adjacent epithelia. Science. 2004;303(5659):848–51.

    CAS  Article  Google Scholar 

  7. 7.

    Bierie B, Moses HL. Tumour microenvironment: TGFbeta: the molecular Jekyll and Hyde of cancer. Nat Rev Cancer. 2006;6(7):506–20.

    CAS  Article  Google Scholar 

  8. 8.

    Planche A, Bacac M, Provero P, et al. Identification of prognostic molecular features in the reactive stroma of human breast and prostate cancer. PLoS One. 2011;6(5):e18640.

    CAS  Article  Google Scholar 

  9. 9.

    Casey T, Bond J, Tighe S, et al. Molecular signatures suggest a major role for stromal cells in development of invasive breast cancer. Breast Cancer Res Treat. 2009;114(1):47–62.

    CAS  Article  Google Scholar 

  10. 10.

    Jablonska-Trypuc A, Matejczyk M, Rosochacki S. Matrix metalloproteinases (MMPs), the main extracellular matrix (ECM) enzymes in collagen degradation, as a target for anticancer drugs. J Enzyme Inhib Med Chem. 2016;31(sup1):177–83.

    CAS  Article  Google Scholar 

  11. 11.

    Oskarsson T. Extracellular matrix components in breast cancer progression and metastasis. Breast. 2013;22(Suppl 2):S66–72.

    Article  Google Scholar 

  12. 12.

    Liu H, Kato Y, Erzinger SA, et al. The role of MMP-1 in breast cancer growth and metastasis to the brain in a xenograft model. BMC Cancer. 2012;12:583.

    CAS  Article  Google Scholar 

  13. 13.

    Mehner C, Hockla A, Miller E, Ran S, Radisky DC, Radisky ES. Tumor cell-produced matrix metalloproteinase 9 (MMP-9) drives malignant progression and metastasis of basal-like triple negative breast cancer. Oncotarget. 2014;5(9):2736–49.

    Article  Google Scholar 

  14. 14.

    Ling B, Watt K, Banerjee S, et al. A novel immunotherapy targeting MMP-14 limits hypoxia, immune suppression and metastasis in triple-negative breast cancer models. Oncotarget. 2017;8(35):58372–85.

    Article  Google Scholar 

  15. 15.

    Pereira IT, Ramos EA, Costa ET, et al. Fibronectin affects transient MMP2 gene expression through DNA demethylation changes in non-invasive breast cancer cell lines. PLoS One. 2014;9(9):e105806.

    Article  Google Scholar 

  16. 16.

    Finak G, Bertos N, Pepin F, et al. Stromal gene expression predicts clinical outcome in breast cancer. Nat Med. 2008;14(5):518–27.

    CAS  Article  Google Scholar 

  17. 17.

    El-Haibi CP, Bell GW, Zhang J, et al. Critical role for lysyl oxidase in mesenchymal stem cell-driven breast cancer malignancy. Proc Natl Acad Sci U S A. 2012;109(43):17460–5.

    CAS  Article  Google Scholar 

  18. 18.

    Liu JL, Wei W, Tang W, et al. Silencing of lysyl oxidase gene expression by RNA interference suppresses metastasis of breast cancer. Asian Pac J Cancer Prev. 2012;13(7):3507–11.

    Article  Google Scholar 

  19. 19.

    Schedin P, O'Brien J, Rudolph M, Stein T, Borges V. Microenvironment of the involuting mammary gland mediates mammary cancer progression. J Mammary Gland Biol Neoplasia. 2007;12(1):71–82.

    Article  Google Scholar 

  20. 20.

    Chong HC, Tan CK, Huang RL, Tan NS. Matricellular proteins: a sticky affair with cancers. J Oncol. 2012;2012:351089.

    Article  Google Scholar 

  21. 21.

    Stajich JE, Block D, Boulez K, et al. The Bioperl toolkit: Perl modules for the life sciences. Genome Res. 2002;12(10):1611–8.

    CAS  Article  Google Scholar 

  22. 22.

    Hastie T, Tibshirani R, Narasimhan B, Chu G. Impute: Imputation for microarray data. Oral History Rev. 2011;1:128–30.

    Google Scholar 

  23. 23.

    Smyth GK. limma: Linear Models for Microarray Data. New York: Springer; 2013. p. 397–420.

    Google Scholar 

  24. 24.

    Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16(5):284.

    CAS  Article  Google Scholar 

  25. 25.

    Keshava Prasad TS, Goel R, Kandasamy K, et al. Human protein reference database--2009 update. Nucleic Acids Res. 2009;37(Database):D767–72.

    CAS  Article  Google Scholar 

  26. 26.

    Chatr-Aryamontri A, Breitkreutz BJ, Heinicke S, et al. The BioGRID interaction database: 2013 update. Nucleic Acids Res. 2013;41(Database issue):D816–23.

    CAS  PubMed  Google Scholar 

  27. 27.

    McDowall MD, Scott MS, Barton GJ. PIPs: human protein-protein interaction prediction database. Nucleic Acids Res. 2009;37(Database issue):D651–6.

    CAS  Article  Google Scholar 

  28. 28.

    Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T. Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics. 2011;27(3):431–2.

    CAS  Article  Google Scholar 

  29. 29.

    Li QS, Meng FY, Zhao YH, Jin CL, Tian J, Yi XJ. Inhibition of microRNA-214-5p promotes cell survival and extracellular matrix formation by targeting collagen type IV alpha 1 in osteoblastic MC3T3-E1 cells. Bone Joint Res. 2017;6(8):464–71.

    CAS  Article  Google Scholar 

  30. 30.

    Habibi I, Emamian ES, Abdi A. Quantitative analysis of intracellular communication and signaling errors in signaling networks. BMC Syst Biol. 2014;8:89.

    Article  Google Scholar 

  31. 31.

    Vleugel MM, Greijer AE, Bos R, van der Wall E, van Diest PJ. c-Jun activation is associated with proliferation and angiogenesis in invasive breast cancer. Hum Pathol. 2006;37(6):668–74.

    CAS  Article  Google Scholar 

  32. 32.

    Crowe DL, Tsang KJ, Shemirani B. Jun N-terminal kinase 1 mediates transcriptional induction of matrix metalloproteinase 9 expression. Neoplasia. 2001;3(1):27–32.

    CAS  Article  Google Scholar 

  33. 33.

    Chan CM, Macdonald CD, Litherland GJ, et al. Cytokine-induced MMP13 expression in human chondrocytes is dependent on activating transcription factor 3 (ATF3) regulation. J Biol Chem. 2017;292(5):1625–36.

    CAS  Article  Google Scholar 

  34. 34.

    Gokulnath M, Partridge NC, Selvamurugan N. Runx2, a target gene for activating transcription factor-3 in human breast cancer cells. Tumour Biol. 2015;36(3):1923–31.

    CAS  Article  Google Scholar 

  35. 35.

    Milde-Langosch K, Roder H, Andritzky B, et al. The role of the AP-1 transcription factors c-Fos, FosB, Fra-1 and Fra-2 in the invasion process of mammary carcinomas. Breast Cancer Res Treat. 2004;86(2):139–52.

    CAS  Article  Google Scholar 

  36. 36.

    Zellmer VR, Schnepp PM, Fracci SL, Tan X, Howe EN, Zhang S. Tumor-induced stromal STAT1 accelerates breast Cancer via deregulating tissue homeostasis. Mol Cancer Res. 2017;15(5):585–97.

    CAS  Article  Google Scholar 

  37. 37.

    Willis CM, Kluppel M. Chondroitin sulfate-E is a negative regulator of a pro-tumorigenic Wnt/beta-catenin-collagen 1 axis in breast cancer cells. PLoS One. 2014;9(8):e103966.

    Article  Google Scholar 

  38. 38.

    Wang J, Du Q, Li C. Bioinformatics analysis of gene expression profiles to identify causal genes in luminal B2 breast cancer. Oncol Lett. 2017;14(6):7880–8.

    PubMed  PubMed Central  Google Scholar 

  39. 39.

    Cardozo Gizzi AM, Prucca CG, Gaveglio VL, Renner ML, Pasquare SJ, Caputto BL. The catalytic efficiency of Lipin 1beta increases by physically interacting with the proto-oncoprotein c-Fos. J Biol Chem. 2015;290(49):29578–92.

    CAS  Article  Google Scholar 

  40. 40.

    Tewari D, Nabavi SF, Nabavi SM, et al. Targeting activator protein 1 signaling pathway by bioactive natural agents: possible therapeutic strategy for cancer prevention and intervention. Pharmacol Res. 2017;128:366–75.

    Article  Google Scholar 

  41. 41.

    Kamide D, Yamashita T, Araki K, et al. Selective activator protein-1 inhibitor T-5224 prevents lymph node metastasis in an oral cancer model. Cancer Sci. 2016;107(5):666–73.

    CAS  Article  Google Scholar 

  42. 42.

    Koenig A, Mueller C, Hasel C, Adler G, Menke A. Collagen type I induces disruption of E-cadherin-mediated cell-cell contacts and promotes proliferation of pancreatic carcinoma cells. Cancer Res. 2006;66(9):4662–71.

    CAS  Article  Google Scholar 

  43. 43.

    Shintani Y, Maeda M, Chaika N, Johnson KR, Wheelock MJ. Collagen I promotes epithelial-to-mesenchymal transition in lung cancer cells via transforming growth factor-beta signaling. Am J Respir Cell Mol Biol. 2008;38(1):95–104.

    CAS  Article  Google Scholar 

  44. 44.

    Kwon EJ, Dudani JS, Bhatia SN. Ultrasensitive tumour-penetrating nanosensors of protease activity. Nat Biomed Eng. 2017;1:0054.

    Article  Google Scholar 

  45. 45.

    Zheng X, Mao H, Huo D, Wu W, Liu B, Jiang X. Successively activatable ultrasensitive probe for imaging tumour acidity and hypoxia. Nat Biomed Eng. 2017;1(4):0057.

    Article  Google Scholar 

  46. 46.

    Nakasone ES, Askautrud HA, Kees T, et al. Imaging tumor-stroma interactions during chemotherapy reveals contributions of the microenvironment to resistance. Cancer Cell. 2012;21(4):488–503.

    CAS  Article  Google Scholar 

Download references


Not applicable.

Availability of data and materials

Not applicable.

Author information




#These authors contributed equally to this work. All authors read and approved the final version of the manuscript.

Corresponding authors

Correspondence to Zhenling Qiu or Zaijun Lin.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Xu, H., Zhu, B. et al. Systematic identification of the key candidate genes in breast cancer stroma. Cell Mol Biol Lett 23, 44 (2018).

Download citation


  • Differentially expressed genes
  • COL1A1
  • FN1
  • Breast cancer