Introduction
Materials and methods
SAGE libraries
Histology | Library name | Tag count | Unique tags |
---|---|---|---|
Normal breast tissue | |||
Normal 1 | SAGE breast normal AP Br Na | 37,419 | 15,886 |
Normal 2 | SAGE breast normal epithelium AP 1a | 49,021 | 18,276 |
Normal 3 | SAGE breast normal organoid Ba | 58,326 | 19,602 |
Normal 4 | SAGE breast normal organoid B2a | 59,481 | 20,391 |
Ductal carcinoma in situ | |||
DCIS 1 | SAGE breast carcinoma MD DCISa | 42,174 | 14,237 |
DCIS 2 | SAGE breast carcinoma AP DCIS 3a | 57,924 | 31,142 |
DCIS 3 | SAGE breast carcinoma B DCIS 4a | 60,699 | 20,224 |
DCIS 4 | SAGE breast carcinoma B DCIS 5a | 43,118 | 15,935 |
DCIS 5 | SAGE breast carcinoma epithelium AP DCIS 6a | 73,409 | 30,256 |
DCIS 6 | SAGE breast carcinoma B BWHT18a | 50,879 | 19,182 |
DCIS 7 | MDACC 22Tb | 102,533 | 33,305 |
Invasive ductal carcinoma | |||
IDC 1 | MDACC 09Tb | 91,647 | 37,863 |
IDC 2 | MDACC 14Tb | 100,255 | 26,422 |
IDC 3 | MDACC 15Tb | 90,198 | 27,653 |
IDC 4 | MDACC 17Tb | 100,386 | 29,300 |
IDC 5 | MDACC 18Tb | 101,543 | 29,936 |
IDC 6 | MDACC 19Tb | 100,334 | 28,498 |
IDC 7 | MDACC 20Tb | 100,047 | 28,903 |
IDC 8 | MDACC 21Tb | 103,825 | 31,412 |
IDC 9 | MDACC 24Tb | 99,546 | 30,363 |
IDC 10 | MDACC 25Tb | 100,501 | 30,778 |
IDC 11 | SAGE breast carcinoma B IDC 3a | 68,937 | 22,732 |
IDC 12 | SAGE breast carcinoma B IDC 5a | 60,476 | 20,457 |
Total | 23 breast libraries | 1,752,678 |
SAGE methodology
SAGE data processing
Statistical analysis of SAGE libraries
Results and discussion
Generation and analysis of SAGE libraries
Tag | Gene | Description | Locus link | Fold change | P value |
---|---|---|---|---|---|
DCIS overexpressed genes | |||||
GTATTTAACT |
PKD1-like
|
Polycystic kidney disease 1-like
| 79932 | 13.7 | 0.0100 |
CGGACTCACT |
STARD10
|
START domain containing 10
| 10809 | 11.2 | 0.0086 |
GTGTTGGGGG |
EPS8L2
|
EPS8-like 2
| 64787 | 9.6 | 0.0099 |
TTTCTGGAGG |
KIAA0545
|
KIAA0545 protein
| 23094 | 8.6 | 0.0100 |
GATAAATTAA |
FLJ14153
|
Hypothetical protein
| 64747 | 8.5 | 0.0055 |
GAGAAATATC |
NP220
|
Nuclear protein
| 27332 | 8.0 | 0.0088 |
CCCTCTTTGG |
LOC118487
|
mRNA similar to RIKEN cDNA 1110001019
| 118487 | 7.4 | 0.0037 |
CTGGGACTGA |
LSM4
| U6 small nuclear RNA associated (S. cerevisiae) | 25804 | 6.4 | 0.0055 |
CTGGGCCAGC |
VAMP5
|
Vesicle-associated membrane protein 5
| 10791 | 6.4 | 0.0068 |
GCCCTTTCTC |
MRC2
|
Mannose receptor, C type 2
| 9902 | 5.8 | 0.0015 |
TCTTGATTTA |
A2M
|
Alpha-2-macroglobulin
| 2 | 5.8 | 0.0083 |
TAGTTTGTGG |
MSH2
|
MutS homolog 2, colon cancer
| 4436 | 5.6 | 0.0091 |
TCAGTGAACT |
HPS4
|
Hermansky–Pudlak syndrome 4
| 89781 | 5.6 | 0.0100 |
GTTTATTCTT |
FOXA1
|
Forkhead box A1
| 3169 | 5.3 | 0.0044 |
GCCGCTGCCA |
PPP1R13B
|
Protein phosphatase 1
| 23368 | 4.3 | 0.0054 |
TAAAGTGTCT |
PIGS
|
Phosphatidylinositol glycan, class S
| 94005 | 3.9 | 0.0100 |
DCIS underexpressed genes | |||||
GGGACGAGTG |
TM4SF1
|
Transmembrane 4 superfamily member 1
| 4071 | -442.6 | 0.0083 |
TAACAGCCAG |
NFKBIA
|
Nuclear factor kappa light polypeptide gene
| 4792 | -158.6 | 2.4 × 10-6 |
CAACTAATTC |
CLU
|
Clusterin
| 1191 | -63.9 | 0.0036 |
GCCTTAACAA |
PBEF
|
Pre-B-cell colony-enhancing factor
| 10135 | -44.3 | 0.0020 |
GGGTTTTTAT |
NSEP1
|
Nuclease sensitive element binding protein 1
| 4904 | -36.2 | 0.0001 |
GACACGAACA |
RASD1
|
RAS, dexamethasone-induce 1
| 51655 | -31.4 | 0.0095 |
AAGATTGGTG |
CD9
|
CD9 antigen (p24)
| 928 | -29.6 | 0.0003 |
ACCAAATTAA |
TNFRSF10B
|
Tumor necrosis factor receptor superfamily
| 8795 | -29.4 | 0.0003 |
CTGGGCCTGA |
LITAF
|
Lipopolysaccharide-induced tumor necrosis factor
| 9516 | -28.9 | 0.0076 |
CTGCCATAAC |
SBDS
|
Shwachman–Bodian–Diamond syndrome
| 51119 | -24.2 | 0.0005 |
CACAGGCAAA |
BZW1
|
Basic leucine zipper and W2 domains 1
| 9689 | -22.1 | 0.0056 |
GTTCCCTGGC |
FAU
|
Finkel–Biskis–Reilly murine sarcoma virus
| 2197 | -22.1 | 0.0028 |
GTCTGCACCT |
DKFZp547C1
|
Hypothetical protein
| 254851 | -21.9 | 0.0087 |
TACGTTGCAG |
GC20
|
Translocation factor sui1 homolog
| 10289 | -21.8 | 0.0079 |
TGTAAAGATT |
CCNL1
|
Cyclin L1
| 57018 | -21.2 | 0.0008 |
TGTTAAGTTC |
CRY1
|
Cryptochrome 1 (photolyase-like)
| 1407 | -18.7 | 0.0091 |
GAAATAAAGT |
FLJ21657
|
Hypothetical protein
| 64417 | -18.5 | 0.0032 |
ATGGGCTTGA |
SQRDL
|
Sulfide quinone reductase-like (yeast)
| 58472 | -17.3 | 0.0061 |
TCAAGAAATT |
PSME3
|
Proteasome activator subunit 3
| 10197 | -15.7 | 0.0028 |
CCGTGGTCGT |
FBL
|
Fibrillarin
| 2091 | -15.3 | 0.0100 |
TGGAACAGGA |
TGIF
|
Transforming growth factor beta-induced factor (TALE family homeobox)
| 7050 | -12.2 | 0.0030 |
AATGCTGGCA |
DNAJB6
|
DnaJ homolog, subfamily B, member 6
| 10049 | -11.7 | 0.0065 |
AATGAGCAAC |
GBP2
|
Guanylate binding protein 2, interferon-inducible
| 2634 | -11.1 | 0.0082 |
GACCTATCTC |
KIAA0992
|
Paladin
| 23022 | -10.9 | 0.0092 |
AACTCTTGAA |
EIF3S3
|
Eukaryotic translation initiation factor 3
| 8667 | -10.6 | 0.0082 |
GGGATTTTGT |
PMAIP1
|
Phorbol-12-myristate-13-acetate-induced protein 1
| 5366 | -10.6 | 0.0091 |
AAAGCAAAAA |
PTPN4
|
Protein tyrosine phosphatase, non-receptor type 4
| 5775 | -10.4 | 0.0040 |
ACTGACTATC |
NEU1
|
Sialidase 1 (lysosomal sialidase)
| 4758 | -10.3 | 0.0095 |
TTCCAGTTCA |
PDE4B
|
Phosphodiesterase 4B, camp-specific
| 5142 | -9.9 | 0.0087 |
GAATGATTTC |
ORF1-FL49
|
Putative nuclear protein
| 84418 | -9.5 | 0.0070 |
GACTCGCTCC |
HSJ001348
|
cDNA for differentially expressed CO16 gene
| 54742 | -9.5 | 0.0072 |
TGGTTACAAA |
NDEL1
|
Nude nuclear distribution gene E homolog like 1
| 81565 | -8.9 | 0.0100 |
AGTATGAGGA |
TNFAIP3
|
Tumor necrosis factor, alpha-induced protein 3
| 7128 | -8.1 | 0.0083 |
CAGTTTAAAA |
CRSP6
|
Cofactor required for Sp1 transcriptional activation
| 9440 | -7.7 | 0.0100 |
Tag | Gene | Description | Locus link | Fold change | P value |
---|---|---|---|---|---|
IDC overexpressed genes | |||||
TGGAAATGAC |
COL1A1
|
Collagen type I, alpha 1
| 1277 | 315.4 | 0.0054 |
ATGTGAAGAG |
SPARC
| Secreted protein, cysteine-rich (osteonectin) | 6678 | 286.8 | 0.0003 |
TTTGGTTTTC |
COL1A2
| Collagen type I, alpha 2 | 1278 | 210.9 | 0.0084 |
TTGCTGACTT |
COL6A1
| Collagen type VI, alpha 1 | 1291 | 73.9 | 0.0023 |
TTATGTTTAA |
LUM
|
Lumican
| 4060 | 56.7 | 0.0011 |
TTGGAGATCT |
NDUFA4
|
NADH dehydrogenase (ubiquinone)
| 4697 | 56.4 | 0.0065 |
CCACAGGGGA |
COL3A1
| Collagen type III, alpha 1 | 1281 | 49.4 | 0.0056 |
ATCTTGTTAC |
FN1
|
Fibronectin 1
| 2335 | 44.3 | 0.0031 |
TTGTAATCGT |
OAZ1
|
Ornithine decarboxylase antizyme 1
| 4946 | 38.6 | 0.0038 |
TGTAATCAAT |
HNRPA1
|
Heterogeneous nuclear ribonucleoprotein A1
| 3178 | 38.2 | 0.0039 |
GGAAGCTAAG |
OSF-2
|
Osteoblast specific factor 2 (fasciclin I-like)
| 10631 | 36.3 | 0.0005 |
ACCTGTATCC |
IFITM3
|
Interferon induced transmembrane protein 3
| 10410 | 34.3 | 0.0021 |
GGAAATGTCA |
MMP2
|
Matrix metalloproteinase 2
| 4313 | 29.1 | 0.0008 |
TGCACTTCAA |
SPARCL1
|
SPARC-like 1 (mast9, hevin)
| 8404 | 21.7 | 0.0050 |
GGAACTTTTA |
SULF2
|
Sulfatase 2
| 55959 | 19.8 | 0.0017 |
CTGTTAGTGT |
MDH1
|
Malate dehydrogenase 1
| 4190 | 18.5 | 0.0026 |
TATGAATGCT |
CSPG2
|
Chondroitin sulfate proteoglycan 2 (versican)
| 1462 | 18.2 | 0.0017 |
TCCAAATCGA |
VIM
|
Vimentin
| 7431 | 17.7 | 0.0014 |
TGTAGTTTGA |
SKP1A
|
S-phase kinase-associated protein 1A
| 6500 | 16.9 | 0.0013 |
TAATAAACAG |
ASAH1
|
n-Acylsphimgosine amidohydrolase
| 427 | 16.8 | 0.0085 |
GCCTCCTCCC |
EIF3k
|
Eukaryotic translation initiation factor 3
| 27335 | 16.8 | 0.0098 |
GAAACAAGAT |
PGK1
|
Phosphoglycerate kinase 1
| 5230 | 15.8 | 0.0006 |
TGCTTTGGGA |
TTC11
|
tetratricopeptide repeat domain 11
| 51024 | 15.8 | 0.0018 |
GAAATCAAAA |
SIGLEC5
|
Sialic acid binding Ig-like lectin 5
| 8778 | 14.7 | 0.0098 |
ATGTAGTAGT |
HNRPD
|
Heterogeneous nuclear ribonucleoprotein D
| 3184 | 14.7 | 0.0025 |
GACCACCTTT |
MFAP2
|
Microfibrillar-associated protein 2
| 4237 | 14.4 | 0.0000 |
ACTTATTATG |
DCN
|
Decorin
| 1634 | 13.9 | 0.0007 |
TCTCTACCCA |
APLP2
|
Amyloid beta (A4) precursor-like protein 2
| 334 | 13.7 | 0.0076 |
TGCAATATGC |
FBN1
|
Fibrillin 1
| 2200 | 13.5 | 0.0037 |
ATTTCTTCAA |
SFRP2
|
Secreted frizzled-related protein 2
| 6423 | 13.4 | 0.0030 |
ATAAAAAGAA |
CTSK
|
Cathepsin K (pycnodysostosis)
| 1513 | 13.0 | 0.0003 |
GTACATTGTA |
MGC15737
|
Hypothetical protein
| 85012 | 12.6 | 0.0011 |
TGATGTTTGA |
DAZAP2
|
DAZ associated protein 2
| 9802 | 12.5 | 0.0013 |
TCCGTGGTTG |
BASP1
|
Membrane attached signal protein 1
| 10409 | 12.1 | 0.0055 |
ACTGCTTTAC |
DKFZp564I1922
|
Adlican
| 25878 | 12.0 | 0.0086 |
TCTGCAATGA |
TINP1
|
Trasnforming growth factor beta-inducible nuclear protein 1
| 10412 | 12.0 | 0.0033 |
GTTTCTTCCC |
SELH
|
Selenoprotein H
| 2880636 | 11.8 | 0.0037 |
AATATGCTTT |
ATP6V1E1
|
ATPase
| 529 | 11.6 | 0.0009 |
TTATGGATCT |
SPON2
|
Spondin 2, extracellular matrix protein
| 10417 | 11.3 | 0.0003 |
AAAATAAAGA |
APEX1
|
Nuclease, multifunctional DNA repair enzyme
| 328 | 11.3 | 0.0099 |
TGTGTGTTTG |
HIF0
|
H1 histone family
| 3005 | 11.2 | 0.0029 |
TATGTTTCAG |
PTPN12
|
Protein tyrosine phosphatase
| 5782 | 11.1 | 0.0003 |
ACCAAAGCCC |
MGC9651
|
Hypothetical protein
| 114932 | 10.6 | 0.0075 |
CAAGGATCTA |
NICE-3
|
NICE-3 protein
| 25912 | 10.5 | 0.0057 |
GACGTCTTAA |
PSMA4
|
Proteasome subunit, alpha type
| 5685 | 10.3 | 0.0038 |
CAGATAACAT |
TOMM20
|
Translocase
| 9804 | 10.1 | 0.0064 |
AACTCTTGAA |
EIF3S3
|
Eukaryotic translation initiation factor 3
| 8667 | 10.0 | 0.0047 |
TTCTTGGTGT |
TRPS1
|
Trichorhinophalangeal syndrome I
| 7227 | 9.9 | 0.0042 |
TGCCTTAGTA |
DNAJC1
|
DNAJ homolog
| 64215 | 9.8 | 0.0040 |
AGACAAGCTG |
SFRS5
|
Splicing factor
| 6430 | 9.4 | 0.0015 |
ACAAGAATTG |
SYPL
|
Synaptophysin-like protein
| 6856 | 9.3 | 0.0027 |
TACATCCGAA |
MTPN
|
Myotrophin
| 136319 | 9.3 | 0.0013 |