SLIDE 2 2
Gene selection
Gene selection with bootstraping for Lymph Node Metastasis:
H0: RatioLNM = RatioNot LNM v.s. H1: RatioLNM ≠ RatioNot LNM
Cluster(Hs.) Name Symbol Mean P-val boot-t Hs.291 glutamyl aminopeptidase (aminopeptidase A) ENPEP 0.727 0.142 Hs.823 hepsin (transmembrane protease, serine 1) HPN 0.542 0.1238 Hs.74861 activated RNA polymerase II transcription cofactor 4 PC4 0.839 0.1602 Hs.60478 ESTs, Moderately similar to S47073 finger protein HZF2 <Hs.60478> 0.6935 0.1414 0.001 Hs.284266 hypothetical protein MGC8471 MGC8471 0.4589 0.1013 0.001 Hs.96 phorbol-12-myristate-13-acetate-induced protein 1 PMAIP1 0.7117 0.1662 0.002 Hs.2025 transforming growth factor, beta 3 TGFB3 0.5842 0.1585 0.002 Hs.83469 nuclear factor (erythroid-derived 2)-like 1 NFE2L1 0.8419 0.1812 0.002 Hs.181046 dual specificity phosphatase 3 (vaccinia virus phosphatDUSP3 0.4843 0.0943 0.002 Hs.331 general transcription factor IIIC, polypeptide 1 GTF3C1 0.2834 0.0864 0.003 Hs.635 calcium channel, voltage-dependent, beta 1 subunit CACNB1 0.5364 0.122 0.003 Hs.1066 small nuclear ribonucleoprotein polypeptide E SNRPE 0.4673 0.1015 0.003 Hs.1098 DKFZp434J1813 protein DKFZP434J1813 0.5033 0.126 0.003 Hs.104481 Nck, Ash and phospholipase C binding protein NAP4 0.6001 0.1276 0.003 Hs.118825 mitogen-activated protein kinase kinase 6 MAP2K6 0.2853 0.0754 0.003 Hs.161 cadherin 2, type 1, N-cadherin (neuronal) CDH2 0.3771 0.1175 0.004 Hs.13063 transcription factor CA150 CA150 0.655 0.1831 0.004 Hs.124029 inositol polyphosphate-5-phosphatase, 40kD INPP5A 0.9106 0.2486 0.004 Hs.170980 KIAA0948 protein KIAA0948 0.683 0.1597 0.004 Hs.211614 chloride channel 6 CLCN6 0.4511 0.114 0.004
Classification
PMAIP1 ENPEP GTF3C1 CACNB1 HPN DKFZP434J1813 TGFB3 MGC8471 ... Class [*, 0.036) [*, -0.046) [*, -0.226) [-0.136, 0.290) [*, -0.288) [*, -0.044) [*, -0.152) [-0.016, 0.318) ... Y [0.036, 0.440) [0.380, *) [0.026, *) [0.290, *) [0.064, *) [0.292, *) [0.108, *) [0.318, *) ... Y [0.440, *) [0.380, *) [0.026, *) [-0.136, 0.290) [-0.288, 0.064) [0.292, *) [-0.152, 0.108) [0.318, *) ... Y [*, 0.036) [*, -0.046) [*, -0.226) [*, -0.136) [*, -0.288) [*, -0.044) [*, -0.152) [*, -0.016) ... N [0.440, *) [0.380, *) [*, -0.226) [0.290, *) [-0.288, 0.064) [-0.044, 0.292) [0.108, *) [0.318, *) ... Y [*, 0.036) [-0.046, 0.380) [-0.226, 0.026) [-0.136, 0.290) [0.064, *) [-0.044, 0.292) [0.108, *) [-0.016, 0.318) ... Y [0.036, 0.440) [*, -0.046) [*, -0.226) [-0.136, 0.290) [*, -0.288) [*, -0.044) [-0.152, 0.108) [-0.016, 0.318) ... N [0.440, *) [0.380, *) [-0.226, 0.026) [0.290, *) [0.064, *) [0.292, *) [0.108, *) [0.318, *) ... Y [0.036, 0.440) [*, -0.046) Undefined [*, -0.136) Undefined [*, -0.044) [*, -0.152) [*, -0.016) ... N Undefined [-0.046, 0.380) Undefined Undefined Undefined Undefined [*, -0.152) Undefined ... N [*, 0.036) [*, -0.046) [-0.226, 0.026) [*, -0.136) [-0.288, 0.064) [-0.044, 0.292) [0.108, *) [*, -0.016) ... N [0.440, *) [-0.046, 0.380) [0.026, *) [0.290, *) [-0.288, 0.064) [0.292, *) [*, -0.152) [0.318, *) ... Y [0.036, 0.440) [-0.046, 0.380) [*, -0.226) [*, -0.136) [*, -0.288) [-0.044, 0.292) [-0.152, 0.108) [*, -0.016) ... N [0.036, 0.440) [-0.046, 0.380) [-0.226, 0.026) [-0.136, 0.290) [-0.288, 0.064) [-0.044, 0.292) [-0.152, 0.108) [-0.016, 0.318) ... Y [0.440, *) [0.380, *) [0.026, *) [0.290, *) [0.064, *) [0.292, *) [-0.152, 0.108) [-0.016, 0.318) ... Y [0.036, 0.440) [-0.046, 0.380) [0.026, *) [-0.136, 0.290) [0.064, *) [-0.044, 0.292) [-0.152, 0.108) [-0.016, 0.318) ... Y [*, 0.036) [*, -0.046) [-0.226, 0.026) [*, -0.136) [*, -0.288) [*, -0.044) [*, -0.152) [*, -0.016) ... N
Decision system: Decision rules:
PMAIP1([*, 0.036)) AND PC4([-0.716, -0.073)) => Class(Y) PMAIP1([*, 0.036)) AND PC4([*, -0.716)) => Class(N) PMAIP1([0.036, 0.440)) AND PC4([*, -0.716)) => Class(N) TGFB3([*, -0.152)) AND MGC8471([-0.016, 0.318)) => Class(Y) TGFB3([0.108, *)) AND MGC8471([-0.016, 0.318)) => Class(Y) CLCN6([-0.209, 0.141)) AND MGC8471([-0.016, 0.318)) => Class(Y) CLCN6([*, -0.209)) AND MGC8471([-0.016, 0.318)) => Class(N)
Prediction Performance
Sample Reducer Discretation Max Genes Sig. lev. Accuracy Sens.
Laurén's histological classification Dynamic Freq.bin (4) 10 0.01 16/17=0.941 1 0.86 0.93 Localization of tumor Dynamic Entropy 20 0.01 17/17=1 1 1 1 Lymph node metastasis Dynamic Freq.bin (3) 20 0.01 14/17=0.824 0.7 1 0.9 Penetration of the stomach wall Holte 1r Entropy 20 0.01 16/17=0.941 1 0.75 0.85 Remote metastasis Holte 1r Entropy 40 0.1 13/13=1 1 1 1 Serum gastrin Genetic Entropy 10 0.05 11/14=0.786 0.9 0.6 0.66
Sample Rules No. (avg) Rules No. (Range) Total no. of genes in all classifiers Laurén's histological classification 24.1 10-67 17 Localization of tumor 238.1 200-311 72 Lymph node metastasis 388.1 222-523 73 Penetration of the stomach wall 109.6 28-280 75 Remote metastasis 425.1 305-468 161 Serum gastrin 47.9 18-72 42
Validation from biomedical literature
Unkown Sample gastric cancer
gastric cancer
Connection Laurén's histological classification 1 2 Localization of tumor 2 3 22 Lymph node metastasis 1 2 1 26 Penetration of the stomach wall 4 1 1 17 Remote metastasis 3 2 47 Serum gastrin 1 1 18 Known connection to the parameter in Known Connection to