The parallel monitoring of the expression profiles of thousands of genes seems particularly promising for a deeper understanding of cancer biology and to identify molecular signatures supporting the histological classification schemes of neoplastic specimens. A computational procedure for feature extraction and classification of gene expression data through the application of principal component analysis and of the soft independent modeling of class analogy approach (SIMCA) is described. The identified features contain critical information about gene-phenotype relationships observed during changes in cell physiology. They represent a rational and dimensionally reduced base for understanding the basic biology of the onset of diseases, defining targets of therapeutic intervention, and developing diagnostic tools for the identification and classification of pathological states. The proposed method has been tested on the childhood round blue cell tumors study presented by Khan et al [1]. The analysis of the SIMCA model allows the identification of specific phenotype markers and provides the assignment to multiple classes for previously unseen instances.
Disjoint PCA models for marker identification and classification of cancer types using gene expression data / Bicciato, S.; Luchini, A.; Di Bello, C.. - (2002), pp. 98-99. (Intervento presentato al convegno IEEE-EMBS Special Topic Conference on Molecular, Cellular and Tissue Engineering, MCTE 2002 tenutosi a StarHotel President, ita nel 2002) [10.1109/MCTE.2002.1175023].
Disjoint PCA models for marker identification and classification of cancer types using gene expression data
Bicciato S.;
2002
Abstract
The parallel monitoring of the expression profiles of thousands of genes seems particularly promising for a deeper understanding of cancer biology and to identify molecular signatures supporting the histological classification schemes of neoplastic specimens. A computational procedure for feature extraction and classification of gene expression data through the application of principal component analysis and of the soft independent modeling of class analogy approach (SIMCA) is described. The identified features contain critical information about gene-phenotype relationships observed during changes in cell physiology. They represent a rational and dimensionally reduced base for understanding the basic biology of the onset of diseases, defining targets of therapeutic intervention, and developing diagnostic tools for the identification and classification of pathological states. The proposed method has been tested on the childhood round blue cell tumors study presented by Khan et al [1]. The analysis of the SIMCA model allows the identification of specific phenotype markers and provides the assignment to multiple classes for previously unseen instances.Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris