Background The parallel monitoring of the expression profiles of thousands of genes seems particularly promising for a deeper understanding of cancer biology and to identify molecular signatures supporting the histological classification schemes of neoplastic specimens. However, molecular diagnostic based on microarray data presents major challenges due to the complex, multiclass nature and to the overwhelming number of variables characterizing gene expression databases of multiple tumor samples. Thus, the development of multiclass; classification schemes and of marker selection methods, that allow the simultaneous classification of multiple tumor types and the identification of those genes that are most likely to confer high classification accuracy, is of paramount importance.Methods. A computational procedure for marker identification and classification of multiclass gene expression data through the application of disjoint principal component models, based on the Soft independent Modeling of Class Analogy approach (SIMCA), is described. The identified features represent a rational and dimensionally reduced base for understanding the basic biology of diseases, defining targets of therapeutic intervention, and developing diagnostic tools for the identification and classification of multiple pathological states.Results. The method has been tested on 2 different microarray data sets obtained from various human tumor samples: i) acute leukemias, and ii) small round blue-cell tumors.Conclusions. The results demonstrate that the disjoint PCA modeling procedure allows the identification of specific phenotype markers and provides the assignment to multiple classes for previously unseen instances.
Disjoint PCA models for marker identification and classification of cancer types using gene expression data / Bicciato, Silvio; Luchini, A; DI BELLO, C.. - In: MINERVA BIOTECNOLOGICA. - ISSN 1120-4826. - STAMPA. - 14:3-4(2002), pp. 281-290.
Disjoint PCA models for marker identification and classification of cancer types using gene expression data
BICCIATO, Silvio;
2002
Abstract
Background The parallel monitoring of the expression profiles of thousands of genes seems particularly promising for a deeper understanding of cancer biology and to identify molecular signatures supporting the histological classification schemes of neoplastic specimens. However, molecular diagnostic based on microarray data presents major challenges due to the complex, multiclass nature and to the overwhelming number of variables characterizing gene expression databases of multiple tumor samples. Thus, the development of multiclass; classification schemes and of marker selection methods, that allow the simultaneous classification of multiple tumor types and the identification of those genes that are most likely to confer high classification accuracy, is of paramount importance.Methods. A computational procedure for marker identification and classification of multiclass gene expression data through the application of disjoint principal component models, based on the Soft independent Modeling of Class Analogy approach (SIMCA), is described. The identified features represent a rational and dimensionally reduced base for understanding the basic biology of diseases, defining targets of therapeutic intervention, and developing diagnostic tools for the identification and classification of multiple pathological states.Results. The method has been tested on 2 different microarray data sets obtained from various human tumor samples: i) acute leukemias, and ii) small round blue-cell tumors.Conclusions. The results demonstrate that the disjoint PCA modeling procedure allows the identification of specific phenotype markers and provides the assignment to multiple classes for previously unseen instances.File | Dimensione | Formato | |
---|---|---|---|
1356-MB.pdf
Accesso riservato
Tipologia:
VOR - Versione pubblicata dall'editore
Dimensione
84.37 kB
Formato
Adobe PDF
|
84.37 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris