Objectives. High-throughput technologies are radically boosting the understanding of living systems, thus creating enormous opportunities to elucidate the biological processes of cells in different physiological states. In particular, the application of DNA microarrays to monitor expression profiles from tumor cells is improving cancer analysis to levels that classical methods have been unable to reach. However, molecular diagnostics based on expression profiling requires addressing computational issues as the overwhelming number of variables and the complex, multi-class nature of tumor samples. Thus, the objective of the present research has been the development of a computational procedure for feature extraction and classification of gene expression data.Methods. The Soft Independent Modeling of Class Analogy (SIMCA) approach has been implemented in a data mining scheme, which allows the identification of those genes that are most likely to confer robust and accurate classification of samples from multiple tumor types.Results: The proposed method has been tested on two different microarray data sets, namely Golub's analysis of acute human leukemia [1] and the small round blue cell tumors study presented by Khan et al. [2]. The identified features represent a rational and dimensionally reduced base for understanding the biology of diseases, defining targets of therapeutic intervention, and developing diagnostic tools for classification of pathological states.Conclusions: The analysis of the SIMCA model residuals allows the identification of specific phenotype markers. At the some time, the class analogy approach provides the assignment to multiple classes, such as different pathological conditions or tissue samples, for previously unseen instances.

Marker identification and classification of cancer types using gene expression data and SIMCA / Bicciato, Silvio; Luchini, A; DI BELLO, C.. - In: METHODS OF INFORMATION IN MEDICINE. - ISSN 0026-1270. - STAMPA. - 43:1(2004), pp. 4-8. [10.1055/s-0038-1633413]

Marker identification and classification of cancer types using gene expression data and SIMCA

BICCIATO, Silvio;
2004

Abstract

Objectives. High-throughput technologies are radically boosting the understanding of living systems, thus creating enormous opportunities to elucidate the biological processes of cells in different physiological states. In particular, the application of DNA microarrays to monitor expression profiles from tumor cells is improving cancer analysis to levels that classical methods have been unable to reach. However, molecular diagnostics based on expression profiling requires addressing computational issues as the overwhelming number of variables and the complex, multi-class nature of tumor samples. Thus, the objective of the present research has been the development of a computational procedure for feature extraction and classification of gene expression data.Methods. The Soft Independent Modeling of Class Analogy (SIMCA) approach has been implemented in a data mining scheme, which allows the identification of those genes that are most likely to confer robust and accurate classification of samples from multiple tumor types.Results: The proposed method has been tested on two different microarray data sets, namely Golub's analysis of acute human leukemia [1] and the small round blue cell tumors study presented by Khan et al. [2]. The identified features represent a rational and dimensionally reduced base for understanding the biology of diseases, defining targets of therapeutic intervention, and developing diagnostic tools for classification of pathological states.Conclusions: The analysis of the SIMCA model residuals allows the identification of specific phenotype markers. At the some time, the class analogy approach provides the assignment to multiple classes, such as different pathological conditions or tissue samples, for previously unseen instances.
2004
43
1
4
8
Marker identification and classification of cancer types using gene expression data and SIMCA / Bicciato, Silvio; Luchini, A; DI BELLO, C.. - In: METHODS OF INFORMATION IN MEDICINE. - ISSN 0026-1270. - STAMPA. - 43:1(2004), pp. 4-8. [10.1055/s-0038-1633413]
Bicciato, Silvio; Luchini, A; DI BELLO, C.
File in questo prodotto:
File Dimensione Formato  
Bicciato_MethInfMed_2004.pdf

Accesso riservato

Tipologia: Versione pubblicata dall'editore
Dimensione 614.19 kB
Formato Adobe PDF
614.19 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/421805
Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 8
social impact