Investigating Power and Limitations of Ensemble Motif Finders Using Metapredictor CE3

Leoncini, Mauro; Montangero, Manuela; PANUCIA TILLAN, Karina

doi:10.2174/157489361002150518122428

Ensemble methods represent a relatively new approach to motif discovery that combines the results returned by "third-party" finders with the aim of achieving a better accuracy than that obtained by the single tools. Besides the choice of the external finders, another crucial element for the success of an ensemble method is the particular strategy adopted to combine the finders' results, a.k.a. learning function. Results appeared in the literature seem to suggest that ensemble methods can provide noticeable improvements over the quality of the most popular tools available for motif discovery. With the goal of better understanding potentials and limitations of ensemble methods, we developed a general software architecture whose major feature is the flexibility with respect to the crucial aspects of ensemble methods mentioned above. The architecture provides facilities for the easy addition of virtually any third-party tool for motif discovery whose code is publicly available, and for the definition of new learning functions. We present a prototype implementation of our architecture, called CE3 (Customizable and Easily Extensible Ensemble). Using CE3, and available ensemble methods, we performed experiments with three well-known datasets. The results presented here are varied. On the one hand, they confirm that ensemble methods cannot be just considered as the universal remedy for "in-silico" motif discovery. On the other hand, we found some encouraging regularities that may help to find a general set up for CE3 (and other ensemble methods as well) able to guarantee substantial improvements over single finders in a systematic way.

Investigating Power and Limitations of Ensemble Motif Finders Using Metapredictor CE3 / Leoncini, Mauro; Montangero, Manuela; Panucia Tillan, Karina. - In: CURRENT BIOINFORMATICS. - ISSN 1574-8936. - STAMPA. - 10:2(2015), pp. 124-138. [10.2174/157489361002150518122428]

Investigating Power and Limitations of Ensemble Motif Finders Using Metapredictor CE3

LEONCINI, Mauro;MONTANGERO, Manuela;PANUCIA TILLAN, Karina

2015

Abstract

Ensemble methods represent a relatively new approach to motif discovery that combines the results returned by "third-party" finders with the aim of achieving a better accuracy than that obtained by the single tools. Besides the choice of the external finders, another crucial element for the success of an ensemble method is the particular strategy adopted to combine the finders' results, a.k.a. learning function. Results appeared in the literature seem to suggest that ensemble methods can provide noticeable improvements over the quality of the most popular tools available for motif discovery. With the goal of better understanding potentials and limitations of ensemble methods, we developed a general software architecture whose major feature is the flexibility with respect to the crucial aspects of ensemble methods mentioned above. The architecture provides facilities for the easy addition of virtually any third-party tool for motif discovery whose code is publicly available, and for the definition of new learning functions. We present a prototype implementation of our architecture, called CE3 (Customizable and Easily Extensible Ensemble). Using CE3, and available ensemble methods, we performed experiments with three well-known datasets. The results presented here are varied. On the one hand, they confirm that ensemble methods cannot be just considered as the universal remedy for "in-silico" motif discovery. On the other hand, we found some encouraging regularities that may help to find a general set up for CE3 (and other ensemble methods as well) able to guarantee substantial improvements over single finders in a systematic way.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2015
			
	Rivista
	
				CURRENT BIOINFORMATICS
			
	N° del Volume
	
				10
			
	Fascicolo
	
				2
			
	Pagina iniziale
	
				124
			
	Pagina finale
	
				138
			
	Codice DOI
	
				https://dx.doi.org/10.2174/157489361002150518122428
			
	Codice WoS
	
				WOS:000354786800002
			
	Codice Scopus
	
				2-s2.0-84930509270
			
	Citazione
	
				Investigating Power and Limitations of Ensemble Motif Finders Using Metapredictor CE3 / Leoncini, Mauro; Montangero, Manuela; Panucia Tillan, Karina. - In: CURRENT BIOINFORMATICS. - ISSN 1574-8936. - STAMPA. - 10:2(2015), pp. 124-138. [10.2174/157489361002150518122428]
			
	Tutti gli autori
	
						Leoncini, Mauro; Montangero, Manuela; Panucia Tillan, Karina
					
	Tipologia
	
				Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Panucia_Tex_text.pdf Open Access dal 03/06/2016 Descrizione: Articolo principale Tipologia: Versione dell'autore revisionata e accettata per la pubblicazione Dimensione 12.17 MB Formato Adobe PDF Visualizza/Apri	12.17 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris