Volume 1158, Issue 1 p. 159-195

Lessons from the DREAM2 Challenges

A Community Effort to Assess Biological Network Inference

Gustavo Stolovitzky

Gustavo Stolovitzky

IBM Computational Biology Center, Yorktown Heights, New York, USA

Department of Biomedical Informatics, Columbia University, New York, New York, USA

Search for more papers by this author
Robert J. Prill

Robert J. Prill

IBM Computational Biology Center, Yorktown Heights, New York, USA

Search for more papers by this author
Andrea Califano

Andrea Califano

Department of Biomedical Informatics, Columbia University, New York, New York, USA

Search for more papers by this author
First published: 30 March 2009
Citations: 139
Address for correspondence: Gustavo Stolovitzky, IBM T.J. Watson Research Center, P.O. Box 218, Yorktown Heights, NY 10598. Voice: 914-945-1292; fax: 914-945-4217. [email protected]

Abstract

Regardless of how creative, innovative, and elegant our computational methods, the ultimate proof of an algorithm's worth is the experimentally validated quality of its predictions. Unfortunately, this truism is hard to reduce to practice. Usually, modelers produce hundreds to hundreds of thousands of predictions, most (if not all) of which go untested. In a best-case scenario, a small subsample of predictions (three to ten usually) is experimentally validated, as a quality control step to attest to the global soundness of the full set of predictions. However, whether this small set is even representative of the global algorithm's performance is a question usually left unaddressed. Thus, a clear understanding of the strengths and weaknesses of an algorithm most often remains elusive, especially to the experimental biologists who must decide which tool to use to address a specific problem. In this chapter, we describe the first systematic set of challenges posed to the systems biology community in the framework of the DREAM (Dialogue for Reverse Engineering Assessments and Methods) project. These tests, which came to be known as the DREAM2 challenges, consist of data generously donated by participants to the DREAM project and curated in such a way as to become problems of network reconstruction and whose solutions, the actual networks behind the data, were withheld from the participants. The explanation of the resulting five challenges, a global comparison of the submissions, and a discussion of the best performing strategies are the main topics discussed.