Volume 980, Issue 1 p. 41-64

Pattern Recognition Techniques in Microarray Data Analysis

A Survey

FARAMARZ VALAFAR

Corresponding Author

FARAMARZ VALAFAR

Department of Computer Science, San Diego State University, San Diego, California 92182, USA

Address for correspondence: Faramarz Valafar, Department of Computer Science, San Diego State University, 5500 Campanile Drive, San Diego, CA 92182, USA. Voice: 619-594-3742; fax: 619-594-6746; [email protected].Search for more papers by this author
First published: 24 January 2006
Citations: 90

Abstract

Abstract: Recent development of technologies (e.g., microarray technology) that are capable of producing massive amounts of genetic data has highlighted the need for new pattern recognition techniques that can mine and discover biologically meaningful knowledge in large data sets. Many researchers have begun an endeavor in this direction to devise such data-mining techniques. As such, there is a need for survey articles that periodically review and summarize the work that has been done in the area. This article presents one such survey. The first portion of the paper is meant to provide the basic biology (mostly for non-biologists) that is required in such a project. This part is only meant to be a starting point for those experts in the technical fields who wish to embark on this new area of bioinformatics. The second portion of the paper is a survey of various data-mining techniques that have been used in mining microarray data for biological knowledge and information (such as sequence information). This survey is not meant to be treated as complete in any form, since the area is currently one of the most active, and the body of research is very large. Furthermore, the applications of the techniques mentioned here are not meant to be taken as the most significant applications of the techniques, but simply as examples among many.