Exploring Expression Data: Identification and Analysis of Coexpressed Genes

  1. Laurie J. Heyer1,
  2. Semyon Kruglyak1,2, and
  3. Shibu Yooseph1
  1. Department of Mathematics, University of Southern California, California USA

Abstract

Analysis procedures are needed to extract useful information from the large amount of gene expression data that is becoming available. This work describes a set of analytical tools and their application to yeast cell cycle data. The components of our approach are (1) a similarity measure that reduces the number of false positives, (2) a new clustering algorithm designed specifically for grouping gene expression patterns, and (3) an interactive graphical cluster analysis tool that allows user feedback and validation. We use the clusters generated by our algorithm to summarize genome-wide expression and to initiate supervised clustering of genes into biologically meaningful groups.

Footnotes

  • 1 The authors contributed equally to this work and are listed in alphabetical order.

  • 2 Corresponding author.

  • E-MAIL kruglyak{at}hto.usc.edu; FAX (213) 740-2424.

    • Received May 19, 1999.
    • Accepted September 14, 1999.
| Table of Contents

Preprint Server