Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Blachon, Sylvain; | Pensa, Ruggero G. | Besson, Jérémy | Robardet, Céline | Boulicaut, Jean-François | Gandrillon, Olivier
Affiliations: Equipe "Bases Moléculaires de l'Autorenouvellement et de ses Altérations", Université de Lyon, Lyon, F-69003, France. Université Lyon 1, Villeurbanne, F-69622, France. CNRS UMR5534, CGMC, Villeurbanne, F-69622, France. Tel.: +33 4 72 44 81 90; Fax: +33 4 72 43 26 85; E-mail: [email protected]; [email protected] | INSA-Lyon, LIRIS CNRS UMR5205, Bâtiment Blaise Pascal, F-69621 Villeurbanne cedex, France. Tel.: +33 4 72 43 89 05; Fax: +33 4 72 43 87 13; E-mail: {Ruggero.Pensa,Jeremy.Besson,Celine.Robardet,Jean-Francois.Boulicaut}@insa-lyon.fr
Note: [] Corresponding author
Abstract: The production of high-throughput gene expression data has generated a crucial need for bioinformatics tools to generate biologically interesting hypotheses. Whereas many tools are available for extracting global patterns, less attention has been focused on local pattern discovery. We propose here an original way to discover knowledge from gene expression data by means of the so-called formal concepts which hold in derived Boolean gene expression datasets. We first encoded the over-expression properties of genes in human cells using human SAGE data. It has given rise to a Boolean matrix from which we extracted the complete collection of formal concepts, i.e., all the largest sets of over-expressed genes associated to a largest set of biological situations in which their over-expression is observed. Complete collections of such patterns tend to be huge. Since their interpretation is a time-consuming task, we propose a new method to rapidly visualize clusters of formal concepts. This designates a reasonable number of Quasi-Synexpression-Groups (QSGs) for further analysis. The interest of our approach is illustrated using human SAGE data and interpreting one of the extracted QSGs. The assessment of its biological relevancy leads to the formulation of both previously proposed and new biological hypotheses.
Keywords: Transcriptome, SAGE, pattern discovery, formal concepts, closed sets, clustering
Journal: In Silico Biology, vol. 7, no. 4-5, pp. 467-483, 2007
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]