In Silico Biology - Volume 2, issue 3 - Journals

Abstract: About five years ago, ontology was almost unknown in bioinformatics, even more so in molecular biology. Nowadays, many bioinformatics articles mention it in connection with text mining, data integration or as a metaphysical cure for problems in standardisation of nomenclature and other applications. This article attempts to give an account of what concept ontologies in the domain of biology and bioinformatics are; what they are not; how they can be constructed; how they can be …used; and some fallacies and pitfalls creators and users should be aware of. Show more

Keywords: Domain ontology, biology, bioinformatics, bio-ontologies, design, guidelines, semantics, philosophy

Citation: In Silico Biology, vol. 2, no. 3, pp. 179-193, 2002

Get PDF

AGenDA: Gene Prediction by Comparative Sequence Analysis

Authors: Rinner, Oliver | Morgenstern, Burkhard

Article Type: Research Article

Abstract: Comparative sequence analysis is a powerful approach to identify functional elements in genomic sequences. Herein, we describe AGenDA (Alignment-based GENe Detection Algorithm), a novel method for gene prediction that is based on long-range alignment of syntenic regions in eukaryotic genome sequences. Local sequence homologies identified by the DIALIGN program are searched for conserved splice signals to define potential protein-coding exons; these candidate exons are then used to assemble complete gene structures. The …performance of our method was tested on a set of 105 human-mouse sequence pairs. These test runs showed that sensitivity and specificity of AGenDA are comparable with the best gene- prediction program that is currently available. However, since our method is based on a completely different type of input information, it can detect genes that are not detectable by standard methods and vice versa. Thus, our approach seems to be a useful addition to existing gene-prediction programs. Availability: DIALIGN is available through the Bielefeld Bioinformatics Server (BiBiServ) at http://bibiserv.techfak.uni-bielefeld.de/dialign/ The gene-prediction program AGenDA described in this paper will be available through the BiBiServ or MIPS web server at http://mips.gsf.de. Show more

Keywords: gene prediction, sequence alignment, comparative genome analysis, cross-species sequence comparison, phylogenetic footprinting, genome annotation, dynamic programming

Citation: In Silico Biology, vol. 2, no. 3, pp. 195-205, 2002

Price: EUR 27.50

A System Architecture for Genomic Data Analysis

Authors: Glass, Änne | Gierl, Lothar

Article Type: Short Communication

Abstract: MOTIVATION: Most of diseases are caused by a set of gene defects, which occur in a complex association. The association scheme of expressed genes can be modelled by genetic networks. Genetic networks are efficiently facilities to understand the dynamic of pathogenic processes by modelling molecular reality of cell conditions. In this sense a genetic network consists of first, a set of genes of specified cells, tissues or species and second, causal relations between these genes determining …the functional condition of the biological system, i. e. under disease. A relation between two genes will exist if they both are directly or indirectly associated with disease [8]. Our goal is to characterize diseases (especially autoimmune diseases like chronic pancreatitis CP, multiple sclerosis MS, rheumatoid arthritis RA) by genetic networks generated by a computer system. We want to introduce this practice as a bioinformatic approach for finding targets. Show more

Keywords: genetic networks, model, functional genomics, proteomics, genomic data, expression data, chip data, data mining, analysis , bioinformatics, software system, complex association, causal relation, interaction, targets, artificial intelligence, AI, ART, parser engine

Citation: In Silico Biology, vol. 2, no. 3, pp. 207-211, 2002

Price: EUR 27.50

Building a Genome Database Using an Object-Oriented Approach

Authors: Barbasiewicz, Anna | Liu, Lin | Lang, B. Franz | Burger, Gertraud

Article Type: Research Article

Abstract: GOBASE is a relational database that integrates data associated with mitochondria and chloro-plasts. The most important data in GOBASE, i. e., molecular sequences and taxonomic information, are obtained from the public sequence data repository at the National Center for Biotechnology Information (NCBI), and are validated by our experts. Maintaining a curated genomic database comes with a towering labor cost, due to the shear volume of available genomic sequences and the plethora of annotation errors and omissions …in records re-trieved from public repositories. Here we describe our approach to increase automation of the database population process, thereby reducing manual intervention. As a first step, we used Unified Modeling Language (UML) to construct a list of potential errors. Each case was evaluated independently, and an expert solution was devised, and represented as a diagram. Subsequently, the UML diagrams were used as templates for writing object-oriented automation programs in the Java programming language. Show more

Keywords: automation, curation, GOBASE, Java, population, UML

Citation: In Silico Biology, vol. 2, no. 3, pp. 213-217, 2002

Price: EUR 27.50

The Semantic Metadatabase (SEMEDA): Ontology Based Integration of Federated Molecular Biological Data Sources

Authors: Köhler, Jacob | Schulze-Kremer, Steffen

Article Type: Research Article

Abstract: A system for "intelligent" semantic integration and querying of federated databases is being implemented by using three main components: A component which enables SQL access to integrated databases by database federation (MARGBench), an ontology based semantic metadatabase (SEMEDA) and an ontology based query interface (SEMEDA-query). In this publication we explain and demonstrate the principles, architecture and the use of SEMEDA. Since SEMEDA is implemented as 3 tiered web application database providers can enter …all relevant semantic and technical information about their databases by themselves via a web browser. SEMEDA' s collaborative ontology editing feature is not restricted to database integration, and might also be useful for ongoing ontology developments, such as the "Gene Ontology" [2]. SEMEDA can be found at http://www-bm.cs.uni-magdeburg. de/semeda/. We explain how this ontologically structured information can be used for semantic database integration. In addition, requirements to ontologies for molecular biological database integration are discussed and relevant existing ontologies are evaluated. We further discuss how ontologies and structured knowledge sources can be used in SEMEDA and whether they can be merged supplemented or updated to meet the requirements for semantic database integration. Show more

Keywords: Semantic database integration, molecular biology, meta-database, ontology, knowledge representation, controlled vocabulary

Citation: In Silico Biology, vol. 2, no. 3, pp. 219-231, 2002

Price: EUR 27.50

Construction of Stochastic Context Trees for Genetic Texts

Authors: Orlov, Yuri L. | Filippov, Vladimir P. | Potapov, Vladimir N. | Kolchanov, Nikolay A.

Article Type: Research Article

Abstract: A method has been developed for constructing a tree source model for genetic text generation. Model visualisation in the form of suffix (context) trees provides a new way of context analysis of symbol sequences. Estimation of the stochastic complexity of the data in the frame of the model serves as a criterion for the model's ascertainment. The model and complexity values are used for analysis of genetic texts. The software realisation of this algorithm enables to …reveal statistical properties of genetic sequences based on an information measure. The program developed is available via Internet at http://wwwmgs.bionet.nsc.ru/mgs/programs/complexity/. Show more

Keywords: complexity, information measure, suffix tree visualisation, variable memory Markov model, genetic texts, statistical modelling

Citation: In Silico Biology, vol. 2, no. 3, pp. 233-247, 2002

Price: EUR 27.50

Finding and Decrypting of Promoters Contributes to the Elucidation of Gene Function

Authors: Werner, Thomas

Article Type: Other

Abstract: The combination of full-scale genomic sequencing with high throughput expression analysis provides a new and largely unexploited basis for in silico functional genomics. Recent break through developments in locat-ing and analyzing promoters now allow extending functional genomics in silico far beyond identification of protein sequences into the complex regulatory structures and mechanisms of the genome. However, only first examples of this new type of approach are emerging at present and intensive further developments …of bioinformatics tools will be required before such analysis can become large-scale routine in genomic sequence analysis. Nevertheless, the door to a new dimension of functional analysis of the genomic sequence is open. Finally, only the tight integration of the enormous amount of knowledge gained from proteins sequence analysis with the complementary information about gene regulation will afford us with a more complete picture of the networks than constitute life. Show more

Keywords: genome annotation, gene regulation, promoter prediction, promoter modeling, functional context, transcription factor binding sites, promoter modules , RANTES, chemokine, coexpressed genes

Citation: In Silico Biology, vol. 2, no. 3, pp. 249-255, 2002

Get PDF

Computer System Gene Discovery for Promoter Structure Analysis

Authors: Vityaev, Eugenii E. | Orlov, Yury L. | Vishnevsky, Oleg V. | Pozdnyakov, Mikhail A. | Kolchanov, Nikolay A.

Article Type: Short Communication

Abstract: This paper presents implementation of Data Mining and Knowledge Discovery techniques for search-ing for regularities in tables of context features of DNA sequences involved in regulation of transcription. The goal is to discover regularities that relate nucleotide sequences to the functional classes of these sequences. The search patterns for regularities have been constructed in the first-order logic augmented by probabilistic estimates. To this aim, the PC software system Gene Discovery has been …designed. This system accepts molecular-genetical data retrieved from a database by using SQL queries. Nucleotide sequences of promoters of several functional systems were extracted from the TRRD database (http://wwwmgs.bionet.nsc.ru/mgs/gnw/trrd/) and analysed. The data in-clude nucleotide sequences of erythroid-specific gene promoters, endocrine system gene promoters, promoter regions of the genes controlling cell cycle, promoter of genes regulating lipid metabolism, and muscle-specific gene promoters. Several regularities that relate the nucleotide sequences in the regulatory DNA and their location relative to the transcription start with each functional class have been found. Show more

Keywords: Machine learning, knowledge discovery, data mining, bioinformatics, eukaryotic promoter recognition, transcription factors binding sites, oligonucleotide patterns

Citation: In Silico Biology, vol. 2, no. 3, pp. 257-262, 2002

Price: EUR 27.50

Mining Putative Regulatory Elements in Promoter Regions of Saccharomyces Cerevisiae

Authors: Horng, Jorng-Tzong | Huang, Hsien-Da | Huang, Shir-Ly | Yang, Ueng-Cheng | Chang, Yu-Chang

Article Type: Research Article

Abstract: The availability of genome-wide gene expression data provides a unique set of genes from which we can decipher the mechanisms underlying the common transcriptional response. Transcription factors, which can bind to specific DNA sites, cooperatively regulate the transcription of genes. This study attempts to mine putative binding sites to investigate how combinations of the sites predicted from known sites and over-represented repetitive elements are distributed in the promoter regions of groups of functionally …related genes. The over-represented repetitive elements appearing in the associations are possible transcription factor binding sites. The deduced association rules would facilitate to predict putative regulatory elements and to identify genes which are potentially co-regulated by the putative regulatory elements. Our proposed approach is applied to Saccharomyces cerevisiae and the promoter regions of yeast ORFs. Show more

Keywords: regulatory elements, repetitive oligonucleotide, data mining, promoter

Citation: In Silico Biology, vol. 2, no. 3, pp. 263-273, 2002

Price: EUR 27.50

Protein Similarity Search under mRNA Structural Constraints: Application to Targeted Selenocysteine Insertion

Authors: Backofen, Rolf | Narayanaswamy, N.S. | Swidan, Firas

Article Type: Research Article

Abstract: Selenocysteine is the 21th amino acid, which occurs in all kingdoms of life. Selenocysteine is en-coded by the STOP-codon UGA. For its insertion, it requires a specific mRNA sequence downstream the UGA-codon that forms a hairpin like structure (called Sec insertion sequence (SECIS). We consider the computational problem of generating new amino acid sequences containing selenocysteine. This requires to find an mRNA se-quence that is similar to the SECIS-consensus, is able to form the secondary structure …required for selenocysteine insertion, and whose translation is maximally similar to the original amino acid sequence. We show that the problem can be solved in linear time when considering the hairpin-like SECIS-structure (and, more generally, when consider-ing a structure that does not contain pseudoknots). Show more

Keywords: selenocysteine, SECIS, protein engineering

Citation: In Silico Biology, vol. 2, no. 3, pp. 275-290, 2002

Price: EUR 27.50

An Overview on Predicting the Subcellular Location of a Protein

Authors: Feng, Zhi-Peng

Article Type: Research Article

Abstract: The present paper overviews the issue on predicting the subcellular location of a protein. Five meas-ures of extracting information from the global sequence based on the Bayes discriminant algorithm are reviewed. 1) The auto-correlation functions of amino acid indices along the sequence; 2) The quasi-sequence-order approach; 3) the pseudo-amino acid composition; 4) the unified attribute vector in Hilbert space, 5) Zp parameters extracted from the Zp curve. The actual performance of the predictive accuracy is closely …related to the degree of similarity be-tween the training and testing sets or to the average degree of pairwise similarity in dataset in a cross-validated study. Many scholars considered that the current higher predictive accuracy still cannot ensure that some available algorithms are effective in practice prediction for the higher pairwise sequence identity of the datasets, but some of them declared that construction of the dataset used for developing software should base on the reality determined by the Mother Nature that some subcellular locations really contain only a minor number of proteins of which some even have a high percentage of sequence similarity. Owing to the complexity of the problem itself, some very so-phisticated and special programs are needed for both constructing dataset and improving the prediction. Anyhow finding the target information in mature protein sequence and properly cooperating it with sorting signals in predic-tion may further improve the overall predictive accuracy and make the prediction into practice. Show more

Keywords: subcellular location, N-terminal targeting sequences, sorting signals, targeting information, amino acid composition, quasi-sequence-order-effect, pseudo-amino acid composition, auto-correlation functions, unified attribute vector, Zp curve, Zp parameters, Bayes discriminant algorithm, component-coupled algorithm, k-nearest neighbor method, hidden Markov model, neural networks, Support Vector Machine (SVM), jackknife test, hydro-phobicity, pairwise sequence similarity

Citation: In Silico Biology, vol. 2, no. 3, pp. 291-303, 2002

Price: EUR 27.50

Molecular Dynamics Simulations on the Free and Complexed N-Terminal SH2 Domain of SHP-2

Authors: Wieligmann, Karin | De Castro, Luis Felipe Pineda | Zacharias, Martin

Article Type: Other

Abstract: ABSTRACT: Signal transduction events are often mediated by small protein domains such as SH2 (Src homology 2) domains that recognize phosphotyrosines (pY) and flanking sequences. In case of the SHP-2 receptor tyrosine phosphatase an N-terminal SH2 domain binds and inactivates the phosphatase (PTP) domain. The pY-peptide- binding site on the N-terminal SH2 domain does not overlap with the PTP binding region. Nevertheless, pY-peptide binding causes domain dissociation and phosphatase activation. Comparative multi-nanosecond …molecular dynam-ics simulations on the N-SH2 domain in ligand-bound and free states have been performed to study the allosteric mechanism that leads to domain dissociation upon pY-peptide binding. Significant ligand-dependent differences in the conformational flexibility of regions that are involved in SH2-PTP domain association have been observed. The results support a mechanism of signal transduction where SH2-peptide binding modulates the domain flexibility and reduces its capacity to fit into the entrance of the PTP catalytic domain of SHP-2. Show more

Keywords: allosteric conformational change, , signal transducution, ligand-receptor binding, molecular dynamics, SH2 domains, SHP-2 phosphatase, conformational flexibility

Citation: In Silico Biology, vol. 2, no. 3, pp. 305-311, 2002

Get PDF

ProML - The Protein Markup Language for Specification of Protein Sequences, Structures and Families

Authors: Hanisch, Daniel | Zimmer, Ralf | Lengauer, Thomas

Article Type: Research Article

Abstract: We propose a specification language ProML for protein sequences, structures, and families based on the open XML standard. The language allows for portable, system-independent, machine-parsable and human-readable representation of essential features of proteins. The language is of immediate use for several bioinformatics applications: we discuss clustering of proteins into families and the representation of the specific shared features of the respective clusters. Moreover, we use ProML for specification of data used in …fold recognition bench-marks exploiting experimentally derived distance constraints. Show more

Keywords: Protein Markup Language, ProML, XML, protein properties, protein families, protein structures, distance constraints, protein clusters

Citation: In Silico Biology, vol. 2, no. 3, pp. 313-324, 2002

Price: EUR 27.50

Improving Fold Recognition of Protein Threading by Experimental Distance Constraints

Authors: Albrecht, Mario | Hanisch, Daniel | Zimmer, Ralf | Lengauer, Thomas

Article Type: Research Article

Abstract: We present a comprehensive analysis of methods for improving the fold recognition rate of the threading approach to protein structure prediction by the utilization of few additional distance constraints. The distance constraints between protein residues may be obtained by experiments such as mass spectrometry or NMR spectroscopy. We applied a post-filtering step with new scoring functions incorporating measures of constraint satisfaction to ranking lists of 123D threading alignments. The detailed analysis of the …results on a small representative benchmark set show that the fold recognition rate can be improved significantly by up to 30% from about 54%-65% to 77%-84%, approaching the maximal attainable performance of 90% estimated by structural superposition alignments. This gain in performance adds about 10% to the recognition rate already achieved in our previous study with cross-link constraints only. Additional recent results on a larger benchmark set involving a confidence function for threading predictions also indicate notable improvements by our combined approach, which should be particularly valuable for rapid structure determination and validation of protein models. Show more

Keywords: protein threading, fold recognition, structure prediction, experimental data, distance constraints, cross-linking reagents, mass spectrometry, NOE restraints, NMR

Citation: In Silico Biology, vol. 2, no. 3, pp. 325-337, 2002

Price: EUR 27.50

A Hypergraph-Based Method for Unification of Existing Protein Structure- and Sequence-Families

Authors: Freudenberg, Jan | Zimmer, Ralf | Hanisch, Daniel | Lengauer, Thomas

Article Type: Research Article

Abstract: Classification of proteins is a major challenge in bioinformatics. Here an approach is presented, that unifies different existing classifications of protein structures and sequences. Protein structural domains are repre-sented as nodes in a hypergraph. Shared memberships in sequence families result in hyperedges in the graph. The presented method partitions the hypergraph into clusters of structural domains. Each computed cluster is based on a set of shared sequence family memberships. Thus, the clusters put existing …protein sequence families into the context of structural family hierarchies. Conversely, structural domains are related to their sequence family member-ships, which can be used to gain further knowledge about the respective structural families. Show more

Keywords: sequence analysis, structure analysis, domain boundary delineation, protein databases, protein homology, protein structure prediction, threading, template selection, optimization, protein clustering

Citation: In Silico Biology, vol. 2, no. 3, pp. 339-349, 2002

Price: EUR 27.50

Comparing Bound and Unbound Protein Structures Using Energy Calculation and Rotamer Statistics

Authors: Koch, Kerstin | Zöllner, Frank | Neumann, Steffen | Kummert, Franz | Sagerer, Gerhard

Article Type: Research Article

Abstract: Protein data in the PDB covers only a snapshot of a protein structure. For flexible docking confor-mational changes need to be considered. Rotamer statistics provide the likelihood for side chain conformations, and further comparison of bound and unbound state yields differences in preferred positions. Furthermore, we do a full sampling of selected angles and apply the AMBER force field. Conformation of energy minima complies with the rotamer statistics. Both types of information target the reduction …of search space for enumerative docking algo-rithms and provide parameters for elastic docking. Show more

Keywords: Rotamer library, flexible protein-protein docking, energy calculations, AMBER force field, side chain flexibility, flexibility measure

Citation: In Silico Biology, vol. 2, no. 3, pp. 351-368, 2002

Price: EUR 27.50

Prediction and Uncertainty in the Analysis of Gene Expression Profiles

Article Type: Research Article

Abstract: We have developed a complete statistical model for the analysis of tumor specific gene expression profiles. The approach provides investigators with a global overview on large scale gene expression data, indicating aspects of the data that relate to tumor phenotype, but also summarizing the uncertainties inherent in classification of tumor types. We demonstrate the use of this method in the context of a gene expression profiling study of 27 human breast cancers. The study is aimed …at defining molecular characteristics of tumors that reflect estrogen receptor status. In addition to good predictive performance with respect to pure classification of the expression profiles, the model also uncovers conflicts in the data with respect to the classification of some of the tumors, highlighting them as critical cases for which additional investigations are appropriate. Show more

Keywords: Computational diagnostics, gene expression analysis, expression profiles, micro array, gene chip, breast cancer, estrogen receptor status, Bayesian statistics, Bayesian regularization, binary regression, probit model, G-prior, singular value decomposition, predictive diagnosis, prognosis, tumor classification, uncertainty, factor regression, ridge regression, machine learning

Citation: In Silico Biology, vol. 2, no. 3, pp. 369-381, 2002

Price: EUR 27.50

Impact of Integrating Clinical and Genetic Information

Article Type: Research Article

Abstract: To assess the relevance of molecular markers it is required to combine clinical and genetic information. For reliable assessment of parameters relevant to diagnostics and therapy large patient collectives must be characterized both with respect to phenotype and genotype. Matching of genetic data like gene expression profiles, molecular genetics and cytogenetics with clinical data like follow-up, morphological findings and diagnoses involves integration of complex databases. In the context of a nationwide leukemia …research network in Germany we designed an integrated database covering both genetic and clinical data of patients. The system contains follow-up data and relevant laboratory modalities, i. e. cytomorphology, cytogenetics, molecular genetics, FISH, immunophenotyping and gene expression profiling. So far 13541 cases from 7746 patients treated by 1225 physicians are documented. The data structure consists of up to 888 variables per case. From our experience, integration of clinical and genetic information requires significant efforts - including data protection issues -, but is feasible and improves data quality leading to faster and more reliable research results for the benefit of the patients. Show more

Keywords: data integration, patient data, microarray, gene expression, cytogenetics, molecular genetics, leukemia

Citation: In Silico Biology, vol. 2, no. 3, pp. 383-391, 2002

Price: EUR 27.50

Modeling of Self-Organized Avascular Tumor Growth with a Hybrid Cellular Automaton

Authors: Dormann, Sabine | Deutsch, Andreas

Article Type: Research Article

Abstract: Pattern formation in multicellular spheroids is addressed with a hybrid lattice-gas cellular automaton model. Multicellular spheroids serve as experimental model system for the study of avascular tumor growth. Typically, multicellular spheroids consist of a necrotic core surrounded by rings of quiescent and proliferating tumor cells, respectively. Furthermore, after an initial exponential growth phase further spheroid growth is significantly slowed down even if further nutrient is supplied. The cellular automaton model explicitly takes …into account mitosis, apoptosis and necrosis as well as nutrient consumption and a diffusible signal that is emitted by cells becoming necrotic. All cells follow identical interaction rules. The necrotic signal induces a chemotactic migration of tumor cells towards maximal signal concentrations. Starting from a small number of tumor cells automaton simulations exhibit the self-organized formation of a layered structure consisting of a necrotic core, a ring of quiescent tumor cells and a thin outer ring of proliferating tumor cells. Show more

Keywords: mathematical model, multicellular spheroid, avascular tumor growth, cellular automaton, , self-organization, simulation

Citation: In Silico Biology, vol. 2, no. 3, pp. 393-406, 2002

Price: EUR 27.50

Supporting Genotype-Phenotype Correlation with the Rare Metabolic Diseases Database Ramedis

Article Type: Research Article

Abstract: To gain further knowledge about rare genetic diseases, a world wide method for data collection via the Internet has been established. This new approach will improve collecting valuable data from single case reports. Ramedis saves standardised patient data which will be usable for statistics, longitudinal examinations and cooperative studies in future time. Embedded in the scene of the German Human Genome Project, Ramedis directly will enable phenotype-genotype correlations. Beside the better characterisation of clinical …heterogeneity of rare metabolic diseases, there may be a great benefit for the treatment of these patients in whom prospective studies are otherwise expensive and difficult to perform. This contribution presents the motivation for this system, introduces features, current state and the future of the project. Additionally, first experiences of using Ramedis by health professionals are explained. Show more

Keywords: case study, database, genotype-phenotype correlation, information system, rare metabolic disease, remote data entry

Citation: In Silico Biology, vol. 2, no. 3, pp. 407-414, 2002

Price: EUR 27.50

The System Architecture of the BioPath System

Authors: Forster, Michael | Pick, Andreas | Raitner, Marcus | Schreiber, Falk | Brandenburg, Franz J.

Article Type: Research Article

Abstract: BioPath is a prototype system for the interactive exploration of biochemical pathways. It has been developed as an electronic version of the famous Boehringer Biochemical Pathways map and offers various ways to access information on substances and pathways and to navigate through pathways. This paper describes the main features and the software architecture of BioPath. The companion paper [11] focuses on the advanced visualization incorporated into BioPath.

Keywords: biochemical pathways, metabolic pathways, visualization, exploration

Citation: In Silico Biology, vol. 2, no. 3, pp. 415-426, 2002

Price: EUR 27.50

Rapid Generation of a Representative Ensemble of N-Glycan Conformations

Authors: Frank, Martin | Bohne-Lang, Andreas | Wetter, Thomas | v.d. Lieth, Claus-W.

Article Type: Research Article

Abstract: Glycosylated proteins are ubiquitous components of extracellular matrices and cellular surfaces where their oligosaccharide moieties are implicated in a wide range of cellcell and cellmatrix recognition events. Glycans constitute highly flexible molecules. Only a small number of glycan X-ray structures is available for which sufficient electron density for an entire oligosaccharide chain has been observed. An unambiguous structure deter-mination based on NMR-derived geometric constraints alone is often not possible. Time consuming computational …approaches such as Monte Carlo calculations and molecular dynamics simulations have been widely used to explore the conformational space accessible to complex carbohydrates. The generation of a comprehensive data base for N-glycan fragments based on long time molecular dynamics simulations is presented. The fragments are chosen in such a way that the effects of branched N-glycan structures are taken into account. The prediction database consti-tutes the basis of a procedure to generate a complete set of all possible conformations for a given N-glycan. The constructed conformations are ranked according to their energy content. The resulting conformations are in reason-able agreement with experimental data. A web interface has been established (http://www.dkfz.de/spec/glydict/), which enables to input any N-glycan of interest and to receive an ensemble of generated conformations within a few minutes. Show more

Keywords: conformations of N-glycans, molecular dynamics simulations, database of N-glycan fragments, glycoproteins

Citation: In Silico Biology, vol. 2, no. 3, pp. 427-439, 2002

Price: EUR 27.50

In Silico Biology - Volume 2, issue 3

The German Conference on Bioinformatics 2001

Bioinformatics Research and Education in Germany

Bioinformatics Service, Education and Research: The EMBnet and CBI

Ontologies for Molecular Biology and Bioinformatics

AGenDA: Gene Prediction by Comparative Sequence Analysis

A System Architecture for Genomic Data Analysis

Building a Genome Database Using an Object-Oriented Approach

The Semantic Metadatabase (SEMEDA): Ontology Based Integration of Federated Molecular Biological Data Sources

Construction of Stochastic Context Trees for Genetic Texts

Finding and Decrypting of Promoters Contributes to the Elucidation of Gene Function

Computer System Gene Discovery for Promoter Structure Analysis

Mining Putative Regulatory Elements in Promoter Regions of Saccharomyces Cerevisiae

Protein Similarity Search under mRNA Structural Constraints: Application to Targeted Selenocysteine Insertion

An Overview on Predicting the Subcellular Location of a Protein

Molecular Dynamics Simulations on the Free and Complexed N-Terminal SH2 Domain of SHP-2

ProML - The Protein Markup Language for Specification of Protein Sequences, Structures and Families

Improving Fold Recognition of Protein Threading by Experimental Distance Constraints

A Hypergraph-Based Method for Unification of Existing Protein Structure- and Sequence-Families

Comparing Bound and Unbound Protein Structures Using Energy Calculation and Rotamer Statistics

Prediction and Uncertainty in the Analysis of Gene Expression Profiles

Impact of Integrating Clinical and Genetic Information

Modeling of Self-Organized Avascular Tumor Growth with a Hybrid Cellular Automaton

Supporting Genotype-Phenotype Correlation with the Rare Metabolic Diseases Database Ramedis

The System Architecture of the BioPath System

Rapid Generation of a Representative Ensemble of N-Glycan Conformations

North America

Europe

Asia