Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Kamasawa, Masami; | Horiuchi, Jun-ichi
Affiliations: Department of Biotechnology and Environmental Chemistry, Kitami Institute of Technology, 165 Koen-cho, Kitami, Hokkaido 090-8507, Japan | Toyo Engineering Corporation, 2-8-1 Akanehama, Narashino, Chiba 275-0024, Japan
Note: [] Corresponding author. Tel.: +81 157 26 9415; Fax: +81 157 24 7719; E-mail: [email protected]
Abstract: A large-scale analysis of human polyadenylation signals was carried out in silico. The most canonical AAUAAA hexamer and its 11 single-nucleotide variants that are most frequent in human genes were used to search for polyadenylation signals in the terminal sequences. Out of 18,277 poly(A) sites that were identified from 26,414 human genes, 82.5% of the sites were found to contain at least one of these 12 hexamers as a polyadenylation signal within 40 nucleotides upstream of the poly(A) site. The rest (17.5%) did not contain any of these hexamers, which suggests the existence of yet unknown signals. A total of 20,347 terminal sequences in close proximity to 12 polyadenylation signals were collected using modified EST clustering technique to establish a large-scale database of polyadenylation signals. To characterize the 12 hexamers, the locations of polyadenylation signals that were identified as "authentic" and the uracil contents of the downstream region of the signal were examined. Based on this analysis, the 11 variants of the canonical AAUAAA were identified as possibly forming "functional" signals as AAUAAA. Moreover, the observed frequency of 41.9% for AAUAAA was significantly lower than those of other reports, suggesting that the non-canonical variants are more important in the polyadenylation process than frequently recognized. Since the poly(A) sites processed by those non-canonical variants have not been generally annotated in major gene databases, it is important to determine whether the variant hexamers could work as polyadenylation signals that may be responsible for generating heterogeneity of mRNAs by alternative polyadenylation.
Keywords: Polyadenylation signal, poly(A) site, downstream sequence element, upstream sequence element, single-nucleotide variants, 3'-untranslated region
Journal: In Silico Biology, vol. 8, no. 3-4, pp. 347-361, 2008
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]