nr-2l a two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived featuresnr-2l两级预测识别核受体亚科基于sequence-derived特性.pdfVIP

  • 6
  • 0
  • 约6.64万字
  • 约 9页
  • 2017-09-01 发布于上海
  • 举报

nr-2l a two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived featuresnr-2l两级预测识别核受体亚科基于sequence-derived特性.pdf

nr-2l a two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived featuresnr-2l两级预测识别核受体亚科基于sequence-derived特性

NR-2L: A Two-Level Predictor for Identifying Nuclear Receptor Subfamilies Based on Sequence-Derived Features 1 1,2 2 Pu Wang , Xuan Xiao *, Kuo-Chen Chou 1 Computer Department, Jing-De-Zhen Ceramic Institute, Jing-De-Zhen, China, 2 Gordon Life Science Institute, San Diego, California, United States of America Abstract Nuclear receptors (NRs) are one of the most abundant classes of transcriptional regulators in animals. They regulate diverse functions, such as homeostasis, reproduction, development and metabolism. Therefore, NRs are a very important target for drug development. Nuclear receptors form a superfamily of phylogenetically related proteins and have been subdivided into different subfamilies due to their domain diversity. In this study, a two-level predictor, called NR-2L, was developed that can be used to identify a query protein as a nuclear receptor or not based on its sequence information alone; if it is, the prediction will be automatically continued to further identify it among the following seven subfamilies: (1) thyroid hormone like (NR1), (2) HNF4-like (NR2), (3) estrogen like, (4) nerve growth factor IB-like (NR4), (5) fushi tarazu-F1 like (NR5), (6) germ cell nuclear factor like (NR6), and (7) knirps like (NR0). The identification was made by the Fuzzy K nearest neighbor (FK-NN) classifier based on the pseudo amino acid composition formed by incorporating various physicochemical and statistical features derived from the protein sequences, such as amino acid composition, dipeptide composition, complexity factor, and low-frequency Fourier spectrum components. As a demonstration, it was shown through some benchmark datasets derived from the NucleaRDB and UniProt with low redundancy that the overall success rates achieved by the jackknife test were about 93% and 89% in the

您可能关注的文档

文档评论(0)

1亿VIP精品文档

相关文档