English  |  正體中文  |  简体中文  |  Items with full text/Total items : 17918/22933 (78%)
Visitors : 7417043      Online Users : 28
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: https://ir.csmu.edu.tw:8080/ir/handle/310902500/21057


    Title: MADS-Box Gene Classification in Angiosperms by Clustering and Machine Learning Approaches
    Authors: Yu-Ting Chen;Chi-Chang Chang;Chi-Wei Chen;Kuan-Chun Chen;Yen-Wei Chu
    Contributors: 應用資訊科學學系暨碩士班
    Keywords: : ABCDE model, MADS-box gene, phylogenetic tree, support vector machine, machine learning
    Date: 2019-01-08
    Issue Date: 2020-08-10T08:58:09Z (UTC)
    Publisher: Front. Genet.
    Abstract: The MADS-box gene family is an important transcription factor family involved in floral organogenesis. The previously proposed ABCDE model suggests that different floral organ identities are controlled by various combinations of classes of MADS-box genes. The five-class ABCDE model cannot cover all the species of angiosperms, especially the orchid. Thus, we developed a two-stage approach for MADS-box gene classification to advance the study of floral organogenesis of angiosperms. First, eight classes of reference datasets (A, AGL6, B12, B34, BPI, C, D, and E) were curated and clustered by phylogenetic analysis and unsupervised learning, and they were confirmed by the literature. Second, feature selection and multiple prediction models were curated according to sequence similarity and the characteristics of the MADS-box gene domain using support vector machines. Compared with the BindN and COILS features, the local BLAST model yielded the best accuracy. For performance evaluation, the accuracy of Phalaenopsis aphrodite MADS-box gene classification was 93.3%, which is higher than 86.7% of our previous classification prediction tool, iMADS. Phylogenetic tree construction – the most common method for gene classification yields classification errors and is time-consuming for analysis of massive, multi-species, or incomplete sequences. In this regard, our new system can also confirm the classification errors of all the random selection that were incorrectly classified by phylogenetic tree analysis. Our model constitutes a reliable and efficient MADS-box gene classification system for angiosperms.
    URI: https://ir.csmu.edu.tw:8080/ir/handle/310902500/21057
    Relation: Front. Genet. 9:707.
    Appears in Collections:[ Department of Medical Informatics (including MS Program) ] Journal paper

    Files in This Item:

    File Description SizeFormat
    4-7.pdf5824KbAdobe PDF250View/Open


    SFX Query

    All items in CSMUIR are protected by copyright, with all rights reserved.


    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback