English  |  正體中文  |  简体中文  |  Items with full text/Total items : 17918/22933 (78%)
Visitors : 7435995      Online Users : 54
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: https://ir.csmu.edu.tw:8080/ir/handle/310902500/3690


    Title: A parallel and incremental algorithm for efficient unique signature discovery on DNA databases
    Authors: Hsiao Ping Lee;Tzu-Fang Sheu;Chuan Yi Tang
    Contributors: 中山醫學大學:應用資訊科學學系
    Date: 2010
    Issue Date: 2011-05-06T02:40:16Z (UTC)
    Abstract: BackgroundDNA signatures are distinct short nucleotide sequences that provide valuable information that is used for various purposes, such as the design of Polymerase Chain Reaction primers and microarray experiments. Biologists usually use a discovery algorithm to find unique signatures from DNA databases, and then apply the signatures to microarray experiments. Such discovery algorithms require to set some input factors, such as signature length l and mismatch tolerance d, which affect the discovery results. However, suggestions about how to select proper factor values are rare, especially when an unfamiliar DNA database is used. In most cases, biologists typically select factor values based on experience, or even by guessing. If the discovered result is unsatisfactory, biologists change the input factors of the algorithm to obtain a new result. This process is repeated until a proper result is obtained. Implicit signatures under the discovery condition (l, d) are defined as the signatures of length ? l with mismatch tolerance ? d. A discovery algorithm that could discover all implicit signatures, such that those that meet the requirements concerning the results, would be more helpful than one that depends on trial and error. However, existing discovery algorithms do not address the need to discover all implicit signatures.
    Results
    This work proposes two discovery algorithms - the consecutive multiple discovery (CMD) algorithm and the parallel and incremental signature discovery (PISD) algorithm. The PISD algorithm is designed for efficiently discovering signatures under a certain discovery condition. The algorithm finds new results by using previously discovered results as candidates, rather than by using the whole database. The PISD algorithm further increases discovery efficiency by applying parallel computing. The CMD algorithm is designed to discover implicit signatures efficiently. It uses the PISD algorithm as a kernel routine to discover implicit signatures efficiently under every feasible discovery condition.
    Conclusions
    The proposed algorithms discover implicit signatures efficiently. The presented CMD algorithm has up to 97% less execution time than typical sequential discovery algorithms in the discovery of implicit signatures in experiments, when eight processing cores are used.
    URI: https://ir.csmu.edu.tw:8080/handle/310902500/3690
    http://dx.doi.org/10.1186/1471-2105-11-132
    Relation: BMC Bioinformatics 2010, 11:132doi:10.1186/1471-2105-11-132
    Appears in Collections:[ Department of Medical Informatics (including MS Program) ] Journal paper

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML413View/Open


    SFX Query

    All items in CSMUIR are protected by copyright, with all rights reserved.


    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback