English  |  正體中文  |  简体中文  |  Items with full text/Total items : 90429/105609 (86%)
Visitors : 10279723      Online Users : 105
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version

    Please use this identifier to cite or link to this item: http://asiair.asia.edu.tw/ir/handle/310904400/4208

    Title: he Query of Bioinformatics Literatures By Document Similarity
    Authors: Kuan-Yen Tseng
    Contributors: Department of Computer Science and Information Engineering
    Keywords: Information Retrieval、Document Query、Cosine Similarity
    Date: 2008
    Issue Date: 2009-11-18 21:14:30 (UTC+8)
    Publisher: Asia University
    Abstract: It is an obstacle for the beginner to use keywords to search for related documents from the literatures, especailly for the one who was not familar with the concept of what he was looking for.
    This reasearch includes two processes, 『the computation of document similarity』 and 『document vectorization』.
    Regarding 『document vectorization』, we transfer each document into one vector by appropriate pattern weighting according to the distribution of the patterns.
    『the computation of document similarity』 means to compute the similarities between the vector of query document and the one of each document in the literature after transfering the query docuemnt into one vector, and to give the order of the documents for user's reference by sorting the values of these similarities.
    We have two approaches to have the set of the patterns, including『dictionary』 and 『content』, and use cosine similarity to evaluate the similiarity of two vectors.
    Experimental results showed that the value of precision achieved by the『dictionary』was higher than that achieved by the 『content』.
    Appears in Collections:[資訊工程學系] 博碩士論文

    Files in This Item:

    File SizeFormat

    All items in ASIAIR are protected by copyright, with all rights reserved.

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback