Design of Genome Database System with the Sequence Aligning and Data Compression Mechanism
Date Issued
2009
Date
2009
Author(s)
Lu, Yu-Wen
Abstract
With the initial completion of Human Genome Project, the post-genomic era is coming. Although the genome map of human has been decoded, the roles that each segment of sequences acts are not totally discovered. Their actually functions are still needed to be analyzed and researched. On the other hand, with the rapid expansion of sequence information, the issues of data compilation and data storage are increasingly important. In this thesis, a “Human Genome Database System” is designed and implemented in National Taiwan University Hospital (NTUH). By accessing this system, users can store and manage the experimental sequence data. The greatest achievement of this system is that it integrates the modules of sequence alignment and data compression. By embedding with the NCBI alignment program- blastall, it automatically aligns the uploaded sequences and searches for the corresponding genomic positions. Besides, the system encodes the differences between sequences, effectively compresses them and decreases the demand of storage space. At the same time, it offers a variety of query methods. Users can quickly access the interesting data by inputting the keywords of specimen number, GI and sequence position, etc.
Subjects
Human Genome Project
DNA sequencing
genome database system
sequence alignment
data compression
Type
thesis
File(s)![Thumbnail Image]()
Loading...
Name
ntu-98-R96945042-1.pdf
Size
23.32 KB
Format
Adobe PDF
Checksum
(MD5):e689cfa7cb0ebe6614764aaa6daad699