https://scholars.lib.ntu.edu.tw/handle/123456789/625576
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhang Z | en_US |
dc.contributor.author | Leipzig J | en_US |
dc.contributor.author | Sasson A | en_US |
dc.contributor.author | ANGELA YU-CHEN LIN | en_US |
dc.contributor.author | Perin J.C | en_US |
dc.contributor.author | Xie H.M | en_US |
dc.contributor.author | Sarmady M | en_US |
dc.contributor.author | Warren P.V | en_US |
dc.contributor.author | White P.S. | en_US |
dc.creator | Zhang Z;Leipzig J;Sasson A;Yu A.M;Perin J.C | - |
dc.creator | Xie H.M;Sarmady M;Warren P.V;White P.S. | - |
dc.date.accessioned | 2022-11-16T08:54:15Z | - |
dc.date.available | 2022-11-16T08:54:15Z | - |
dc.date.issued | 2013 | - |
dc.identifier.issn | 14712105 | - |
dc.identifier.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84886859902&doi=10.1186%2f1471-2105-14-S11-S3&partnerID=40&md5=53964e3b25847507a21d462c88af73d2 | - |
dc.identifier.uri | https://scholars.lib.ntu.edu.tw/handle/123456789/625576 | - |
dc.description.abstract | Background: High-throughput sequencing (HTS) technologies are spearheading the accelerated development of biomedical research. Processing and summarizing the large amount of data generated by HTS presents a non-trivial challenge to bioinformatics. A commonly adopted standard is to store sequencing reads aligned to a reference genome in SAM (Sequence Alignment/Map) or BAM (Binary Alignment/Map) files. Quality control of SAM/BAM files is a critical checkpoint before downstream analysis. The goal of the current project is to facilitate and standardize this process.Results: We developed bamchop, a robust program to efficiently summarize key statistical metrics of HTS data stored in BAM files, and to visually present the results in a formatted report. The report documents information about various aspects of HTS data, such as sequencing quality, mapping to a reference genome, sequencing coverage, and base frequency. Bamchop uses the R language and Bioconductor packages to calculate statistical matrices and the Sweave utility and associated LaTeX markup for documentation. Bamchop's efficiency and robustness were tested on BAM files generated by local sequencing facilities and the 1000 Genomes Project. Source code, instruction and example reports of bamchop are freely available from https://github.com/CBMi-BiG/bamchop.Conclusions: Bamchop enables biomedical researchers to quickly and rigorously evaluate HTS data by providing a convenient synopsis and user-friendly reports. © 2013 Zhang et al; licensee BioMed Central Ltd. | - |
dc.relation.ispartof | BMC Bioinformatics | - |
dc.subject.other | Base frequencies; Biomedical research; Current projects; High-throughput sequencing; Large amounts; Non-trivial; Sequence alignments; Source codes; Bioinformatics; Genes; Data visualization; article; chromosome; computer program; exon; genome; high throughput sequencing; methodology; nucleotide sequence; reproducibility; sequence alignment; Base Sequence; Chromosomes; Exons; Genome; High-Throughput Nucleotide Sequencing; Reproducibility of Results; Sequence Alignment; Software | - |
dc.title | Efficient digest of high-throughput sequencing data in a reproducible report | en_US |
dc.type | journal article | en |
dc.identifier.doi | 10.1186/1471-2105-14-S11-S3 | - |
dc.identifier.pmid | 24564231 | - |
dc.identifier.scopus | 2-s2.0-84886859902 | - |
dc.relation.journalvolume | 14 | - |
dc.relation.journalissue | SUPPL11 | - |
item.fulltext | no fulltext | - |
item.cerifentitytype | Publications | - |
item.openairetype | journal article | - |
item.openairecristype | http://purl.org/coar/resource_type/c_6501 | - |
item.grantfulltext | none | - |
crisitem.author.dept | Environmental Engineering | - |
crisitem.author.orcid | 0000-0003-4236-7233 | - |
crisitem.author.parentorg | College of Engineering | - |
Appears in Collections: | 環境工程學研究所 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.