https://scholars.lib.ntu.edu.tw/handle/123456789/590356
標題: | KTU: K-mer Taxonomic Units improve the biological relevance of amplicon sequence variant microbiota data | 作者: | Liu, Po Yu SHAN-HUA YANG Yang, Sung Yin |
關鍵字: | amplicon sequencing | k-mer-based taxonomy unit | microbiome-associated studies | microbiota | 公開日期: | 1-一月-2021 | 出版社: | British Ecological Society | 卷: | 13 | 期: | 1 | 起(迄)頁: | 560-568 | 來源出版物: | Methods in Ecology and Evolution | 摘要: | Amplicon sequencing is widely implemented in microbiome-associated studies. In recent years, microbial ecologists have switched to new algorithms for taxonomic identification and quantification. The amplicon sequence variant (ASV) denoising algorithm of unbiased sequence picking has replaced the OTU clustering methods. ASV can be used to detect and distinguish biological variations to the species OTU level (≥97% similarity). However, the ASV quantification among samples is sparse and less prevalent within the same batch. Here, we present a k-mer based, alignment-free algorithm—‘KTU’ (K-mer Taxonomic Unit)—to iteratively re-cluster ASVs into optimal biological taxonomic units. The ‘KTU’ algorithm comprises four parts: (a) The k-mer frequency calling is sliding window counted by tetranucleotide frequencies from both ends of the DNA sequence. (b) The similarities in k-mer frequencies among the sequences are measured by cosine dissimilarity. (c) The KTUs are detected from the cosine dissimilarity matrix using the partition around medoids (PAM) clustering algorithm. The iterative PAM-KTU detecting process searches for the numbers of KTU convergent clusters according to the maximum silhouette coefficient. (d) Finally, the ASVs are aggregated into the corresponding KTUs. KTU re-clustered every 1.38–4.53 ASVs into a feature with >99% sequence similarity on average and 1% cosine divergence for each KTU. Additionally, the re-clustering procedure improved biological explanations for correlations and significances of clinical and environmental factors. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/590356 | ISSN: | 2041210X | DOI: | 10.1111/2041-210X.13758 |
顯示於: | 漁業科學研究所 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。