FEI-MAN HSUHorton, PaulPaulHorton2026-03-042026-03-042024-10-09https://www.scopus.com/pages/publications/85205997376?origin=resultslisthttps://scholars.lib.ntu.edu.tw/handle/123456789/736083Article number 326Background: Some transcription factors, MYC for example, bind sites of potentially methylated DNA. This may increase binding specificity as such sites are (1) highly under-represented in the genome, and (2) offer additional, tissue specific information in the form of hypo- or hyper-methylation. Fortunately, bisulfite sequencing data can be used to investigate this phenomenon. Method: We developed MethylSeqLogo, an extension of sequence logos which includes new elements to indicate DNA methylation and under-represented dimers in each position of a set binding sites. Our method displays information from both DNA strands, and takes into account the sequence context (CpG or other) and genome region (promoter versus whole genome) appropriate to properly assess the expected background dimer frequency and level of methylation. MethylSeqLogo preserves sequence logo semantics—the relative height of nucleotides within a column represents their proportion in the binding sites, while the absolute height of each column represents information (relative entropy) and the height of all columns added together represents total information Results: We present figures illustrating the utility of using MethylSeqLogo to summarize data from several CpG binding transcription factors. The logos show that unmethylated CpG binding sites are a feature of transcription factors such as MYC and ZBTB33, while some other CpG binding transcription factors, such as CEBPB, appear methylation neutral. Conclusions: Our software enables users to explore bisulfite and ChIP sequencing data sets—and in the process obtain publication quality figures.trueDNA methylationTranscription Factor Binding SitesVisualizationMethylSeqLogo: DNA methylation smart sequence logosjournal article10.1186/s12859-024-05896-22-s2.0-85205997376