[HTML][HTML] refTSS: a reference data set for human and mouse transcription start sites

I Abugessaisa, S Noguchi, A Hasegawa… - Journal of molecular …, 2019 - Elsevier
I Abugessaisa, S Noguchi, A Hasegawa, A Kondo, H Kawaji, P Carninci, T Kasukawa
Journal of molecular biology, 2019Elsevier
Transcription starts at genomic positions called transcription start sites (TSSs), producing
RNAs, and is mainly regulated by genomic elements and transcription factors binding
around these TSSs. This indicates that TSSs may be a better unit to integrate various data
sources related to transcriptional events, including regulation and production of RNAs.
However, although several TSS datasets and promoter atlases are available, a
comprehensive reference set that integrates all known TSSs is lacking. Thus, we constructed …
Abstract
Transcription starts at genomic positions called transcription start sites (TSSs), producing RNAs, and is mainly regulated by genomic elements and transcription factors binding around these TSSs. This indicates that TSSs may be a better unit to integrate various data sources related to transcriptional events, including regulation and production of RNAs. However, although several TSS datasets and promoter atlases are available, a comprehensive reference set that integrates all known TSSs is lacking. Thus, we constructed a reference dataset of TSSs (refTSS) for the human and mouse genomes by collecting publicly available TSS annotations and promoter resources, such as FANTOM5, DBTSS, EPDnew, and ENCODE. The data set consists of genomic coordinates of TSS peaks, their gene annotations, quality check results, and conservation between human and mouse. We also developed a web interface to browse the refTSS (http://reftss.clst.riken.jp/). Users can access the resource for collecting and integrating data and information about transcriptional regulation and transcription products.
Elsevier