Genome-wide identification of DNaseI hypersensitive sites using active chromatin sequence libraries

PJ Sabo, R Humbert, M Hawrylycz… - Proceedings of the …, 2004 - National Acad Sciences
PJ Sabo, R Humbert, M Hawrylycz, JC Wallace, MO Dorschner, M McArthur…
Proceedings of the National Academy of Sciences, 2004National Acad Sciences
Comprehensive identification of sequences that regulate transcription is one of the major
goals of genome biology. Focal alteration in chromatin structure in vivo, detectable through
hypersensitivity to DNaseI and other nucleases, is the sine qua non of a diverse cast of
transcriptional regulatory elements including enhancers, promoters, insulators, and locus
control regions. We developed an approach for genome-scale identification of DNaseI
hypersensitive sites (HSs) via isolation and cloning of in vivo DNaseI cleavage sites to …
Comprehensive identification of sequences that regulate transcription is one of the major goals of genome biology. Focal alteration in chromatin structure in vivo, detectable through hypersensitivity to DNaseI and other nucleases, is the sine qua non of a diverse cast of transcriptional regulatory elements including enhancers, promoters, insulators, and locus control regions. We developed an approach for genome-scale identification of DNaseI hypersensitive sites (HSs) via isolation and cloning of in vivo DNaseI cleavage sites to create libraries of active chromatin sequences (ACSs). Here, we describe analysis of >61,000 ACSs derived from erythroid cells. We observed peaks in the density of ACSs at the transcriptional start sites of known genes at non-gene-associated CpG islands, and, to a lesser degree, at evolutionarily conserved noncoding sequences. Peaks in ACS density paralleled the distribution of DNaseI HSs. ACSs and DNaseI HSs were distributed between both expressed and nonexpressed genes, suggesting that a large proportion of genes reside within open chromatin domains. The results permit a quantitative approximation of the distribution of HSs and classical cis-regulatory sequences in the human genome.
National Acad Sciences