Abstract:BACKGROUND:Nucleosome distribution along chromatin dictates genomic DNA accessibility and thus profoundly influences gene expression. However, the underlying mechanism of nucleosome formation remains elusive. Here, taking a structural perspective, we systematically explored nucleosome formation potential of genomic sequences and the effect on chromatin organization and gene expression in S. cerevisiae.RESULTS:We analyzed twelve structural features related to flexibility, curvature and energy of DNA sequences. The results showed that some structural features such as DNA denaturation, DNA-bending stiffness, Stacking energy, Z-DNA, Propeller twist and free energy, were highly correlated with in vitro and in vivo nucleosome occupancy. Specifically, they can be classified into two classes, one positively and the other negatively correlated with nucleosome occupancy. These two kinds of structural features facilitated nucleosome binding in centromere regions and repressed nucleosome formation in the promoter regions of protein-coding genes to mediate transcriptional regulation. Based on these analyses, we integrated all twelve structural features in a model to predict more accurately nucleosome occupancy in vivo than the existing methods that mainly depend on sequence compositional features. Furthermore, we developed a novel approach, named DLaNe, that located nucleosomes by detecting peaks of structural profiles, and built a meta predictor to integrate information from different structural features. As a comparison, we also constructed a hidden Markov model (HMM) to locate nucleosomes based on the profiles of these structural features. The result showed that the meta DLaNe and HMM-based method performed better than the existing methods, demonstrating the power of these structural features in predicting nucleosome positions.CONCLUSIONS:Our analysis revealed that DNA structures significantly contribute to nucleosome organization and influence chromatin structure and gene expression regulation. The results indicated that our proposed methods are effective in predicting nucleosome occupancy and positions and that these structural features are highly predictive of nucleosome organization.The implementation of our DLaNe method based on structural features is available online.

iNuc-PhysChem: a sequence-based predictor for identifying nucleosomes via physicochemical properties.

Genome-wide Nucleosome Detection Based on the Dinucleotide Position Frequencies

Inuc-Pseknc: a Sequence-Based Predictor for Predicting Nucleosome Positioning in Genomes with Pseudo K-Tuple Nucleotide Composition.

NucPosPred: Predicting Species-Specific Genomic Nucleosome Positioning Via Four Different Modes of General PseKNC.

Nuc-PLoc: a New Web-Server for Predicting Protein Subnuclear Localization by Fusing PseAA Composition and PsePSSM.

DNA physical properties outperform sequence compositional information in classifying nucleosome-enriched and -depleted regions

An integrative analysis of nucleosome occupancy and positioning using diverse sequence dependent properties.

DeepNup: Prediction of Nucleosome Positioning from DNA Sequences Using Deep Neural Network

Structural Features Based Genome-Wide Characterization and Prediction of Nucleosome Organization

Improving the Prediction of Protein-Nucleic Acids Binding Residues Via Multiple Sequence Profiles and the Consensus of Complementary Methods

Ienhancer-2L: a Two-Layer Predictor for Identifying Enhancers and Their Strength by Pseudo K-Tuple Nucleotide Composition

Pdnasite: Identification of Dna-Binding Site from Protein Sequence by Incorporating Spatial and Sequence Context

Prediction of Nucleosome Positions in the Yeast Genome Based on Matched Mirror Position Filtering.

Sc-ncDNAPred: A Sequence-Based Predictor for Identifying Non-coding DNA in Saccharomyces Cerevisiae

iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.

Snbrfinder: A Sequence-Based Hybrid Algorithm For Enhanced Prediction Of Nucleic Acid-Binding Residues

Dna Physical Parameters Modulate Nucleosome Positioning in the Saccharomyces Cerevisiae Genome

Irspot-Psednc: Identify Recombination Spots with Pseudo Dinucleotide Composition

DPNuc: Identifying Nucleosome Positions Based on the Dirichlet Process Mixture Model

Irna(m6a)-Psednc: Identifying N6-methyladenosine Sites Using Pseudo Dinucleotide Composition.

Iori-Pseknc: A Predictor for Identifying Origin of Replication with Pseudo K-Tuple Nucleotide Composition