ChEC-seq and other MNase based approaches have several advantages over ChIP-seq. ChEC-seq is not biased by non-specific crosslinking to highly transcribed regions, by DNA sequences that do not efficiently crosslink with formaldehyde and does not require specific antibodies to the transcription factor (TF) of interest. In practical terms, ChEC-seq experiments are significantly faster and cheaper (no costs associated with antibodies and antibody resins). If cells are ready for harvesting in the morning, the full experiment can be done in one day for multiple samples (alternatively two stopping points are available). In the yeast system, ChEC-seq has also an advantage over the other MNase-based methods developed recently such as Cut&Run and Cut&Tag since ChEC-seq does not require manipulations that may affect gene expression or protein-DNA binding such as digestion of the cell wall and subsequent nuclei isolation. That being said, there are several important considerations associated with ChEC-seq that are discussed below. Since no available genomic technique is free of potential pitfalls, we believe that, for S. cerevisiae, ChEC-seq provides a fast, sensitive and robust alternative to ChIP-seq. Both methods complement each other in certain aspects. Depending on the experimental questions being asked, available resources, and other considerations, one or both methods may be used to probe genome-wide binding and function of TFs.
A requirement for ChEC-seq is the need to prepare a yeast strain harboring a fusion between the protein of interest and micrococcal nuclease. In our experience, construction and use of such strains is rapid and straightforward since the tag does not compromise protein function in almost all cases. Even though ChIP-seq experiments can be carried out with a factor-specific antibody (if available), in practice, many researchers still prefer to use one of the epitope tags like Flag, Myc or HA, which also requires strain construction. One-step tagging of yeast strains with a C-terminal MNase fusion is described in (Zentner et al. 2015).
The DNA cleavage signal generated from a strain expressing MNase with a nuclear import signal not fused to any protein factor serves as an important control in ChEC-seq experiments, which is used in a similar way as the input control in ChIP-seq. The free MNase cleavage pattern provides a basis for peak calling as we expect that the specific interaction of the TF-MNase fusion with chromatin will generate a signal significantly stronger than free MNase. In practice, free MNase generates a cleavage pattern dictated solely by local chromatin accessibility. Thus, free MNase preferentially cleaves exposed DNA including nucleosome depleted regions, but this cleavage shows relatively little variation. When MNase is fused to a TF or coactivator that is localized to specific promoter regions, the cleavage pattern may resemble free MNase when only signal location is considered but the signal intensity at specific and non-specific loci differs significantly.
The free MNase used in ChEC experiments is tagged with a nuclear localization signal for efficient import to the nucleus and should be expressed from a promoter of equal or greater strength compared to the factor of interest (see Donczew et al. 2020 for gene-specific mRNA levels in normal growth conditions). In practice we recommend using a promoter with sufficient activity to ensure that a strain carrying the free MNase will be suitable as a control for mapping a wide range of factors. With the ChEC conditions used here, the free MNase signal is very low compared to that obtained for a TF-MNase fusion with specific DNA localization, even if the promoter of the studied factor is less active than the one driving the expression of free MNase.
Earlier applications of ChEC-seq were based on collecting multiple samples for a single experiment, corresponding to different times of MNase treatment. This approach may provide information about the kinetics of the interaction of a TF with DNA, but it is not clear how such data can be used in practice, except to compare the kinetics of factors and free MNase. A drawback of this approach is generation of a large number of samples and increased cost for a limited gain in experimental insights. Based on numerous experiments done in our lab, an efficient alternative is to use a fixed time point after calcium addition (5 minutes) with digestion kinetics limited by a low calcium concentration in the reaction. Such conditions favor the signal from TF-DNA interactions versus random free MNase diffusion. These modified conditions make it feasible to process multiple cultures simultaneously. In practice, we add calcium to consecutive samples every 30 seconds, which allows us to process up to 10 cultures during a single experiment.
We recommend collecting at least two biological replicates for ChEC-seq experiments. For mapping of factor binding sites we routinely collect three replicates and use two out of three criterium to identify bound genomic regions. For quantitative analysis, especially when comparing different experimental conditions, it may be advantageous to use an even higher number of replicate experiments. In our experience both ChEC-seq and ChIP-seq sometimes show a significant variation in signal intensity between replicate samples, which makes it hard to identify relatively small changes in binding when using a limited number of replicate samples.
We use MNase digested D. melanogaster DNA as a spike-in for quantitative analysis (e.g. treatment/control experiments). For simple mapping of factor binding sites spike-in addition is not necessary because the commonly used peak calling algorithms utilize RPM normalization. OurD. melanogasterDNA stock has a concentration of 1 ng/ml. In a typical experiment we supplement Stop Solution with an amount correlated to the OD600 measurement of the S. cerevisiae culture (volume = OD600 x 8 μl).
Limitations related to MNase substrate specificity
When analyzing a ChEC-seq data it is important to realize that MNase cleavage activity can be biased by the local chromatin environment and DNA sequence. Due to this property, data for different factors generated by ChEC-seq carry some qualitative resemblance to free MNase and to each other and it is the signal intensity that primarily discriminates specific versus non-specific interactions. The position of the peak in ChEC-seq data corresponds to the actual binding site as long as it is located clearly in the nucleosome depleted region. For example, we were able to successfully identify known consensus binding motifs in the vicinity of a significant fraction of peaks called for the TFs Abf1 and Rap1 (Donczew et al, submitted to Mol Cell). In other cases, (eg., the TF binding site is adjacent to a nucleosome) the cleavage peak may be shifted some distance from the actual binding site. This can also occur if the binding site is located in a region where nucleosomes are not well positioned. For example, the performance of ChEC-seq within a gene body is significantly decreased compared to ChIP-seq (unpublished results). Consequently, we do not recommend ChEC-seq for mapping transcribing polymerase and elongation factors.