Peak calling is a crucial step in ChIP-seq analysis to identify enriched genomic regions due to protein binding. Peak calling and annotation enhance ChIP-seq data analysis, providing insights into protein binding and gene regulation mechanisms The normR Bioconductor package, utilizing a binomial mixture model, is employed for peak detection and normalization across ChIP and Input samples.
Short regions, typically from transcription factors or localized histone modifications like H3K4me3
Wide genomic domains, often linked with histone modifications like H3K36me3.
A combination of sharp and broad regions, often from RNA Polymerase 2.
ChIP vs. Input samples compared, and regions visualized for enrichment.
Tiling window size adjusted for broad domains, with resulting signal visualized.
Peak authenticity is confirmed by checking read percentages in peaks and verifying alignment with known transcription factor binding motifs.
To assign identified peaks to functional genomic regions, adding context to their biological significance.
Human gene models are downloaded to reference genomic regions like exons, introns, and promoters.
Disjoint Peak Regions: Each peak is treated independently.
📚Overlapping Annotations: Nearest functional annotation like promoter, exon, intron, is assigned.
🔗Hierarchical Prioritization: Overlapping annotations are resolved.
📝Summary Statistics: Distribution of peaks across genomic regions is calculated.
📌Annotation Function Application: The annotatePeaks() function is applied to CTCF and H3K36me3 peaks using the lapply() function.
🧮Combining and Visualizing Results: Results are combined using dplyr::bind_rows() and visualized as a bar plot showing H3K36me3 peaks in gene bodies and CTCF peaks near promoters.
H3K36me3 peaks’ localization in gene bodies suggests a role in transcriptional regulation, while CTCF’s enrichment near promoters highlights its involvement in chromatin architecture and transcription regulation.
🪢For more follow me here: https://lnkd.in/gpsrVrat
🔭Learn more here: https://lnkd.in/ewr4M-Ut
#ComputationalGenomics #Bioinformatics #R #PeakCalling #ChIPSeq #GeneRegulation #Genomics #BioinformaticsTools #Hiring