Thesis Defence: Msc. Project 2024
Stud. MSc. Bioinformatics, BiRC
Associate Professor, BiRC
January 30, 2025
All code, figures, analyses, and slides are available at
https://munch-group.org/hic-spermatogenesis/
Analyzed the 3D chromatin structure on X during spermatogenesis in rhesus macaque and correlating to low-diversity regions in baboons and humans, leveraging a comprehensive reproducible computational framework.
Megabase-spanning regions of reduced diversity across great apes indicate strong selection, reflecting evolutionary dynamics that possibly interplay with structural features.
Architecture modulates gene regulation and meiosis. Could these high-level structural features underlie the low-diversity regions?
Figure from (Misteli 2020)
Figure from (Misteli 2020)
Figure from (Misteli 2020)
From (WikiCommons:Prakrutiuday)
Jaccard test
Proximity test
tissue | Fb | Spa | Pac | RS | Sperm | ||
---|---|---|---|---|---|---|---|
viewframe | test | query | |||||
10Mb | jaccard | ECH90 | 0.021 | 0.006 | nan | nan | 0.048 |
proximity | olive | nan | nan | nan | nan | 0.016 | |
olivehama | nan | 0.019 | 0.012 | nan | 0.001 | ||
arms | jaccard | ECH90 | 0.017 | 0.006 | 0.005 | 0.028 | nan |
proximity | olivehama | nan | nan | 0.036 | nan | nan |
\(\leftarrow 2\times 40\) tests. What about multiple testing?
Uncorrected:
tissue | Fb | Spa | Pac | RS | Sperm | ||
---|---|---|---|---|---|---|---|
viewframe | test | query | |||||
10Mb | jaccard | ECH90 | 0.021 | 0.006 | nan | nan | 0.048 |
proximity | olive | nan | nan | nan | nan | 0.016 | |
olivehama | nan | 0.019 | 0.012 | nan | 0.001 | ||
arms | jaccard | ECH90 | 0.017 | 0.006 | 0.005 | 0.028 | nan |
proximity | olivehama | nan | nan | 0.036 | nan | nan |
Benjamini-Hochberg (FDR) corrected
tissue | Fb | Spa | Pac | RS | Sperm | ||
---|---|---|---|---|---|---|---|
viewframe | test | query | |||||
10Mb | jaccard | ECH90 | 0.043 | 0.020 | nan | nan | nan |
proximity | olivehama | nan | nan | nan | nan | 0.038 | |
arms | jaccard | ECH90 | 0.041 | 0.020 | 0.020 | 0.046 | nan |
Tool | Description |
---|---|
Jupyter | Interactive coding environment for analysis and development (notebooks are natively rendered with Quarto) |
Quarto | A Quarto Manuscript project nested inside a Quarto Book for rendering html (website) and PDF (manuscript) from Markdown via Pandoc. Supports direct embedding of output from Jupyter Notebook cells (plots, tables). |
Conda | For managing software requirements and dependency versions reproducibly. |
git | Version control and gh-pages branch for automated render of Quarto project |
GitHub | Action was triggered on push to render the project and host on munch-group.org |
gwf | Workflow manager to automate the analysis on a HPC cluster, wrapped in Python code. workflow.py currently does everything from .fastq to .cool , but notebooks can be set to run sequentially as part of the workflow as well. |
Call TADs using interaction data summarized into a 1D genomic profile
Many methods exist for calling TADs \(\rightarrow\) different results from different approaches
“[…] an example of how adjacent boundaries calculated with cooltools can specify a set of intervals that could be analyzed as TADs, […]“
—cooltools documentation
Chromatin Architecture and Selection on X • Søren Jørgensen • MSc. Bioinformatics • munch-group.org/hic-spermatogenesis