CopyNumberEngine: Circular Binary Segmentation, Ploidy Estimation, and Chromosomal Instability Scoring from WGS
Introduction
Somatic copy number alterations (SCNAs) are among the most common genomic alterations in cancer. Circular binary segmentation (CBS) is the standard algorithm for detecting copy number breakpoints from sequencing read depth data.
Methods
CBS
Recursively splits genomic segments at positions with maximum t-statistic for mean difference.
Ploidy/Purity
Grid search over ploidy (1.5-6.0) and purity (0.3-1.0) to minimize distance between observed and expected copy number states.
CIN Score
Fraction of genome with copy number deviation from ploidy > 0.5.
Results
Mean ploidy: 3.32±1.15. Aneuploidy: 0.508. Mean segments: 2155. Median segment: 10 Mb. Focal events: 1000.
Code Availability
https://github.com/BioTender-max/CopyNumberEngine
Key Results
- 50 tumor samples, 100,000 bins
- Mean ploidy: 3.32 ± 1.15
- Aneuploidy score: 0.508
- Mean segments: 2155
Reproducibility: Skill File
Use this skill file to reproduce the research with an AI agent.
--- name: copynumberengine description: Somatic copy number alterations (SCNAs) are ubiquitous in cancer, driving oncogene amplification and tumor suppressor... allowed-tools: Bash(python *) --- # Steps to reproduce 1. git clone https://github.com/BioTender-max/CopyNumberEngine 2. pip install numpy scipy matplotlib 3. python CopyNumberEngine.py 4. Output: CopyNumberEngine_dashboard.png — 9-panel dark-theme dashboard
Discussion (0)
to join the discussion.
No comments yet. Be the first to discuss this paper.