πŸ†• Advanced Report, now with Y-DNA, mtDNA & ROH analysis Discover your paternal & maternal haplogroups, shown only when AI prediction confidence is high enough to be reliable. Plus ROH endogamy insights, ancient sample matches, all interpreted by Claude AI. Raw DNA file required for these new analyses (not available with G25 coordinates).
Discover Now
🍽️ DNA-Based Nutrition Report: Discover Which Foods Fuel Your Body Upload your 23andMe, AncestryDNA or MyHeritage file and discover exactly which foods fuel your body, based on your personal genetic blueprint.
Try our Free Test

Study Information

2025
World

Abstract

The human Y chromosome enables detailed reconstruction of paternal lineages through haplogroup classification. Existing tools for this purpose typically rely on outdated phylogenies, lack ancient DNA handling, or provide limited confidence metrics. Here I present yallHap, a Y-chromosome haplogroup classifier that integrates the YFull phylogenetic tree (185,780 SNPs) with probabilistic scoring, built-in ancient DNA damage filtering, and parallel processing for population-scale studies. Validation on 1,231 high-coverage gnomAD samples achieved 99.9% accuracy (95% CI: 99.5-100%) on GRCh38, and 1,233 samples from 1000 Genomes Phase 3 achieved 99.8% accuracy (95% CI: 99.3-100%). For ancient DNA with moderate variant density (4-10%), Bayesian ancient mode achieves +19.3 pp improvement over heuristic mode (+12 to +24 pp at 1% increments; see Supplementary Table S3), reaching 60-86% accuracy. On full AADR ancient DNA validation (7,333 samples spanning ~45,000 years), this translates to 90.7% overall accuracy (95% CI: 90.0-91.3%) versus 88.3% for heuristic transversions-only mode. At variant densities =10%, both modes reach 97-99% accuracy. yallHap supports multiple reference genomes (GRCh37, GRCh38, T2T-CHM13v2.0), provides detailed quality metrics including optional ISOGG nomenclature output, and offers multi-threaded batch processing for large-scale studies. The tool is designed for integration into modern bioinformatics pipelines, with example wrappers for nf-core/eager [16,17] and Snakemake [18] workflows. The software is open source, available at https://github.com/trianglegrrl/yallHap, and distributed via pip, Bioconda, and Docker.

We use cookies to enhance your experience. By continuing to visit this site you agree to our use of cookies. Learn more