UK Biobank has filed 110 DMCA takedown notices against GitHub repositories where researchers accidentally published participant health data. The notices target specific files including Jupyter/R notebooks, genetic data files (PLINK, BOLT-LMM, BGEN), and tabular datasets. Developers from at least 14 countries are affected, predominantly from the US and China. The site tracks these notices by parsing GitHub's public DMCA repository, extracting filing dates and repository URLs, and resolving developer locations via the GitHub API. The data exposure is part of a broader series of governance controversies surrounding UK Biobank, including concerns about overseas access, insurance company data sharing, and misuse by fringe groups.
Sort: