A bioinformatician used R to query the UniProt API and calculate the distribution of transmembrane domains (TMDs) across the human proteome and several model organisms. Key findings: about 25% of the 20,659 human proteins have at least one TMD, with single-TMD proteins being most common (2,402 proteins, ~46% of TMD-containing proteins) and nearly 1,000 7TM proteins (GPCRs). The analysis was extended to zebrafish, Drosophila, C. elegans, and yeast, showing 18–29% TMD-containing proteins across species. Full R code using the UniProt REST API with chunked ID mapping and pagination handling is provided.
Sort: