Source: https://github.com/markziemann/GeneNameErrors2020
View the reports: http://ziemann-lab.net/public/gene_name_errors/
Gene name errors result when data are imported improperly into MS Excel and other spreadsheet programs (Zeeberg et al, 2004). Certain gene names like MARCH3, SEPT2 and DEC1 are converted into date format. These errors are surprisingly common in supplementary data files in the field of genomics (Ziemann et al, 2016). This could be considered a small error because it only affects a small number of genes, however it is symptomtic of poor data processing methods. The purpose of this script is to identify gene name errors present in supplementary files of PubMed Central articles in the previous month.
library("XML")
library("jsonlite")
library("xml2")
library("reutils")
library("readxl")
Here I will be getting PubMed Central IDs for the previous month.
Start with figuring out the date to search PubMed Central.
CURRENT_MONTH=format(Sys.time(), "%m")
CURRENT_YEAR=format(Sys.time(), "%Y")
if (CURRENT_MONTH == "01") {
PREV_YEAR=as.character(as.numeric(format(Sys.time(), "%Y"))-1)
PREV_MONTH="12"
} else {
PREV_YEAR=CURRENT_YEAR
PREV_MONTH=as.character(as.numeric(format(Sys.time(), "%m"))-1)
}
DATE=paste(PREV_YEAR,"/",PREV_MONTH,sep="")
DATE
## [1] "2022/1"
Let’s see how many PMC IDs we have in the past month.
QUERY ='((genom*[Abstract]))'
ESEARCH_RES <- esearch(term=QUERY, db = "pmc", rettype = "uilist", retmode = "xml", retstart = 0,
retmax = 5000000, usehistory = TRUE, webenv = NULL, querykey = NULL, sort = NULL, field = NULL,
datetype = NULL, reldate = NULL, mindate = DATE, maxdate = DATE)
pmc <- efetch(ESEARCH_RES,retmode="text",rettype="uilist",outfile="pmcids.txt")
## Retrieving UIDs 1 to 500
## Retrieving UIDs 501 to 1000
## Retrieving UIDs 1001 to 1500
## Retrieving UIDs 1501 to 2000
## Retrieving UIDs 2001 to 2500
## Retrieving UIDs 2501 to 3000
## Retrieving UIDs 3001 to 3500
pmc <- read.table(pmc)
pmc <- paste("PMC",pmc$V1,sep="")
NUM_ARTICLES=length(pmc)
NUM_ARTICLES
## [1] 3051
writeLines(pmc,con="pmc.txt")
Now run the bash script. Note that false positives can occur (~1.5%) and these results have not been verified by a human.
Here are some definitions:
NUM_XLS = Number of supplementary Excel files in this set of PMC articles.
NUM_XLS_ARTICLES = Number of articles matching the PubMed Central search which have supplementary Excel files.
GENELISTS = The gene lists found in the Excel files. Each Excel file is counted once even it has multiple gene lists.
NUM_GENELISTS = The number of Excel files with gene lists.
NUM_GENELIST_ARTICLES = The number of PMC articles with supplementary Excel gene lists.
ERROR_GENELISTS = Files suspected to contain gene name errors. The dates and five-digit numbers indicate transmogrified gene names.
NUM_ERROR_GENELISTS = Number of Excel gene lists with errors.
NUM_ERROR_GENELIST_ARTICLES = Number of articles with supplementary Excel gene name errors.
ERROR_PROPORTION = This is the proportion of articles with Excel gene lists that have errors.
system("./gene_names.sh pmc.txt")
results <- readLines("results.txt")
XLS <- results[grep("XLS",results,ignore.case=TRUE)]
NUM_XLS = length(XLS)
NUM_XLS
## [1] 4780
NUM_XLS_ARTICLES = length(unique(sapply(strsplit(XLS," "),"[[",1)))
NUM_XLS_ARTICLES
## [1] 709
GENELISTS <- XLS[lapply(strsplit(XLS," "),length)>2]
#GENELISTS
NUM_GENELISTS <- length(unique(sapply(strsplit(GENELISTS," "),"[[",2)))
NUM_GENELISTS
## [1] 522
NUM_GENELIST_ARTICLES <- length(unique(sapply(strsplit(GENELISTS," "),"[[",1)))
NUM_GENELIST_ARTICLES
## [1] 265
ERROR_GENELISTS <- XLS[lapply(strsplit(XLS," "),length)>3]
#ERROR_GENELISTS
NUM_ERROR_GENELISTS = length(ERROR_GENELISTS)
NUM_ERROR_GENELISTS
## [1] 368
GENELIST_ERROR_ARTICLES <- unique(sapply(strsplit(ERROR_GENELISTS," "),"[[",1))
GENELIST_ERROR_ARTICLES
## [1] "PMC8800370" "PMC8794705" "PMC8792904" "PMC8789167" "PMC8786741"
## [6] "PMC8782480" "PMC8776252" "PMC8773395" "PMC8769648" "PMC8713813"
## [11] "PMC8713784" "PMC8770739" "PMC8755833" "PMC8752600" "PMC8751407"
## [16] "PMC8748537" "PMC8767772" "PMC8766719" "PMC8762874" "PMC8756499"
## [21] "PMC8692330" "PMC8761743" "PMC8753122" "PMC8738961" "PMC8757482"
## [26] "PMC8744257" "PMC8742043" "PMC8742041" "PMC8744399" "PMC8741763"
## [31] "PMC8741844" "PMC8741300" "PMC8724844" "PMC8733947" "PMC8733896"
## [36] "PMC8725365" "PMC8719881" "PMC8724546" "PMC8724129" "PMC8688513"
## [41] "PMC8674242" "PMC8668906" "PMC8797262" "PMC8792531" "PMC8787225"
## [46] "PMC8756151" "PMC8739494" "PMC8781439" "PMC8754154" "PMC8780294"
## [51] "PMC8777075" "PMC8713981" "PMC8762462" "PMC8758788" "PMC8752785"
## [56] "PMC8748879" "PMC8762886" "PMC8760727" "PMC8759260" "PMC8756637"
## [61] "PMC8732846" "PMC8732282" "PMC8728658" "PMC8762721" "PMC8762282"
## [66] "PMC8746912" "PMC8711282" "PMC8749219" "PMC8728608" "PMC8747517"
## [71] "PMC8750853" "PMC8748700" "PMC8748539" "PMC8742585" "PMC8740005"
## [76] "PMC8739483" "PMC8739361" "PMC8719428" "PMC8722201" "PMC8669064"
## [81] "PMC8722433" "PMC8690928" "PMC8718409"
NUM_ERROR_GENELIST_ARTICLES <- length(GENELIST_ERROR_ARTICLES)
NUM_ERROR_GENELIST_ARTICLES
## [1] 83
ERROR_PROPORTION = NUM_ERROR_GENELIST_ARTICLES / NUM_GENELIST_ARTICLES
ERROR_PROPORTION
## [1] 0.3132075
Here you can have a look at all the gene lists detected in the past month, as well as those with errors. The dates are obvious errors, these are commonly dates in September, March, December and October. The five-digit numbers represent dates as they are encoded in the Excel internal format. The five digit number is the number of days since 1900. If you were to take these numbers and put them into Excel and format the cells as dates, then these will also mostly map to dates in September, March, December and October.
#GENELISTS
ERROR_GENELISTS
## [1] "PMC8800370 /pmc/articles/PMC8800370/bin/NIHMS1769771-supplement-Supplemental_Table_1.xlsx Ggallus 309 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448"
## [2] "PMC8800370 /pmc/articles/PMC8800370/bin/NIHMS1769771-supplement-Supplemental_Table_1.xlsx Hsapiens 22 44256 44256 44256 44256 44446 44531 44531 44256 44259 44256 44256 44446 44446 44263 44263 44263 44259 44259 44259 44446 44531 44531"
## [3] "PMC8800370 /pmc/articles/PMC8800370/bin/NIHMS1769771-supplement-Supplemental_Table_3.xlsx Hsapiens 6 44531 44256 44265 44266 44259 44263"
## [4] "PMC8794705 /pmc/articles/PMC8794705/bin/Table_2.XLSX Hsapiens 1 44442"
## [5] "PMC8794705 /pmc/articles/PMC8794705/bin/Table_2.XLSX Ggallus 1 44442"
## [6] "PMC8794705 /pmc/articles/PMC8794705/bin/Table_2.XLSX Ggallus 13 44442 44442 44448 44443 44443 44443 44443 44443 44443 44445 44445 44445 44445"
## [7] "PMC8792904 /pmc/articles/PMC8792904/bin/Table_4.xlsx Hsapiens 7 44447 44442 44447 44442 44442 44443 44442"
## [8] "PMC8792904 /pmc/articles/PMC8792904/bin/Table_5.xlsx Hsapiens 7 44447 44442 44447 44442 44442 44443 44442"
## [9] "PMC8792904 /pmc/articles/PMC8792904/bin/Table_5.xlsx Hsapiens 159 44261 44449 44445 44445 44447 44443 44260 44256 44262 44261 44447 44259 44446 44261 44447 44266 44441 44531 44260 44257 44442 44454 44258 44445 44260 44450 44262 44447 44260 44262 44449 44260 44256 44446 44449 44447 44257 44446 44261 44449 44445 44447 44257 44260 44262 44454 44446 44440 44445 44453 44266 44453 44446 44261 44449 44447 44441 44531 44450 44256 44446 44261 44447 44450 44442 44447 44442 44447 44441 44260 44264 44445 44441 44257 44260 44262 44454 44261 44440 44263 44445 44447 44453 44266 44441 44257 44262 44446 44261 44258 44449 44447 44443 44442 44259 44262 44446 44445 44531 44450 44446 44440 44531 44444 44449 44445 44447 44441 44260 44262 44258 44449 44447 44256 44442 44262 44447 44256 44450 44259 44262 44443 44260 44258 44258 44445 44443 44262 44453 44442 44264 44447 44531 44256 44454 44256 44440 44262 44261 44258 44447 44257 44256 44450 44442 44454 44449 44443 44454 44443 44257 44258 44449 44453 44441 44260 44256 44447 44441"
## [10] "PMC8792904 /pmc/articles/PMC8792904/bin/Table_5.xlsx Hsapiens 26 44448 44442 44449 44445 44447 44443 44263 44447 44442 44442 44447 44446 44447 44450 44260 44450 44442 44447 44447 44443 44450 44441 44442 44441 44443 44442"
## [11] "PMC8792904 /pmc/articles/PMC8792904/bin/Table_5.xlsx Hsapiens 292 44442 44261 44258 44447 44443 44260 44256 44450 44256 44262 44261 44445 44447 44256 44442 44259 44261 44260 44257 44450 44442 44264 44258 44260 44450 44262 44256 44262 44263 44445 44256 44258 44449 44447 44450 44446 44257 44260 44451 44453 44257 44256 44262 44261 44448 44449 44453 44450 44256 44453 44257 44442 44263 44447 44442 44259 44447 44441 44260 44257 44260 44262 44261 44263 44445 44453 44441 44256 44450 44262 44264 44446 44265 44441 44256 44442 44448 44445 44442 44261 44258 44531 44256 44450 44262 44261 44263 44447 44256 44262 44264 44448 44445 44453 44441 44260 44450 44261 44258 44449 44453 44441 44256 44450 44442 44259 44261 44258 44263 44260 44259 44258 44443 44260 44261 44263 44447 44441 44450 44258 44442 44262 44261 44448 44258 44445 44447 44441 44260 44257 44256 44442 44262 44441 44256 44450 44445 44443 44261 44448 44449 44447 44256 44256 44262 44258 44263 44260 44256 44262 44261 44258 44447 44453 44441 44257 44256 44450 44442 44262 44261 44449 44441 44443 44450 44442 44259 44262 44258 44263 44443 44442 44259 44261 44258 44260 44256 44262 44261 44263 44447 44441 44257 44450 44442 44262 44264 44449 44257 44261 44448 44263 44257 44442 44264 44263 44441 44450 44262 44445 44256 44448 44447 44264 44265 44263 44445 44256 44442 44448 44265 44263 44450 44259 44256 44261 44448 44265 44263 44445 44257 44256 44450 44264 44261 44263 44445 44256 44442 44259 44264 44261 44448 44447 44259 44256 44258 44440 44261 44440 44257 44450 44440 44261 44448 44258 44445 44257 44450 44442 44259 44264 44261 44448 44265 44263 44445 44447 44443 44450 44442 44259 44264 44261 44448 44258 44449 44445 44453 44443 44260 44442 44264 44263 44445 44447 44453 44260 44256 44450 44262 44261 44258 44263 44260 44442 44264 44261 44258 44265 44260 44442"
## [12] "PMC8792904 /pmc/articles/PMC8792904/bin/Table_7.xlsx Hsapiens 1 44443"
## [13] "PMC8792904 /pmc/articles/PMC8792904/bin/Table_9.xlsx Hsapiens 7 44082 44077 44082 44077 44077 44078 44077"
## [14] "PMC8789167 /pmc/articles/PMC8789167/bin/pone.0261293.s010.xlsx Hsapiens 22 44088 44075 44086 44085 43898 43892 44080 43892 43893 44082 43897 43894 44076 44081 43896 44083 43900 44078 44084 43901 43899 43895"
## [15] "PMC8789167 /pmc/articles/PMC8789167/bin/pone.0261293.s011.xlsx Hsapiens 1 43893"
## [16] "PMC8786741 /pmc/articles/PMC8786741/bin/Table1.XLS Hsapiens 4 2022/03/11 2022/03/01 2022/09/03 2022/03/04"
## [17] "PMC8782480 /pmc/articles/PMC8782480/bin/pone.0262051.s015.xlsx Hsapiens 17 41153 41883 38412 39508 39142 38047 39326 37865 36951 40787 39692 37681 40422 40238 40057 40603 38777"
## [18] "PMC8776252 /pmc/articles/PMC8776252/bin/elife-74153-supp3.xlsx Hsapiens 3 37316 36951 37226"
## [19] "PMC8773395 /pmc/articles/PMC8773395/bin/13577_2022_672_MOESM4_ESM.xlsx Hsapiens 1 37834"
## [20] "PMC8769648 /pmc/articles/PMC8769648/bin/elife-68224-fig1-data5.xlsx Drerio 9 44451 44449 44348 11567 22525 44349 44350 44351 44441"
## [21] "PMC8769648 /pmc/articles/PMC8769648/bin/elife-68224-fig1-data5.xlsx Drerio 10 44451 44449 44442 44445 44449 44451 44449 44449 44441 44451"
## [22] "PMC8769648 /pmc/articles/PMC8769648/bin/elife-68224-fig5-data2.xlsx Drerio 7 44080 44089 44076 43898 44445 44454 44454"
## [23] "PMC8713813 /pmc/articles/PMC8713813/bin/pnas.2112836118.sd04.xlsx Scerevisiae 1 44287"
## [24] "PMC8713784 /pmc/articles/PMC8713784/bin/pnas.2111920118.sd04.xlsx Hsapiens 6 44266 44258 44449 44442 44445 44448"
## [25] "PMC8713784 /pmc/articles/PMC8713784/bin/pnas.2111920118.sd04.xlsx Hsapiens 1 44448"
## [26] "PMC8713784 /pmc/articles/PMC8713784/bin/pnas.2111920118.sd04.xlsx Hsapiens 1 44448"
## [27] "PMC8713784 /pmc/articles/PMC8713784/bin/pnas.2111920118.sd04.xlsx Hsapiens 2 44263 44445"
## [28] "PMC8770739 /pmc/articles/PMC8770739/bin/Table1.xls Hsapiens 1 43534"
## [29] "PMC8755833 /pmc/articles/PMC8755833/bin/41467_2021_27704_MOESM3_ESM.xlsx Mmusculus 2 40057 40787"
## [30] "PMC8755833 /pmc/articles/PMC8755833/bin/41467_2021_27704_MOESM4_ESM.xlsx Hsapiens 3 38596 40057 38961"
## [31] "PMC8755833 /pmc/articles/PMC8755833/bin/41467_2021_27704_MOESM4_ESM.xlsx Hsapiens 5 44257 44447 44256 44264 44446"
## [32] "PMC8755833 /pmc/articles/PMC8755833/bin/41467_2021_27704_MOESM4_ESM.xlsx Hsapiens 4 40422 37135 38596 39692"
## [33] "PMC8755833 /pmc/articles/PMC8755833/bin/41467_2021_27704_MOESM6_ESM.xlsx Hsapiens 6 44440 44448 44446 44260 44441 44445"
## [34] "PMC8755833 /pmc/articles/PMC8755833/bin/41467_2021_27704_MOESM6_ESM.xlsx Hsapiens 6 44440 44448 44446 44260 44441 44445"
## [35] "PMC8752600 /pmc/articles/PMC8752600/bin/41467_2021_27865_MOESM3_ESM.xls Mmusculus 23 44075 44081 44082 44084 44078 43901 44077 44086 43895 43898 44076 43897 43893 43891 44089 43896 43899 44080 44085 44083 44079 43900 43892"
## [36] "PMC8752600 /pmc/articles/PMC8752600/bin/41467_2021_27865_MOESM9_ESM.xlsx Mmusculus 17 11-Sep 10-Sep 10-Sep 8-Sep 9-Mar 9-Sep 11-Sep 8-Mar 10-Sep 8-Sep 7-Mar 7-Sep 2-Mar 6-Mar 11-Sep 7-Mar 8-Mar"
## [37] "PMC8752600 /pmc/articles/PMC8752600/bin/41467_2021_27865_MOESM9_ESM.xlsx Mmusculus 23 12-Sep 10-Mar 1-Sep 3-Sep 11-Mar 5-Sep 15-Sep 6-Sep 8-Sep 3-Mar 2-Mar 7-Sep 10-Sep 11-Sep 5-Mar 2-Sep 9-Sep 1-Mar 6-Mar 7-Mar 9-Mar 8-Mar 4-Sep"
## [38] "PMC8751407 /pmc/articles/PMC8751407/bin/supp_mcs.a006113_Supplemental_Table_S2.xlsx Hsapiens 29 37500 40787 40057 39326 40422 39142 39692 38777 38412 38961 37681 37316 38596 36951 39873 39508 37135 38047 38231 37316 36951 40238 37226 37865 39508 41153 41883 40603 40057"
## [39] "PMC8748537 /pmc/articles/PMC8748537/bin/41467_2021_27670_MOESM4_ESM.xlsx Hsapiens 3 42988 42982 42803"
## [40] "PMC8748537 /pmc/articles/PMC8748537/bin/41467_2021_27670_MOESM4_ESM.xlsx Hsapiens 1 42992"
## [41] "PMC8767772 /pmc/articles/PMC8767772/bin/43856_2021_42_MOESM2_ESM.xlsx Hsapiens 21 44448 44261 44261 44448 44256 44444 44441 44448 44262 44256 44448 44444 44259 44261 44266 44256 44448 44266 44259 44445 44446"
## [42] "PMC8767772 /pmc/articles/PMC8767772/bin/43856_2021_42_MOESM3_ESM.xlsx Hsapiens 2 44261 44448"
## [43] "PMC8767772 /pmc/articles/PMC8767772/bin/43856_2021_42_MOESM4_ESM.xlsx Hsapiens 2 44446 44446"
## [44] "PMC8767772 /pmc/articles/PMC8767772/bin/43856_2021_42_MOESM5_ESM.xlsx Hsapiens 961 44448 44261 44261 44448 44256 44444 44441 44448 44262 44256 44448 44444 44259 44261 44266 44256 44448 44266 44259 44445 44446 44448 44448 44266 44266 44266 44265 44448 44258 44448 44448 44256 44257 44266 44256 44259 44448 44448 44266 44448 44258 44448 44441 44256 44448 44450 44256 44447 44444 44446 44448 44448 44257 44447 44266 44448 44265 44448 44262 44256 44256 44531 44259 44448 44257 44258 44256 44448 44440 44448 44445 44449 44449 44265 44448 44448 44448 44262 44448 44256 44531 44256 44265 44264 44259 44257 44443 44256 44448 44265 44256 44262 44449 44448 44449 44448 44258 44446 44445 44445 44448 44256 44448 44448 44261 44448 44448 44531 44263 44257 44448 44445 44445 44531 44448 44448 44448 44531 44256 44448 44442 44445 44258 44448 44448 44258 44446 44256 44446 44256 44444 44448 44265 44265 44256 44258 44257 44448 44259 44445 44450 44263 44448 44448 44448 44448 44449 44448 44448 44531 44450 44258 44454 44262 44263 44258 44446 44450 44256 44443 44445 44266 44448 44445 44259 44448 44260 44265 44265 44442 44256 44442 44447 44448 44256 44451 44531 44258 44263 44259 44448 44448 44256 44258 44263 44258 44448 44256 44256 44450 44256 44266 44263 44448 44265 44448 44256 44258 44262 44449 44448 44258 44448 44448 44257 44256 44258 44259 44448 44451 44256 44256 44258 44256 44531 44445 44448 44448 44259 44256 44266 44445 44443 44442 44256 44450 44450 44440 44256 44444 44448 44531 44263 44447 44447 44450 44265 44448 44261 44257 44448 44256 44256 44448 44259 44264 44450 44447 44258 44258 44260 44259 44448 44259 44256 44445 44259 44265 44448 44448 44448 44257 44265 44449 44448 44262 44262 44450 44256 44261 44448 44256 44448 44448 44448 44448 44450 44256 44445 44257 44448 44260 44450 44266 44259 44448 44259 44452 44265 44450 44265 44265 44447 44263 44256 44445 44257 44263 44261 44448 44259 44446 44447 44263 44259 44447 44447 44452 44445 44531 44260 44452 44448 44256 44259 44259 44265 44444 44447 44447 44446 44440 44261 44259 44256 44266 44446 44448 44257 44259 44531 44448 44258 44448 44448 44259 44265 44448 44448 44448 44257 44448 44448 44448 44444 44448 44259 44448 44448 44266 44448 44441 44531 44448 44448 44450 44258 44448 44448 44531 44448 44448 44258 44448 44447 44265 44261 44263 44256 44257 44258 44448 44258 44265 44531 44263 44448 44263 44444 44450 44260 44263 44450 44450 44262 44445 44448 44448 44448 44445 44450 44265 44445 44451 44448 44261 44258 44262 44442 44256 44448 44441 44449 44448 44443 44448 44256 44256 44266 44444 44448 44448 44454 44448 44450 44448 44265 44447 44448 44448 44261 44258 44451 44442 44262 44261 44262 44259 44448 44441 44448 44531 44261 44259 44259 44256 44448 44448 44448 44448 44448 44257 44258 44256 44448 44450 44447 44263 44256 44448 44441 44261 44260 44263 44448 44266 44442 44448 44266 44447 44450 44263 44448 44448 44261 44258 44448 44448 44256 44261 44531 44264 44263 44263 44448 44448 44262 44452 44256 44445 44266 44256 44263 44265 44450 44453 44259 44453 44445 44450 44441 44447 44448 44263 44448 44256 44448 44259 44257 44448 44448 44450 44448 44445 44443 44257 44261 44258 44448 44261 44531 44443 44451 44441 44531 44448 44263 44260 44262 44265 44450 44256 44442 44447 44449 44259 44259 44446 44260 44447 44263 44265 44443 44258 44266 44448 44448 44531 44448 44446 44258 44256 44256 44259 44450 44259 44449 44257 44259 44449 44266 44264 44448 44448 44453 44448 44442 44265 44265 44261 44265 44256 44440 44443 44258 44448 44448 44448 44266 44257 44450 44264 44256 44449 44531 44262 44261 44257 44256 44441 44266 44448 44448 44447 44442 44259 44450 44445 44450 44442 44531 44448 44531 44447 44448 44262 44531 44266 44256 44445 44259 44262 44448 44266 44448 44448 44443 44261 44448 44444 44448 44256 44258 44531 44442 44259 44265 44256 44256 44448 44448 44450 44445 44259 44450 44448 44445 44450 44452 44261 44265 44447 44450 44448 44257 44454 44258 44262 44531 44447 44444 44263 44259 44256 44447 44257 44257 44445 44258 44257 44450 44266 44266 44450 44450 44264 44450 44448 44448 44446 44448 44448 44446 44264 44448 44441 44441 44258 44446 44448 44261 44445 44447 44256 44446 44259 44259 44444 44450 44448 44448 44259 44448 44442 44257 44256 44260 44265 44258 44448 44450 44448 44447 44448 44258 44258 44447 44445 44261 44450 44448 44260 44448 44442 44263 44443 44263 44450 44445 44259 44448 44448 44259 44261 44256 44259 44448 44448 44444 44445 44531 44257 44263 44448 44263 44443 44263 44448 44448 44531 44531 44261 44450 44259 44448 44450 44447 44256 44447 44257 44448 44263 44447 44450 44447 44448 44444 44448 44263 44446 44256 44263 44256 44448 44440 44258 44445 44256 44445 44440 44265 44447 44266 44448 44258 44261 44445 44444 44444 44256 44259 44259 44448 44448 44261 44531 44257 44265 44261 44263 44264 44256 44448 44445 44450 44262 44448 44263 44258 44256 44261 44260 44448 44443 44441 44447 44445 44448 44444 44256 44262 44449 44256 44448 44257 44261 44260 44447 44261 44448 44445 44448 44258 44448 44448 44448 44262 44448 44265 44257 44265 44256 44264 44448 44450 44447 44256 44448 44448 44448 44448 44259 44450 44259 44450 44448 44257 44258 44261 44448 44266 44258 44263 44448 44449 44448 44447 44446 44448 44257 44448 44261 44448 44260 44266 44443 44448 44258 44448 44265 44258 44256 44451 44447 44445 44259 44440 44259 44450 44259 44531 44259 44259 44446 44450 44447 44444 44444 44450 44263 44448 44261 44258 44265 44443 44450 44450 44258 44256 44261 44445 44259 44531 44450 44259 44448 44261 44442 44448 44441 44446 44256 44261 44265 44263 44448 44444 44258 44448 44258 44261 44265 44262 44448 44259 44446 44256 44445 44445 44257 44448 44531 44446 44257 44447 44447 44258 44445 44443 44447 44258 44259 44256 44450 44448 44256 44264 44440 44262 44448 44261 44447 44444 44447 44264 44263 44445 44450 44448 44448 44531 44448 44265 44263 44256 44265 44450 44262 44442 44448 44448"
## [45] "PMC8766719 /pmc/articles/PMC8766719/bin/Table_2.xlsx Hsapiens 1 44454"
## [46] "PMC8762874 /pmc/articles/PMC8762874/bin/12860_2021_404_MOESM3_ESM.xlsx Hsapiens 2 44448 44448"
## [47] "PMC8762874 /pmc/articles/PMC8762874/bin/12860_2021_404_MOESM3_ESM.xlsx Hsapiens 1 44259"
## [48] "PMC8762874 /pmc/articles/PMC8762874/bin/12860_2021_404_MOESM3_ESM.xlsx Hsapiens 2 44259 44258"
## [49] "PMC8756499 /pmc/articles/PMC8756499/bin/mmc2.xlsx Ggallus 6 44083 44083 44083 44083 44083 44083"
## [50] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM10_ESM.xlsx Hsapiens 11 44089 44089 44089 44089 44089 43894 43894 43894 43894 43894 43891"
## [51] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM10_ESM.xlsx Hsapiens 11 44089 44089 44089 44089 44089 43894 43894 43894 43894 43894 43891"
## [52] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM11_ESM.xlsx Hsapiens 10 44089 44089 44089 44089 44089 44089 43894 43894 43894 43891"
## [53] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM11_ESM.xlsx Hsapiens 10 44089 44089 44089 44089 44089 44089 43894 43894 43894 43891"
## [54] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM12_ESM.xlsx Hsapiens 21 44089 43892 43892 43891 43898 44078 43900 44083 44083 43892 44084 43897 44079 44077 44085 43891 43896 43901 43893 44082 44080"
## [55] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM12_ESM.xlsx Hsapiens 21 44089 43892 43892 43891 43898 44078 43900 44083 44083 43892 44084 43897 44079 44077 44085 43891 43896 43901 43893 44082 44080"
## [56] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM13_ESM.xlsx Hsapiens 22 44089 43892 43892 43891 43898 44078 43900 43900 44083 44083 44083 43892 44084 44079 44077 44085 43891 43896 43901 43893 44082 44080"
## [57] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM13_ESM.xlsx Hsapiens 22 44089 43892 43892 43891 43898 44078 43900 43900 44083 44083 44083 43892 44084 44079 44077 44085 43891 43896 43901 43893 44082 44080"
## [58] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM14_ESM.xlsx Hsapiens 2 44085 43901"
## [59] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM14_ESM.xlsx Hsapiens 2 44085 43901"
## [60] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM15_ESM.xlsx Hsapiens 2 44085 43901"
## [61] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM15_ESM.xlsx Hsapiens 2 44085 43901"
## [62] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM16_ESM.xlsx Hsapiens 1 44089"
## [63] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM16_ESM.xlsx Hsapiens 1 44089"
## [64] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM17_ESM.xlsx Hsapiens 1 44089"
## [65] "PMC8692330 /pmc/articles/PMC8692330/bin/41467_2021_26901_MOESM17_ESM.xlsx Hsapiens 1 44089"
## [66] "PMC8761743 /pmc/articles/PMC8761743/bin/Table_2.xlsx Hsapiens 2 44256 44261"
## [67] "PMC8761743 /pmc/articles/PMC8761743/bin/Table_2.xlsx Hsapiens 2 44256 44266"
## [68] "PMC8753122 /pmc/articles/PMC8753122/bin/mmc3.xlsx Celegans 184 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043 37043"
## [69] "PMC8753122 /pmc/articles/PMC8753122/bin/mmc3.xlsx Celegans 42 38777 37043 38777 38777 37043 38777 38777 37043 38777 38777 37043 38777 37043 37043 38777 38777 37043 38777 37043 38777 37043 38777 38777 38777 37043 38777 38777 37043 38777 38777 38777 38777 37043 38777 38777 38777 37043 38777 37530 37043 38777 38777"
## [70] "PMC8753122 /pmc/articles/PMC8753122/bin/mmc5.xlsx Celegans 23 38777 36982 37530 38777 36982 37135 36982 37530 37043 38047 37043 37043 37165 38777 38412 38777 37681 38777 38047 38412 37681 38047 37681"
## [71] "PMC8738961 /pmc/articles/PMC8738961/bin/mmc3.xlsx Hsapiens 2 44447 44448"
## [72] "PMC8757482 /pmc/articles/PMC8757482/bin/NIHMS1735652-supplement-3.xlsx Hsapiens 27 44450 44452 44446 44266 44261 44451 44260 44447 44256 44263 44531 44259 44440 44448 44443 44442 44264 44265 44444 44441 44454 44445 44453 44258 44257 44449 44262"
## [73] "PMC8744257 /pmc/articles/PMC8744257/bin/12915_2021_1213_MOESM11_ESM.xls Hsapiens 1 37316"
## [74] "PMC8744257 /pmc/articles/PMC8744257/bin/12915_2021_1213_MOESM14_ESM.xls Hsapiens 1 36951"
## [75] "PMC8742043 /pmc/articles/PMC8742043/bin/41598_2021_4042_MOESM2_ESM.xlsx Hsapiens 1 44077"
## [76] "PMC8742043 /pmc/articles/PMC8742043/bin/41598_2021_4042_MOESM2_ESM.xlsx Hsapiens 19 43163 43160 43164 43160 43353 43345 43352 43167 43351 43354 43161 43165 43358 43168 43161 43166 43350 43357 43349"
## [77] "PMC8742041 /pmc/articles/PMC8742041/bin/41598_2021_3848_MOESM4_ESM.xlsx Athaliana 1 37865"
## [78] "PMC8744399 /pmc/articles/PMC8744399/bin/mmc3.xls Hsapiens 3 44531 44531 44453"
## [79] "PMC8741763 /pmc/articles/PMC8741763/bin/41598_2021_4015_MOESM6_ESM.xlsx Hsapiens 1 39340"
## [80] "PMC8741763 /pmc/articles/PMC8741763/bin/41598_2021_4015_MOESM6_ESM.xlsx Hsapiens 1 39340"
## [81] "PMC8741763 /pmc/articles/PMC8741763/bin/41598_2021_4015_MOESM6_ESM.xlsx Hsapiens 1 39340"
## [82] "PMC8741763 /pmc/articles/PMC8741763/bin/41598_2021_4015_MOESM6_ESM.xlsx Hsapiens 1 39340"
## [83] "PMC8741763 /pmc/articles/PMC8741763/bin/41598_2021_4015_MOESM6_ESM.xlsx Hsapiens 1 39340"
## [84] "PMC8741763 /pmc/articles/PMC8741763/bin/41598_2021_4015_MOESM6_ESM.xlsx Hsapiens 1 39340"
## [85] "PMC8741763 /pmc/articles/PMC8741763/bin/41598_2021_4015_MOESM6_ESM.xlsx Hsapiens 1 39340"
## [86] "PMC8741763 /pmc/articles/PMC8741763/bin/41598_2021_4015_MOESM7_ESM.xlsx Hsapiens 3 38596 38961 40057"
## [87] "PMC8741844 /pmc/articles/PMC8741844/bin/ijbsv18p0637s2.xlsx Hsapiens 21 44443 44261 44441 44259 44261 44448 44264 44445 44453 44447 44441 44446 44447 44261 44450 44447 44260 44259 44441 44441 44261"
## [88] "PMC8741300 /pmc/articles/PMC8741300/bin/pbio.3001490.s012.xlsx Hsapiens 24 39142 38412 44451 44443 44266 44454 44261 44450 44258 44263 44257 44447 44265 44441 44531 44259 44264 44448 44449 44453 44442 44446 44445 44444"
## [89] "PMC8741300 /pmc/articles/PMC8741300/bin/pbio.3001490.s013.xlsx Hsapiens 24 44263 44258 44257 44266 44450 44449 44264 44261 44531 44448 44265 44442 44445 44451 44443 44453 44262 44259 44260 44444 44454 44446 44441 44447"
## [90] "PMC8741300 /pmc/articles/PMC8741300/bin/pbio.3001490.s013.xlsx Hsapiens 24 44263 44258 44257 44266 44450 44449 44264 44261 44531 44448 44265 44442 44445 44451 44443 44453 44262 44259 44260 44444 44454 44446 44441 44447"
## [91] "PMC8741300 /pmc/articles/PMC8741300/bin/pbio.3001490.s013.xlsx Hsapiens 24 44262 44260 44451 44443 44266 44454 44261 44450 44258 44263 44257 44265 44447 44441 44453 44449 44448 44531 44259 44264 44442 44446 44445 44444"
## [92] "PMC8741300 /pmc/articles/PMC8741300/bin/pbio.3001490.s013.xlsx Hsapiens 24 44262 44260 44451 44443 44266 44454 44261 44450 44258 44263 44257 44265 44447 44441 44453 44449 44448 44531 44259 44264 44442 44446 44445 44444"
## [93] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43898"
## [94] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43891"
## [95] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44083"
## [96] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44078"
## [97] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43892"
## [98] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Ggallus 1 43892"
## [99] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Ggallus 1 43896"
## [100] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44079"
## [101] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 4 44083 44083 44083 44083"
## [102] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 5 43896 43896 43896 43896 43896"
## [103] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 5 43892 44083 44082 44082 43897"
## [104] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 44083 43892 44083"
## [105] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 4 43891 43896 43896 43891"
## [106] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43897"
## [107] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 5 44082 44086 43897 44083 44083"
## [108] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 4 43895 43896 44083 44081"
## [109] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44083"
## [110] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44083"
## [111] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43896 43897"
## [112] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 4 43897 43895 43896 44081"
## [113] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43895 43897"
## [114] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 44080 43897 44086"
## [115] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44083"
## [116] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44082 44083"
## [117] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 43897 44078 44084"
## [118] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 4 43895 43897 44086 44075"
## [119] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 43897 43896 43897"
## [120] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44082 44083"
## [121] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 43897 44080 43896"
## [122] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44082 44081"
## [123] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44083 44084"
## [124] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43897 43896"
## [125] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 44078 44075 43896"
## [126] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43897 44082"
## [127] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44079"
## [128] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43896"
## [129] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43896"
## [130] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Ggallus 3 44083 44080 43895"
## [131] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44075 44082"
## [132] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43895 43896"
## [133] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43896"
## [134] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 5 43896 44083 44083 44088 43897"
## [135] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43897 44079"
## [136] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44079"
## [137] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43893"
## [138] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44083"
## [139] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43897 44079"
## [140] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 44079 44083 44076"
## [141] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44086"
## [142] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 43898 44081 43897"
## [143] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 4 43897 44083 43895 43896"
## [144] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43893 44081"
## [145] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44076"
## [146] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 43897 44079 44079"
## [147] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 44079 44078 44078"
## [148] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44079 43897"
## [149] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44086"
## [150] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44088"
## [151] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43897 44079"
## [152] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43897 44085"
## [153] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44086 44077"
## [154] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43897"
## [155] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44082"
## [156] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44076 43893"
## [157] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 44083 43895 44081"
## [158] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43891"
## [159] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 43897 44079 43895"
## [160] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44085 43897"
## [161] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44086"
## [162] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44086 44079"
## [163] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44075 43898"
## [164] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44083 44078"
## [165] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44079 44078"
## [166] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 44079 44078 44079"
## [167] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44079"
## [168] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43897 43892"
## [169] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44078"
## [170] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44079 43895"
## [171] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44086 44083"
## [172] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 43897 43892 44085"
## [173] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43897"
## [174] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43891"
## [175] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44084 44081"
## [176] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44077"
## [177] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43897"
## [178] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44081"
## [179] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44079"
## [180] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43891"
## [181] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43897"
## [182] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43893"
## [183] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44078 44083"
## [184] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 5 44075 43891 44080 44078 44079"
## [185] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Ggallus 1 44075"
## [186] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Ggallus 1 44082"
## [187] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44079"
## [188] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44078"
## [189] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43891"
## [190] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43897 43897"
## [191] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44082 43897"
## [192] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44083 43892"
## [193] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 43897 44081"
## [194] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43897"
## [195] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44081"
## [196] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44079"
## [197] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44083 44086"
## [198] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44086"
## [199] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44078 44081"
## [200] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43897"
## [201] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43891"
## [202] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44085 43900"
## [203] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 2 44080 44080"
## [204] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43895"
## [205] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44081"
## [206] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43897"
## [207] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 43891"
## [208] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Ggallus 1 43897"
## [209] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 3 44078 44075 43897"
## [210] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44079"
## [211] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44080"
## [212] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Ggallus 1 43897"
## [213] "PMC8724844 /pmc/articles/PMC8724844/bin/bt-30-1-98-supple2.xlsx Hsapiens 1 44081"
## [214] "PMC8733947 /pmc/articles/PMC8733947/bin/Table_1.xlsx Hsapiens 1 44445"
## [215] "PMC8733896 /pmc/articles/PMC8733896/bin/Table3.XLSX Hsapiens 1 43349"
## [216] "PMC8725365 /pmc/articles/PMC8725365/bin/12957_2021_2461_MOESM1_ESM.xlsx Hsapiens 2 43891 44075"
## [217] "PMC8719881 /pmc/articles/PMC8719881/bin/elife-68213-supp1.xlsx Hsapiens 1 41897"
## [218] "PMC8719881 /pmc/articles/PMC8719881/bin/elife-68213-supp2.xlsx Hsapiens 1 43719"
## [219] "PMC8724546 /pmc/articles/PMC8724546/bin/Table1.XLSX Hsapiens 3 44260 44261 44262"
## [220] "PMC8724129 /pmc/articles/PMC8724129/bin/Table_5.xlsx Hsapiens 1 44531"
## [221] "PMC8688513 /pmc/articles/PMC8688513/bin/42003_2021_2942_MOESM5_ESM.xlsx Hsapiens 14 44257 44257 44262 44264 44259 44256 44261 44263 44531 44265 44258 44266 44256 44260"
## [222] "PMC8688513 /pmc/articles/PMC8688513/bin/42003_2021_2942_MOESM6_ESM.xlsx Hsapiens 13 44257 44257 44262 44264 44259 44256 44261 44263 44531 44265 44258 44256 44260"
## [223] "PMC8674242 /pmc/articles/PMC8674242/bin/42003_2021_2930_MOESM4_ESM.xlsx Hsapiens 1 44256"
## [224] "PMC8674242 /pmc/articles/PMC8674242/bin/42003_2021_2930_MOESM4_ESM.xlsx Hsapiens 1 44448"
## [225] "PMC8674242 /pmc/articles/PMC8674242/bin/42003_2021_2930_MOESM4_ESM.xlsx Hsapiens 1 44448"
## [226] "PMC8668906 /pmc/articles/PMC8668906/bin/41467_2021_27432_MOESM11_ESM.xlsx Athaliana 1 44049"
## [227] "PMC8797262 /pmc/articles/PMC8797262/bin/pone.0261691.s001.xlsx Hsapiens 6 37135 40057 40057 39692 40057 41883"
## [228] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM10_ESM.xlsx Ggallus 24 44266 44446 44449 44441 44447 44260 44444 44263 44258 44531 44451 44262 44443 44257 44448 44454 44264 44261 44445 44265 44259 44442 44450 44453"
## [229] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM10_ESM.xlsx Hsapiens 28 44440 44256 44447 44531 44265 44263 44445 44262 44453 44258 44450 44449 44257 44260 44443 44442 44261 44441 44451 44259 44266 44454 44264 44257 44448 44256 44446 44444"
## [230] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM10_ESM.xlsx Hsapiens 28 44449 44256 44264 44531 44448 44260 44263 44454 44450 44445 44262 44258 44453 44451 44441 44447 44444 44443 44259 44265 44257 44256 44442 44261 44257 44440 44266 44446"
## [231] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM10_ESM.xlsx Hsapiens 24 44446 44262 44266 44258 44261 44265 44441 44451 44445 44260 44447 44531 44453 44449 44450 44264 44448 44257 44454 44443 44263 44259 44444 44442"
## [232] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM10_ESM.xlsx Hsapiens 24 44450 44441 44266 44448 44443 44444 44454 44261 44442 44262 44449 44264 44259 44445 44453 44531 44447 44446 44260 44451 44263 44265 44258 44257"
## [233] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM10_ESM.xlsx Hsapiens 28 44531 44451 44264 44265 44453 44257 44256 44258 44263 44447 44445 44443 44449 44266 44262 44257 44256 44450 44454 44259 44448 44444 44261 44441 44442 44260 44446 44440"
## [234] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM10_ESM.xlsx Hsapiens 1 44446"
## [235] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM10_ESM.xlsx Hsapiens 28 44265 44443 44264 44263 44266 44531 44257 44258 44453 44445 44447 44262 44451 44441 44256 44257 44454 44260 44450 44448 44256 44446 44444 44449 44259 44440 44261 44442"
## [236] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM2_ESM.xlsx Hsapiens 4 44257 44257 44257 44257"
## [237] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM2_ESM.xlsx Hsapiens 4 44257 multiKO_2-Mar_ENSCSAG00000015876 44257 44257"
## [238] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM3_ESM.xlsx Hsapiens 4 44257 44257 multiKO_2-Mar_ENSCSAG00000015876 44257"
## [239] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM3_ESM.xlsx Hsapiens 4 44257 44257 multiKO_2-Mar_ENSCSAG00000015876 44257"
## [240] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM4_ESM.xlsx Hsapiens 2 44257 multiKO_2-Mar_ENSCSAG00000015876"
## [241] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM5_ESM.xlsx Hsapiens 2 44257 multiKO_2-Mar_ENSCSAG00000015876"
## [242] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM5_ESM.xlsx Hsapiens 2 44257 multiKO_2-Mar_ENSCSAG00000015876"
## [243] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM6_ESM.xlsx Hsapiens 28 44531 44259 44265 44257 44262 44451 44257 44440 44260 44264 44442 44444 44261 44446 44449 44454 44258 44443 44447 44266 44445 44263 44450 44256 44441 44453 44448 44256"
## [244] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM6_ESM.xlsx Hsapiens 28 44531 44265 44262 44259 44263 44450 44447 44445 44446 44260 44256 44444 44443 44258 44449 44454 44440 44441 44257 44257 44266 44261 44451 44256 44442 44453 44448 44264"
## [245] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM6_ESM.xlsx Hsapiens 28 44265 44447 44531 44263 44262 44264 44442 44260 44446 44443 44257 44256 44449 44454 44441 44258 44444 44448 44261 44440 44266 44257 44453 44259 44256 44451 44450 44445"
## [246] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM6_ESM.xlsx Hsapiens 28 44531 44263 44262 44442 44259 44265 44264 44260 44446 44256 44257 44258 44440 44261 44444 44449 44454 44257 44443 44445 44447 44441 44451 44453 44450 44256 44448 44266"
## [247] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM6_ESM.xlsx Hsapiens 28 44447 44265 44264 44531 44266 44259 44262 44446 44260 44257 44449 44440 44258 44256 44454 44443 44451 44444 44448 44261 44442 44257 44256 44453 44441 44445 44263 44450"
## [248] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM7_ESM.xlsx Hsapiens 28 44261 44257 44448 44260 44445 44256 44441 44258 44446 44453 44256 44262 44443 44265 44447 44450 44257 44259 44449 44266 44440 44263 44531 44442 44444 44451 44264 44454"
## [249] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM7_ESM.xlsx Hsapiens 28 44440 44260 44257 44453 44448 44441 44444 44261 44445 44265 44258 44447 44256 44263 44259 44531 44450 44257 44443 44262 44442 44256 44266 44264 44451 44449 44454 44446"
## [250] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM8_ESM.xlsx Hsapiens 28 44441 44260 44261 44256 44449 44450 44258 44445 44256 44448 44531 44259 44454 44440 44446 44257 44444 44442 44266 44264 44443 44453 44265 44257 44451 44263 44262 44447"
## [251] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM8_ESM.xlsx Hsapiens 27 44531 44262 44441 44260 44265 44256 44448 44263 44264 44257 44447 44261 44445 44258 44450 44442 44266 44453 44257 44451 44444 44454 44449 44446 44440 44443 44256"
## [252] "PMC8792531 /pmc/articles/PMC8792531/bin/13073_2022_1013_MOESM8_ESM.xlsx Hsapiens 27 44531 44441 44260 44449 44448 44442 44445 44454 44261 44266 44262 44446 44264 44450 44257 44451 44443 44447 44258 44257 44263 44453 44256 44440 44265 44444 44256"
## [253] "PMC8787225 /pmc/articles/PMC8787225/bin/Data_Sheet_1.xlsx Hsapiens 1 44447"
## [254] "PMC8787225 /pmc/articles/PMC8787225/bin/Data_Sheet_1.xlsx Hsapiens 1 44440"
## [255] "PMC8756151 /pmc/articles/PMC8756151/bin/icu-63-107-s004.xls Hsapiens 22 44266 44265 44449 44454 44445 44441 44257 44448 44443 44263 44260 44259 44262 44256 44258 44264 44450 44447 44444 44446 44440 44442"
## [256] "PMC8756151 /pmc/articles/PMC8756151/bin/icu-63-107-s004.xls Hsapiens 14 44256 44257 44260 44440 44449 44450 44441 44442 44443 44444 44445 44446 44447 44448"
## [257] "PMC8756151 /pmc/articles/PMC8756151/bin/icu-63-107-s004.xls Hsapiens 2 44444 44440"
## [258] "PMC8739494 /pmc/articles/PMC8739494/bin/LSA-2021-01134_SdataF5.xlsx Hsapiens 26 44260 44261 44256 44265 44258 44259 44449 44447 44257 44444 44263 44440 44266 44450 44257 44256 44262 44443 44451 44264 44442 44448 44453 44441 44531 44446"
## [259] "PMC8781439 /pmc/articles/PMC8781439/bin/13578_2022_745_MOESM3_ESM.xlsx Rnorvegicus 55 44443 44447 44446 44257 44257 44263 44445 44444 44266 44257 44446 44260 44257 44444 44445 44257 44446 44260 44258 44443 44263 44447 44446 44257 44441 44257 44449 44444 44445 44447 44446 44260 44257 44444 44258 44447 44266 44260 44258 44260 44441 44449 44257 44263 44257 44444 44450 44258 44440 44441 44265 44449 44257 44443 44257"
## [260] "PMC8781439 /pmc/articles/PMC8781439/bin/13578_2022_745_MOESM4_ESM.xlsx Rnorvegicus 4 44444 44266 44443 44441"
## [261] "PMC8781439 /pmc/articles/PMC8781439/bin/13578_2022_745_MOESM5_ESM.xlsx Rnorvegicus 8 44443 44257 44446 44447 44257 44263 44445 44444"
## [262] "PMC8781439 /pmc/articles/PMC8781439/bin/13578_2022_745_MOESM7_ESM.xlsx Mmusculus 7 44266 44263 44441 44257 44447 44443 44444"
## [263] "PMC8754154 /pmc/articles/PMC8754154/bin/MSB-18-e10407-s002.xlsx Mmusculus 1 44257"
## [264] "PMC8780294 /pmc/articles/PMC8780294/bin/12711_2022_696_MOESM3_ESM.xlsx Hsapiens 5 44446 44446 44446 44446 44265"
## [265] "PMC8780294 /pmc/articles/PMC8780294/bin/12711_2022_696_MOESM3_ESM.xlsx Hsapiens 4 44259 44446 44446 44446"
## [266] "PMC8777075 /pmc/articles/PMC8777075/bin/DataSheet8.xlsx Hsapiens 1 44259"
## [267] "PMC8777075 /pmc/articles/PMC8777075/bin/DataSheet8.xlsx Hsapiens 6 44263 44259 44257 44442 44264 44453"
## [268] "PMC8713981 /pmc/articles/PMC8713981/bin/pnas.2117557118.sd08.xlsx Hsapiens 20 44257 44256 44263 44264 44440 44443 44448 44257 44449 44262 44259 44441 44442 44450 44261 44266 44258 44447 44446 44445"
## [269] "PMC8762462 /pmc/articles/PMC8762462/bin/mmc2.xlsx Mmusculus 4 1-Sep 2-Sep 7-Sep 9-Sep"
## [270] "PMC8758788 /pmc/articles/PMC8758788/bin/41467_2021_27869_MOESM5_ESM.xlsx Mmusculus 27 37316 39508 38596 37316 36951 39692 38961 38412 40057 37865 39873 37681 39326 41883 38231 39142 38777 37500 38047 40422 37135 36951 42248 40787 41153 40603 40238"
## [271] "PMC8758788 /pmc/articles/PMC8758788/bin/41467_2021_27869_MOESM6_ESM.xlsx Mmusculus 27 38047 37500 36951 37316 39142 42248 40787 41883 39508 37135 36951 39326 38961 40422 39873 39692 38231 40238 40057 40603 38777 37865 41153 38596 37316 37681 38412"
## [272] "PMC8752785 /pmc/articles/PMC8752785/bin/41467_2021_27924_MOESM3_ESM.xls Mmusculus 1 44262"
## [273] "PMC8752785 /pmc/articles/PMC8752785/bin/41467_2021_27924_MOESM4_ESM.xls Mmusculus 20 44440 44446 44447 44449 44443 44266 44442 44451 44260 44263 44441 44256 44262 44258 44453 44256 44261 44259 44264 44445"
## [274] "PMC8748879 /pmc/articles/PMC8748879/bin/41467_2021_27722_MOESM3_ESM.xlsx Hsapiens 1 44448"
## [275] "PMC8748879 /pmc/articles/PMC8748879/bin/41467_2021_27722_MOESM4_ESM.xlsx Mmusculus 17 44446 44262 44441 44450 44449 44264 44447 44441 44263 44445 44257 44260 44440 44257 44448 44442 44261"
## [276] "PMC8748879 /pmc/articles/PMC8748879/bin/41467_2021_27722_MOESM4_ESM.xlsx Mmusculus 17 44257 44257 44450 44449 44445 44263 44447 44442 44260 44261 44448 44441 44446 44440 44262 44441 44264"
## [277] "PMC8748879 /pmc/articles/PMC8748879/bin/41467_2021_27722_MOESM5_ESM.xlsx Hsapiens 10 44256 44258 44264 44257 44259 44260 44261 44257 44263 44262"
## [278] "PMC8762886 /pmc/articles/PMC8762886/bin/13046_2022_2245_MOESM1_ESM.xlsx Hsapiens 1 44257"
## [279] "PMC8762886 /pmc/articles/PMC8762886/bin/13046_2022_2245_MOESM1_ESM.xlsx Hsapiens 6 44264 44263 44449 44454 44264 44442"
## [280] "PMC8760727 /pmc/articles/PMC8760727/bin/12935_2021_2431_MOESM1_ESM.xlsx Hsapiens 23 44442 44443 44257 44445 44262 44450 44264 44451 44259 44256 44261 44447 44263 44441 44531 44258 44440 44266 44448 44444 44256 44449 44260"
## [281] "PMC8760727 /pmc/articles/PMC8760727/bin/12935_2021_2431_MOESM1_ESM.xlsx Hsapiens 26 44257 44442 44443 44257 44446 44445 44262 44450 44264 44451 44259 44256 44261 44447 44263 44441 44531 44265 44258 44440 44454 44266 44448 44444 44256 44260"
## [282] "PMC8760727 /pmc/articles/PMC8760727/bin/12935_2021_2431_MOESM1_ESM.xlsx Hsapiens 22 44257 44442 44443 44257 44446 44445 44262 44450 44264 44451 44259 44256 44261 44447 44263 44441 44265 44454 44448 44444 44256 44449"
## [283] "PMC8759260 /pmc/articles/PMC8759260/bin/13059_2021_2585_MOESM2_ESM.xlsx Hsapiens 22 37316 36951 42248 39142 37500 40422 40787 36951 38777 37681 39692 39326 38412 39508 39873 37135 40057 38231 40238 37316 38596 37865"
## [284] "PMC8759260 /pmc/articles/PMC8759260/bin/13059_2021_2585_MOESM2_ESM.xlsx Hsapiens 22 39692 37681 36951 37316 36951 42248 39142 37500 40422 40787 38777 39326 38412 39508 39873 37135 40057 38231 40238 37316 38596 37865"
## [285] "PMC8756637 /pmc/articles/PMC8756637/bin/13059_2021_2595_MOESM2_ESM.xlsx Mmusculus 9 44446 44261 44261 44446 44449 44449 44446 44449 44449"
## [286] "PMC8732846 /pmc/articles/PMC8732846/bin/10120_2021_1226_MOESM3_ESM.xls Mmusculus 3 44448 44448 44448"
## [287] "PMC8732846 /pmc/articles/PMC8732846/bin/10120_2021_1226_MOESM3_ESM.xls Mmusculus 4 44257 44448 44448 44448"
## [288] "PMC8732846 /pmc/articles/PMC8732846/bin/10120_2021_1226_MOESM5_ESM.xls Mmusculus 3 44440 44448 44444"
## [289] "PMC8732282 /pmc/articles/PMC8732282/bin/41388_2021_2077_MOESM2_ESM.xlsx Hsapiens 23 44531 44257 44265 44266 44258 44259 44260 44261 44262 44263 44264 44449 44450 44451 44453 44441 44442 44443 44444 44445 44446 44447 44448"
## [290] "PMC8728658 /pmc/articles/PMC8728658/bin/jamaneurol-e214781-s002.xlsx Hsapiens 2 44443 44443"
## [291] "PMC8762721 /pmc/articles/PMC8762721/bin/NIHMS1761779-supplement-Table_S8.xlsx Hsapiens 20 44445 44450 44261 44261 44261 44256 44443 44443 44443 44443 44443 44256 44446 44256 44446 44448 44265 44451 44259 44261"
## [292] "PMC8762721 /pmc/articles/PMC8762721/bin/NIHMS1761779-supplement-Table_S9.xlsx Hsapiens 3 44257 44256 44531"
## [293] "PMC8762282 /pmc/articles/PMC8762282/bin/Table2.XLSX Ggallus 18 44446 44454 44441 44261 44443 44257 44260 44262 44263 44449 44258 44450 44447 44445 44451 44444 44448 44257"
## [294] "PMC8762282 /pmc/articles/PMC8762282/bin/Table2.XLSX Hsapiens 18 44446 44454 44441 44261 44443 44257 44260 44262 44263 44449 44258 44450 44447 44445 44451 44444 44448 44256"
## [295] "PMC8762282 /pmc/articles/PMC8762282/bin/Table2.XLSX Hsapiens 18 44446 44454 44441 44261 44443 44257 44260 44262 44263 44449 44258 44450 44447 44445 44451 44444 44448 44256"
## [296] "PMC8762282 /pmc/articles/PMC8762282/bin/Table2.XLSX Ggallus 19 44446 44454 44441 44261 44443 44257 44260 44262 44263 44449 44258 44450 44447 44445 44451 44444 44448 44442 44257"
## [297] "PMC8762282 /pmc/articles/PMC8762282/bin/Table2.XLSX Hsapiens 19 44446 44454 44441 44261 44443 44257 44260 44262 44263 44449 44258 44450 44447 44445 44451 44444 44448 44442 44256"
## [298] "PMC8762282 /pmc/articles/PMC8762282/bin/Table2.XLSX Hsapiens 19 44446 44454 44441 44261 44443 44257 44260 44262 44263 44449 44258 44450 44447 44445 44451 44444 44448 44442 44256"
## [299] "PMC8762282 /pmc/articles/PMC8762282/bin/Table4.XLSX Hsapiens 1 44447"
## [300] "PMC8762282 /pmc/articles/PMC8762282/bin/Table4.XLSX Hsapiens 2 44265 44448"
## [301] "PMC8762282 /pmc/articles/PMC8762282/bin/Table7.XLSX Hsapiens 1 44442"
## [302] "PMC8762282 /pmc/articles/PMC8762282/bin/Table8.XLSX Hsapiens 16 44449 44454 44450 44448 44444 44441 44257 44446 44261 44257 44262 44263 44260 44445 44258 44447"
## [303] "PMC8746912 /pmc/articles/PMC8746912/bin/cir-145-134-s002.xlsx Hsapiens 26 44441 44453 44446 44442 44263 44260 44256 44257 44447 44444 44264 44440 44261 44449 44443 44256 44448 44531 44257 44265 44451 44262 44258 44266 44259 44450"
## [304] "PMC8711282 /pmc/articles/PMC8711282/bin/peerj-09-12682-s008.xlsx Hsapiens 10 44264 44263 44256 44259 44262 44258 44257 44531 44260 44261"
## [305] "PMC8749219 /pmc/articles/PMC8749219/bin/mmc2.xlsx Mmusculus 6 43899 43892 44075 44082 43891 44079"
## [306] "PMC8749219 /pmc/articles/PMC8749219/bin/mmc3.xlsx Mmusculus 6 43899 43892 44082 44075 44079 43891"
## [307] "PMC8749219 /pmc/articles/PMC8749219/bin/mmc5.xlsx Mmusculus 6 43899 43892 44075 44082 43891 44079"
## [308] "PMC8749219 /pmc/articles/PMC8749219/bin/mmc6.xlsx Mmusculus 6 43899 43892 44082 44075 44079 43891"
## [309] "PMC8728608 /pmc/articles/PMC8728608/bin/EMBR-23-e53054-s003.xlsx Hsapiens 26 44256 44263 44446 44450 44531 44257 44443 44444 44262 44448 44440 44453 44441 44454 44258 44445 44264 44451 44261 44442 44260 44266 44449 44259 44265 44447"
## [310] "PMC8747517 /pmc/articles/PMC8747517/bin/elife-75415-supp2.xlsx Hsapiens 4 43349 43349 43351 43351"
## [311] "PMC8750853 /pmc/articles/PMC8750853/bin/13071_2021_5140_MOESM7_ESM.xls Hsapiens 1 44258"
## [312] "PMC8748700 /pmc/articles/PMC8748700/bin/41598_2021_4346_MOESM1_ESM.xlsx Rnorvegicus 2 44081 43897"
## [313] "PMC8748700 /pmc/articles/PMC8748700/bin/41598_2021_4346_MOESM1_ESM.xlsx Rnorvegicus 2 43900 43891"
## [314] "PMC8748700 /pmc/articles/PMC8748700/bin/41598_2021_4346_MOESM1_ESM.xlsx Rnorvegicus 1 44078"
## [315] "PMC8748539 /pmc/articles/PMC8748539/bin/41598_2021_4208_MOESM2_ESM.xlsx Hsapiens 5 44531 44443 44444 44445 44448"
## [316] "PMC8742585 /pmc/articles/PMC8742585/bin/acmi-3-0282-s001.xlsx Hsapiens 2 38596 40057"
## [317] "PMC8740005 /pmc/articles/PMC8740005/bin/12890_2021_1807_MOESM1_ESM.xlsx Rnorvegicus 5 43347 43347 43347 43160 43351"
## [318] "PMC8739483 /pmc/articles/PMC8739483/bin/Table_1.xlsx Hsapiens 1 44453"
## [319] "PMC8739361 /pmc/articles/PMC8739361/bin/mmc2.xlsx Hsapiens 4 44448 44257 44257 44256"
## [320] "PMC8719428 /pmc/articles/PMC8719428/bin/DAD2-13-e12270-s002.xlsx Hsapiens 2 44443 44442"
## [321] "PMC8719428 /pmc/articles/PMC8719428/bin/DAD2-13-e12270-s002.xlsx Hsapiens 1 44443"
## [322] "PMC8719428 /pmc/articles/PMC8719428/bin/DAD2-13-e12270-s002.xlsx Hsapiens 1 44443"
## [323] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM7_ESM.xlsx Hsapiens 1 44531"
## [324] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM7_ESM.xlsx Ggallus 1 44441"
## [325] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM7_ESM.xlsx Hsapiens 2 44257 44256"
## [326] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM7_ESM.xlsx Hsapiens 1 44441"
## [327] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM7_ESM.xlsx Hsapiens 1 44445"
## [328] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM7_ESM.xlsx Hsapiens 1 44449"
## [329] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM7_ESM.xlsx Hsapiens 2 44257 44256"
## [330] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM9_ESM.xlsx Hsapiens 1 44448"
## [331] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM9_ESM.xlsx Hsapiens 1 44257"
## [332] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM9_ESM.xlsx Ggallus 1 44441"
## [333] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM9_ESM.xlsx Hsapiens 1 44265"
## [334] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM9_ESM.xlsx Hsapiens 1 44444"
## [335] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM9_ESM.xlsx Hsapiens 3 44443 44448 44265"
## [336] "PMC8722201 /pmc/articles/PMC8722201/bin/12920_2021_1143_MOESM9_ESM.xlsx Hsapiens 3 44443 44448 44265"
## [337] "PMC8669064 /pmc/articles/PMC8669064/bin/41467_2021_26862_MOESM21_ESM.xlsx Mmusculus 6 44446 44257 44449 44256 44444 44448"
## [338] "PMC8722433 /pmc/articles/PMC8722433/bin/NIHMS1766114-supplement-Supplementary_materials_3.xlsx Hsapiens 181 44446 44261 44448 44448 44256 44262 44448 44448 44257 44262 44264 44263 44444 44256 44444 44263 44261 44263 44263 44264 44446 44448 44444 44440 44448 44263 44261 44445 44261 44448 44441 44261 44257 44440 44257 44450 44441 44256 44260 44257 44446 44260 44262 44446 44449 44448 44448 44258 44441 44261 44445 44450 44446 44450 44441 44261 44449 44260 44450 44445 44261 44448 44257 44262 44448 44256 44257 44441 44446 44444 44448 44441 44441 44448 44441 44446 44256 44441 44446 44441 44258 44446 44441 44261 44256 44444 44441 44256 44258 44256 44441 44441 44444 44440 44450 44447 44448 44441 44448 44260 44261 44444 44448 44258 44441 44440 44448 44450 44441 44448 44449 44262 44450 44447 44450 44448 44445 44448 44448 44441 44257 44264 44449 44448 44446 44447 44256 44262 44448 44448 44256 44445 44446 44441 44440 44263 44441 44448 44440 44446 44256 44448 44447 44448 44446 44448 44448 44444 44263 44448 44446 44441 44445 44262 44445 44258 44441 44441 44262 44256 44448 44441 44445 44441 44448 44446 44263 44446 44441 44440 44450 44257 44440 44446 44446 44447 44441 44448 44256 44261 44262"
## [339] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s3.xlsx Hsapiens 5 44081 44088 43898 43895 43892"
## [340] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s3.xlsx Hsapiens 5 44081 44088 43898 43895 43892"
## [341] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s4.xlsx Hsapiens 1 44448"
## [342] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s4.xlsx Hsapiens 6 44256 44256 44256 44258 44531 44448"
## [343] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s4.xlsx Hsapiens 9 44257 44259 44256 44256 44256 44258 44258 44258 44531"
## [344] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s4.xlsx Hsapiens 6 44450 44256 44256 44258 44531 44448"
## [345] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 17 44257 44260 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44257 44450 44266 44531"
## [346] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 36 44454 44454 44256 44263 44263 44260 44264 44264 44451 44451 44440 44440 44443 44443 44443 44265 44265 44448 44448 44448 44448 44444 44444 44444 44444 44442 44442 44442 44442 44450 44450 44258 44258 44447 44447 44445"
## [347] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 16 44257 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44257 44442 44450 44266 44531"
## [348] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 5 44263 44263 44448 44262 44262"
## [349] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 20 44257 44260 44443 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44448 44257 44442 44450 44266 44531"
## [350] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 12 44256 44263 44260 44440 44448 44448 44448 44448 44448 44441 44441 44531"
## [351] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 35 44454 44454 44454 44263 44260 44264 44264 44451 44451 44440 44440 44443 44443 44443 44265 44265 44448 44448 44444 44444 44444 44444 44442 44442 44442 44442 44450 44450 44450 44258 44258 44447 44447 44447 44445"
## [352] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 3 44263 44262 44262"
## [353] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 36 44454 44454 44256 44263 44260 44264 44264 44451 44451 44451 44440 44440 44443 44443 44443 44265 44265 44448 44448 44444 44444 44444 44444 44442 44442 44442 44442 44450 44450 44266 44258 44258 44447 44447 44447 44445"
## [354] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 4 44256 44260 44441 44441"
## [355] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 4 44263 44262 44266 44447"
## [356] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 8 44256 44260 44448 44448 44449 44441 44441 44444"
## [357] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 10 44260 44443 44448 44448 44449 44444 44442 44450 44258 44447"
## [358] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 10 44454 44448 44449 44444 44444 44444 44442 44450 44447 44447"
## [359] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 3 44449 44441 44445"
## [360] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 3 44448 44448 44449"
## [361] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 5 44448 44448 44448 44448 44266"
## [362] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 5 44443 44448 44449 44266 44258"
## [363] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 62 44262 44261 44265 44441 44447 44441 44441 44441 44441 44441 44441 44262 44441 44447 44262 44446 44441 44441 44441 44441 44441 44441 44441 44441 44441 44441 44261 44441 44441 44441 44441 44441 44441 44441 44441 44446 44441 44262 44441 44441 44441 44441 44441 44446 44441 44441 44261 44261 44266 44261 44261 44261 44262 44450 44265 44440 44440 44265 44265 44448 44448 44257"
## [364] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s5.xlsx Hsapiens 43 44257 44451 44256 44266 44256 44260 44257 44257 44263 44451 44446 44448 44448 44448 44263 44446 44263 44441 44446 44256 44266 44256 44450 44256 44256 44256 44256 44443 44265 44451 44451 44451 44448 44448 44448 44451 44448 44451 44448 44257 44257 44257 44257"
## [365] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s6.xlsx Hsapiens 24 43892 44077 44078 43892 44081 44080 43897 44085 43899 43894 43896 44082 43898 44076 44166 43900 43893 44075 44089 44083 44079 43891 44084 43895"
## [366] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s6.xlsx Hsapiens 27 43892 44077 44078 43892 44081 44080 43897 44085 43899 44086 43894 43891 43896 44088 44082 43898 44076 44166 43900 43893 44075 43901 44083 44079 43891 44084 43895"
## [367] "PMC8690928 /pmc/articles/PMC8690928/bin/thnov12p0459s6.xlsx Hsapiens 27 43892 44077 44078 43892 44081 44080 43897 44085 43899 44086 43894 43891 43896 44088 44082 43898 44076 44166 43900 43893 44075 43901 44083 44079 43891 44084 43895"
## [368] "PMC8718409 /pmc/articles/PMC8718409/bin/Table_1.xlsx Hsapiens 25 44257 44256 44442 44441 44445 44261 44531 44265 44263 44447 44451 44448 44440 44258 44450 44446 44266 44264 44444 44262 44443 44260 44453 44449 44259"
Let’s investigate the errors in more detail.
# By species
SPECIES <- sapply(strsplit(ERROR_GENELISTS," "),"[[",3)
table(SPECIES)
## SPECIES
## Athaliana Celegans Drerio Ggallus Hsapiens Mmusculus
## 2 3 3 16 314 22
## Rnorvegicus Scerevisiae
## 7 1
par(mar=c(5,12,4,2))
barplot(table(SPECIES),horiz=TRUE,las=1)
par(mar=c(5,5,4,2))
# Number of affected Excel files per paper
DIST <- table(sapply(strsplit(ERROR_GENELISTS," "),"[[",1))
DIST
##
## PMC8668906 PMC8669064 PMC8674242 PMC8688513 PMC8690928 PMC8692330 PMC8711282
## 1 1 3 2 29 16 1
## PMC8713784 PMC8713813 PMC8713981 PMC8718409 PMC8719428 PMC8719881 PMC8722201
## 4 1 1 1 3 2 14
## PMC8722433 PMC8724129 PMC8724546 PMC8724844 PMC8725365 PMC8728608 PMC8728658
## 1 1 1 121 1 1 1
## PMC8732282 PMC8732846 PMC8733896 PMC8733947 PMC8738961 PMC8739361 PMC8739483
## 1 3 1 1 1 1 1
## PMC8739494 PMC8740005 PMC8741300 PMC8741763 PMC8741844 PMC8742041 PMC8742043
## 1 1 5 8 1 1 2
## PMC8742585 PMC8744257 PMC8744399 PMC8746912 PMC8747517 PMC8748537 PMC8748539
## 1 2 1 1 1 2 1
## PMC8748700 PMC8748879 PMC8749219 PMC8750853 PMC8751407 PMC8752600 PMC8752785
## 3 4 4 1 1 3 2
## PMC8753122 PMC8754154 PMC8755833 PMC8756151 PMC8756499 PMC8756637 PMC8757482
## 3 1 6 3 1 1 1
## PMC8758788 PMC8759260 PMC8760727 PMC8761743 PMC8762282 PMC8762462 PMC8762721
## 2 2 3 2 10 1 2
## PMC8762874 PMC8762886 PMC8766719 PMC8767772 PMC8769648 PMC8770739 PMC8773395
## 3 2 1 4 3 1 1
## PMC8776252 PMC8777075 PMC8780294 PMC8781439 PMC8782480 PMC8786741 PMC8787225
## 1 2 2 4 1 1 2
## PMC8789167 PMC8792531 PMC8792904 PMC8794705 PMC8797262 PMC8800370
## 2 25 7 3 1 3
summary(as.numeric(DIST))
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 1.000 1.000 2.000 4.434 3.000 121.000
hist(DIST,main="Number of affected Excel files per paper")
# PMC Articles with the most errors
DIST_DF <- as.data.frame(DIST)
DIST_DF <- DIST_DF[order(-DIST_DF$Freq),,drop=FALSE]
head(DIST_DF,20)
## Var1 Freq
## 18 PMC8724844 121
## 5 PMC8690928 29
## 79 PMC8792531 25
## 6 PMC8692330 16
## 14 PMC8722201 14
## 61 PMC8762282 10
## 32 PMC8741763 8
## 80 PMC8792904 7
## 52 PMC8755833 6
## 31 PMC8741300 5
## 8 PMC8713784 4
## 44 PMC8748879 4
## 45 PMC8749219 4
## 67 PMC8767772 4
## 74 PMC8781439 4
## 3 PMC8674242 3
## 12 PMC8719428 3
## 23 PMC8732846 3
## 43 PMC8748700 3
## 48 PMC8752600 3
MOST_ERR_FILES = as.character(DIST_DF[1,1])
MOST_ERR_FILES
## [1] "PMC8724844"
# Number of errors per paper
NERR <- as.numeric(sapply(strsplit(ERROR_GENELISTS," "),"[[",4))
names(NERR) <- sapply(strsplit(ERROR_GENELISTS," "),"[[",1)
NERR <-tapply(NERR, names(NERR), sum)
NERR
## PMC8668906 PMC8669064 PMC8674242 PMC8688513 PMC8690928 PMC8692330 PMC8711282
## 1 6 3 27 447 140 10
## PMC8713784 PMC8713813 PMC8713981 PMC8718409 PMC8719428 PMC8719881 PMC8722201
## 10 1 20 25 4 2 20
## PMC8722433 PMC8724129 PMC8724546 PMC8724844 PMC8725365 PMC8728608 PMC8728658
## 181 1 3 227 2 26 2
## PMC8732282 PMC8732846 PMC8733896 PMC8733947 PMC8738961 PMC8739361 PMC8739483
## 23 10 1 1 2 4 1
## PMC8739494 PMC8740005 PMC8741300 PMC8741763 PMC8741844 PMC8742041 PMC8742043
## 26 5 120 10 21 1 20
## PMC8742585 PMC8744257 PMC8744399 PMC8746912 PMC8747517 PMC8748537 PMC8748539
## 2 2 3 26 4 4 5
## PMC8748700 PMC8748879 PMC8749219 PMC8750853 PMC8751407 PMC8752600 PMC8752785
## 5 45 24 1 29 63 21
## PMC8753122 PMC8754154 PMC8755833 PMC8756151 PMC8756499 PMC8756637 PMC8757482
## 249 1 26 38 6 9 27
## PMC8758788 PMC8759260 PMC8760727 PMC8761743 PMC8762282 PMC8762462 PMC8762721
## 54 44 71 4 131 4 23
## PMC8762874 PMC8762886 PMC8766719 PMC8767772 PMC8769648 PMC8770739 PMC8773395
## 5 7 1 986 26 1 1
## PMC8776252 PMC8777075 PMC8780294 PMC8781439 PMC8782480 PMC8786741 PMC8787225
## 3 7 9 74 17 4 2
## PMC8789167 PMC8792531 PMC8792904 PMC8794705 PMC8797262 PMC8800370
## 23 485 499 15 6 337
hist(NERR,main="number of errors per PMC article")
NERR_DF <- as.data.frame(NERR)
NERR_DF <- NERR_DF[order(-NERR_DF$NERR),,drop=FALSE]
head(NERR_DF,20)
## NERR
## PMC8767772 986
## PMC8792904 499
## PMC8792531 485
## PMC8690928 447
## PMC8800370 337
## PMC8753122 249
## PMC8724844 227
## PMC8722433 181
## PMC8692330 140
## PMC8762282 131
## PMC8741300 120
## PMC8781439 74
## PMC8760727 71
## PMC8752600 63
## PMC8758788 54
## PMC8748879 45
## PMC8759260 44
## PMC8756151 38
## PMC8751407 29
## PMC8688513 27
MOST_ERR = rownames(NERR_DF)[1]
MOST_ERR
## [1] "PMC8767772"
GENELIST_ERROR_ARTICLES <- gsub("PMC","",GENELIST_ERROR_ARTICLES)
### JSON PARSING is more reliable than XML
ARTICLES <- esummary( GENELIST_ERROR_ARTICLES , db="pmc" , retmode = "json" )
ARTICLE_DATA <- reutils::content(ARTICLES,as= "parsed")
ARTICLE_DATA <- ARTICLE_DATA$result
ARTICLE_DATA <- ARTICLE_DATA[2:length(ARTICLE_DATA)]
JOURNALS <- unlist(lapply(ARTICLE_DATA,function(x) {x$fulljournalname} ))
JOURNALS_TABLE <- table(JOURNALS)
JOURNALS_TABLE <- JOURNALS_TABLE[order(-JOURNALS_TABLE)]
length(JOURNALS_TABLE)
## [1] 50
NUM_JOURNALS=length(JOURNALS_TABLE)
par(mar=c(5,25,4,2))
barplot(head(JOURNALS_TABLE,10), horiz=TRUE, las=1,
xlab="Articles with gene name errors in supp files",
main="Top journals this month")
Congrats to our Journal of the Month winner!
JOURNAL_WINNER <- names(head(JOURNALS_TABLE,1))
JOURNAL_WINNER
## [1] "Nature Communications"
There are two categories:
Paper with the most suplementary files affected by gene name errors (MOST_ERR_FILES)
Paper with the most gene names converted to dates (MOST_ERR)
Sometimes, one paper can win both categories. Congrats to our winners.
MOST_ERR_FILES <- gsub("PMC","",MOST_ERR_FILES)
ARTICLES <- esummary( MOST_ERR_FILES , db="pmc" , retmode = "json" )
ARTICLE_DATA <- reutils::content(ARTICLES,as= "parsed")
ARTICLE_DATA <- ARTICLE_DATA[2]
ARTICLE_DATA
## $result
## $result$uids
## [1] "8724844"
##
## $result$`8724844`
## $result$`8724844`$uid
## [1] "8724844"
##
## $result$`8724844`$pubdate
## [1] "2021 Aug 25"
##
## $result$`8724844`$epubdate
## [1] "2021 Aug 25"
##
## $result$`8724844`$printpubdate
## [1] "2022 Jan 1"
##
## $result$`8724844`$source
## [1] "Biomol Ther (Seoul)"
##
## $result$`8724844`$authors
## name authtype
## 1 Chun KH Author
##
## $result$`8724844`$title
## [1] "Discovery of Cellular RhoA Functions by the Integrated Application of Gene Set Enrichment Analysis"
##
## $result$`8724844`$volume
## [1] "30"
##
## $result$`8724844`$issue
## [1] "1"
##
## $result$`8724844`$pages
## [1] "98-116"
##
## $result$`8724844`$articleids
## idtype value
## 1 pmid 34429388
## 2 doi 10.4062/biomolther.2021.075
## 3 pmcid PMC8724844
##
## $result$`8724844`$fulljournalname
## [1] "Biomolecules & Therapeutics"
##
## $result$`8724844`$sortdate
## [1] "2021/08/25 00:00"
##
## $result$`8724844`$pmclivedate
## [1] "2022/01/07"
MOST_ERR <- gsub("PMC","",MOST_ERR)
ARTICLE_DATA <- esummary(MOST_ERR,db = "pmc" , retmode = "json" )
ARTICLE_DATA <- reutils::content(ARTICLE_DATA,as= "parsed")
ARTICLE_DATA
## $header
## $header$type
## [1] "esummary"
##
## $header$version
## [1] "0.3"
##
##
## $result
## $result$uids
## [1] "8767772"
##
## $result$`8767772`
## $result$`8767772`$uid
## [1] "8767772"
##
## $result$`8767772`$pubdate
## [1] "2021 Oct 26"
##
## $result$`8767772`$epubdate
## [1] "2021 Oct 26"
##
## $result$`8767772`$printpubdate
## [1] "2021"
##
## $result$`8767772`$source
## [1] "Commun Med (London)"
##
## $result$`8767772`$authors
## name authtype
## 1 Konigsberg IR Author
## 2 Barnes B Author
## 3 Campbell M Author
## 4 Davidson E Author
## 5 Zhen Y Author
## 6 Pallisard O Author
## 7 Boorgula MP Author
## 8 Cox C Author
## 9 Nandy D Author
## 10 Seal S Author
## 11 Crooks K Author
## 12 Sticca E Author
## 13 Harrison GF Author
## 14 Hopkinson A Author
## 15 Vest A Author
## 16 Arnold CG Author
## 17 Kahn MG Author
## 18 Kao DP Author
## 19 Peterson BR Author
## 20 Wicks SJ Author
## 21 Ghosh D Author
## 22 Horvath S Author
## 23 Zhou W Author
## 24 Mathias RA Author
## 25 Norman PJ Author
## 26 Porecha R Author
## 27 Yang IV Author
## 28 Gignoux CR Author
## 29 Monte AA Author
## 30 Taye A Author
## 31 Barnes KC Author
##
## $result$`8767772`$title
## [1] "Host methylation predicts SARS-CoV-2 infection and clinical outcome"
##
## $result$`8767772`$volume
## [1] "1"
##
## $result$`8767772`$issue
## [1] "1"
##
## $result$`8767772`$pages
## [1] "42"
##
## $result$`8767772`$articleids
## idtype value
## 1 pmid 35072167
## 2 doi 10.1038/s43856-021-00042-y
## 3 pmcid PMC8767772
##
## $result$`8767772`$fulljournalname
## [1] "Communications Medicine"
##
## $result$`8767772`$sortdate
## [1] "2021/10/26 00:00"
##
## $result$`8767772`$pmclivedate
## [1] "2022/01/19"
To plot the trend over the past 6-12 months.
url <- "http://ziemann-lab.net/public/gene_name_errors/"
doc <- htmlParse(url)
links <- xpathSApply(doc, "//a/@href")
links <- links[grep("html",links)]
links
## href href href
## "Report_2021-02.html" "Report_2021-03.html" "Report_2021-04.html"
## href href href
## "Report_2021-05.html" "Report_2021-06.html" "Report_2021-07.html"
## href href href
## "Report_2021-08.html" "Report_2021-09.html" "Report_2021-10.html"
## href href href
## "Report_2021-11.html" "Report_2021-12.html" "Report_2022-01.html"
unlink("online_files/",recursive=TRUE)
dir.create("online_files")
sapply(links, function(mylink) {
download.file(paste(url,mylink,sep=""),destfile=paste("online_files/",mylink,sep=""))
} )
## href href href href href href href href href href href href
## 0 0 0 0 0 0 0 0 0 0 0 0
myfilelist <- list.files("online_files/",full.names=TRUE)
trends <- sapply(myfilelist, function(myfilename) {
x <- readLines(myfilename)
# Num XL gene list articles
NUM_GENELIST_ARTICLES <- x[grep("NUM_GENELIST_ARTICLES",x)[3]+1]
NUM_GENELIST_ARTICLES <- sapply(strsplit(NUM_GENELIST_ARTICLES," "),"[[",3)
NUM_GENELIST_ARTICLES <- sapply(strsplit(NUM_GENELIST_ARTICLES,"<"),"[[",1)
NUM_GENELIST_ARTICLES <- as.numeric(NUM_GENELIST_ARTICLES)
# number of affected articles
NUM_ERROR_GENELIST_ARTICLES <- x[grep("NUM_ERROR_GENELIST_ARTICLES",x)[3]+1]
NUM_ERROR_GENELIST_ARTICLES <- sapply(strsplit(NUM_ERROR_GENELIST_ARTICLES," "),"[[",3)
NUM_ERROR_GENELIST_ARTICLES <- sapply(strsplit(NUM_ERROR_GENELIST_ARTICLES,"<"),"[[",1)
NUM_ERROR_GENELIST_ARTICLES <- as.numeric(NUM_ERROR_GENELIST_ARTICLES)
# Error proportion
ERROR_PROPORTION <- x[grep("ERROR_PROPORTION",x)[3]+1]
ERROR_PROPORTION <- sapply(strsplit(ERROR_PROPORTION," "),"[[",3)
ERROR_PROPORTION <- sapply(strsplit(ERROR_PROPORTION,"<"),"[[",1)
ERROR_PROPORTION <- as.numeric(ERROR_PROPORTION)
# number of journals
NUM_JOURNALS <- x[grep('JOURNALS_TABLE',x)[3]+1]
NUM_JOURNALS <- sapply(strsplit(NUM_JOURNALS," "),"[[",3)
NUM_JOURNALS <- sapply(strsplit(NUM_JOURNALS,"<"),"[[",1)
NUM_JOURNALS <- as.numeric(NUM_JOURNALS)
NUM_JOURNALS
res <- c(NUM_GENELIST_ARTICLES,NUM_ERROR_GENELIST_ARTICLES,ERROR_PROPORTION,NUM_JOURNALS)
return(res)
})
colnames(trends) <- sapply(strsplit(colnames(trends),"_"),"[[",3)
colnames(trends) <- gsub(".html","",colnames(trends))
trends <- as.data.frame(trends)
rownames(trends) <- c("NUM_GENELIST_ARTICLES","NUM_ERROR_GENELIST_ARTICLES","ERROR_PROPORTION","NUM_JOURNALS")
trends <- t(trends)
trends <- as.data.frame(trends)
CURRENT_RES <- c(NUM_GENELIST_ARTICLES,NUM_ERROR_GENELIST_ARTICLES,ERROR_PROPORTION,NUM_JOURNALS)
trends <- rbind(trends,CURRENT_RES)
paste(CURRENT_YEAR,CURRENT_MONTH,sep="-")
## [1] "2022-02"
rownames(trends)[nrow(trends)] <- paste(CURRENT_YEAR,CURRENT_MONTH,sep="-")
plot(trends$NUM_GENELIST_ARTICLES, xaxt = "n" , type="b" , main="Number of articles with Excel gene lists per month",
ylab="number of articles", xlab="month")
axis(1, at=1:nrow(trends), labels=rownames(trends))
plot(trends$NUM_ERROR_GENELIST_ARTICLES, xaxt = "n" , type="b" , main="Number of articles with gene name errors per month",
ylab="number of articles", xlab="month")
axis(1, at=1:nrow(trends), labels=rownames(trends))
plot(trends$ERROR_PROPORTION, xaxt = "n" , type="b" , main="Proportion of articles with Excel gene list affected by errors",
ylab="proportion", xlab="month")
axis(1, at=1:nrow(trends), labels=rownames(trends))
plot(trends$NUM_JOURNALS, xaxt = "n" , type="b" , main="Number of journals with affected articles",
ylab="number of journals", xlab="month")
axis(1, at=1:nrow(trends), labels=rownames(trends))
unlink("online_files/",recursive=TRUE)
Zeeberg, B.R., Riss, J., Kane, D.W. et al. Mistaken Identifiers: Gene name errors can be introduced inadvertently when using Excel in bioinformatics. BMC Bioinformatics 5, 80 (2004). https://doi.org/10.1186/1471-2105-5-80
Ziemann, M., Eren, Y. & El-Osta, A. Gene name errors are widespread in the scientific literature. Genome Biol 17, 177 (2016). https://doi.org/10.1186/s13059-016-1044-7
sessionInfo()
## R version 4.1.2 (2021-11-01)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 20.04.3 LTS
##
## Matrix products: default
## BLAS: /usr/lib/x86_64-linux-gnu/blas/libblas.so.3.9.0
## LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.9.0
##
## locale:
## [1] LC_CTYPE=en_AU.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_AU.UTF-8 LC_COLLATE=en_AU.UTF-8
## [5] LC_MONETARY=en_AU.UTF-8 LC_MESSAGES=en_AU.UTF-8
## [7] LC_PAPER=en_AU.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_AU.UTF-8 LC_IDENTIFICATION=C
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] readxl_1.3.1 reutils_0.2.3 xml2_1.3.3 jsonlite_1.7.2 XML_3.99-0.8
##
## loaded via a namespace (and not attached):
## [1] Rcpp_1.0.7 knitr_1.37 magrittr_2.0.1 R6_2.5.1
## [5] rlang_0.4.12 fastmap_1.1.0 stringr_1.4.0 highr_0.9
## [9] tools_4.1.2 xfun_0.29 jquerylib_0.1.4 htmltools_0.5.2
## [13] yaml_2.2.1 digest_0.6.29 assertthat_0.2.1 sass_0.4.0
## [17] bitops_1.0-7 RCurl_1.98-1.5 evaluate_0.14 rmarkdown_2.11
## [21] stringi_1.7.6 compiler_4.1.2 bslib_0.3.1 cellranger_1.1.0