bitr_kegg
clusterProfiler
can convert biological IDs using OrgDb
object via the bitr
function. Now I implemented another function, bitr_kegg
for converting IDs through KEGG API.
library(clusterProfiler)
data(gcSample)
hg <- gcSample[[1]]
head(hg)
## [1] "4597" "7111" "5266" "2175" "755" "23046"
eg2np <- bitr_kegg(hg, fromType='kegg', toType='ncbi-proteinid', organism='hsa')
## Warning in bitr_kegg(hg, fromType = "kegg", toType = "ncbi-proteinid",
## organism = "hsa"): 3.7% of input gene IDs are fail to map...
head(eg2np)
## kegg ncbi-proteinid
## 1 8326 NP_003499
## 2 58487 NP_001034707
## 3 139081 NP_619647
## 4 59272 NP_068576
## 5 993 NP_001780
## 6 2676 NP_001487
np2up <- bitr_kegg(eg2np[,2], fromType='ncbi-proteinid', toType='uniprot', organism='hsa')
head(np2up)
## ncbi-proteinid uniprot
## 1 NP_005457 O75586
## 2 NP_005792 P41567
## 3 NP_005792 Q6IAV3
## 4 NP_037536 Q13421
## 5 NP_006054 O60662
## 6 NP_001092002 O95398
The ID type (both fromType & toType) should be one of ‘kegg’, ‘ncbi-geneid’, ‘ncbi-proteinid’ or ‘uniprot’. The ‘kegg’ is the primary ID used in KEGG database. The data source of KEGG was from NCBI. A rule of thumb for the ‘kegg’ ID is entrezgene
ID for eukaryote species and Locus
ID for prokaryotes.
Continue reading