PubChem 2019 update: improved access to chemical data

Sunghwan Kim, Jie Chen, Tiejun Cheng, Asta Gindulyte, Jia He, Siqian He, Qingliang Li, Benjamin A Shoemaker, Paul A Thiessen, Bo Yu, Leonid Zaslavsky, Jian Zhang, Evan E Bolton, Sunghwan Kim, Jie Chen, Tiejun Cheng, Asta Gindulyte, Jia He, Siqian He, Qingliang Li, Benjamin A Shoemaker, Paul A Thiessen, Bo Yu, Leonid Zaslavsky, Jian Zhang, Evan E Bolton

Abstract

PubChem (https://pubchem.ncbi.nlm.nih.gov) is a key chemical information resource for the biomedical research community. Substantial improvements were made in the past few years. New data content was added, including spectral information, scientific articles mentioning chemicals, and information for food and agricultural chemicals. PubChem released new web interfaces, such as PubChem Target View page, Sources page, Bioactivity dyad pages and Patent View page. PubChem also released a major update to PubChem Widgets and introduced a new programmatic access interface, called PUG-View. This paper describes these new developments in PubChem.

Figures

Figure 1.
Figure 1.
Number of unique PubChem users per month (interactive users only).
Figure 2.
Figure 2.
PubChem Target View page for the human histamine receptor H1 (HRH1) gene (https://pubchem.ncbi.nlm.nih.gov/target/gene/3269) (bottom right), along with its example entry points from the Compound Summary page for CID 2678 (https://pubchem.ncbi.nlm.nih.gov/compound/2678) and the BioAssay Record page for AID 238823 (https://pubchem.ncbi.nlm.nih.gov/bioassay/238823).
Figure 3.
Figure 3.
PubChem Bioactivity dyad page for SID 4247730 (corresponding to CID 3241895) and AID 820 (https://pubchem.ncbi.nlm.nih.gov/bioassay/820#sid=4247730) (right). This page can be accessed from the Substance Record page for SID 4247730 (https://pubchem.ncbi.nlm.nih.gov/substance/4247730), the Compound Summary page for CID 3241895 (https://pubchem.ncbi.nlm.nih.gov/compound/3241895), or the BioAssay Record page for AID 820 (https://pubchem.ncbi.nlm.nih.gov/bioassay/238823).
Figure 4.
Figure 4.
PubChem Patent View page for US8501698 (https://pubchem.ncbi.nlm.nih.gov/patent/US8501698) (right). This page can be accessed from the ‘Depositor-Supplied Patent Identifiers’ section on the Compound Summary page for CID 4247730 (https://pubchem.ncbi.nlm.nih.gov/compound/2162) (left).

References

    1. Kim S., Thiessen P.A., Bolton E.E., Chen J., Fu G., Gindulyte A., Han L., He J., He S., Shoemaker B.A. et al. . PubChem substance and compound databases. Nucleic Acids Res. 2016; 44:D1202–D1213.
    1. Wang Y., Bryant S.H., Cheng T., Wang J., Gindulyte A., Shoemaker B.A., Thiessen P.A., He S., Zhang J.. PubChem BioAssay: 2017 update. Nucleic Acids Res. 2017; 45:D955–D963.
    1. Kim S. Getting the most out of PubChem for virtual screening. Expert Opin. Drug Discov. 2016; 11:843–855.
    1. Hähnke V.D., Kim S., Bolton E.E.. PubChem chemical structure standardization. J. Cheminform. 2018; 10:36.
    1. Kim S., Thiessen P.A., Cheng T., Yu B., Shoemaker B.A., Wang J.Y., Bolton E.E., Wang Y.L., Bryant S.H.. Literature information in PubChem: associations between PubChem records and scientific articles. J. Cheminform. 2016; 8:32.
    1. Meija J., Coplen T.B., Berglund M., Brand W.A., De Bievre P., Groning M., Holden N.E., Irrgeher J., Loss R.D., Walczyk T. et al. . Atomic weights of the elements 2013 (IUPAC Technical Report). Pure Appl. Chem. 2016; 88:265–291.
    1. Meija J., Coplen T.B., Berglund M., Brand W.A., De Bievre P., Groning M., Holden N.E., Irrgeher J., Loss R.D., Walczyk T. et al. . Isotopic compositions of the elements 2013 (IUPAC Technical Report). Pure Appl. Chem. 2016; 88:293–306.
    1. Audi G., Kondev F.G., Wang M., Huang W.J., Naimi S.. The NUBASE2016 evaluation of nuclear properties. Chin. Phys. C. 2017; 41:030001.
    1. Gaulton A., Hersey A., Nowotka M., Bento A.P., Chambers J., Mendez D., Mutowo P., Atkinson F., Bellis L.J., Cibrian-Uhalte E. et al. . The ChEMBL database in 2017. Nucleic Acids Res. 2017; 45:D945–D954.
    1. Wishart D.S., Feunang Y.D., Guo A.C., Lo E.J., Marcu A., Grant J.R., Sajed T., Johnson D., Li C., Sayeeda Z. et al. . DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 2018; 46:D1074–D1082.
    1. Harding S.D., Sharman J.L., Faccenda E., Southan C., Pawson A.J., Ireland S., Gray A.J.G., Bruce L., Alexander S.P.H., Anderton S. et al. . The IUPHAR/BPS Guide to PHARMACOLOGY in 2018: updates and expansion to encompass the new guide to IMMUNOPHARMACOLOGY. Nucleic Acids Res. 2018; 46:D1091–D1106.
    1. Brown G.R., Hem V., Katz K.S., Ovetsky M., Wallin C., Ermolaeva O., Tolstoy I., Tatusova T., Pruitt K.D., Maglott D.R. et al. . Gene: a gene-centered information resource at NCBI. Nucleic Acids Res. 2015; 43:D36–D42.
    1. Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T. et al. . Gene Ontology: tool for the unification of biology. Nat. Genet. 2000; 25:25–29.
    1. Carbon S., Dietze H., Lewis S.E., Mungall C.J., Munoz-Torres M.C., Basu S., Chisholm R.L., Dodson R.J., Fey P., Thomas P.D. et al. . Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Res. 2017; 45:D331–D338.
    1. Yates B., Braschi B., Gray K.A., Seal R.L., Tweedie S., Bruford E.A.. : the HGNC and VGNC resources in 2017. Nucleic Acids Res. 2017; 45:D619–D625.
    1. Bateman A., Martin M.J., O’Donovan C., Magrane M., Alpi E., Antunes R., Bely B., Bingley M., Bonilla C., Britto R. et al. . UniProt: the universal protein knowledgebase. Nucleic Acids Res. 2017; 45:D158–D169.
    1. Rose P.W., Prlic A., Altunkaya A., Bi C.X., Bradley A.R., Christie C.H., Di Costanzo L., Duarte J.M., Dutta S., Feng Z.K. et al. . The RCSB protein data bank: integrative view of protein, gene and 3D structural information. Nucleic Acids Res. 2017; 45:D271–D281.
    1. Marchler-Bauer A., Bo Y., Han L.Y., He J.E., Lanczycki C.J., Lu S.N., Chitsaz F., Derbyshire M.K., Geer R.C., Gonzales N.R. et al. . CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. 2017; 45:D200–D203.
    1. Finn R.D., Coggill P., Eberhardt R.Y., Eddy S.R., Mistry J., Mitchell A.L., Potter S.C., Punta M., Qureshi M., Sangrador-Vegas A. et al. . The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 2016; 44:D279–D285.
    1. Halavi M., Maglott D., Gorelenkov V., Rubinstein W.. MedGen. The NCBI Handbook [Internet]. 2013; 2nd edBethesda: National Center for Biotechnology Information (US).
    1. Kanehisa M., Furumichi M., Tanabe M., Sato Y., Morishima K.. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 2017; 45:D353–D361.
    1. Geer L.Y., Marchler-Bauer A., Geer R.C., Han L.Y., He J., He S.Q., Liu C.L., Shi W.Y., Bryant S.H.. The NCBI BioSystems database. Nucleic Acids Res. 2010; 38:D492–D496.
    1. Madej T., Lanczycki C.J., Zhang D.C., Thiessen P.A., Geer R.C., Marchler-Bauer A., Bryant S.H.. MMDB and VAST+: tracking structural similarities between macromolecular complexes. Nucleic Acids Res. 2014; 42:D297–D303.
    1. Papadatos G., Davies M., Dedman N., Chambers J., Gaulton A., Siddle J., Koks R., Irvine S.A., Pettersson J., Goncharoff N. et al. . SureChEMBL: a large-scale, chemically annotated patent document database. Nucleic Acids Res. 2016; 44:D1220–D1228.
    1. Heifets A., Jurisica I.. SCRIPDB: a portal for easy access to syntheses, chemicals and reactions in patents. Nucleic Acids Res. 2012; 40:D428–D433.
    1. Gilson M.K., Liu T.Q., Baitaluk M., Nicola G., Hwang L., Chong J.. BindingDB in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res. 2016; 44:D1045–D1053.
    1. Kim S., Thiessen P.A., Bolton E.E., Bryant S.H.. PUG-SOAP and PUG-REST: web services for programmatic access to chemical information in PubChem. Nucleic Acids Res. 2015; 43:W605–W611.
    1. Kim S., Thiessen P.A., Cheng T.J., Yu B., Bolton E.E.. An update on PUG-REST: RESTful interface for programmatic access to PubChem. Nucleic Acids Res. 2018; 46:W563–W570.

Source: PubMed

3
订阅