Web services for data warehouses: OMOP and PCORnet on i2b2

Jeffrey G Klann, Lori C Phillips, Christopher Herrick, Matthew A H Joss, Kavishwar B Wagholikar, Shawn N Murphy, Jeffrey G Klann, Lori C Phillips, Christopher Herrick, Matthew A H Joss, Kavishwar B Wagholikar, Shawn N Murphy

Abstract

Objective: Healthcare organizations use research data models supported by projects and tools that interest them, which often means organizations must support the same data in multiple models. The healthcare research ecosystem would benefit if tools and projects could be adopted independently from the underlying data model. Here, we introduce the concept of a reusable application programming interface (API) for healthcare and show that the i2b2 API can be adapted to support diverse patient-centric data models.

Materials and methods: We develop methodology for extending i2b2's pre-existing API to query additional data models, using i2b2's recent "multi-fact-table querying" feature. Our method involves developing data-model-specific i2b2 ontologies and mapping these to query non-standard table structure.

Results: We implement this methodology to query OMOP and PCORnet models, which we validate with the i2b2 query tool. We implement the entire PCORnet data model and a five-domain subset of the OMOP model. We also demonstrate that additional, ancillary data model columns can be modeled and queried as i2b2 "modifiers."

Discussion: i2b2's REST API can be used to query multiple healthcare data models, enabling shared tooling to have a choice of backend data stores. This enables separation between data model and software tooling for some of the more popular open analytic data models in healthcare.

Conclusion: This methodology immediately allows querying OMOP and PCORnet using the i2b2 API. It is released as an open-source set of Docker images, and also on the i2b2 community wiki.

Figures

Figure 1.
Figure 1.
Web services on data warehouses. Shown here: the i2b2 database uses i2b2 XML REST services to communicate with the query tool. The bottom two arrows on the left show hypothetical connections to PCORnet and OMOP.
Figure 2.
Figure 2.
Showing the linkage between i2b2 ontology and non-i2b2 data models.
Figure 3.
Figure 3.
A simplified query XML for the query in Figure 4, showing a query for all black, female patients with diabetes mellitus. The “item key” provides the unique ontology link that identifies each data element in the query.
Figure 4.
Figure 4.
i2b2 querying OMOP’s synPUF 1000-patient dataset. This query took 2.2 seconds.

References

    1. OHDSI | Observational Health Data Sciences and Informatics. Accessed July 29, 2015.
    1. Hripcsak G, Duke JD, Shah NH, et al. Observational Health Data Sciences and Informatics (OHDSI): opportunities for observational researchers. Stud Health Technol Inform 2015; 216: 574–8.
    1. OHDSI. resources: data_network. Observational Health Data Sciences and Informatics. Accessed April 18, 2018.
    1. All of Us Research Program. National Institutes of Health (NIH). Accessed March 6, 2017.
    1. Collins FS, Hudson KL, Briggs JP, et al. PCORnet: turning a dream into reality. J Am Med Inform Assoc 2014; 214: 576–7.
    1. PCORnet Common Data Model (CDM). PCORnet. Accessed March 6, 2017.
    1. Murphy SN, Weber G, Mendis M, et al. Serving the enterprise and beyond with Informatics for Integrating Biology and the Bedside (i2b2). J Am Med Inform Assoc 2010; 172: 124–30.
    1. McMurry AJ, Murphy SN, MacFadden D, et al. SHRINE: enabling nationally scalable multi-site disease studies. PLoS One 2013; 83: e55811.
    1. CTSA Consortium Tackling Clinical Trial Recruitment Roadblocks. National Center for Advancing Translational Sciences. 2015. Accessed September 21, 2016.
    1. i2b2 Community Plugins. GitHub. Accessed January 6, 2018.
    1. Klann JG, Abend A, Raghavan VA, et al. Data interchange using i2b2. J Am Med Inform Assoc 2016; 235: 909–15.
    1. Murphy SN, Mendis M, Hackett K, et al. Architecture of the Open-source Clinical Research Chart from Informatics for Integrating Biology and the Bedside. AMIA Annu Symp Proc 2007; 2007: 548–52.
    1. Wagholikar KB, Mandel JC, Klann JG, et al. SMART-on-FHIR implemented over i2b2. J Am Med Inform Assoc 2017; 242: 398.
    1. Klann JG, Murphy SN.. Computing health quality measures using Informatics for Integrating Biology and the Bedside. J Med Internet Res 2013; 154: e75.
    1. Klann JG, Mendis M, Phillips LC, et al. Taking advantage of continuity of care documents to populate a research repository. J Am Med Inform Assoc 2014; 22 (2): 370–9.
    1. Wattanasin N, Porter A, Ubaha S, et al. Apps to display patient data, making SMART available in the i2b2 platform. AMIA Annu Symp Proc 2012; 2012: 960–9.
    1. Kenneth Fisher. UNPIVOT a table using CROSS APPLY. SQL Studies. 2013. Accessed April 23, 2018.
    1. unpivot and PostgreSQL - Stack Overflow. Accessed April 23, 2018.
    1. Accessible Research Commons for Health Github. GitHub. Accessed March 6, 2017.
    1. Phillips L. NCBO Extraction Tool Version 2.0. NCBO Ontology Tools for i2b2. Accessed December 4, 2014.
    1. CommonDataModel: Definition and DDLs for the OMOP Common Data Model (CDM). Observational Health Data Sciences and Informatics 2017. Accessed December 22, 2017.
    1. i2b2 on OMOP - Release 1.7.09. Accessed November 15, 2017.
    1. Klann JG, Joss M. i2b2-on-PCORnet: Scripts and Documentation for Implementing. Accessible Research Commons for Health 2017. Accessed January 4, 2018.
    1. ETL-CMS: Workproducts to ETL CMS datasets into OMOP Common Data Model. Observational Health Data Sciences and Informatics 2017. Accessed January 7, 2018.
    1. pcori/omop i2b2 demo. Accessed November 15, 2017.
    1. Install dockerized i2b2 on PCORnet/OMOP. Accessed November 15, 2017.

Source: PubMed

3
Subscribe