User Tools

Site Tools


development:data_architecture

This is an old revision of the document!


Data Architecture

In this document we describe various conventions used in the processing and storage of data within the OHDSI architecture. Data within the OHDSI data architecture falls into one of four categories including Source Data, Standardized Data, Derived Data and Administrative Data.

  • Source Data : Person level data that is acquired through different parts of the healthcare system. These sources are said to exist in the “native schema”.
  • Standardized Data : Data that has been converted into the OMOP CDM format. These data are said to exist in the “CDM schema”.
  • Derived Data : Data that is the result of some form of processing on data that came from the OMOP CDM format. These data are said to exist in the “results schema
  • Administrative Data : Data that is created or maintained to describe a concept, process or system function within the broader OHDSI architecture. These data are said to exist in the “OHDSI schema”.

Data Life-Cycle

There is a life-cycle of data within the OHDSI data architecture. Data primarily originates from a person level data source. These data sources include electronic health records, administrative claims records, clinical trial data and billing data among others. These person level data sources are typically received in their own proprietary format which we refer to as the “Native Format”. The approach used within OHDSI is to standardize these sources of person level data by performing ETL processes and converting them from their Native format to the OMOP Common Data Model (CDM) format.

development/data_architecture.1471889176.txt.gz · Last modified: 2016/08/22 18:06 by frank_defalco