© 2024 IQVIA - All Rights Reserved

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 10 Next »

Overview

The purpose of this page is to provide high-level guidance one what should be contained in a Read Replica training module. The goal of preparing this material should be to present it to JnJ digitally and with in-person training. After it has been digested by their team, we should refine and package it in a way that can be shared with others by way of our knowledge transfer tools. In addition to the content shared here, we also need to share the full data model in a highly consumable fashion (ie the HTML frame-set generated by ClinSpark builds).

Read Replicas

Show the logical architecture of a ClinSpark deployment.  Highlight the replicas and one reserved for use by customers.  Note encryption.  


Use Cases

ClinSpark has many reports, dashboards and export options available for customers.  These have been built to answer a wide array of the most common shared use cases for ClinSpark customers.

With the Read Replica it's possible to have your users create custom reports, queries or visual analysis.  Examples of customer use cases for this data include

Custom Reports


Ad Hoc Queries


Advanced Business Intelligence Dashboards


Latency 

Describe how AWS RDS populates a replica and what the expected lag time is between master and replica.

Connecting to your Replica

PrivateLink Connection.  We would set this up for you during onboarding.

<diagram showing user computer, ssh tunnel

Share details on what an SSH tunnel is, and how it can be leveraged by customers in order to connect business intelligence tools.

Clinical Data Interchange Standards Consortium Standard (CDISC)

To the maximum extent possible, the ClinSpark data model is based on the CDISC Operational Data Model (ODM) standard. Introduce CDISC ODM.

Links to CDISC documentation.  Identify subject areas driven by CDISC.  Introduce core entities to highlight how the standard represents clinical study design and study data.  Introduction to these tables is given in the context of CDISC, and cleanly segues into the data model below.  Highlights implicit point that the CDISC documentation itself is a valuable guide to the CS data model.

ClinSpark Data Model

to the extent possible and practical, the CS data model is tightly aligned to CDSIC 1.3.2.  There are several benefits to this

  • Standards-based simplifies interoperability 
  • Simplified training and transferable knowledge
  • High quality data foundation vs proprietary makes data more future-proof

The following section highlights a series of key subject areas within the ClinSpark data model.  Where elements are defined within the CDISC ODM standard, the descriptions from this documentation are used.

Study Design

Studies, Sites, StudySites an Subjects


TableFrom CDISC?Notes
1studyYesThis element collects static structural information about an individual study.  A study is related to a given clinical trial protocol.
2study_siteNoAn association between a physical site and a study.  A study site is different than a physical location.  Often, pharma sponsors will specify sites with arbitrary codes and those codes must pass through during data export time. In addition, this domain encapsulates recruitment efforts for a given study / site.
3siteNoA site is a physical place belonging to an organization.  An organization having multiple physical clinical sites will have multiple site rows.
4subjectNoSomeone participating in clinical research within the context of a given study. Creates glue between the volunteer and the participation.

Form Definition


TableFrom CDISC?Notes
1StudyMetaDataYesStudyMetaData (MetadataVersion in CDISC) defines the types of study events, forms, item groups, and items that form the study data.  This is basically an aggregation of all CRF design elements for a study.
2FormYes

A Form (FormDef in CDISC) describes a type of form that can occur in a study. 

A form is basically a container for item groups. This class explicitly excludes the required 'repeating' (Yes|No) attribute from the domain due to the fact that phase 1 studies are different and that it's likely that most of the forms will repeat as they are related to a study event. At ODM build time, we'll check for those forms that repeat and ensure that we create the repeating attribute properly. 

3ItemGroupYes

An ItemGroup (ItemGroupDef in CDISC) describes a type of item group that can occur within a study.

It basically is collection of related items in a given form.

4ItemYes

An Item (ItemDef in CDISC) describes a type of item that can occur within a study. Item properties include name, datatype, measurement units, range or codelist restrictions, and several other properties.

It basically represents the definition of a piece of data collected.

5ItemGroupRefYesA reference to a given ItemGroup.  This reference can hold data about the association.
6ItemRefYesA reference to a given Item.  This reference can hold data about the association.

Activity Plans

An activity plan is a schedule of events for a given cohort.  Activity Plans do not appear in CDSIC, there is no similar construct in the ODM to capture this concept.  Activity Plan, Segment and Scheduled Activity are ClinSpark specific.


TableFrom CDISC?Notes
1StudyYesThis element collects static structural information about an individual study.  A study is related to a given clinical trial protocol.
2Activity PlanNoA schedule of events for a given cohort. Plans can be assigned to multiple cohorts. A timed plan must have a reference time in order to properly provide UI feedback as segments and scheduled activities are set.

Untimed Activity Plan:
  • Reference time must be null
  • Single segment; segment must be root and must have offset second set to zero


Timed Activity Plan:
  • Must have a reference time
  • Can have 1-n segments; always sort by offset seconds
  • Reference segment must have offset seconds of zero
3SegmentPartially

Holds a group of scheduled activities in an activity plan. The segment's offset seconds is essentially the time of the reference event, all scheduled activities are relative to this.

Modeled somewhat of off CDISC SDM:

"Segments are often seen as the basic building blocks of study design. A segment usually specifies a combination of planned observations and interventions, which may or may not involve treatment, during a period of time."

4Scheduled ActivityNoWraps a form, but adds metadata including timing.
5FormYesA form is basically a container for item groups.
6Study EventYesA study event represents a given 'visit'. In phase 1 trials this will commonly simply refer to a 'day'. When scheduling forms for a given schedule, the builder must associate the study event. Note: there are common study events that are typically reserved for special events: unscheduled, common (AE, CM), etc

Epochs and Cohorts


TableFrom CDISC?Notes
1epochNoAn epoch is typically specified in a study protocol and typically signifies some milestone type events within the trial. ie: screening, treatment, followup, etc
2cohortNoA cohort is a way to break up epochs into different groupings. Protocols will occasionally indicate that epochs should be broken up (perhaps in different trial arms to test different dose levels, etc), or this can just purely be an organizational thing
3cohort_assignmentNo

Binds a Volunteer as Subjects within a given cohort along with an Activity Plan. We allow assigning different schedules, and we allow for later scheduling a subject.

4activity_planNoA schedule of events for a given cohort.
6subjectNoSomeone participating in clinical research within the context of a given study.


Clinical Data

Parallel data and definition objects in CDISC

CDISC ODM defines a pattern for modeling the definition of a clinical data element and the data captured.  This pattern has one element describe the definition, and another element record the data for a particular clinical collection of this element type.  ClinSpark adheres to this pattern, though as noted in the table below, the suffix "Def" is removed from the definition elements.  Here are the examples

Data Definition ElementData Capture ElementNotes
Item (ItemDef in CDISC)ItemDataRepresent the definition of a piece of data collected
ItemGroup (ItemGroupDef in CDISC)ItemGroupDataAggregates item data
Form (FormDef in CDISC)FormDataBasically a container for item groups
StudyEvent (StudyEventDef in CDISC)StudyEventDataClinical data for a study event (visit) for a given subject

With this in mind, below is a view of how these elements relate to each other.



TableFrom CDISC?Notes
1formYesBasically a container for item groups.
2form_dataYesForm data represents data collected for a given subject. Instead of storing the scheduled time on this domain, we leverage the relationship to the encapsulated scheduledActivity domain and thus its relationship to the segment. We purposely don't set formRepeatKey in the domain. It is calculated later on when building ODM.
3item_groupYesA collection of related items in a given form
4item_group_refYesA given item group within a form definition.
5item_group_dataYesAggregates item data
6itemYesRepresents the definition of a piece of data collected.
7item_refYesA given item within an item group definition.
8item_dataYesA piece of collected data
9study_eventYesA study event represents a given 'visit'. In phase 1 trials this will commonly simply refer to a 'day'. When scheduling forms for a given schedule, the builder must associate the study event. Note: there are common study events that are typically reserved for special events: unscheduled, common (AE, CM), etc
10study_event_dataYesClinical data for a study event (visit) for a given subject

Study Data



TableFrom CDISC?Notes
1item_dataYesA piece of collected data
2item_group_dataYesAggregates item data
3form_dataYesForm data represents data collected for a given subject. Instead of storing the scheduled time on this domain, we leverage the relationship to the encapsulated scheduledActivity domain and thus its relationship to the segment.
4study_event_dataYesClinical data for a study event (visit) for a given subject
5study_eventYesA study event represents a given 'visit'. In phase 1 trials this will commonly simply refer to a 'day'. When scheduling forms for a given schedule, the builder must associate the study event. Note: there are common study events that are typically reserved for special events: unscheduled, common (AE, CM), etc
6subjectNoSomeone participating in clinical research within the context of a given study.

Lab Data

Lab data can be reached via ItemData


TableFrom CDISC?Notes
1item_dataYesA piece of collected data.  Note that lab orders and all results are associated to a given Item Data.  You will need to join through Item Data when working with lab data.
2base_specimenYesModeled off of CDISC Lab. A specimen is collected from a subject and assigned to a given item data instance. There can be multiple batteries (test groups) associated to a given specimen. Combines Accession Level and Base Specimen from spec.
3base_batteryNoA panel related to a specimen - typically this is just a 1:1.
4base_test_resultPartiallyCombines CDISC Lab BaseTest and BaseResult.  These are the results from the lab.
5lab_orderNoWhen specimens are collected, this domain represents that an order is generated. It causes a manifest file to be created (PDF) and potentially a file order to be dumped on to the file system and made available to web services.
6lab_interfaceNoEncapsulates how to send and receive orders and results from a particular safety lab.  Sites may have multiple labs, and if so each of these will have their own lab interface instance.
7study_lab_panelNoSomething that can be ordered from item level
8specimen_containerNoWhen setting up samples or labs, users can optionally choose a container.

Volunteers

Volunteers are not a part of the CDISC. We need to show a path to volunteer data and associated medical histories, medications, etc.


TableFrom CDISC?Notes
1volunteerNoSomeone who indicates that they are interested in participating in clinical research for the given org.Someone who indicates that they are interested in participating in clinical research for the given org.
2volunteer_medical_conditionNoAssociates a given condition to a given volunteer
3volunteer_noteNoA simple note that can be attached to the volunteer record
4volunteer_correspondenceNoRepresents a call or text to / from a volunteer by way of Twilio
5volunteer_substance_useNoWe purposely don't track SUOCCUR, it allows us to indicate that the vlunteer is not using the substance.
6recruitment_appointmentNoAllows for a given volunteer to be assigned to a given time slot
7study_siteNoAn association between a study and a site
8subjectNoSomeone participating in clinical research within the context of a given study.

Users


TableFrom CDISC?Notes
1application_userNoA user in the system
2studyYesIn this context, these are study restrictions.  Studies which appear here mean the user is whitelisted to working only with data from these studies.
3application_user_roleNoRoles which the user has
4application_user_role_secure_actionsYesSecure Actions are permissions which the role entitles the user to.

Read Replica Usage

Integration to your Existing Data Warehouse

Ad Hoc SQL Queries

Probably all SQL tools connect to MySQL databases.  So whatever SQL tool you typically use should work with very little configuration.  Use the tool that is most familiar to you.  If you do not have a tool preference, one option may be the free MySQL Workbench, which is a fairly full featured tool.  Using a SQL tool like this is useful for creating queries to answer questions on the fly.  It can also be used to generate simple reports, perform data modeling (the above diagrams were created with this tool), etc. 

The connection instructions above show how to connect MySQL Workbench to your replica.  And there is extensive help online and in the app for using the tool from there.

Business Intelligence Tools

The ClinSpark Read Replica can be used with any Business Intelligence tool which operates on relational data.  These are 

Tableau 

Tibco Spotfire

AWS Quicksight

Crystal Reports

SQL Examples

Pick a few reports and show how SQL can produce similar results

Lab Data example

Item Data example


  • No labels