Architecture Overview

The Galileo Elastic Enrichment Cloud (GE2C) provides an extremely high performance data enrichment stack and a “data warehouse”-style data access paradigm that offers a self-service interface to a broad array of third-party analytics applications and development frameworks that require bulk data access. The warehouse, implemented using Amazon Redshift, serves as a gateway to all of CE’s data assets: transactional “HIEBus” data concepts as well as higher-order, reconciled data concepts (i.e. survey data, measures, risk stratification, disease and medication profiles, derived encounters, etc.) produced by GE2C.

A high level view of the full architecture can be summarized as follows. There are three main components of the architecture:

Development and Management Tools that are used to code and test “data transformers” in python and administer the cloud-based data enrichment processes. The developer tools are industry standard applications including Visual Studio Code, Jupyter notebooks, and github. The python development ecosystem has been selected because of its high performance libraries (notably, pandas) and wealth of data modeling and machine learning libraries widely used in high performance data science tech stacks. The administration tooling is developed by CE (as our implementation of “build” has many CE-specific requirements) and is fully managed and serverless.

Learn more about Data Transformer Development.

Cloud Infrastructure in AWS that ingests HIEBus data into a Redshift data warehouse, then implements “Cloud Build” using containerized, dynamically-allocated compute resources in a scale-out, partitioned processing architecture. The build process produces enriched data concepts using “data transformers”, then persists the results back into the Redshift data warehouse in adjacent schemas that serve as data marts.

Learn more about Ingest to AWS or “Cloud Build” using the Galileo Elastic Enrichment Cloud (GE2C).

Platform Data API that reveals transformed, analytics-ready data via data marts to widely-used analytics applications such as Tableau, any AWS service, CE Beacon and other HIEBus applications.

Learn more about our Data Platform API.