For more information on Pharma and Biotech or other ways AWS can help your organization visit us at: aws.amazon.com/health/biotech-pharma
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Establishing a cloud data lake for storage & integration
AWS reference architecture
2
Store: Store ensures that data silos are eliminated in an effort to drive data consolidation and access to
where plant operators require visibility into plant operations. The primary technical outcome is an acute
focus away from traditional historians to one where hot and cold data can reside in parallel with the
necessary compute resources to manage real-time shop floor operations. This pattern includes loose data
coupling with business applications (ERP/CRM/QMS).
Connected Worker
Bio Reactor/
Unit Operation
Cameras
ML Inference
OFC-UA/Modbus
MQTT
OPC-UA
OPC-DA
Ethernet/
IP
PLC/
DCS
Amazon Kinesis
Lambda function
AWS IoT
Greengrass
AWS IoT Greengrass
Connectors
AWS DataSync Agent Historia
Local Storage MES AWS Storage
Gateway
AWS Snowball Edge
AWS Cloud
VPC Business Applications (ERP/CRM/QMS)
Availability Zone 1
Private subnet
Availability Zone 2
Data Lake
Factory
AWS IoT Core
AWS IoT SiteWise
AWS IoT SiteWise
Connector
Protocol Conversion
Amazon S3 (Raw Data)
Amazon S3 (Raw Data)
Amazon Glue (ETL Job)
Amazon
Redshift
(Structured
data)
Amazon
Neptune
Amazon RDS Primary Amazon RDS Slave
Amazon EC2 Amazon EC2 Auto Scaling
Group