Data Strategies for Panintelligence
Panintelligence (pi) communicates with databases using SQL.
The metadata capabilities of pi allow us to access data that may not always be perfectly structured. However, when querying large volumes of data, the structure and location of your data become more important.
Two concepts are worth understanding when querying the data in the production schema is not possible:
E.T.L / E.L.T
Extract Transform and Load, more often Extract Load and then Transform.
But essentially,
Extract: get data from a source(s)
Transform: alter data structure, pre-calculate and validate data
Load: put data into a DB designed for high volume queries
Data warehouse
What is a data warehouse.
When we talk about a data warehouse we often mean 2 things.
The database used to hold and allow querying of the data.
The fact that we have applied Transformations to the data to optomise queries. The term data lake refers to a database that holds data from multiple data sources.
Technologies that work well with pi.
Whilst pi is tool agnostic, here are some examples of tools that we have had success with.
ELT
Rivery - https://rivery.io/
Great simple tool - particularly good at pulling data from API sources and combining into Snowfake or BigQuery. Has standard toolkits (Kit Hub) to combine data from common sources, i.e. social media.
https://rivery.io/kits/panintelligence-marketing-visualization/
Stitch - https://www.stitchdata.com/
Simple tool - great a pulling data from API sources and combining in to a large selection of databases. Wide range of connectors.
Fivetran - https://www.fivetran.com/
Sophisticated ETL tool. Often used by Data Engineers.
Segment - https://segment.com/
Sophisticated ETL tool. Often used by Data Engineers.
Data warehouses
Snowflake - The Snowflake AI Data Cloud - Mobilize Data, Apps, and AI
BigQuery - Cloud Computing Services | Google Cloud
MS Azzure - https://azure.microsoft.com/en-gb/products/azure-sql/database
Redshift - https://docs.aws.amazon.com/redshift/latest/mgmt/welcome.html
Yellowbrick - Yellowbrick Data Warehouse | The Modern Enterprise Cloud Data Warehouse
Clickhouse - Fast Open-Source OLAP DBMS - ClickHouse
Firebolt - Cloud Data Warehouse For Engineers | Firebolt
pi can actually connect to any DB that supports JDBC Java Database Connectivity
The complete list of connectors used by pi can be found here;
CUSTOMER NEWS - Our August 24 Release Is Now Available - Download It Now!