Updated: Apr 17
This article explains how to use DataLakeHouse.io, a SaaS platform.
DataLakeHouse.io has connectors for many applications and datasources:
Aiven Postgre SQL,
Food Delivery Service,
Google Analytics 4,
Google Big Query,
We will cover a Data Lake created on the Snowflake Data Cloud with data from Harvest.
Step 1 - Create a Source Connection
Connect to your instance on app.datalakehouse.io.
Add a Harvest source connection and add a Name and a Target Schema Prefix. Click to Authorize Your Account with your Harvest account.
Click on Actions and Edit to view the entities you will replicate.
Note all of the Tables and Entities appear on the Harvest data source.
Step 2 - Create a Target.
Open your Snowflake Data Cloud instance and create an empty database:
On DataLakeHouse.io, create a new Snowflake Target Connection and enter a Database, Warehouse and your username credentials.
Click on Save & Test Connection
Step 3 - Create a Sync Bridge which will be used to consume the data from your source and feed your data lake. Select the Harvest and Snowflake connections and Save the Sync Bridge.
Click on Actions and Enable the Sync Bridge
Return to your Snowflake instance and notice a few tables created under the HARVEST schema on DLH database:
DataLakeHouse.io is a powerful technology and it has an intuitive user interface where you can connect to dozens of data sources to populate Snowflake Data Cloud or Google Big Query without writing any code. All of the target tables will be automatically created and populated by a predefined data model. You can also request other sources and customizations directly with DataLakeHouse support team, more details on DLH.io.