GCP

GCS

To set up GCS as data lake, you need to follow below steps

1. Navigate to Settings.

2. Click on Destination

3. Click on Setup Data Lake.

4. Insert all the data lake specific credentials, along with a name and description for the connection.

5. Click on Validate and then Create to save the data lake connection.

AuthenticationHMAC Key authentication
Access IDAccess ID linked to the service account
SecretSecret ID linked to the corresponding Access ID
GCS Bucket NameGoogle Cloud Stroage Bucket name
GCS Bucket PathSubdirectory under the bucket to sync the data into
GCS Bucket RegionRegion of the GCS bucket
Output FormatOutput data format from one of Avro, Parquet, CSV, JSON
CompressionWhether the output files be compressed
NormalizationWhether the input JSON be normalized

BigQuery

To set up GCP as data warehouse, you need to follow below steps

1. Navigate to Settings.

2. Click on Destination

3. Click on Setup Data Warehouse.

4. Insert all the data warehouse specific credentials, along with a name and description for the connection.

5. Click on Validate and then Create to save the data warehouse connection.

Project IDGCP Project ID
Dataset LocationLocation of the dataset
Default dataset IDDefault Bigquery data set ID
Loading methodGCS Staging / Standard inserts
HMAC Access key IDHMAC Access ID
HMAC Key secretHMAC Access key
GCS Bucket nameName of GCS bucket
GCS Bucket pathPath of GCS bucket
GCS Tmp files afterward processingDelete / Keep all
Service account key JSONJSON service account key
Transformation query run typeInteractive / Batch
Google bigquery chunk sizeBigquery client’s chunk size
Raw Table dataset nameDataset to write raw tables