Updated on
Sep 5, 2024
Note: Azure Data Lake is currently supported only as a Destination. This guide doesn’t cover the DataPrep setup for Azure Data Lake.
Azure Data Lake is a scalable cloud storage solution that enables the efficient storage, processing, and analysis of large amounts of structured and unstructured data. It supports big data analytics by integrating with Azure's analytics services, offering seamless data ingestion, high-throughput processing, and advanced security features.
Follow our setup guide to connect Azure Data Lake to Improvado.
On the Azure Data Lake connection page, fill in the following fields:
{%dropdown-body name="account-url"%}
Account URL must satisfy the following regular expression: ```https://[a-z0-9]*.blob.core.windows.net$```.
The ```[a-z0-9]``` part of Account URL must be between 3 and 24 characters in length.
{%dropdown-end%}
{%dropdown-body name="sas-token"%}
Learn how to create a SAS Token with this guide.
{%dropdown-end%}
{%dropdown-body name="file-system-name"%}
File System Name length must be between 3 and 63 characters and must satisfy the following regular expression: ```r'^(?!.*--.*)[a-z0-9][a-z0-9]*[a-z0-9]$’```
{%dropdown-end%}
{%dropdown-body name="encryption-type"%}
Possible options:
{%dropdown-end%}
{%dropdown-body name="encryption-key"%}
If you have selected the Default Cloud Storage encryption type, you will not be able to edit this field.
Otherwise, you should enter your AES-256 key, encoded in standard Base64 or resource name of Cloud KMS key used to encrypt the blob’s contents. For more info, see Azure Data Lake encryption docs.
{%dropdown-end%}
{%dropdown-body name="folder"%}
Possible parameters:
```/data_source/data_table_title/report_type/YYYY/MM/DD/timestamp```
If you use ```/YYYY/MM/DD``` settings, the data will be added to folders daily. Each new record will not delete the previous one, even for data that contains no date.
The maximum length is 254 characters.
{%dropdown-end%}
{%dropdown-body name="file-format"%}
Possible formats:
{%dropdown-end%}
{%dropdown-body name="filename"%}
Possible parameters:
```filename-YYYY-MM-DD```
Note: you cannot use ```DD``` for partition by month.
Also, you can use “_” instead of “-” or do not use any symbols at all, for example:
{%dropdown-end%}
{%dropdown-body name="separator"%}
Possible delimiters that can separate data in your file:
{%dropdown-end%}
{%dropdown-body name="partition-by"%}
Possible ways of splitting data:
{%dropdown-end%}
Improvado team is always happy to help with any other questions you might have! Send us an email.
Contact your Customer Success Manager or raise a request in Improvado Service Desk.