Documentation

Blend Data

Updated on

Nov 20, 2024

The Blend Data operator is a powerful tool in Self-Serve Data Transformation, allowing you to merge and unify data from multiple sources into a single, cohesive dataset.

How Does It Work?

  • Define the Output Schema
    Blend Data begins with defining the output schema, which determines the structure of your resulting dataset.
    In most cases, you’ll start with a pre-configured Recipe, where most fields and mappings are already prepared.
  • Add Sources
    Select the data sources you want to include in the blend.
    These can be tables from your database, datasets from external platforms, or transformation results from previous steps.
  • Map Fields
    Match fields from each source to the defined schema. This step can be simplified using AI Autofill, which analyzes your data and automatically maps the fields based on output schema.
  • Apply Custom Formulas
    Enhance the blending process by applying Custom Formulas to your fields: use formulas to calculate new fields, transform data by specific conditions. For detailed guidance on creating and applying formulas, refer to the Custom Formulas Documentation.
  • Run and Save
    Once your schema, sources, and mappings are configured, click «Run and Save» to execute the blending operation. The result is a unified dataset, structured according to your schema and incorporating all applied formulas and mappings.

Use Case

Blending Facebook, Bing, and Google Ads Data

To illustrate, let’s walk through a real-world example of blending data from multiple advertising platforms: Facebook Ads, Bing Ads, and Google Ads.

  • Goal: Create a unified dataset with standardized metrics and dimensions.
  • Steps:
  1. Define the Output Schema: Start with a pre-defined Cross-Channel Recipe, which includes the appropriate output schema.
  2. Add New Sources: The default recipe starts with Google and Facebook data sources. We just need to add one more, Bing Ads.
  3. Map Fields: Use AI Autofill to automatically map fields from the sources to the schema.
  4. Run and Save: Execute the blending operation to generate the final dataset.

The result is a unified, standardized dataset that integrates fields from Facebook, Bing, and Google Ads. This dataset is now ready for cross-channel analysis, offering insights into performance metrics like total spend, impressions, and clicks across platforms.

Conclusion

The Blend Data operator is an intuitive yet powerful tool for unifying datasets from multiple sources. By leveraging recipe, AI-powered field mapping, and custom formulas, it simplifies the process of creating actionable datasets.

Schema information

Setup guide

Settings

No items found.

Troubleshooting

Troubleshooting guides

Check out troubleshooting guides for
Blend Data
here:

Limits

Frequently asked questions

No items found.
☶ On this page
Description
Related articles
No items found.
No items found.

Questions?

Improvado team is always happy to help with any other questions you might have! Send us an email.

Contact your Customer Success Manager or raise a request in Improvado Service Desk.