Data Reconciliation: A Complete Guide for Marketing Teams in 2026

Last updated on May 15, 2026

VP of Products, Improvado

Marketing teams today pull data from dozens of platforms — Google Ads, Meta, LinkedIn, Salesforce, HubSpot, and more. Each system reports metrics differently. Each uses its own naming conventions. Each updates on its own schedule.

Without a reconciliation process, your dashboards show conflicting numbers. Finance questions your budget reports. Sales disputes lead attribution. You spend hours each week hunting for discrepancies instead of analyzing performance.

This is the problem data reconciliation solves. Done right, it transforms fragmented, contradictory data into a single source of truth — one your entire organization can trust.

This guide walks you through the entire reconciliation process: what it is, why it matters, how to do it step-by-step, common mistakes to avoid, and the tools that make it faster. By the end, you'll know exactly how to build confidence in your marketing data.

Key Takeaways

✓ Data reconciliation is the process of comparing data from multiple sources to identify and resolve discrepancies, ensuring accuracy and consistency across systems.

✓ Manual reconciliation methods — spreadsheets, VLOOKUP formulas, pivot tables — break down as data volume grows, typically requiring 20–40 hours per week for teams managing more than five platforms.

✓ Automated reconciliation tools reduce validation time by up to 95%, catching errors in real-time before they reach dashboards or executive reports.

✓ The reconciliation process follows five stages: data collection, standardization, comparison, investigation, and resolution — each stage requires different skills and tooling.

✓ Common reconciliation errors stem from timezone mismatches, currency conversion inconsistencies, attribution window differences, and incomplete historical data during connector migrations.

✓ Best-in-class reconciliation workflows combine automated validation rules with human oversight for edge cases, balancing speed with accuracy.

What Is Data Reconciliation?

Data reconciliation is the process of verifying that data from different sources matches and, when it doesn't, identifying and fixing the cause of discrepancies. In marketing operations, this means comparing the numbers you see in your dashboards against the raw data in each advertising platform, CRM, or analytics tool.

The goal is simple: ensure every stakeholder — marketing, sales, finance, executives — looks at the same numbers and reaches the same conclusions. When reconciliation works, your CFO sees the same ad spend total that Google Ads reports. Your sales team sees the same lead count that HubSpot logs. Your CEO sees ROI calculations built on data everyone trusts.

Data reconciliation in marketing refers to the systematic comparison of data across platforms, databases, and reports to identify mismatches, validate accuracy, and establish a unified view of performance metrics. It's the quality control layer that sits between raw data ingestion and business decision-making.

Reconciliation differs from data integration. Integration moves data from point A to point B. Reconciliation verifies that what arrived at point B matches what left point A — and investigates when it doesn't.

Why Data Reconciliation Matters for Marketing Teams

Marketing attribution depends on accurate data. If your Facebook Ads connector reports $50,000 in spend but your finance system shows $48,200, which number do you use to calculate ROAS? If your CRM logs 1,200 leads but your dashboard shows 1,184, where did the 16 leads go?

These discrepancies cascade. Budget forecasts miss targets. Campaign performance looks better or worse than reality. Teams make decisions based on incorrect assumptions. Trust erodes.

Reconciliation prevents three critical failures:

• Reporting errors — dashboards that contradict source systems undermine credibility with executives and cross-functional partners

• Budget waste — undetected spend discrepancies lead to overspending in some channels and missed opportunities in others

• Attribution mistakes — incomplete or mismatched data attributes conversions to the wrong campaigns, skewing optimization decisions

Organizations that reconcile data systematically report higher confidence in marketing ROI calculations, faster monthly close processes, and fewer disputes between marketing and finance teams.

Pro tip:

Teams using automated reconciliation catch budget overruns 10 days earlier on average, preventing wasted spend and giving marketing ops time to reallocate budget to high-performing channels before month-end.

See it in action →

Step 1: Collect Data from All Sources

Reconciliation begins with raw data collection. Before you can compare numbers, you need to pull data from every system that reports on the same event or metric.

For a typical marketing team, this includes:

• Ad platforms (Google Ads, Meta Ads, LinkedIn Ads, TikTok, Bing)

• Analytics tools (Google Analytics 4, Adobe Analytics, Mixpanel)

• CRM systems (Salesforce, HubSpot, Microsoft Dynamics)

• Email marketing platforms (Marketo, Mailchimp, ActiveCampaign)

• Attribution tools (Bizible, Dreamdata, HockeyStack)

• Finance/ERP systems (NetSuite, QuickBooks, SAP)

Each platform exports data differently. Google Ads provides CSV downloads or API access. Salesforce requires SOQL queries or third-party connectors. Some tools limit historical data exports to 90 days. Others change their schema without notice, breaking existing integrations.

Manual Collection Methods

Most teams start with manual exports. An analyst logs into each platform, selects the date range, downloads a CSV, and saves it to a shared folder. This works for small teams managing three to five platforms.

The process breaks down as data volume grows. Manual exports introduce four problems:

• Human error — wrong date ranges, missed platforms, inconsistent export settings

• Time cost — 30–60 minutes per export × 10 platforms = 5–10 hours per week

• Inconsistent timing — data pulled at different times shows different numbers due to attribution windows and retroactive updates

• Missing historical context — no version control, no audit trail, no way to compare today's export against last week's

Automated Collection Methods

Automated data pipelines pull data on a schedule — hourly, daily, or in real-time. APIs query each platform, extract the required fields, and load raw data into a staging area (data warehouse, lake, or integration platform).

Automation solves the consistency problem. Every platform gets queried at the same time with identical parameters. Historical data persists. Schema changes get logged. If a connector fails, the system alerts you immediately.

Modern marketing data platforms connect to hundreds of sources out of the box. Improvado, for example, supports 1,000+ data sources with pre-built connectors for every major ad platform, CRM, and analytics tool. Each connector maps the platform's raw schema to a standardized data model, preserving full granularity while making cross-platform comparison possible.

Step 2: Standardize Data Formats

Raw data from different platforms rarely matches. Google Ads labels its date field Date. Facebook calls it date_start. LinkedIn uses dateRange.start. Salesforce stores dates as CreatedDate in ISO 8601 format.

Standardization transforms these inconsistent formats into a unified schema. Every date becomes date. Every spend metric becomes spend. Every impression count becomes impressions. You apply the same logic to currencies, timezones, naming conventions, and metric definitions.

Common Standardization Challenges

Several issues complicate standardization:

Timezone mismatches — Google Ads defaults to the account timezone. Facebook Ads uses UTC. Google Analytics uses the property timezone. If your team operates across multiple regions, a single campaign can report different results depending on which platform you query.

Currency conversions — international campaigns report spend in local currency (EUR, GBP, JPY). To reconcile total spend, you need to convert everything to a base currency using exchange rates from the transaction date — not today's rate.

Attribution windows — Facebook Ads defaults to 7-day click, 1-day view attribution. Google Ads uses last-click. Your CRM attributes revenue to the first touch. These models produce different conversion counts for the same campaign.

Metric definitions — what Google Ads calls a "conversion," LinkedIn might call a "lead gen form submission." Facebook distinguishes between "purchases" and "purchase events." Salesforce tracks "opportunities" and "closed-won deals." You must map these terms to a common vocabulary.

Standardization Techniques

Teams use three methods to standardize data:

Transformation scripts — SQL queries or Python scripts that rename fields, convert data types, and apply business logic. Analysts write these scripts once and run them every time new data arrives.

ETL tool transformations — platforms like dbt, Matillion, or Fivetran Transformations let you define transformation logic in YAML or SQL, version-control it, and run it automatically as part of the pipeline.

Pre-built data models — some integration platforms ship with marketing-specific schemas that handle standardization out of the box. Improvado's Marketing Common Data Model (MCDM), for example, normalizes data from 1,000+ sources into a consistent format with 46,000+ pre-mapped metrics and dimensions, eliminating the need for custom transformation code.

Step 3: Compare Data Across Sources

Once data is standardized, you compare it. This step reveals discrepancies — places where the same metric from different sources shows different values.

The comparison process depends on what you're reconciling:

Platform-to-warehouse reconciliation — compare the numbers in your data warehouse against the numbers in the source platform's UI. Example: your Google Ads table shows $12,450 in spend for January 15. You log into Google Ads and check the same date range. Does it match?

Cross-platform reconciliation — compare related metrics across platforms. Example: Facebook Ads reports 1,200 link clicks. Google Analytics reports 1,180 sessions from Facebook. The 20-session difference might be legitimate (bots, tracking blockers) or might indicate a tracking error.

Dashboard-to-source reconciliation — compare aggregated numbers in your executive dashboard against raw source data. Example: your monthly performance dashboard shows 15,000 MQLs. Your CRM reports 14,987. The 13-lead difference needs explanation.

Setting Reconciliation Thresholds

Not every discrepancy requires investigation. Small variances are expected due to rounding, timezone cutoffs, and data processing delays.

Define thresholds based on business impact:

• Spend data — tolerate ≤0.5% variance (e.g., $50 on $10,000 spend)

• Conversion data — tolerate ≤2% variance for high-volume metrics, 0% for low-volume (every enterprise deal matters)

• Impression/click data — tolerate ≤5% variance (ad platforms often adjust these retroactively)

Anything outside these thresholds gets flagged for investigation.

Comparison Methods

Manual comparison uses spreadsheets. You export data from each source, paste it into separate sheets, and use VLOOKUP or INDEX-MATCH formulas to align rows. Then you calculate the difference and highlight cells where variance exceeds your threshold.

This approach works for one-time audits. It doesn't scale for daily or weekly reconciliation.

Automated comparison uses SQL queries or reconciliation scripts. Example query:

SELECT
  warehouse.date,
  warehouse.campaign_id,
  warehouse.spend AS warehouse_spend,
  platform.spend AS platform_spend,
  ABS(warehouse.spend - platform.spend) AS variance,
  ABS(warehouse.spend - platform.spend) / platform.spend AS variance_pct
FROM warehouse_google_ads AS warehouse
LEFT JOIN platform_google_ads AS platform
  ON warehouse.date = platform.date
  AND warehouse.campaign_id = platform.campaign_id
WHERE ABS(warehouse.spend - platform.spend) / platform.spend > 0.005
ORDER BY variance DESC;

This query identifies every campaign-date combination where warehouse data differs from platform data by more than 0.5%, sorted by largest variance first.

Signs your data reconciliation is broken

⚠️

5 signs your reconciliation process needs an upgradeMarketing teams switch to automated reconciliation when they recognize these patterns:

→Your analyst spends 15+ hours per week manually comparing platform exports in spreadsheets
→Finance disputes your ad spend reports every month because numbers don't match their billing records
→Discrepancies get discovered weeks after they occur, making root cause investigation nearly impossible
→You reconcile only aggregated totals, missing campaign-level or date-level errors that cancel out in summary views
→Schema changes from Google, Meta, or LinkedIn break your integrations without warning, and you lose days of data

Talk to an expert →

Step 4: Investigate Discrepancies

When comparison reveals a mismatch, you investigate the root cause. Discrepancies fall into three categories: legitimate differences, data pipeline errors, and source system issues.

Legitimate Differences

Some variances are expected and don't require fixes:

Attribution model differences — if Google Ads uses last-click attribution and your CRM uses first-touch, the same conversion will be credited to different campaigns. This isn't an error; it's a difference in methodology.

Timezone cutoffs — a campaign that started at 11 PM Pacific on March 1 appears in the March 2 data for UTC-based systems. Both systems are correct; they just define "day" differently.

Retroactive updates — ad platforms adjust conversion counts days or weeks after the initial report as delayed conversions come in. If you pulled data on March 5 and again on March 12, March 1–4 numbers may differ. The second pull is more accurate.

Sampling and bots — analytics platforms filter out bot traffic. Ad platforms count all clicks. A 2–5% difference between ad platform clicks and analytics sessions is typical.

Data Pipeline Errors

These discrepancies indicate problems in your integration or transformation logic:

Incomplete data pulls — the connector failed mid-query and pulled only 80% of records. The missing 20% causes a variance.

Transformation bugs — your currency conversion script used the wrong exchange rate, or your timezone adjustment added 24 hours instead of subtracting it.

Deduplication errors — your pipeline pulled the same data twice and didn't deduplicate it, inflating your metrics by 2x.

Schema drift — the source platform renamed a field from cost to spend, and your connector still queries cost, now getting null values.

Source System Issues

Occasionally, the source platform itself reports incorrect data:

Platform bugs — Google Ads API returned incorrect data for 6 hours on a specific date. This happens, though rarely.

Tracking errors — your Facebook Pixel stopped firing due to a website code change, causing underreporting.

Manual adjustments — someone applied a credit or refund in the ad platform UI, but the API didn't reflect it yet.

Investigation Workflow

Effective investigation follows a checklist:

• Check the date range — are both systems querying the same start and end dates, inclusive?

• Check the filters — did one query include filters (e.g., campaign status = active) that the other didn't?

• Check the granularity — are you comparing daily data to weekly rollups?

• Check the pipeline logs — did the connector run successfully? Any warnings or timeouts?

• Check the source platform — log in and manually verify the number in the UI

• Check recent schema changes — did the platform release an API update in the last 30 days?

Document every discrepancy you investigate. Create a log with: date discovered, metric affected, variance amount, root cause, resolution taken, date resolved. This log becomes institutional knowledge, helping future team members recognize recurring patterns.

Governed Data Pipelines That Reconcile Automatically — No Engineering Required

Improvado applies 250+ pre-built validation rules to every data pull, flagging budget overruns, schema drift, and cross-platform inconsistencies in real-time. When a connector breaks or a platform updates its API, the system alerts you immediately and suggests fixes based on historical patterns. Marketing ops teams at enterprise brands rely on Improvado to maintain data quality across hundreds of campaigns without writing SQL.

Get a demo

Step 5: Resolve and Document

Resolution depends on the root cause. If the discrepancy is a legitimate difference (attribution model, timezone), you document it and move on. If it's a pipeline error, you fix the code. If it's a source system issue, you contact support or apply a manual adjustment.

Resolution Techniques

Code fixes — update your transformation script, connector configuration, or SQL query to eliminate the error. Test the fix against historical data to ensure it resolves past discrepancies without creating new ones.

Manual adjustments — for one-time issues (e.g., a refund that the API didn't capture), apply a manual correction in your warehouse. Clearly label it as an adjustment so future analysts know it wasn't raw data.

Connector updates — if schema drift caused the issue, update your connector to query the new field name. If you're using a managed platform like Improvado, the connector updates happen automatically — the platform monitors schema changes across 1,000+ sources and adjusts mappings without manual intervention.

Acceptance — sometimes the right answer is "this difference is expected." Document why, set a threshold, and exclude it from future alerts.

Documentation Best Practices

Every resolved discrepancy should be logged in a centralized location — a wiki page, a Notion doc, or a dedicated reconciliation tracking table in your warehouse.

Include these fields:

• Date discovered

• Metric/dimension affected

• Source systems involved

• Variance amount (absolute and percentage)

• Root cause

• Resolution action

• Date resolved

• Owner (who investigated and fixed it)

This log serves three purposes: it prevents duplicate investigations, it trains new team members, and it provides evidence of data quality improvements for stakeholders.

38 hrssaved per analyst every week

Marketing operations teams using Improvado eliminate manual reconciliation work, freeing analysts to focus on campaign optimization and strategic analysis instead of spreadsheet audits.

Book a demo →

Common Mistakes to Avoid

Even experienced teams make reconciliation mistakes. Here are the most common, and how to avoid them.

Mistake 1: Reconciling Too Infrequently

Some teams reconcile once a month, during the close process. By then, discrepancies have compounded. If a connector broke on March 3 and you don't discover it until March 31, you've lost a month of accurate data.

Best practice: reconcile weekly for high-stakes metrics (spend, revenue, conversions), daily for metrics that inform optimization decisions (ROAS, CPA).

Mistake 2: Ignoring Small Variances

A $50 variance on $10,000 spend feels insignificant. But if that $50 variance occurs across 50 campaigns, it's a $2,500 discrepancy. Small errors compound.

Best practice: set thresholds, but investigate patterns. If the same campaign or platform consistently shows small variances, there's likely a systemic issue.

Mistake 3: Relying Entirely on Automation

Automated reconciliation catches 95% of issues. The other 5% require human judgment — edge cases, API bugs, business logic that machines can't infer.

Best practice: automate the comparison and alerting, but assign a human owner to review flagged discrepancies weekly.

Mistake 4: Not Version-Controlling Transformation Logic

If you fix a transformation script on March 15, does that fix apply retroactively to historical data? If not, your March 1–14 data remains incorrect. If it does, did you test the fix to ensure it doesn't break something else?

Best practice: use a tool like dbt or store transformation logic in Git. Every change gets versioned, tested, and applied consistently.

Mistake 5: Reconciling Aggregated Data Only

It's tempting to reconcile only the final dashboard numbers — total spend, total conversions. But if you reconcile at the summary level, you miss granular issues. A campaign might be over-reported while another is under-reported, and the errors cancel out in the aggregate.

Best practice: reconcile at the most granular level your systems support — campaign-date level for ad platforms, lead-date level for CRMs.

Tools That Help with Data Reconciliation

Manual reconciliation works for small teams, but it doesn't scale. As data volume grows, you need tools that automate collection, comparison, and alerting.

Spreadsheets (Google Sheets, Excel)

Spreadsheets are the default starting point. You export CSVs from each platform, paste them into tabs, and use formulas to compare.

Strengths: free, familiar, flexible

Limitations: manual, error-prone, doesn't scale beyond 5–10 data sources, no version control, no automation

Best for: teams with fewer than 3 platforms, one-time audits

ETL Platforms (Fivetran, Stitch, Airbyte)

ETL tools automate data extraction and loading. They connect to your ad platforms, CRMs, and analytics tools, pull data on a schedule, and load it into your warehouse.

Strengths: 300–400+ pre-built connectors, reliable scheduling, handles schema changes

Limitations: requires a data warehouse (Snowflake, BigQuery, Redshift), no built-in reconciliation logic — you write SQL queries to compare data yourself, limited support for marketing-specific transformations

Best for: engineering-led teams with data warehouse infrastructure already in place

Marketing Data Integration Platforms (Improvado, Supermetrics, Windsor.ai)

Marketing-focused integration platforms combine data extraction, transformation, and validation in one workflow. They're built specifically for marketing use cases.

Improvado stands out for reconciliation workflows. The platform connects to 1,000+ data sources, automatically standardizes data using the Marketing Common Data Model, and includes 250+ pre-built validation rules that flag discrepancies in real-time. When a variance exceeds your threshold, Improvado alerts the responsible analyst, logs the issue, and suggests likely root causes based on historical patterns.

Key reconciliation features:

• Pre-launch validation — checks data quality before it reaches dashboards, catching errors in staging

• Budget vs. actuals monitoring — compares planned spend against actual spend daily, alerting teams to overruns

• Cross-platform consistency checks — compares spend, clicks, and conversions across Google, Meta, LinkedIn, and other platforms to identify tracking gaps

• 2-year historical data preservation — when platforms change their schema, Improvado maintains historical mappings so you can reconcile current data against past periods without re-engineering pipelines

Implementation typically takes days, not months. The platform includes a no-code interface for marketers and full SQL access for analysts who need custom logic.

Strengths: marketing-specific, pre-built validation rules, real-time alerting, handles currency/timezone/attribution standardization automatically

Limitations: custom pricing (contact sales), not ideal for teams needing deep engineering control over transformation logic

Best for: marketing operations teams managing 10+ data sources, enterprises requiring governed data workflows

Business Intelligence Tools (Looker, Tableau, Power BI)

BI platforms visualize data and support basic reconciliation through calculated fields and data blending.

Strengths: powerful visualization, works with any data source, supports ad-hoc analysis

Limitations: requires data to already be in a warehouse, no built-in data collection, limited automation for variance detection

Best for: teams that already have clean, reconciled data in a warehouse and need to visualize it

Tool	Data Collection	Standardization	Automated Comparison	Alerting	Best For
Spreadsheets	Manual	Manual	Manual	No	Small teams, one-time audits
Improvado	Automated	Automated	Automated	Yes	Marketing ops, 10+ sources
Fivetran	Automated	Partial	Manual (SQL)	Basic	Engineering-led teams
Looker/Tableau	Requires warehouse	Requires warehouse	Manual	Basic	Visualization after reconciliation

Best Practices for Ongoing Reconciliation

Reconciliation isn't a one-time project. It's an ongoing discipline. These practices help teams maintain data quality long-term.

Establish a Weekly Reconciliation Cadence

Assign one person (or rotate the responsibility) to run reconciliation checks every Monday morning. They review the past week's data, flag discrepancies above threshold, investigate root causes, and document resolutions.

This cadence catches issues before they compound and ensures someone owns data quality.

Automate Alerting, Not Just Reporting

Don't wait for someone to check a dashboard. Configure your reconciliation system to send Slack or email alerts when variances exceed thresholds. The alert should include: affected metric, variance amount, affected time period, link to detailed logs.

Maintain a Reconciliation Runbook

Create a runbook that new team members can follow. Include: which metrics to reconcile, how often, where to find source data, thresholds for each metric, common root causes and fixes, escalation path for unsolved issues.

Update the runbook every time you encounter a new type of discrepancy.

Reconcile Before Major Decisions

Before presenting results to executives, before committing to a new budget allocation, before declaring a campaign a success or failure — reconcile the underlying data. Confirm that the numbers you're basing decisions on match the source systems.

Use Pre-Built Validation Rules

If your integration platform supports it, enable pre-built validation rules that check for common issues: negative spend values, conversion counts higher than click counts, sudden 10x spikes in impressions, currency mismatches.

Improvado's Marketing Data Governance includes 250+ such rules, automatically applied to every data pull. This catches obvious errors before they reach your warehouse.

Conclusion

Data reconciliation is the difference between data you trust and data you question. When your dashboards show conflicting numbers, decisions slow down. Teams argue over whose data is correct. Executives lose confidence in marketing's ability to measure ROI.

Reconciliation solves this. It establishes a single source of truth — one set of numbers that every stakeholder agrees on. The process requires discipline: collect data systematically, standardize formats, compare rigorously, investigate discrepancies, and document resolutions.

Manual reconciliation works for small teams, but it doesn't scale. As your marketing stack grows — more platforms, more campaigns, more data — you need automation. The right tools eliminate 95% of the manual work, catching errors in real-time and alerting you before bad data reaches decision-makers.

Start small. Pick one high-stakes metric — total ad spend, for example — and reconcile it weekly. Build the habit. Expand to more metrics as the process becomes routine. Document everything. Over time, reconciliation becomes invisible infrastructure: always running, always validating, always protecting your team from the costs of bad data.

Without systematic reconciliation, teams make budget decisions on data that's 5–15% inaccurate — wasting thousands per month on underperforming campaigns they think are winners.

Book a demo →

Frequently Asked Questions

What is the difference between data reconciliation and data validation?

Data validation checks whether individual data points meet quality rules — for example, ensuring that a date field contains a valid date, or that a spend value is positive. Data reconciliation compares data across systems to verify consistency — for example, confirming that the spend reported in your warehouse matches the spend in Google Ads. Validation happens at the record level; reconciliation happens at the aggregate or comparative level. Both are necessary. Validation catches malformed data before it enters your systems. Reconciliation catches integration errors, transformation bugs, and cross-system discrepancies after data is loaded.

How often should marketing teams reconcile data?

Reconciliation frequency depends on data volume and business impact. For high-stakes metrics like ad spend and revenue, reconcile daily or weekly. For lower-impact metrics like impressions or email open rates, monthly reconciliation may suffice. At minimum, reconcile before major reporting deadlines — month-end close, quarterly business reviews, executive presentations. Teams managing more than 10 data sources typically automate daily reconciliation checks and assign a human owner to review flagged discrepancies weekly.

What causes discrepancies between ad platforms and analytics tools?

Several factors create legitimate differences between what ad platforms report and what analytics tools show. Attribution windows differ — Facebook Ads may use 7-day click attribution while Google Analytics uses last-click. Timezone settings vary — one platform may define "day" in UTC, another in Pacific time. Analytics tools filter bot traffic; ad platforms count all clicks. Tracking pixels may fail to fire due to ad blockers, slow page loads, or implementation errors. Cookie policies and privacy settings prevent some conversions from being tracked. A 2–5% variance between ad platform clicks and analytics sessions is typical. Larger variances suggest a tracking issue that requires investigation.

Can you reconcile data without a data warehouse?

Yes, but it's harder. Without a warehouse, you reconcile by manually exporting data from each platform and comparing it in spreadsheets. This works for small teams managing 3–5 platforms. As data volume grows, manual reconciliation becomes impractical — too slow, too error-prone, too difficult to maintain historical comparisons. A data warehouse centralizes all your data in one location, making automated reconciliation possible. That said, some integration platforms handle reconciliation without requiring you to set up and manage a warehouse. Improvado, for example, performs reconciliation checks in its own infrastructure, alerting you to discrepancies before data reaches your BI tool or warehouse.

What should you do when you can't resolve a discrepancy?

Some discrepancies resist easy resolution — the root cause isn't obvious, the source platform's API documentation is vague, or the variance is small enough to ignore but large enough to notice. In these cases: document the discrepancy, including what you investigated and what you ruled out; set an acceptable variance threshold and exclude it from future alerts if it stays within that range; escalate to platform support if you suspect a bug on their end; apply a manual adjustment if the business impact justifies it, clearly labeling it as an adjustment in your records. Not every discrepancy has a perfect fix. The goal is to make informed decisions with full awareness of data limitations.

How do you reconcile data after a platform schema change?

When a platform changes its API schema — renaming fields, adding new dimensions, or restructuring how data is returned — your existing integration may break or start pulling incomplete data. To reconcile after a schema change: first, identify when the change occurred by reviewing the platform's API changelog or checking your pipeline logs for errors; second, update your connector or transformation logic to query the new schema; third, backfill historical data using the new schema if the platform allows it; fourth, reconcile the backfilled data against the old schema to confirm consistency. Some platforms provide backward compatibility for a transition period, letting you query both old and new schemas. Modern integration platforms handle schema changes automatically — Improvado, for example, monitors schema updates across 1,000+ sources and adjusts mappings without requiring manual intervention, preserving 2 years of historical data through the transition.

What reconciliation metrics should you track?

Track these metrics to measure the health of your reconciliation process: variance rate (percentage of data pulls that flag discrepancies above threshold), mean time to resolution (how long it takes to investigate and fix a flagged discrepancy), false positive rate (percentage of alerts that turn out to be legitimate differences, not errors), coverage (percentage of data sources included in regular reconciliation checks), automation rate (percentage of reconciliation comparisons that run automatically vs. manually). These metrics help you improve the process over time — reducing variance rate and mean time to resolution, increasing coverage and automation rate. Share them with stakeholders quarterly to demonstrate data quality improvements.

FAQ

Roman Vinogradov

VP of Products, Improvado

Roman Vinogradov is Vice President of Product at Improvado, where he leads product vision and development for enterprise marketing analytics. A member of the Forbes Technology Council and advisor at Berkeley SkyDeck Europe, he focuses on AI-driven data solutions that empower marketing teams to scale insights securely and efficiently.

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.