KNIME logo
Contact usDownload
Back to all templates

Matched Data

Matched data analysis is the process of identifying records that share key attributes across different datasets. This approach is commonly used to detect relationships or overlaps, such as employees appearing as vendors, by comparing fields like addresses or bank account numbers.

AuditFinanceDataviz & Data Apps
Header icon
Workflow
70%
Matched Data with KNIME

How This Workflow Works

This workflow compares two datasets—typically employee and vendor records—to identify matches based on selected fields. It guides users through data validation, field selection, matching logic, and result reporting, supporting investigations into potential conflicts or fraud.

Key Features:

  • Detects overlapping records between datasets using configurable matching criteria
  • Supports validation of numeric, character, and date fields to ensure data quality
  • Allows users to define thresholds and select fields for flexible matching
  • Generates interactive dashboards and detailed reports for review and sharing

Step-by-step:

1. VValidate and Configure Data Fields:

The workflow checks the integrity of numeric, character, and date fields in both datasets. Users can review summary statistics and address missing or invalid values, ensuring that only reliable data is used for matching.

2. Define Matching Criteria:

Users select which fields to compare—such as addresses or bank accounts—and can set thresholds for numeric or date similarities. This step tailors the matching logic to the specific risks or scenarios under investigation.

3. Identify and Analyze Matches:

The workflow applies the defined criteria to both datasets, systematically searching for records that meet the matching conditions. It highlights potential overlaps, such as employees sharing bank accounts or addresses with vendors.

4. Visualize and Share Insights:

Results are presented in interactive dashboards and detailed reports. Users can review matched records, export findings, or generate PDF reports for further analysis or communication with stakeholders.

How to Get Started