Introducing our Snowflake Data Cloud Native Application: AI-Augmented Data Quality built into SQL statements! Learn More

Dataset Matching Workflow Integration

Automate and integrate dataset or database table matching into your workflow

Overview

This guide explains how to automate the running of matching jobs for datasets or database tables, allowing you to schedule, add to business processes, match & merge multiple datasets, or integrate into ETL/ELT processes.

Automation

Schedule and integrate matching jobs into your workflows.

Multiple Data Sources

Support for various cloud and database platforms.

Single Command

Execute powerful matching capabilities with a single HTTP request.

How It Works

The matching process is initiated via an HTTP request "query string", which can be embedded into any process, batch file, scheduler, or series of commands.

Example Request (CSV source)

https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=CSV&connection=https://dl.interzoid.com/csv/companies.csv&table=CSV&column=1&process=matchreport&category=company&html=true

Running with cURL

You can run the command from a Linux, Windows, or macOS command line using cURL:

Linux & Mac

$ curl 'https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=CSV&connection=https://dl.interzoid.com/csv/companies.csv&table=CSV&column=1&process=matchreport&category=company'

Windows

curl "https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=CSV&connection=https://dl.interzoid.com/csv/companies.csv&table=CSV&column=1&process=matchreport&category=company"

Redirecting Output

Output from these curl commands can be redirected to output files for further processing using the greater-than symbol in both Linux & Windows.

Linux & Mac

$ curl '[HTTP query string]' > output.csv

Windows

curl "[HTTP query string]" > output.csv

Connecting to Cloud SQL Data Tables

Examples of using the HTTP query string to match an entire database table of company names:

Snowflake Example

https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=Snowflake&connection=your-specific-connection-string&table=companies&column=company&process=matchreport&category=company

Azure SQL Example

https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=azure sql&connection=your-specific-connection-string&table=companies&column=company&process=matchreport&category=company

AWS RDS Example

https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=aws rds postgres&connection=your-specific-connection-string&table=companies&column=company&process=matchreport&category=company

Google Cloud SQL Example

https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=postgres&connection=your-specific-connection-string&table=companies&column=company&process=matchreport&category=company

PostgreSQL Example

https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=postgres&connection=your-specific-connection-string&table=companies&column=company&process=matchreport&category=company

MySQL Example

https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=mysql&connection=your-specific-connection-string&table=companies&column=company&process=matchreport&category=company

Supported Data Sources

  • AWS RDS/Aurora
  • Snowflake
  • Azure SQL
  • Google Cloud SQL
  • Databricks
  • PostgreSQL
  • MySQL
  • MariaDB
  • Parquet
  • CSV
  • TSV
  • Excel

Data Matching Parameters

Required Parameters
  • function: Use 'match' for data matching.
  • process: Defines the report or action (e.g., 'matchreport', 'keysonly', 'gensql', 'createtable').
  • category: Indicates which set of algorithms to use ('company', 'individual', or 'address').