data engineering apis

Cloud Data Matching - Identify Inconsistent/Redundant Data Records

This browser-based matching tool enables you to run data quality match reports from database tables existing in the Cloud on the major Cloud Data Platforms (or any other database that you can access via the Internet). CSV files, TSV files, and Excel files are supported as well. Simply provide the connection string from your data source's database, table name, and column name(s). The Match Column Name is the column for which the selected matching algorithm will be applied to identify inconsistently represented data and duplicate data candidates. Data will be retrieved, the appropriate matching algorithm will be utilized, and the reports generated and displayed for your use and analysis. You can also generate SQL Insert scripts with similarity keys (the basis for matching) appended to apply your own matching and merging logic, or write results to new tables in the source database through the connection. This is an easy step to take in improving the usability and value of your important data assets.

First time? Check out our quick and easy Data Matching Tutorial to see how it's done.

For instructions on how to include data matching of a dataset as part of workflow, including scheduled runs, ongoing processing, or as part of a data pipeline, see the Data Matching Workflow Guide.


Run Here - Report Setup:


Login or register to obtain

Select Matching Algorithm Type:
See example reports based on matching algorithm types
Sample connection strings

Select Report/Processing Type:


* Note: Use connection type = "CSV File", connection string = "https://dl.interzoid.com/csv/companies.csv", table name = "CSV", and column name = "1" for a demo that does not use credits from your account, regardless of report type.


Additional Information:


Login to www.interzoid.com to obtain your API Key. It is how we track usage. If you do not yet have one, register for a free trial at www.interzoid.com/register-api-account.

Sample SQL Scripts in case you need sample/test data for a test drive.

How to use CSV files for data processing within Interzoid Cloud Data Connect.

API used (behind the scenes) for Company/Organization Name Matching.

API used (behind the scenes) for Individual Name Matching.

API used (behind the scenes) for Address Matching.

See Sample Match Reports


            
        Sample Matches (showing all three report types):
G.E. Gen Electric General Electric Bill Jameson William R. Jamison 500 Browne lane suite #100 500 Browne ln suite 100 500 brown lane ste 100
Similarity Key Appending (including in new table):
Apple International,k2HDzRo6pObj5PkfW5sskHTHESF7AQ apple inc.,k2HDzRo6pObj5PkfW5sskHTHESF7AQ The Apple Store,k2HDzRo6pObj5PkfW5sskHTHESF7AQ Apple Corp,k2HDzRo6pObj5PkfW5sskHTHESF7AQ Apple USA,k2HDzRo6pObj5PkfW5sskHTHESF7AQ 7-11,Rf6RCPXSmrZOp8FKRSjQuRvfzO3ef 7-eleven,Rf6RCPXSmrZOp8FKRSjQuRvfzO3ef Seven Eleven Stores,Rf6RCPXSmrZOp8FKRSjQuRvfzO3ef

Run scheduled or as part of a batch script? Embed in workflow or within a data pipeline? Execute programmatically? You can run this entire process with a single API call. Example:

            
        https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=CSV&connection=https://dl.interzoid.com/csv/companies.csv&table=CSV&column=1&process=matchreport&category=company&html=true
            


Full Dataset Processing API Documentation

Return to Platform Home


All content (c) 2019-2024 Interzoid Incorporated. Questions or assistance? Contact support@interzoid.com

201 Spear Street, Suite 1100, San Francisco, CA 94105-6164

Interested in Data Cleansing or Enhancement Services?

Start Here
Terms of Service
Privacy Policy

Use the Interzoid Cloud Connect Data Platform and Start to Supercharge your Cloud Data Now: connect.interzoid.com
API Integration Code Examples and SDKs: github.com/interzoid
Documentation and Overview: Docs site
Interzoid Product and Technology Newsletter: Subscribe
Partnership Interest? Inquire