
Jeff Butcher
- Total activity 32
- Last activity
- Member since
- Following 0 users
- Followed by 0 users
- Votes 0
- Subscriptions 18
Articles
Recent activity by Jeff Butcher Sort by recent activity-
Advanced mapping concepts
This article guides you through the process of mapping and loading data into Senzing using the csv files attached to this article. If you want to follow along, download them to a directory of you...
Exploratory Data Analysis 4 - Comparing ER results
The G2Audit utility was designed to compare Senzing ER results with ... Postulated ER results from a truth set, see How to create an entity resolution truth set ER results from other engines Prior...
Scoring Search Results
This article describes a configurable scoring algorithm that can be implemented after a search to rank the entities returned by the best matching record. At the heart of this strategy is a jso...
Generic Entity Specification - JSON & CSV Mapping
The attached documents are the latest specification for presenting entities to the Senzing engine. The generic entity specification includes the data dictionary for inputting entities, and sample ...
Exploratory Data Analysis 2 - Basic exploration
Please follow the instructions below and/or watch this Video tutorial For the next step ... If you installed directly in linux, navigate to <your project>/g2/python directory. If you installed th...
Exploratory Data Analysis 3 - Taking a snapshot
Entities are changing every time you load data. Periodically you will want to take a snapshot to calculate and review the following statistical reports ... dataSourceSummary This report shows th...
Exploratory Data Analysis 1 - Loading the truth set demo
For this demonstration we will load the truth set demo files as a set of customers and a watch list. But you could really name them anything based on your use case. For instance, an insider thre...
Exploratory Data Analysis Overview (EDA Tools)
So you've just loaded a bunch of data. How many duplicate customers do you have? Were any of them on a watch list? What is that ambiguous match? How does Senzing compare with other matching en...
AWS cloud formation / adding configuration for new data sources
Every new data source you load will require some amount of configuration even if it is only to register the data source. You will need to ssh into the senzing/sshd container to run the the G2Confi...
AWS cloud formation / diagnosing dead letter queue
Background: Lets say you have installed the the Senzing AWS cloud formation stack. You have then written a program to put your data on the SQS queue and records are hitting the dead letter queue ...