5.6.6. Pattern Extractor

<< Click to Display Table of Contents >>

Navigation:  5. Detailed description of the Actions > 5.6. Cleaning >

5.6.6. Pattern Extractor

 

Icon: ANATEL~3_img580

 
Function: patternExtractor
 

Property window:

 

ANATEL~3_img581

 

Short description:

 

Extract patterns out of phone numbers.

 

Long Description:

 

This analyzes a column containing phone numbers. It has 2 operating mode. The first operating mode is typically used to find some examples of different patterns in the phone numbers.

 

The rules to create the patterns are:
 

All the digits have been replaced with “n”.

All the letters have been replaced with “A”.

All the punctuations are kept “in place”.

 

You typically use the first operating mode this way:

 

clip0214

 

The output table contains 3 examples of phone number for each different pattern. Three examples is enough to understand if a pattern is representing some valid entries or not. Thereafter, you can take action to correct & clean the data.

 

Let’s now assume that you are interested in a specific pattern “n.nn” and you want to have all examples following this specific pattern: You’ll use:

 

clip0215