4.3. Using StarDust AFTER a predictive analysis

<< Click to Display Table of Contents >>

Navigation:  4. How to start « TIMi – StarDust module »? >

4.3. Using StarDust AFTER a predictive analysis

 

For your first lecture of this guide, I suggest you to skip this section a go directly to section 5. In this section, we will assume that you are familiar with the usage of StarDust.

 

Inside StarDust, all the algorithms (PCA, K-means & Ward) are able to take into account user-defined “weights” on the columns and on the rows.

A large weight on a row of your dataset means that this particular individual is very important. A large weight on a column of your dataset means that this column is very important. You can set the row-weights here: number1 and the column-weights here: number2.

 

STARDU~1_img106

 
 

The column-weights are designated as STARDU~1_img107 in section 5.5.1.

The sum of the row-weights inside a segment is designated as STARDU~1_img108in section 5.5.3.2.

 

There exist some optimal column-weights that allow you to visualize in an optimal way the characteristics that separate your “Target Group” from the rest of the population. When you are using these optimal column-weights, you can easily “see” and “explore” your “Target Group” in relation to the other individuals.

 

For example, for the census-income database:

 

STARDU~1_img109

 

Note that we added as illustrative variable the “taxable income amount”. number1

 

A good idea is to also add as illustrative variable the “sex”.
Thereafter, we load the dataset. We go directly to the chart representing the variable “Taxable income amount” (click The STARDU~1_img110 button) and we add the “True” modality as a new group.

 

STARDU~1_img111

 
Let’s configure the display of our Groups: click the STARDU~1_img16 button and select the “(Meta-)Group” tab:

 

STARDU~1_img112

 

 

... and we finally obtain:

 

STARDU~1_img114

 
 

Apparently, in this case, the “Target Group” is roughly divided in two parts! This is interesting... We should investigate what’s the difference between these parts.