Bivariate
From Clariopedia
Contents |
Overview
The clario® node Bivariate (now available in beta) allows you to to create three types of bivariate tables to examine the relationships between attributes on an input data stream:
- A two-way frequency table or cross tabulation (often abbreviated as cross tab) for string attributes
- A means table, for string vs. numeric attributes
- A correlation matrix, for numeric attributes
Examples:
- Compare counts or percentages of the number of Males vs. Females who order from a marketing campaign
- Summarize mean age of residents by state
- Look at the relationship between age and income
Usage
Input Stream
The node connector can be connected to a variety of nodes, (ie. Read, Aggregate, Append, Missing, etc.), but requires a valid stream of data.
Configuration
The Bivariate node has only one configuration face.
Two-Way Frequency Table
Select at least two string attributes to produce this Table. To create the Two-Way Frequency Table, click the Create icon under the Two-Way Frequency Table list. Select one attribute as the Compare Attribute by dragging that attribute into the Compare Attribute box. This attribute name will be used as the Table name in results.
Select one or more attributes by dragging them to the Attribute List box. Click Save to save Table configuration or Cancel to exit without saving. To erase a previously defined table, click on the table name (the Compare Attribute) in the Two-Way Frequency Table list and press delete.
NOTES: The only attributes that will be available for the Two-Way Frequency Table are string attributes.
Means Table
To create the Means Table click the Create icon under the Means Table list. Select a Compare Attribute. After a Compare Attribute is selected, the Available Attributes list will be filtered by data type. For example, if a string attribute is selected, only numeric attributes will be listed as available in the Available Attributes box and vice versa. Click Save to save Table configuration or Cancel to exit without saving. To erase a previously defined table, click on the table name (the Compare Attribute) in the Means Table list and press delete.
Correlation Matrix
To create the Correlation Matrix, simply check the Create Correlation Matrix box at the bottom of the configuration face. All numeric attributes will be available in the matrix results.
NOTES:
- To efficiently find attribute names, begin typing an attribute name in the text box directly under Available Attributes. You will be directed to the attributes beginning with the letter(s) you type.
- To select multiple attributes at once, either use [Ctrl]+click to select multiple, one at a time, [Shift]+down arrow to select multiple in order of appearance, or use [Shift]+click to select the beginning and the ending attribute which will select all attributes.
- To de-select an attribute click on the attribute in the Attribute List box and drag and drop into the Available Attributes box.
- Attributes in the Attribute list can be re-ordered by clicking and holding on an attribute and dragging it to the desired position within the Attribute list box.
Results
There is one results face with up to three views. Views will be populated based on the tables requested in the node configuration.
Two-Way Frequency Table
Click on the View button next to Two-Way Frequency. Select an Analysis Set (table name) from the left-hand drop-down box, and an Attribute from the right-hand drop-down box. The count in each cell will appear, along with the overall %. Export this table to a spreadsheet, if desired, by clicking on the spreadsheet icon in the upper right corner. Note that only the counts (not the % values) export to the spreadsheet.
Means Table
Click on the View button next to Means Table. Select an Analysis Set (table name) from the left-hand drop-down box, and an Attribute from the right-hand drop-down box. The means table will appear, with the string attribute defining the rows and the numeric attribute summarized across the columns. Numeric values displayed include: N, Mean, Minimum, Maximum, Standard Deviation, and Sum. Export this table to a spreadsheet, if desired, by clicking on the spreadsheet icon in the upper right corner.
Correlation Matrix
Click on the View button next to Correlation Matrix. Select the attributes you want displayed in the matrix by dragging the attribute names from the Available Attributes list to the Selected Attributes list. To view a symmetrical matrix, where the diagonal values will all be 1, check the View Symmetrical Matrix box. If this box is not checked, Bivariate shows the correlations of the selected attributes with all numeric attributes on the incoming data stream. Click the submit button to view the matrix. To view the correlations between only one attribute with all other attributes (for example to look at the correlation between Gender and all other attributes) select only Gender and do not check the View Symmetrical Matrix box. Bivariate will then show the correlations between Gender and all numeric attributes on the incoming data stream. A slider in the lower right corner allows highlighting of cells over a certain absolute value. Additionally, the Attributes button just to the left of the slider allows you to re-select which attributes are displayed, without returning to the previous step. Export this correlation matrix to a spreadsheet, if desired, by clicking on the spreadsheet icon in the upper right corner.
Output Stream
None.
Video Demonstration
References
None.
