The Confusion Matrix & Precision-Recall Tradeoff

Question 1

How do I create a new Stats iQ variable?

You can create a new variable by clicking Create or Clean Variable in the lower-left corner of your workspace. For more information, please visit the support page on Variable Creation.

Question 2

What are the options for analyzing my data in Stats iQ?

Stats iQ provides four options for analyzing data:

Describe: Selecting a variable from the list and then clicking Describe will give you a visualization of the data contained in that variable. Use this when you would like to see how the data for a certain variable is distributed.
Relate: Selecting two variables and then clicking Relate will run a statistical analysis of the relation between the two variables. Use this when you would like to know how strongly two variables are correlated.
Pivot Table: Selecting two or more variables and clicking Pivot Table will create a table that displays the values of the variables as rows and columns. The cells can be set to display a variety of different information including column and row percentage, Sum, and Variance. Use this when you would like to compare the overlap between specific values of a set of variables.
Regression: Selecting two variables and clicking Regression will give the mathematical relationship between the variables. Use this when you would like to predict values for one variable based off of the values of another.
Cluster: Selecting two to ten demographic variables and clicking Cluster will display groupings of traits most likely to occur together, thus revealing the population segments captured in your data.

Question 3

I don't know what this statistical term means. Can you tell me?

Statistical tests: ANOVA, T-test, and Chi-squared are all statistical test that Stats iQ performs to test whether or not the relationship between two variables is significant. These tests are used to generate a P-Value.
P-Value: This value represents the probability that the observed results would be seen if no correlation between the variables exists. A lower P-Value means more correlated data.
Effect Size: The effect size is a measure of how large the correlation between two variables is. This is measured in different ways depending on the type of the statistical test performed. Examples are Cohen’s d, Pearson’s r, and Cramer’s v. The larger the effect size value, the more correlated the variables are.

For more information, visit the Statistical Test Assumptions and Technical Details support page.

Question 4

How do I filter the data that appears in Stats iQ?

You can filter the data that appears in Stats iQ on two different levels: on individual cards and on the overall workspace. You can find instructions for this on the Filtering Data page for Stats iQ.

Question 5

How do I get my new responses to show up in Stats iQ?

In Stats iQ, click the Settings button and select Import Latest Data. This will import any new responses to Stats iQ and include them in your analysis.

Question 6

How are analysis cards ordered in my Stats iQ Workspace?

Analysis cards are automatically ordered to show the most statistically significant results. You can change the order in which the variables appear in the data set by navigating to the Analysis Settings menu.

Question 7

What’s Stats iQ? / Where’s Statwing?

Stats iQ is the new name for Statwing. You can find Stats iQ by going to any project, going to Data & Analysis, and selecting Stats iQ.

Question 8

What do I do if my data isn't loading properly?

Make sure you've loaded your current dataset by clicking Import Latest Data in Stats iQ. If your data is still not loading properly, then please contact Qualtrics Technical Support.

CustomerID	Age	Gender
…	…	…
324	54	Female
325	23	Female
326	62	Male
327	15	Female
…	…	…

CustomerID	Age	Gender	Model-estimated likelihood of return
…	…	…	…
324	54	Female	34%
325	23	Female	24%
326	62	Male	65%
327	15	Female	7%
…	…	…	…

CustomerID	Age	Gender	Model-estimated likelihood of return	Model prediction (30% cutoff)
…	…	…	…	…
324	54	Female	34%	Will return
325	23	Female	24%	Won’t
326	62	Male	65%	Will return
327	15	Female	7%	Won’t
…	…	…	…	…

CustomerID	Age	Gender	Model-estimated likelihood of return	Model prediction (30% cutoff)	Returned
1	21	Male	44%	Will return	Returned
2	34	Female	4%	Won’t	Returned
3	13	Female	65%	Will return	Didn’t
4	25	Female	27%	Won’t	Didn’t
…	…	…	…	…	…

CustomerID	Age	Gender	Model-estimated likelihood of return	Model prediction (30% cutoff)	Returned	Prediction accuracy
1	21	Male	44%	Will return	Returned	Correct
2	34	Female	4%	Won’t	Returned	Incorrect
3	13	Female	65%	Will return	Didn’t	Incorrect
4	25	Female	27%	Won’t	Didn’t	Correct
…	…	…	…	…	…	…

The Confusion Matrix & Precision-Recall Tradeoff

Confusion Matrix

Precision vs. Recall Curve

FAQs