Dataset Types

When you upload a custom dataset, you must choose the correct dataset type. You can upload raw datasets, frequency tables, and contingency tables.

Raw dataset

A raw dataset is a list of observations that has not been analyzed or summarized.

Example: The clothing type, size, color, price, and brand for 109 clothing items sold.
Clothing type Size Color Price Brand
T-shirt Small Blue 10.50 A
Sweatshirt Large Blue 25.00 A
T-shirt Medium Green 11.75 B
Sweatpants Medium Black 26.95 A
Sweatshirt Small Red 19.99 B

Frequency table

A frequency table summarizes one categorical variable by listing the possible outcomes for the variable and the frequency of each outcome.

Example: Table summarizes "clothing type" for 109 clothing sales.

Clothing type Frequency
T-shirt 32
Sweatshirt 54
Sweatpants 23

2-Way Contingency table

A contingency table summarizes two categorical variables by listing the possible outcomes and frequency of the outcomes for each variable. One variable is in rows, and the other is in columns. The table is used to study the relationship between the two variables.

Example: Table summarizes the variables 'item sold' and 'size' for 109 clothing sales.

Small Medium Large
T-shirt 8 13 11
Sweatshirt 11 24 19
Sweatpants 5 9 9