Distributions and Calculations Flashcards
Distribution
List or function showing all the possible values/intervals of the data and how often they occur
Four Main Distributions
Positive - Tail heads towards positive
Negative - tail heads towards negative
Binomial - two distinct peaks
Symmetric - tails are equal
Leptokurtic (L)
Long tails relative to normal distribution
Platykurtic
Short tails relative to normal distribution
Class (Class Intervals)
Division of data into Bin Widths
Class Frequency
Number of observations within a class of data
Bin Width Rules
Surges = 1 + Log2(N)
Rice = 2 x Cubed Root N
N = Total number of observations
Balance Scale
Point at which distribution is even
Center Distribution Using Smallest Absolute Deviation
Absolute Deviation = Sum of the difference in values from deviation
Center Distribution Using Smallest Absolute Deviation = smallest number for which the sum of absolute deviation is smallest
*Use median (midpoint)
Center Distribution Using Smallest Squared Deviation
Squared Deviation = difference in values from deviation, squared
Center Distribution Using Smallest Squared Deviation = smallest number for which the sum of squared deviation is smallest
Three Ways to Define a Central Tendency
- Point at which distribution is balanced
- Number which minimizes the sum of the absolute differences
- Number which minimizes the sum of the squared differences
Mode
Most frequently occurring value
Median
Midpoint of a distribution ranked from lowest to highest
Median - medians are in the middle - middle number
*Use the median to calculate absolute deviations
Mean
Total Value / Total Numbers = Mean (average)
- The mean is where we hope a distribution is balanced b/c it is the average
- Use the mean to calculate squared deviations
Range
Highest Value - Lowest Value