USP493 Session 3 Notes

  1. Graphic presentation of data
    1. For nominal and ordinal data you can do a pie chart or a bar chart
    2. For grouped interval data you can construct a histogram
    3. For grouped or ungrouped interval data you can construct line graphs
  2. Measures of Central Tendency
    1. Mode – can be used for nominal, ordinal and interval data the most often selected value (Note the mode is the value of the variable not the frequency) can have a bimodal distribution when more than two modes
    2. Median – can be used for ordinal and interval data the “middle” value – half the sample is higher and half lower
      1. line up all the cases from the smallest to the largest
      2. find the middle position (N+1)/2
        1. if odd number of cases, it will be the value of that case
        2. if even number of cases, it will be the average of the values of the cases on either side of it. For ordinal data, simply state it is between the two values.
        3. Locating a median if you have a cumulative frequency distribution
    3. The mean – can be used for interval data
      1. calculating a mean from a frequency distribution
        1. multiple the number of cases which take on each value by the value, then add together and divide by the number of cases as usual
    4. Relationship between mode, median and mean
    5. Which one should you use?
    6. The shape of a distribution
      1. Symmetrical – mean median and more are the same
      2. Positive skew – there are some extreme high values
      3. Negative skew – there are some extreme low values
  3. Measures of Variability
    1. Index of Qualitative Variability – used for nominal data
    2. The range – used for interval data – simply the largest value minus the lowest value. The interquartile range, the difference between the cutpoint where 25 percent of the cases have a larger values, and the cutpoint where 25 percent of the cases have a lower value. The boxplot shows the range, the interquartile range and the median
    3. The variance and standard deviation, a measure of the amount scores cluster at the mean or spread out from the mean. Used for interval data. The variance is the average squared deviation of scores from the mean. The standard deviation is calculated from the variance. You simply take the square root of the variance.