Summarizing Data Using Frequency Distributions

Summarizing Data Using Frequency Distributions

Frequency Distribution 

A frequency distribution refers to the presentation of statistical data in a tabular format to simplify data analysis. In a frequency distribution, data is subdivided into groups or intervals.

The standard procedure for constructing a frequency distribution involves the following steps:

  1. Determine the number of classes one wishes to have. 5 – 20 is always a good number.
  2. Determine the interval size. To do this, the range and the number of classes should guide an analyst. The range is the difference between the smallest and the largest observations. In the case of a fractional result, an analyst should take the next higher whole number as the size of the interval.
  3. Determine the starting point. It could be the lower limit of the lowest observation or a convenient value just below the lower limit.
  4. Add the class interval to the starting point to get the second lower limit. This process should be repeated.
  5. List the lower limits in a vertical column alongside the upper-class limits.
  6. Complete the table by counting the number of observations that fall under each class.

Points to note:

  • Classes should be mutually exclusive and have similar widths.
  • The last class includes the maximum value.
  • All class intervals should be tabulated even if they have zero observations.
  • The sum of the frequencies should be equal to the number of observations.
  • Tally bars offer a convenient tool for the visual presentation of the number of observations in each.

Example: Frequency Distribution Table

You have been given the following data showing the percentage returns that certain classes of investment offer in a year. Use the data to construct a frequency distribution table.

$$ \begin{array}{c|c|c|c|c} \text{-10%} & \text{2%} & \text{32%} & \text{-28%} & \text{25%} \\ \hline \text{-25.60%} & \text{4%} & \text{11%} & \text{-14%} & \text{15%} \\ \hline \text{23%} & \text{13%} & \text{6%} & \text{-2.70%} & \text{8%} \\ \hline \text{12%} & \text{28%} & \text{17.50%} & \text{5.80%} & \text{20%} \\ \hline \text{4.60%} & \text{17%} & \text{-3.90%} & \text{22.40%} & \text{15%} \\ \end{array} $$

Solution

  1. We have 25 observations in total. First of all, we will sort the data in ascending order.
  2. After this, we will calculate the range of the data, where Range = Maximum value − Minimum value = 32% – (-28%) = 60%.
  3. We will then determine an interval width of 10%.
  4. The lowest return intervals will be -30% ≤ Rt < -20%, while the highest one will be 30% ≤ Rt < 40%.
  5. Then, we will count the observations in each interval.

$$ \begin{array}{c|c|c} \textbf{Interval} & \textbf{Tally} & \textbf{Frequency} \\ \hline -30\% \leq R_t < -20\% & \text{II} & \text{2} \\ -20\% \leq R_t < -10\% & \text{I} & \text{1} \\ -10\% \leq R_t < 0\% & \text{III} & \text{3} \\ 0\% \leq R_t <10\% & \text{IIIIII} & \text{6} \\ 10\% \leq R_t < 20\% & \text{IIIIIII} & \text{7} \\ 20\% \leq R_t < 30\% & \text{IIIII} & \text{5} \\ 30\% \leq R_t < 40\% & \text{I} & \text{1} \\ \textbf{Total} & \text{} & \textbf{25} \\ \end{array} $$

Absolute versus Relative Frequency

Absolute frequency is the actual number of observations in a given interval.

Relative frequency refers to the percentage of observations falling within a given class. It is calculated by dividing the absolute frequency of each return interval by the total number of observations. Using our earlier example when we introduced the frequency distribution table, we could come up with the relative frequency for each interval using the formula below:

$$\text{Relative Frequency}=\frac{\text{Absolute frequency}}{\text{Total frequency}}$$

Where \(\text{Total frequency}\) is the total number of observations.

Example: Relative Frequency

$$
\begin{array}{c|c|c|c}
\textbf { Interval } & \textbf { Tally } & \textbf { Frequency } & \textbf { Relative Frequency } \\
\hline-30 \% \leq \mathrm{R}_{\mathrm{t}} \leq-20 \% & \text { II } & 2 & \frac{2}{25}=8 \% \\
-20 \% \leq \mathrm{R}_{\mathrm{t}} \leq-10 \% & \text { I } & 1 & \frac{1}{25}=4 \% \\
-10 \% \leq \mathrm{R}_{t} \leq 0 \% & \text { III } & 3 & \frac{3}{25}=12 \% \\
0 \% \leq \mathrm{R}_{t} \leq 10 \% & \text { IIIII } & 6 & \frac{6}{25}=24 \% \\
10 \% \leq \mathrm{R}_{t} \leq 20 \% & \text { IIIIII } & 7 & \frac{7}{25}=28 \% \\
20 \% \leq \mathrm{R}_{t} \leq 30 \% & \text { IIII } & 5 & \frac{5}{25}=20 \% \\
30 \% \leq \mathrm{R}_{t} \leq 40 \% & \text { I } & 1 &\frac{1}{25}=4 \% \\
\text { Total } & & 25 & \frac{25}{25}=100 \%
\end{array}
$$

In the above table, the absolute frequency of the 1st interval is 2. Similarly, the relative frequency of the 1st interval is 8%. The same applies to other intervals.

Cumulative absolute frequency is the sum of the absolute frequencies, including the given interval.

Cumulative relative frequency similarly sums up the relative frequencies up to and including the given relative frequency.

Example: Cumulative Frequencies

$$
\begin{array}{c|c|c|c|c|c}
\textbf { Interval } & \textbf { Tally } & \textbf { Frequency } & \textbf { Relative Frequency } & \begin{array}{c}
\textbf { Cumulative Absolute } \\
\textbf { Frequency }
\end{array} & \begin{array}{c}
\textbf { Cumulative Relative } \\
\textbf { Frequency }
\end{array} \\
\hline-30 \% \leq R \leq-20 \% & \text { II } & 2 & \frac{2 }{25}=8 \% & 2 & 8 \% \\
-20 \% \leq R_{4} \leq-10 \% & \text { I } & 1 & \frac{1 }{25}=4 \% & 3 & 12 \% \\
-10 \% \leq R_{1} \leq 0 \% & \text { III } & 3 & \frac{3}{ 25}=12 \% & 6 & 24 \% \\
0 \% \leq R_{1} \leq 10 \% & \text { IIIII } & 6 & \frac{6}{25}=24 \% & 12 & 48 \% \\
10 \% \leq R \leq 20 \% & \text { IIIII } & 7 & \frac{7}{25}=28 \% & 19 & 76 \% \\
20 \% \leq R \leq 30 \% & \text { IIII } & 5 & \frac{5}{25}=8 \% & 24 & 96 \% \\
30 \% \leq R \leq 40 \% & \text { I } & 1 & \frac{1}{25}=4 \% & 25 & 100 \% \\
\text { Total } & & 25 & \frac{25}{25}=100 \% & &
\end{array}
$$

In the above table, cumulative absolute frequency is the sum of the absolute frequencies up to and including the given interval. The cumulative relative frequency similarly sums up the relative frequencies up to and including the given relative frequency.

Question 1

The class frequency divided by the total number of observations is most likely called:

  1. Relative frequency.
  2. Percentage frequency.
  3. Cumulative relative frequency.

Solution

The correct answer is A.

Relative frequency refers to the percentage of observations falling within a given class. It is calculated by dividing the absolute frequency of each return interval by the total number of observations.

C is incorrect. Cumulative relative frequency sums up the relative frequencies up to and including the given relative frequency.

B is incorrect. There is no such term as percentage frequency.

Question 2

The number of tally sheet count for each value or a group is most likely known as:

  1. Class limit.
  2. Frequency.
  3. Class width.

Solution

The correct answer is B.

The frequency of a class is the number of data entries in the class.

A is incorrect. Each class will have a “lower-class limit” and an “upper-class limit”, which are the lowest and highest numbers in each class.

C is incorrect. The “class width” is the distance between the lower limits of consecutive classes.

Shop CFA® Exam Prep

Offered by AnalystPrep

Featured Shop FRM® Exam Prep Learn with Us

    Subscribe to our newsletter and keep up with the latest and greatest tips for success
    Shop Actuarial Exams Prep Shop Graduate Admission Exam Prep


    Sergio Torrico
    Sergio Torrico
    2021-07-23
    Excelente para el FRM 2 Escribo esta revisión en español para los hispanohablantes, soy de Bolivia, y utilicé AnalystPrep para dudas y consultas sobre mi preparación para el FRM nivel 2 (lo tomé una sola vez y aprobé muy bien), siempre tuve un soporte claro, directo y rápido, el material sale rápido cuando hay cambios en el temario de GARP, y los ejercicios y exámenes son muy útiles para practicar.
    diana
    diana
    2021-07-17
    So helpful. I have been using the videos to prepare for the CFA Level II exam. The videos signpost the reading contents, explain the concepts and provide additional context for specific concepts. The fun light-hearted analogies are also a welcome break to some very dry content. I usually watch the videos before going into more in-depth reading and they are a good way to avoid being overwhelmed by the sheer volume of content when you look at the readings.
    Kriti Dhawan
    Kriti Dhawan
    2021-07-16
    A great curriculum provider. James sir explains the concept so well that rather than memorising it, you tend to intuitively understand and absorb them. Thank you ! Grateful I saw this at the right time for my CFA prep.
    nikhil kumar
    nikhil kumar
    2021-06-28
    Very well explained and gives a great insight about topics in a very short time. Glad to have found Professor Forjan's lectures.
    Marwan
    Marwan
    2021-06-22
    Great support throughout the course by the team, did not feel neglected
    Benjamin anonymous
    Benjamin anonymous
    2021-05-10
    I loved using AnalystPrep for FRM. QBank is huge, videos are great. Would recommend to a friend
    Daniel Glyn
    Daniel Glyn
    2021-03-24
    I have finished my FRM1 thanks to AnalystPrep. And now using AnalystPrep for my FRM2 preparation. Professor Forjan is brilliant. He gives such good explanations and analogies. And more than anything makes learning fun. A big thank you to Analystprep and Professor Forjan. 5 stars all the way!
    michael walshe
    michael walshe
    2021-03-18
    Professor James' videos are excellent for understanding the underlying theories behind financial engineering / financial analysis. The AnalystPrep videos were better than any of the others that I searched through on YouTube for providing a clear explanation of some concepts, such as Portfolio theory, CAPM, and Arbitrage Pricing theory. Watching these cleared up many of the unclarities I had in my head. Highly recommended.