/ factorpad.com / fin / glossary / histogram.html
An ad-free and cookie-free website.
Beginner
Histogram is a chart showing the number, or frequency, of observations within ranges of data. Ranges may also be called buckets. The graphic, depicted using a bar chart, is helpful for identifying outliers and to validate that statistical tests are appropriate for the distribution of the data. A frequency distribution presents the same data in table form.
Synonym: frequency chart
A common mistake by investment modelers is to overly rely on summary statistics. It's easy to understand why. With thousands of stocks to review time is of the essence. Analysts find it most convenient to use the beta, standard deviation and correlation figures provided by their data service provider to arrive at a conclusion. The problem is that many risk measures utilize the mean, and the mean is subject to outliers.
One way to adjust for these biases is to visualize distributions using a histogram. In general, this is a good practice when making any decision based on statistical data.
Doc: This histogram shows the
distribution of past final grades in this Portfolio Theory class.
Leo: Excuse me Professor, I have an unrelated
question. What is the last day to drop?
This video can be accessed in a new window or App , at our YouTube Channel or from below.
Histogram definition for investment modeling (3:44)
The script includes two sections where we visualize and demonstrate the creation of a histogram.
We're sitting right here in Excel and this is a snippet from our boot camp course.
First is the frequency distribution, followed by the histogram. Here we analyzed 60 monthly returns on four stocks, located on a tab called Returns and in this table we listed six ranges, or bins.
As an illustration, there were 9 returns of less than -10% for eBay and Merck had returns greater than 10%, 6 times.
The same data is presented graphically in a histogram. This visual perspective helps you spot errors and trends. It will also let you see if the distribution is in alignment with your statistical tests. See how eBay, in orange, had more observations away from the center, than Microsoft and Abbott Labs. So eBay would have higher risk if we measured it by standard deviation.
Next, let's demonstrate.
There are eight steps to generate a frequency distribution and histogram in Excel.
The first seven are covered in another video on the frequency distribution, and because they go together, I will provide a link at the end of this video.
So our starting point here is the completed frequency distribution.
Highlight the range of data, including labels, click Insert, then Column. That's it. Of course, edit this histogram as you see fit.
Let's focus on the purpose of a histogram. For many statistical tests in investment modeling, even for studies with as few as 60 observations, it is common to use the normal distribution.
And a quick look at this indicates that more observations are in the tails, or away from the center, so it may make more sense to use a t-distribution instead of a normal distribution.
Click box for answer.
True
False. A vertical bar chart is most common.
Still unclear on the histogram? Leave a question in the comments section on YouTube or check out the Quant 101 Series, specifically Stock return frequency distributions and histograms in Excel.
Our trained humans found other terms in the category statistics basics you may find helpful.
We're developing a smart crowd at the FactorPad YouTube Channel. Join us.
/ factorpad.com / fin / glossary / histogram.html
A newly-updated free resource. Connect and refer a friend today.