How Financial Institutions Differ from ...
A financial institution is an intermediary between providers and recipients of capital or... Read More
Survivorship bias occurs when a conclusion is drawn from data whose scope only captures companies that survived until the date the backtesting was done. It is worth clarifying that many practitioners fail to quantify the effects of survivorship bias in their backtesting. Several investors suggest that there is nothing wrong with backtesting strategies using the current index constituents. The problem that may arise, however, is that one cannot tell the company which will go under from that which will be amalgamated and subsequently become successful enough to be added to the index. There is yet another problem: The list of surviving firms is also thought to be biased.
When data is available from data vendors, it is strongly recommended that investors use a point-in-time approach to track all companies that have ever existed. That way, the backtesting exercise will yield realistic results.
A low-volatility anomaly is an empirical observation used to illustrate the difference between backtesting with point-in-time data and current index constituents. Low-volatility anomaly argues that low volatility stocks outperform those with high volatility.
Look-ahead bias emanates from the use of unavailable information by an investor during the historical periods over which a backtest is conducted. It is noteworthy that this is the most common mistake made when backtesting is conducted. To avert its occurrence, we use point-in-time data. There are three common forms of look-ahead data.
Data snooping, also known as p-hacking or selective reporting, involves making an inference after looking at statistical results instead of testing the inference. An analyst does this by selecting data until they obtain the desired result.
Data snooping can take the following forms:
Data snooping generates false positives. To mitigate data snooping, we often use cross-validation or higher-than-average hurdles, e.g. a high t-statistic.
Question
Cross-validation is useful in the avoidance of which one of the following problems that may affect backtesting?
- Data snooping.
- Survivorship bias.
- Look-ahead bias.
Solution
The correct answer is A.
Cross-validation is often used to mitigate data snooping problems.
B and C are incorrect. Both survivorship and look-ahead bias use point-in-time data to avoid backtesting problems.
Reading 42: Backtesting and Simulation
LOS 42 (d) Identify problems in a backtest of an investment strategy.