I replicate the five Fama-French factors on the monthly frequency. The results of my replication in terms of correlation are as follows:
- SMB: 97.19%
- HML: 95.15%
- RMW: 91.81%
- CMA: 97.30%
The "Cumulative returns" folder contains a visual comparison of the replicated factors with the original by plotting their cumulative returns from July 1963 to December 2021.
I use the CRSPSift application for Windows to extract the following data from the CRSP monthly tape:
- RET: total return
- PRC: end-of-period price
- SHROUT: number of shares outstanding
- EXCHCD: exchange code
- SHRCD: security share code
- PERMNO: security identifier
- PERMCO: company identifier
I extract everything for the period Jan 1926 to Dec 2021 and then I subset it.
I extract all data necessary to construct Book-to-Market, Operating Profitability and Investability variable as described in the paper from Compustat. The only filter I use is that items must be reported in USD. Then I proceed to define the aforementioned variables and merge them with market equity data from CRSP.
I subset the data to include only common ordinary shares (SHRCD = 10, 11) that trade in NYSE, AMEX and NASDAQ (EXCHCD = 1, 2, 3) after June 1963. Then I proceed to construct the factors using the methods of PortSort. Check https://github.com/ioannisrpt/portsort for more details.