This report is an extension of the midterm project submitted for STATS 531 in Winter 2021.
I included the Decomposition of Yield as Trend, Noise and Cycle in section 5.3 which should have been included in the midterm project. This can be reviewed along with other complementary approaches to this research.
US Treasury Bond is a common investment tool for many institutional investors, including insurance companies. Specifically, insurance companies are required to invest in long-term bonds to match the asset duration with the liability duration and enhance the RBC (Risk-Based Capital) ratio. As a person who is devoting the career to the insurance industry, I have a particular interest in the trend, cyclic pattern, and the association with other factors of the U.S. Treasury long-term bond.
The data is downloaded from the U.S. Treasury Website. Please note that it may take a minute to download the data. If it takes too long, please pause and rerun the code.
In this analysis, I choose to analyze the 10-year data among different long-term bonds because it has the most data points. The 30-year yield data was not available for a period between 2002 and 2006 because 30 year securities were not being offered during that period. The 20-year yield is not available until October 1993 because 20-year securities were not offered by then.
I choose to transform the daily data into monthly data for the analysis which has just right sample size, \(n=375\). The daily data has \(n=7801\) samples, which are too large, and the annual data has \(n=19\), which are too small.
We first checked the overall picture of the monthly yields using the basic plot.
Next, we checked the difference of the yields, which can be used for the stochastic volatility models.
In our efforts to find the best model to fit this data, we first fitted the GARCH (Generalized Auto-Regressive Conditional Heteroskedasticity) model.
The 3-parameter GARCH(1,1) model has a maximized log likelihood of -33.89. We could simulate from a fitted GARCH model, but we choose not to do. It is because GARCH is a black-box model.
Instead of GARCH model, we tried a POMP (Partially Observed Markov Processes) model that allows us to try diverse hypotheses about how volatility behaves.
We first checked that we could indeed filter and re-estimate parameters successfully for the simulated data. We set up code to switch between different levels of computational intensity.
## se
## -539.67113182 0.03425802
In a second, using 36 cores of CPU on Great Lakes, we obtained a log likelihood estimate of -539.67 with a Monte Carlo standard error of 0.034.
Next, we tried out iterated filtering on the U.S. 10-year bond yield data. We used the IF2 algorithm of Ionides et al. (2015), implemented by mif2.
In this section, the investigation took 3 minutes on Great Lakes. The repeated stochastic maximization also shows us the geometry of the likelihood surface in a neighborhood of this point estimate:
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## -32.27 -27.83 -26.10 -27.07 -25.93 -25.75
In this section, the investigation took 10 minutes on Great Lakes. The best likelihood found was -25.71.
Plotting these diverse parameter estimates can help to give a feel for the global geometry of the likelihood surface.
We conducted an extended analysis for extracting the cyclic pattern for the data. Beginning with the Loess smoothing, we decomposed the Trend, Noise and Cycle for the data.
It is alleged that the 10-year bond yield has an association with CPI. We tried to check this. We used the data for CPI (Customer Price Index) growth rate compared to the previous period. The data is from Federal Reserve Bank of St. Louis.
We first used the Hodrick-Prescott filter to define the HP-detrended yield and CPI.
We computed the p-value from a likelihood ratio test based on the detrended data.
## [1] 0.4824025
The p-value is much larger than 0.05, so we could not have a clear statistical evidence for a positive association between detrended yield and detrended CPI.
To ensure that there is no association between two data, we drew a squared coherence plot. As shown below, we couldn’t see a significant peak for the plot.
We compared the GARCH and the POMP models. The POMP model performed better in terms of the maximum likelihood estimate. The GARCH model for the dataset showed a maximized likelihood of -33.89 with 3 fitted parameters. The POMP model has a maximized log likelihood of -25.71 with 6 fitted parameters. Due to the better performance and possibility of interpretation, we would prefer using POMP model.
As opposed to the common knowledge that there should be an association between the US 10-year bond yield rate and the CPI, we could not find a clear association. Naturally, we could not use the CPI for predicting the yield. It may be possible that other factors such as PMI (Purchasing Managers’ Index) or unemployment rates are associate with the yields.
[1] 10-year U.S. Treasury Bond Yield data
[2] Parsing 10-year federal note yield from the website
https://stackoverflow.com/questions/37952589/parsing-10-year-federal-note-yield-from-the-website
[3] Customer Price Index Data: Total All Items for the United States
https://fred.stlouisfed.org/tags/series?t=cpi%3Bmonthly
[4] Introduction to Risk-Based Capital (RBC) by National Association of Insurance Commissioners
https://content.naic.org/cipr_topics/topic_riskbased_capital.htm
[5] Prof. Edward Ionides’ Lecture Slides for STATS 531 (Winter 2021)