Strengths: Many techniques from the class are tried out in a clearly presented analysis.
Points for consideration:
The sample ACF is hard to interpret when the data are not well modeled by a stationary process. Maybe subtract off an estimated trend first?
Sec 3. Improperly described and conducted hypothesis test - the choice of 0.1 as a significance level is unusual, and if you reject the null then you reject ARIMA(0,1,1) rather than supporting it.
Plotting data against fitted values can be a weak test of model specification. Even plots that look like a close fit may not actually support that. For example, just predicting time n by the value at time \(n-1\) will often look like a good fit when plotted in thie way—your model should do better than a simple and obvious forecast if you want to show it is doing a good job.
What role does auto.arima()
play in your logical reasoning? You don’t say what it does, or how its output should be compared to your other lines of reasoning.
Inconsistent values in the AIC table (not mathematically possible under accurate likelihood maximization and evaluation) should be noted and discussed.
References should be cited where used in the text, to make clear which parts correspond to which reference.
A linear trend is fitted, but maybe a nonlinear trend would be appropriate?
Having gone to the trouble of identifing a trend model, why do differencing rather than detrending?
Error: “the absolute value of root is less than 1 and within the circle, so the ARIMA(0,1,1) is invertable”. You have plotted the inverse root, which is within the circle, so the root itself is outside, which is needed for invertibility.
Some unprocessed R output is presented in the report.
Saying “Autocorrelation by year” makes it seem that the unit of time is year, when it is months in your sample ACF plot.
The lowest AIC is ARMA(2,1). This is incorrectly reported as ARMA(1,2) and the analysis continues to investigate ARMA(1,2) instead.
Inconsistent use of \(\Delta\) and \(\triangledown\) to denote difference.