EF_Activities/Exercise_06_StateSpace.Rmd at master · julbateman/EF_Activities · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
Activity 6 - State-space models
========================================================

This activity will explore the state-space framework for modeling time-series and spatial data sets. Chapter 8 provides a more in-depth description of the state-space model, but in a nutshell it is based on separating the process model, which describes how the system evolves in time or space, from the observation error model. Furthermore, the state-space model gets its name because the model estimates that true value of the underlying **latent** state variables.

For this activity we will write all the code, process all the data, and visualize all the outputs in R, but the core of the Bayesian computation will be handled by JAGS (Just Another Gibbs Sampler, http://mcmc-jags.sourceforge.net). Therefore, before we get started you will want to download both the JAGS software and the rjags library, which allows R to call JAGS. We're also going to install our `ecoforecastR` package, which has some helper functions we will use.

```{r}
#install.packages("daymetr")
library(rjags)
library(rnoaa)
library(daymetr)
devtools::install_github("EcoForecast/ecoforecastR")
```

Next we'll want to grab the data we want to analyze. For this example we'll use the Google Flu Trends data for the state of Massachusetts, which we saw how to pull directly off the web in Activity 3.

```{r}
gflu = read.csv("http://www.google.org/flutrends/about/data/flu/us/data.txt",skip=11)
time = as.Date(gflu$Date)
y = gflu$Massachusetts
plot(time,y,type='l',ylab="Flu Index",lwd=2,log='y')
```

Next we'll want to define the JAGS code, which we'll do by writing the code as a string in R. The code itself has three components, the data model, the process model, and the priors. The data model relates the observed data, y, at any time point to the latent variable, x. For this example we'll assume that the observation model just consists of Gaussian observation error. The process model relates the state of the system at one point in time to the state one time step ahead. In this case we'll start with the simplest possible process model, a random walk, which just consists of Gaussian process error centered around the current value of the system.

$$X_{t+1} \sim N(X_{t},\tau_{add})$$

Finally, for the priors we need to define priors for the initial condition, the process error, and the observation error.

```{r}
RandomWalk = "
model{

  #### Data Model
  for(t in 1:n){
    y[t] ~ dnorm(x[t],tau_obs)
  }

  #### Process Model
  for(t in 2:n){
    x[t]~dnorm(x[t-1],tau_add)
  }

  #### Priors
  x[1] ~ dnorm(x_ic,tau_ic)
  tau_obs ~ dgamma(a_obs,r_obs)
  tau_add ~ dgamma(a_add,r_add)
}
"
```

Next we need to define the data and priors as a list. For this analysis we'll work with the log of the Google flu index since the zero-bound on the index and the magnitudes of the changes appear much closer to a log-normal distribution than to a normal.
```{r}
data <- list(y=log(y),n=length(y),x_ic=log(1000),tau_ic=100,a_obs=1,r_obs=1,a_add=1,r_add=1)
```

Next we need to definite the initial state of the model's parameters for each chain in the MCMC. The overall initialization is stored as a list the same length as the number of chains, where each chain is passed a list of the initial values for each parameter. Unlike the definition of the priors, which had to be done independent of the data, the initialization of the MCMC is allowed (and even encouraged) to use the data. However, each chain should be started from different initial conditions. We handle this below by basing the initial conditions for each chain off of a different random sample of the original data.
```{r}
nchain = 3
init <- list()
for(i in 1:nchain){
  y.samp = sample(y,length(y),replace=TRUE)
  init[[i]] <- list(tau_add=1/var(diff(log(y.samp))),tau_obs=5/var(log(y.samp)))
}
```

Now that we've defined the model, the data, and the initialization, we need to send all this info to JAGS, which will return the JAGS model object.
```{r}
j.model   <- jags.model (file = textConnection(RandomWalk),
                             data = data,
                             inits = init,
                             n.chains = 3)
```

Next, given the defined JAGS model, we'll want to take a few samples from the MCMC chain and assess when the model has converged. To take samples from the MCMC object we'll need to tell JAGS what variables to track and how many samples to take.
```{r, fig.asp = 1.0}
## burn-in
jags.out   <- coda.samples (model = j.model,
                            variable.names = c("tau_add","tau_obs"),
                                n.iter = 1000)
plot(jags.out)
```

Here we see that the model converges rapidly. Since rjags returns the samples as a CODA object, we can use any of the diagnositics in the R *coda* library to test for convergence, summarize the output, or visualize the chains.

Now that the model has converged we'll want to take a much larger sample from the MCMC and include the full vector of X's in the output
```{r}
jags.out   <- coda.samples (model = j.model,
                            variable.names = c("x","tau_add","tau_obs"),
                                n.iter = 10000)
```

Given the full joint posteror samples, we're next going to visualize the output by just looking at the 95% credible interval of the timeseries of X's and compare that to the observed Y's. To do so we'll convert the coda output into a matrix and then calculate the quantiles. Looking at colnames(out) will show you that the first two columns are `tau_add` and `tau_obs`, so we calculate the CI starting from the 3rd column. We also transform the samples back from the log domain to the linear domain.
```{r}
time.rng = c(1,length(time)) ## adjust to zoom in and out
out <- as.matrix(jags.out)
x.cols <- grep("^x",colnames(out)) ## grab all columns that start with the letter x
ci <- apply(exp(out[,x.cols]),2,quantile,c(0.025,0.5,0.975)) ## model was fit on log scale

plot(time,ci[2,],type='n',ylim=range(y,na.rm=TRUE),ylab="Flu Index",log='y',xlim=time[time.rng])
## adjust x-axis label to be monthly if zoomed
if(diff(time.rng) < 100){
  axis.Date(1, at=seq(time[time.rng[1]],time[time.rng[2]],by='month'), format = "%Y-%m")
}
ecoforecastR::ciEnvelope(time,ci[1,],ci[3,],col=ecoforecastR::col.alpha("lightBlue",0.75))
points(time,y,pch="+",cex=0.5)
```

Next, lets look at the posterior distributions for `tau_add` and `tau_obs`, which we'll convert from precisions back into standard deviations.
```{r}
hist(1/sqrt(out[,1]),main=colnames(out)[1])
hist(1/sqrt(out[,2]),main=colnames(out)[2])
```
We'll also want to look at the joint distribution of the two parameters to check whether the two parameters strongly covary.
```{r, fig.asp = 1.0}
plot(out[,1],out[,2],pch=".",xlab=colnames(out)[1],ylab=colnames(out)[2])
cor(out[,1:2])
```


Assignment:
-----------

To look at how observation frequency affects data assimilation, convert 3 out of every 4 observations to NA (i.e. treat the data as approximately monthly) and refit the model.

###P.1. Reduce data, refit model.
```{r}
#subset data so its monthly (1/4 of origional)
y.monthly = rep(NA,620)
for(i in 1:620) {
 if (i %% 4 == 0)
 {
   y.monthly[i] = y[i]
 }
}
```

```{r}

RandomWalk.monthly = "
model{

  #### Data Model
  for(t in 1:n){
    y.monthly[t] ~ dnorm(x[t],tau_obs)
  }

  #### Process Model
  for(t in 2:n){
    x[t]~dnorm(x[t-1],tau_add)
  }

  #### Priors
  x[1] ~ dnorm(x_ic,tau_ic)
  tau_obs ~ dgamma(a_obs,r_obs)
  tau_add ~ dgamma(a_add,r_add)
}
"
```

```{r}
#define data and priors as a list
data.monthly <- list(y.monthly=log(y.monthly),n=length(y.monthly),x_ic=log(1000),tau_ic=100,a_obs=1,r_obs=1,a_add=1,r_add=1)
```

```{r}
#initialization of MCMC
nchain = 3
init.monthly <- list()
for(i in 1:nchain){
  y.samp.monthly = sample(y.monthly,length(y.monthly),replace=TRUE)
  init.monthly[[i]] <- list(tau_add=1/var(diff(log(y.samp.monthly))),tau_obs=5/var(log(y.samp.monthly)))
}
```

```{r}
#JAGS model
j.model.monthly   <- jags.model (file = textConnection(RandomWalk.monthly),
                             data = data.monthly,
                             inits = init.monthly,
                             n.chains = 3)
```

```{r, fig.asp = 1.0}
#look at model fit: these plots show that the model does converge, but it is not as tight as the weekly data model.

## burn-in
jags.out.monthly   <- coda.samples (model = j.model.monthly,
                            variable.names = c("tau_add","tau_obs"),
                                n.iter = 1000)
plot(jags.out.monthly)

#now take more samples since we know the data converges
jags.out.monthly   <- coda.samples (model = j.model.monthly,
                            variable.names = c("x","tau_add","tau_obs"),
                                n.iter = 10000)
```


###P.2. Generate a time-series plot for the CI of x that includes the observations (as above). Use a different color and symbol to differentiate observations that were included in the model versus those that were converted to NA's.
```{r}
#separate out the values from origional data that were replaced with NA in monthly simulation
y.empty = rep(NA,620)
y.na = y
for(i in 1:620) {
 if (i %% 4 == 0)
 {
   y.na[i] = y.empty[i]
 }
}
```

```{r}
time.rng = c(1,length(time)) ## adjust to zoom in and out
out.monthly <- as.matrix(jags.out.monthly)
x.cols.monthly <- grep("^x",colnames(out.monthly)) ## grab all columns that start with the letter x
ci.monthly <- apply(exp(out.monthly[,x.cols.monthly]),2,quantile,c(0.025,0.5,0.975)) ## model was fit on log scale

plot(time,ci.monthly[2,],type='n',ylim=range(y.monthly,na.rm=TRUE),ylab="Flu Index",log='y',xlim=time[time.rng])
## adjust x-axis label to be monthly if zoomed
if(diff(time.rng) < 100){
  axis.Date(1, at=seq(time[time.rng[1]],time[time.rng[2]],by='month'), format = "%Y-%m")
}
ecoforecastR::ciEnvelope(time,ci.monthly[1,],ci.monthly[3,],col=ecoforecastR::col.alpha("lightBlue",0.75))
points(time,y.na, pch = 16,cex = 0.5, col = "red")
points(time,y.monthly,pch="+",cex=1)
legend("top", c("NA Values", "Monthly Values"), pch = c(16, 3), col = c("red", "black"), cex = 0.7, horiz = T, bty = "n")
```


###P.3. Compare the CI between the two runs.
```{r}
#comparing the confidence intervals for the weekly run vs monthly run shows how much tighter and percise the weekly model was compared to when we moved some data. Though they do follow the same path, there is a much wider confidence interval for the monthly model.

plot(time,ci.monthly[2,],type='n',ylim=range(y.monthly,na.rm=TRUE),ylab="Flu Index",log='y',xlim=time[time.rng])
## adjust x-axis label to be monthly if zoomed
if(diff(time.rng) < 100){
  axis.Date(1, at=seq(time[time.rng[1]],time[time.rng[2]],by='month'), format = "%Y-%m")
}
ecoforecastR::ciEnvelope(time,ci.monthly[1,],ci.monthly[3,],col=ecoforecastR::col.alpha("lightBlue",0.75))
ecoforecastR::ciEnvelope(time,ci[1,],ci[3,],col=ecoforecastR::col.alpha("darkgreen",0.75))
legend("top", bty = "n", c("Origional CI", "Monthly CI"), col = c("dark green", "light blue"), horiz = T, pch = 16)
```


###P.4. Generate a predicted (median) vs observed plot for the data points that were removed
```{r}
plot(ci.monthly[3,], y.na, pch = 21, bg = "light blue", col = "black",
     main = "Predicted vs Observed", ylab = "Observed Values", xlab = "Predicted Values (Median)")
abline(0,1, lty = 3)
```


###P.5. Comment on the accuracy and precision of the state estimates.
> By observing the CI comparisona and Predicted vs Observed plots, I can tell that these model estimates were accurate, but not very precise. The plot above shows that the prediced vs estimated values follow a 1:1 line generally, but they are not perfectly linear. The CI comparison supports this statement considering both intervals followed the same path (accuracy), but the monthly model had much wider intervals, indicating less precise estimates compared to the true values.


###P.6. How does the reduction in data volume affect the parameter estimates (taus)

```{r}
hist(1/sqrt(out.monthly[,1]),main=colnames(out.monthly)[1])
hist(1/sqrt(out.monthly[,2]),main=colnames(out.monthly)[2])
```

```{r, fig.asp = 1.0}
plot(out.monthly[,1],out.monthly[,2],pch=".",xlab=colnames(out.monthly)[1],ylab=colnames(out.monthly)[2])
cor(out.monthly[,1:2])
```
> The reduction in data seems to make the tau values skew the the left a little bit. This was also observed in the Predicted vs Actual plot when the values were clustered towards the lower left corner of the plot. The tau values are also very different, in the origional full model they were positve (~0.07), but in the reduced model they are (~-0.05).


Extra Credit (Part 1):
----------------------

Return to the original data and instead of removing 3/4 of the data remove the last 40 observations (convert to NA) and refit the model to make a forecast for this period

###Refit model
```{r}
#subset data to remove last 40 obvservations, and then separate the values that were na to use later
y.last40 <- as.matrix(y)
y.last40na <- as.matrix(y)

y.last40[c(580:620),] <- NA
y.last40na[c(1:580),] <- NA

y.last40 <- as.vector(y.last40)
y.last40na <- as.vector(y.last40na)
```

```{r}

RandomWalk.last40 = "
model{

  #### Data Model
  for(t in 1:n){
    y.last40[t] ~ dnorm(x[t],tau_obs)
  }

  #### Process Model
  for(t in 2:n){
    x[t]~dnorm(x[t-1],tau_add)
  }

  #### Priors
  x[1] ~ dnorm(x_ic,tau_ic)
  tau_obs ~ dgamma(a_obs,r_obs)
  tau_add ~ dgamma(a_add,r_add)
}
"
```

```{r}
#define data and priors as a list
data.last40 <- list(y.last40=log(y.last40),n=length(y.last40),x_ic=log(1000),tau_ic=100,a_obs=1,r_obs=1,a_add=1,r_add=1)
```

```{r}
#initialization of MCMC
nchain = 3
init.last40 <- list()
for(i in 1:nchain){
  y.samp.last40 = sample(y.last40,length(y.last40),replace=TRUE)
  init.last40[[i]] <- list(tau_add=1/var(diff(log(y.samp.last40))),tau_obs=5/var(log(y.samp.last40)))
}
```

```{r}
#JAGS model
j.model.last40   <- jags.model (file = textConnection(RandomWalk.last40),
                             data = data.last40,
                             inits = init.last40,
                             n.chains = 3)
```

```{r, fig.asp = 1.0}
#look at model fit: these plots show that the model does converge, but it is not as tight as the weekly data model.

## burn-in
jags.out.last40   <- coda.samples (model = j.model.last40,
                            variable.names = c("tau_add","tau_obs"),
                                n.iter = 1000)
plot(jags.out.last40)

#now take more samples since we know the data converges
jags.out.last40   <- coda.samples (model = j.model.last40,
                            variable.names = c("x","tau_add","tau_obs"),
                                n.iter = 10000)
```

###Generate a time-series plot for the CI of x that includes the observations (as above but zoom the plot on the last ~80 observations). Use a different color and symbol to differentiate observations that were included in the model versus those that were converted to NA's.
```{r}
time.rng.last40 = c(540,length(time)) ## adjust to zoom in and out
out.last40 <- as.matrix(jags.out.last40)
x.cols.last40 <- grep("^x",colnames(out.last40)) ## grab all columns that start with the letter x
ci.last40 <- apply(exp(out.last40[,x.cols.last40]),2,quantile,c(0.025,0.5,0.975)) ## model was fit on log scale

plot(time,ci.last40[2,],type='n',ylim=range(y.last40,na.rm=TRUE), ylab="Flu Index",log='y',xlim=time[time.rng.last40])
## adjust x-axis label to be monthly if zoomed
if(diff(time.rng.last40) < 100){
  axis.Date(1, at=seq(time[time.rng.last40[1]],time[time.rng.last40[2]],by='month'), format = "%Y-%m")
}
ecoforecastR::ciEnvelope(time,ci.last40[1,],ci.last40[3,],col=ecoforecastR::col.alpha("lightBlue",0.75))
points(time,y.last40na, pch = 16,cex = 0.5, col = "red")
points(time,y.last40,pch="+",cex=1)
legend("top", c("NA Values", "Origional Values"), pch = c(16, 3), col = c("red", "black"), cex = 0.7, horiz = T, bty = "n")
```

###Comment on how well the random walk model performed (both accuracy and precision) and how it might be modified to improve both these criteria.
```{r}
plot(ci.last40[3,], y.last40na, pch = 21, bg = "light blue", col = "black",
     main = "Predicted vs Observed", ylab = "Observed Values", xlab = "Predicted Values (Median)")
abline(0,1, lty = 3)
```

> The random walk model did not preform very well at all. It was neither accurate nor precise, which is clear by looking at the predicted vs observed plot, as well as how the CI completely blows up when it reaches the last 40 values. It seems like the model doesnt even know how to predict these values. This could be improved by adding in more information for how the model runs, which may be more complex than a Random Walk model. It needs to be trained to predict forward, not just fill spaces in data.


# Dynamic Linear Models

The random walk model can easily be generalized to more sophisiticated models describing the dynamics of the system. One simple but useful extension is the class of dynamic linear models (DLMs) -- linear models where the future state depends on the current state and other covariates, $z_t$

$$X_{t+1} \sim N(\beta_0 + \beta_1 z_t + \beta_{IC} X_{t}, \tau_{add})$$

where $\beta_0$ is the intercept, $\beta_1$ is the slope of the covariate effect, and $\beta_{IC}$ is the slope of the initial condition effect. Rather than implement this model in JAGS directly, we're going to rely on the ecoforecastR package, which accepts a `lm` like syntax for specifying covariates (with the notable exception that the response variable, which is our latent X, is not specified explictly). Here we're going to use the Daymet product to get daily weather estimates, and then use daily minimum temperature (Tmin) as the covariate in our influenza model

```{r}
## grab weather data
df <- daymetr::download_daymet(site = "Boston",
                lat = 42.36,
                lon = -71.06,
                start = 2003,
                end = 2016,
                internal = TRUE)$data
df$date <- as.Date(paste(df$year,df$yday,sep = "-"),"%Y-%j")
data$Tmin = df$tmin..deg.c.[match(time,df$date)]

## fit the model
ef.out <- ecoforecastR::fit_dlm(model=list(obs="y",fixed="~ Tmin"),data)
names(ef.out)
```

The package returns a list with four elements. `params` and `predict` are both the same mcmc.list objects we get back from JAGS, only split between the parameters and the latent state variables, respectively, to make it easier to perform diagnostics and visualizations:

```{r, fig.asp = 1.0}
## parameter diagnostics
params <- window(ef.out$params,start=1000) ## remove burn-in
plot(params)
summary(params)
cor(as.matrix(params))
pairs(as.matrix(params))

## confidence interval
out.ef <- as.matrix(ef.out$predict)
ci.ef <- apply(exp(out.ef),2,quantile,c(0.025,0.5,0.975))
plot(time,ci.ef[2,],type='n',ylim=range(y,na.rm=TRUE),ylab="Flu Index",log='y',xlim=time[time.rng])
## adjust x-axis label to be monthly if zoomed
if(diff(time.rng) < 100){
  axis.Date(1, at=seq(time[time.rng[1]],time[time.rng[2]],by='month'), format = "%Y-%m")
}
ecoforecastR::ciEnvelope(time,ci.ef[1,],ci.ef[3,],col=ecoforecastR::col.alpha("lightBlue",0.75))
points(time,y,pch="+",cex=0.5)
```


The JAGS model that was fit 'under the hood' is returned as `model` which we can view as:
```{r, echo=FALSE}
strsplit(ef.out$model,"\n",fixed = TRUE)[[1]]
```
This code illustrates a few things:
* The "Priors" section is identical to our earlier random walk model
* The "Random Effects" section, which is currently commented out, illustrates that the `ecoforcastR::fit_dlm` function supports random effects, which can be turned on via the `model$random` argument
* The "Fixed Effects" section contains additional priors for our fixed effects as well as priors on the means (mu) and precisions (tau) of the covariates.
* The "Data Model" section is the same as in our random walk except for the addition of code for the means of the covariates. This code is here as a very simple missing data model -- any time the covariate is observed it is used to estimate the mean and precision, but any time the covariate is missing (NA) it is imputed.
* The "Process Model" is very similar to the random walk, except now the expected value (mu) is calculated according to the linear model described earlier

Finally, the returned object also includes the `data` that was used to fit the model.

Assignment:
-----------

###Compare the process and observation error estimates and model CI between this fit and the original random walk model. How much has the residual variance been reduced by?
```{r}
plot(time,ci.monthly[2,],type='n',ylim=range(y.monthly,na.rm=TRUE),ylab="Flu Index",log='y',xlim=time[time.rng])
## adjust x-axis label to be monthly if zoomed
if(diff(time.rng) < 100){
  axis.Date(1, at=seq(time[time.rng[1]],time[time.rng[2]],by='month'), format = "%Y-%m")
}
ecoforecastR::ciEnvelope(time,ci[1,],ci[3,],col=ecoforecastR::col.alpha("darkgreen",0.75))
ecoforecastR::ciEnvelope(time,ci.ef[1,],ci.ef[3,],col=ecoforecastR::col.alpha("lightBlue",0.6))
legend("top", bty = "n", c("RW CI", "EF CI"), col = c("dark green", "light blue"), horiz = T, pch = 16)


```
>The EF model clearly has much tighter CI then the RW (Random Walk) model, indicating that it predicted more accurate and precise values to the data.

###Because a state-space model returns X's that are close to the Y's, metrics such as R2 and RMSE aren't great metrics of model performance. Besides looking at the taus, how else could we judge which model is doing better (in a way that avoids/penalizes overfitting)?
> A state-space model's performance can be evaluated further by removing some data points throughout time to see how the model predicts them.

###Explain and discuss the parameter estimates (betas) from the linear model (what do they mean both biologically and in terms of the predictability of the system) and their correlations
>The beta estimates from this linear model are related to the relationship between the covariate (min Temp) and effect on the depedent variable (flu). The betaTmin has a value of ~-0.002, which is implying that there is a negative relationship between the flu and minimum temperature. This is expected: as temperature decreases, flu increases. The beta intercept value of ~0.32 is simply telling us where flu is when temp =0. The beta coeff related to the initial conditions tells us about the relationship between the IC and the flu; basically, this beta coeff of 0.95 tells us there is positive correlation between the initial condition and the flu. I would say the betaTmin and betaIC are the most important coeficients here, especially biologically, because we know that there is some (but small) influence between low temperatures and the flu, and an even higher correlation between initial conditions of an ecosystem and rates of the flu in a region.

Extra Credit (Part 2):
----------------------

###Repeat the process of forecasting the last 40 observations (convert to NA), this time using the DLM with temperature as a covariate
```{r}
data.last40$Tmin = df$tmin..deg.c.[match(time,df$date)]
## fit the model
ef.out.last40 <- ecoforecastR::fit_dlm(model=list(obs="y.last40",fixed="~ Tmin"),data.last40)
names(ef.out.last40)
```


###Generate a time-series plot for the CI that includes the observations and both the random walk and DLM models (Hint, think about the order you plot in so you can see both models, also consider including transpancy [alpha] in the CI color)
```{r, fig.asp = 1.0}
## parameter diagnostics
params.last40 <- window(ef.out.last40$params,start=1000) ## remove burn-in

## confidence interval
out.ef.last40 <- as.matrix(ef.out.last40$predict)
ci.ef.last40 <- apply(exp(out.ef.last40),2,quantile,c(0.025,0.5,0.975))


plot(time, ci.ef.last40[2,],type='n',ylim=range(y.last40,na.rm=TRUE),ylab="Flu Index",log='y',xlim=time[time.rng.last40])
## adjust x-axis label to be monthly if zoomed
if(diff(time.rng.last40) < 100){
  axis.Date(1, at=seq(time[time.rng.last40[1]],time[time.rng.last40[2]],by='month'), format = "%Y-%m")
}
ecoforecastR::ciEnvelope(time,ci.last40[1,],ci.last40[3,],col=ecoforecastR::col.alpha("darkgreen", 0.7))
ecoforecastR::ciEnvelope(time,ci.ef.last40[1,],ci.ef.last40[3,],col=ecoforecastR::col.alpha("lightBlue", 0.6))

points(time,y.last40na, pch = 16,cex = 0.5, col = "red")
points(time,y.last40,pch="+",cex=0.5)
legend("top", c("NA Values", "Origional Values", "RW CI", "EF CI"), pch = c(16, 3, 15,15), col = c("red", "black","dark green", "light blue"), horiz = T, bty = "n")
```
###Comment on how well the DLM model performed (both accuracy and precision) relative to the random walk and the true observations. How could the model be further improved?
> The DLM model was more accurate than the RW model, but it wasnt very precise. The wide CI got significantly smaller around the data points, but its still not following the trend completely like it should be. One startling thing about the DLS model is that it did not capture that peak of data, but the RW model did (mostly). I'm not sure if it would be better to have a model with less variance that missed a few points, or a model with a very wide CI that caputured it all.