A Simple Example: Multilevel Manifest AR(1) Model

library(mlts)

One of the simplest models we can fit with mlts is a multilevel first order autoregressive model with only one observed variable. We start by specifying the model with mlts_model(). The argument q controls the number of time-series constructs. For this simple model, the following call is sufficient:

ar1_model <- mlts_model(q = 1)

We can check the parameters present in the model by just calling the object:

ar1_model
#>        Model   Level             Type                   Param
#> 1 Structural  Within     Fixed effect                    mu_1
#> 2 Structural  Within     Fixed effect               phi(1)_11
#> 3 Structural  Within     Fixed effect             ln.sigma2_1
#> 4 Structural Between Random effect SD              sigma_mu_1
#> 5 Structural Between Random effect SD         sigma_phi(1)_11
#> 6 Structural Between Random effect SD       sigma_ln.sigma2_1
#> 7 Structural Between   RE correlation        r_mu_1.phi(1)_11
#> 8 Structural Between   RE correlation      r_mu_1.ln.sigma2_1
#> 9 Structural Between   RE correlation r_phi(1)_11.ln.sigma2_1
#>               Param_Label isRandom prior_type prior_location prior_scale
#> 1                   Trait        1     normal              0        10.0
#> 2                 Dynamic        1     normal              0         2.0
#> 3 Log Innovation Variance        1     normal              0        10.0
#> 4                   Trait        0     cauchy              0         2.5
#> 5                 Dynamic        0     cauchy              0         2.5
#> 6 Log Innovation Variance        0     cauchy              0         2.5
#> 7                  RE Cor        0        LKJ              1          NA
#> 8                  RE Cor        0        LKJ              1          NA
#> 9                  RE Cor        0        LKJ              1          NA

When mlts_model() sets up this model, all model parameters are free (i.e., not constrained) by default. On the within level, there are three fixed effects: the grand mean (mu_1) of the outcome, the autoregressive effect (phi(1)_11), and the natural log of the innovation variance (ln.sigma2_1). The (1) in brackets for the autoregressive effect parameter indicates a lag of first order, and the _11 subscript denotes that the first construct is as well predicted by the first construct. Note that for the innovation variance, the natural log is used to prevent the variance from dropping below zero. For each of these effects, random effects are also estimated on the between level, which are drawn from a multivariate normal distribution with zero mean. Standard deviations of random effects are indicated with a sigma_ prefix, and random effect correlations are indicated with an r_ prefix.

A TeX formula for the above model can be obtained by calling the mlts_model_formula() function on the model object. By default, the function produces an RMarkdown file and renders it to a pdf file using knitr. However, the TeX file can also be kept by calling keep_tex = TRUE within the function call.

mlts_model_formula(ar1_model)
\[\text{Decomposition.}\]
\[ \begin{gathered} \begin{bmatrix} y_{1, it} \\ \end{bmatrix} = \begin{bmatrix} \mu_{1,i} \\ \end{bmatrix} + \begin{bmatrix} y_{1, it}^w \\ \end{bmatrix} \end{gathered} \]
\[\text{Within-level model.}\]
\[ \begin{gathered} \begin{bmatrix} y_{1, it}^w \\ \end{bmatrix} = \begin{bmatrix} \phi_{(1)11,i} \\ \end{bmatrix} \begin{bmatrix} y_{1,i(t - 1)}^w \\ \end{bmatrix} + \begin{bmatrix} \zeta_{1, it} \\ \end{bmatrix} ,\\ \text{with}~\zeta_{1,it} \sim \mathit{N}(0, \sigma^2_{\zeta_{1},i}) \end{gathered} \]
\[\text{Between-level model.}\]

\[ \begin{gathered} \begin{bmatrix} \mu_{1,i}\\ \phi_{(1)11,i}\\ \ln(\sigma^2_{\zeta_{1},i})\\ \end{bmatrix} = \begin{bmatrix} \gamma_{0,\mu_{1}}\\ \gamma_{0,\phi_{(1)11}}\\ \gamma_{0,\ln(\sigma^2_{\zeta_{1}})}\\ \end{bmatrix} + \begin{bmatrix} \upsilon_{\mu_{1},i}\\ \upsilon_{\phi_{(1)11},i}\\ \upsilon_{\ln(\sigma^2_{\zeta_{1}}),i}\\ \end{bmatrix} ,\\ \text{with}~ \upsilon_{i} \sim \mathit{MVN}(\mathbf{0}, \mathbf{\Omega}) \end{gathered} \]

Furthermore, a path model can also be produced with the function mlts_model_paths(). Again, the function produces an RMarkdown and pdf file by default. A png file for each level can also be produced by calling add_png = TRUE and the TeX code for the path model can be kept with keep_tex = TRUE.

mlts_model_paths(ar1_model)

By path model convention, rectangles represent observed variables and circles represent latent (i.e., unobserved) variables. A single-headed arrow represents a directed path (i.e., a regression), a double-headed arrow represents (co-)variation, and a dot on a path indicates that this parameter can vary across between-level units. To fit the above model, we pass it together with the data set to mlts_fit(). The data set for this example is an artificial data set simulated from an autoregressive model:

head(ar1_data)
#>   ID time    Y1
#> 1  1    1  1.16
#> 2  1    2 -0.29
#> 3  1    3  0.40
#> 4  1    4 -0.18
#> 5  1    5 -0.66
#> 6  1    6  0.42

We need to specify the variable in data that contains the time-series process in the ts argument and the variable that contains the unit identifier in the id argument. With the argument tinterval, the time interval for approximation of a continuous time process can be specified Asparouhov et al. (2018). We don’t specify it here, but see the Vignette Approximation of a Continuous Time Model for more details.

ar1_fit <- mlts_fit(
  model = ar1_model,
  data = ar1_data,
  id = "ID",
  ts = "Y1",
  iter = 4000
)

The model summary() shows general information about the model and data:

summary(ar1_fit)
#> Call:
#> mlts_model(q = 1, max_lag = 1)
#> Time series variables as indicated by parameter subscripts: 
#>    1 --> Y1
#> Data: 2500 observations in 50 IDs
#> Model convergence criteria: 
#>   Maximum Potential Scale Reduction Factor (PSR; Rhat): 1.009 (should be < 1.01)
#>   Minimum Bulk ESS: 575 (should be > 200, 100 per chain) 
#>   Minimum Tail ESS: 566 (should be > 200, 100 per chain) 
#>   Number of divergent transitions: 0 (should be 0) 
#> 
#> Fixed Effects:
#>              Post. Mean Post. Median Post. SD   2.5%  97.5% Rhat Bulk_ESS
#>         mu_1      0.731        0.731    0.090  0.552  0.910    1     5357
#>    phi(1)_11      0.275        0.276    0.029  0.218  0.331    1     2433
#>  ln.sigma2_1     -0.304       -0.304    0.043 -0.392 -0.217    1     2964
#>  Tail_ESS
#>      2695
#>      3051
#>      3013
#> 
#> Random Effects SDs:
#>              Post. Mean Post. Median Post. SD  2.5% 97.5%  Rhat Bulk_ESS
#>         mu_1      0.600        0.595    0.068 0.481 0.749 1.000     5042
#>    phi(1)_11      0.134        0.134    0.032 0.069 0.198 1.009      582
#>  ln.sigma2_1      0.231        0.229    0.045 0.147 0.323 1.002     1266
#>  Tail_ESS
#>      3082
#>       566
#>      1376
#> 
#> Random Effects Correlations:
#>                        Post. Mean Post. Median Post. SD   2.5% 97.5%  Rhat
#>         mu_1.phi(1)_11      0.017        0.018    0.213 -0.396 0.435 1.001
#>       mu_1.ln.sigma2_1      0.217        0.224    0.187 -0.159 0.569 1.000
#>  phi(1)_11.ln.sigma2_1      0.114        0.118    0.260 -0.391 0.607 1.000
#>  Bulk_ESS Tail_ESS
#>      2116     2477
#>      2398     2726
#>      1056     1736
#> 
#> Samples were drawn using NUTS on Thu Apr 25 12:39:28 2024.
#> For each parameter, Bulk_ESS and Tail_ESS are measures of effective
#> sample size, and Rhat is the potential scale reduction factor
#> on split chains (at convergence, Rhat = 1).

The line Time series variables as indicated by parameter subscripts: 1 --> Y1 shows that model parameters indexed by a 1 refer to the variable Y1 in the data set. The Model convergence criteria provide an overview across convergence diagnostics for all model parameters (i.e., also parameters which are not printed in the summary() by default). For the simple AR1-model, all parameters converged well after 4,000 iterations.

The section Fixed Effects provides information about the fixed effects in the model, i.e., \(\gamma_{0, \mu_1}\), \(\gamma_{0, \phi_{(1)11}}\), and \(\gamma_{0,\ln(\sigma^2_{\zeta_{1}})}\) in the above formula. For example, the posterior mean of the autoregressive effect parameter phi(1)_11 is estimated at .275 with 95%-credible interval [.218, .331]. The log variance of the innovations \(\zeta_{1t}\) is estimated at -.304. The estimate needs to be exponentiated to be on the original scale: exp(-.304) = 0.738.

The section Random Effects SDs shows standard deviations of the random effects \(\upsilon_{\mu_{1},i}\), \(\upsilon_{\phi_{(1)11},i}\), and \(\upsilon_{\ln(\sigma^2_{\zeta_{1}}),i}\). The section Random Effects Correlations shows correlations between random effects. For example, while random effects of the person mean \(\upsilon_{\mu_{1},i}\) and the autoregressive effect \(\upsilon_{\phi_{(1)11},i}\) display nearly no correlation, 0.017, there is a positive correlation between the person mean and log innovation variance, 0.217. This indicates that individuals with a higher person mean in the variable Y1 also tend to have a higher innovation variance.

References

Asparouhov, T., Hamaker, E. L., & Muthén, B. (2018). Dynamic Structural Equation Models. Structural Equation Modeling: A Multidisciplinary Journal, 25(3), 359–388. https://doi.org/10.1080/10705511.2017.1406803