Skip to contents

Model

The measurement model is given by 𝐲i,t=π›Ž+πš²π›ˆi,t+𝛆i,t,with𝛆i,tβˆΌπ’©(𝟎,𝚯)\begin{equation} \mathbf{y}_{i, t} = \boldsymbol{\nu} + \boldsymbol{\Lambda} \boldsymbol{\eta}_{i, t} + \boldsymbol{\varepsilon}_{i, t}, \quad \mathrm{with} \quad \boldsymbol{\varepsilon}_{i, t} \sim \mathcal{N} \left( \mathbf{0}, \boldsymbol{\Theta} \right) \end{equation} where 𝐲i,t\mathbf{y}_{i, t}, π›ˆi,t\boldsymbol{\eta}_{i, t}, and 𝛆i,t\boldsymbol{\varepsilon}_{i, t} are random variables and π›Ž\boldsymbol{\nu}, 𝚲\boldsymbol{\Lambda}, and 𝚯\boldsymbol{\Theta} are model parameters. 𝐲i,t\mathbf{y}_{i, t} represents a vector of observed random variables, π›ˆi,t\boldsymbol{\eta}_{i, t} a vector of latent random variables, and 𝛆i,t\boldsymbol{\varepsilon}_{i, t} a vector of random measurement errors, at time tt and individual ii. π›Ž\boldsymbol{\nu} denotes a vector of intercepts, 𝚲\boldsymbol{\Lambda} a matrix of factor loadings, and 𝚯\boldsymbol{\Theta} the covariance matrix of 𝛆\boldsymbol{\varepsilon}.

An alternative representation of the measurement error is given by 𝛆i,t=𝚯12𝐳i,t,with𝐳i,tβˆΌπ’©(𝟎,𝐈)\begin{equation} \boldsymbol{\varepsilon}_{i, t} = \boldsymbol{\Theta}^{\frac{1}{2}} \mathbf{z}_{i, t}, \quad \mathrm{with} \quad \mathbf{z}_{i, t} \sim \mathcal{N} \left( \mathbf{0}, \mathbf{I} \right) \end{equation} where 𝐳i,t\mathbf{z}_{i, t} is a vector of independent standard normal random variables and (𝚯12)(𝚯12)β€²=𝚯\left( \boldsymbol{\Theta}^{\frac{1}{2}} \right) \left( \boldsymbol{\Theta}^{\frac{1}{2}} \right)^{\prime} = \boldsymbol{\Theta} .

The dynamic structure is given by dπ›ˆi,t=𝚽(π›ˆi,tβˆ’π›)dt+𝚺12d𝐖i,t\begin{equation} \mathrm{d} \boldsymbol{\eta}_{i, t} = \boldsymbol{\Phi} \left( \boldsymbol{\eta}_{i, t} - \boldsymbol{\mu} \right) \mathrm{d}t + \boldsymbol{\Sigma}^{\frac{1}{2}} \mathrm{d} \mathbf{W}_{i, t} \end{equation} where 𝛍\boldsymbol{\mu} is the long-term mean or equilibrium level, 𝚽\boldsymbol{\Phi} is the rate of mean reversion, determining how quickly the variable returns to its mean, 𝚺\boldsymbol{\Sigma} is the matrix of volatility or randomness in the process, and d𝐖\mathrm{d}\boldsymbol{W} is a Wiener process or Brownian motion, which represents random fluctuations.

Data Generation

Notation

Let t=1000t = 1000 be the number of time points and n=1000n = 1000 be the number of individuals.

Let the measurement model intecept vector π›Ž\boldsymbol{\nu} be given by

π›Ž=(000).\begin{equation} \boldsymbol{\nu} = \left( \begin{array}{c} 0 \\ 0 \\ 0 \\ \end{array} \right) . \end{equation}

Let the factor loadings matrix 𝚲\boldsymbol{\Lambda} be given by

𝚲=(100010001).\begin{equation} \boldsymbol{\Lambda} = \left( \begin{array}{ccc} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \\ \end{array} \right) . \end{equation}

Let the measurement error covariance matrix 𝚯\boldsymbol{\Theta} be given by

𝚯=(0.20000.20000.2).\begin{equation} \boldsymbol{\Theta} = \left( \begin{array}{ccc} 0.2 & 0 & 0 \\ 0 & 0.2 & 0 \\ 0 & 0 & 0.2 \\ \end{array} \right) . \end{equation}

Let the initial condition π›ˆ0\boldsymbol{\eta}_{0} be given by

π›ˆ0βˆΌπ’©(π›π›ˆβˆ£0,πšΊπ›ˆβˆ£0)\begin{equation} \boldsymbol{\eta}_{0} \sim \mathcal{N} \left( \boldsymbol{\mu}_{\boldsymbol{\eta} \mid 0}, \boldsymbol{\Sigma}_{\boldsymbol{\eta} \mid 0} \right) \end{equation}

π›π›ˆβˆ£0=(000)\begin{equation} \boldsymbol{\mu}_{\boldsymbol{\eta} \mid 0} = \left( \begin{array}{c} 0 \\ 0 \\ 0 \\ \end{array} \right) \end{equation}

πšΊπ›ˆβˆ£0=(0.34251480.32960230.03438170.32960230.56646250.25459550.03438170.25459550.2999909).\begin{equation} \boldsymbol{\Sigma}_{\boldsymbol{\eta} \mid 0} = \left( \begin{array}{ccc} 0.3425148 & 0.3296023 & 0.0343817 \\ 0.3296023 & 0.5664625 & 0.2545955 \\ 0.0343817 & 0.2545955 & 0.2999909 \\ \end{array} \right) . \end{equation}

Let the long-term mean vector 𝛍\boldsymbol{\mu} be given by

𝛍=(000).\begin{equation} \boldsymbol{\mu} = \left( \begin{array}{c} 0 \\ 0 \\ 0 \\ \end{array} \right) . \end{equation}

Let the rate of mean reversion matrix 𝚽\boldsymbol{\Phi} be given by

𝚽=(βˆ’0.357000.771βˆ’0.5110βˆ’0.450.729βˆ’0.693).\begin{equation} \boldsymbol{\Phi} = \left( \begin{array}{ccc} -0.357 & 0 & 0 \\ 0.771 & -0.511 & 0 \\ -0.45 & 0.729 & -0.693 \\ \end{array} \right) . \end{equation}

Let the dynamic process noise covariance matrix 𝚺\boldsymbol{\Sigma} be given by

𝚺=(0.24455560.0220159βˆ’0.05004760.02201590.0706780.0153946βˆ’0.05004760.01539460.0755306).\begin{equation} \boldsymbol{\Sigma} = \left( \begin{array}{ccc} 0.2445556 & 0.0220159 & -0.0500476 \\ 0.0220159 & 0.070678 & 0.0153946 \\ -0.0500476 & 0.0153946 & 0.0755306 \\ \end{array} \right) . \end{equation}

Let Ξ”t=0.1\Delta t = 0.1.

R Function Arguments

n
#> [1] 1000
time
#> [1] 1000
delta_t
#> [1] 0.1
mu0
#> [1] 0 0 0
sigma0
#>           [,1]      [,2]      [,3]
#> [1,] 0.3425148 0.3296023 0.0343817
#> [2,] 0.3296023 0.5664625 0.2545955
#> [3,] 0.0343817 0.2545955 0.2999909
sigma0_l # sigma0_l <- t(chol(sigma0))
#>            [,1]      [,2]      [,3]
#> [1,] 0.58524763 0.0000000 0.0000000
#> [2,] 0.56318429 0.4992855 0.0000000
#> [3,] 0.05874726 0.4436540 0.3157701
mu
#> [1] 0 0 0
phi
#>        [,1]   [,2]   [,3]
#> [1,] -0.357  0.000  0.000
#> [2,]  0.771 -0.511  0.000
#> [3,] -0.450  0.729 -0.693
sigma
#>             [,1]       [,2]        [,3]
#> [1,]  0.24455556 0.02201587 -0.05004762
#> [2,]  0.02201587 0.07067800  0.01539456
#> [3,] -0.05004762 0.01539456  0.07553061
sigma_l # sigma_l <- t(chol(sigma))
#>             [,1]      [,2]     [,3]
#> [1,]  0.49452559 0.0000000 0.000000
#> [2,]  0.04451917 0.2620993 0.000000
#> [3,] -0.10120330 0.0759256 0.243975
nu
#> [1] 0 0 0
lambda
#>      [,1] [,2] [,3]
#> [1,]    1    0    0
#> [2,]    0    1    0
#> [3,]    0    0    1
theta
#>      [,1] [,2] [,3]
#> [1,]  0.2  0.0  0.0
#> [2,]  0.0  0.2  0.0
#> [3,]  0.0  0.0  0.2
theta_l # theta_l <- t(chol(theta))
#>           [,1]      [,2]      [,3]
#> [1,] 0.4472136 0.0000000 0.0000000
#> [2,] 0.0000000 0.4472136 0.0000000
#> [3,] 0.0000000 0.0000000 0.4472136

Visualizing the Dynamics Without Measurement Error and Process Noise (n = 5 with Different Initial Condition)

Using the SimSSMOUFixed Function from the simStateSpace Package to Simulate Data

library(simStateSpace)
sim <- SimSSMOUFixed(
  n = n,
  time = time,
  delta_t = delta_t,
  mu0 = mu0,
  sigma0_l = sigma0_l,
  mu = mu,
  phi = phi,
  sigma_l = sigma_l,
  nu = nu,
  lambda = lambda,
  theta_l = theta_l,
  type = 0
)
data <- as.data.frame(sim)
head(data)
#>   id time          y1         y2         y3
#> 1  1  0.0  0.45047365 -1.2185550  0.6708638
#> 2  1  0.1 -0.84190461  0.1242594  0.1813493
#> 3  1  0.2  0.47289651 -0.2398710  0.5995107
#> 4  1  0.3  0.04089817 -0.6547109  0.4674929
#> 5  1  0.4 -1.37222946 -0.2010512 -0.3341885
#> 6  1  0.5 -0.62410106  0.5262251 -0.1892906
summary(data)
#>        id              time             y1                  y2           
#>  Min.   :   1.0   Min.   : 0.00   Min.   :-3.597165   Min.   :-4.091221  
#>  1st Qu.: 250.8   1st Qu.:24.98   1st Qu.:-0.487677   1st Qu.:-0.579974  
#>  Median : 500.5   Median :49.95   Median : 0.005842   Median : 0.009244  
#>  Mean   : 500.5   Mean   :49.95   Mean   : 0.005679   Mean   : 0.007098  
#>  3rd Qu.: 750.2   3rd Qu.:74.92   3rd Qu.: 0.500799   3rd Qu.: 0.595597  
#>  Max.   :1000.0   Max.   :99.90   Max.   : 3.447253   Max.   : 4.187849  
#>        y3           
#>  Min.   :-3.668061  
#>  1st Qu.:-0.474746  
#>  Median : 0.001988  
#>  Mean   : 0.001420  
#>  3rd Qu.: 0.477942  
#>  Max.   : 3.358401
plot(sim)

Model Fitting

Prepare Data

dynr_data <- dynr::dynr.data(
  dataframe = data,
  id = "id",
  time = "time",
  observed = c("y1", "y2", "y3")
)

Prepare Initial Condition

dynr_initial <- dynr::prep.initial(
  values.inistate = mu0,
  params.inistate = c("mu0_1_1", "mu0_2_1", "mu0_3_1"),
  values.inicov = sigma0,
  params.inicov = matrix(
    data = c(
      "sigma0_1_1", "sigma0_2_1", "sigma0_3_1",
      "sigma0_2_1", "sigma0_2_2", "sigma0_3_2",
      "sigma0_3_1", "sigma0_3_2", "sigma0_3_3"
    ),
    nrow = 3
  )
)

Prepare Measurement Model

dynr_measurement <- dynr::prep.measurement(
  values.load = diag(3),
  params.load = matrix(data = "fixed", nrow = 3, ncol = 3),
  state.names = c("eta_1", "eta_2", "eta_3"),
  obs.names = c("y1", "y2", "y3")
)

Prepare Dynamic Process

dynr_dynamics <- dynr::prep.formulaDynamics(
  formula = list(  
    eta_1 ~ (phi_1_1 * (eta_1 - mu_1_1)) + (phi_1_2 * (eta_2 - mu_2_1)) + (phi_1_3 * (eta_3 - mu_3_1)),
    eta_2 ~ (phi_2_1 * (eta_1 - mu_1_1)) + (phi_2_2 * (eta_2 - mu_2_1)) + (phi_2_3 * (eta_3 - mu_3_1)),
    eta_3 ~ (phi_3_1 * (eta_1 - mu_1_1)) + (phi_3_2 * (eta_2 - mu_2_1)) + (phi_3_3 * (eta_3 - mu_3_1))
  ),
  startval = c(
    mu_1_1 = mu[1], mu_2_1 = mu[2], mu_3_1 = mu[3],
    phi_1_1 = phi[1, 1], phi_1_2 = phi[1, 2], phi_1_3 = phi[1, 3],
    phi_2_1 = phi[2, 1], phi_2_2 = phi[2, 2], phi_2_3 = phi[2, 3],
    phi_3_1 = phi[3, 1], phi_3_2 = phi[3, 2], phi_3_3 = phi[3, 3]
  ),
  isContinuousTime = TRUE
)

Prepare Process Noise

dynr_noise <- dynr::prep.noise(
  values.latent = sigma,
  params.latent = matrix(
    data = c(
      "sigma_1_1", "sigma_2_1", "sigma_3_1",
      "sigma_2_1", "sigma_2_2", "sigma_3_2",
      "sigma_3_1", "sigma_3_2", "sigma_3_3"
    ),
    nrow = 3
  ),
  values.observed = theta,
  params.observed = matrix(
    data = c(
      "theta_1_1", "fixed", "fixed",
      "fixed", "theta_2_2", "fixed",
      "fixed", "fixed", "theta_3_3"
    ),
    nrow = 3
  )
)

Prepare the Model

model <- dynr::dynr.model(
  data = dynr_data,
  initial = dynr_initial,
  measurement = dynr_measurement,
  dynamics = dynr_dynamics,
  noise = dynr_noise,
  outfile = "ou.c"
)

Add lower and upper bounds to aid in the optimization.

model$lb[
  c(
    "phi_1_1",
    "phi_1_2",
    "phi_1_3",
    "phi_2_1",
    "phi_2_2",
    "phi_2_3",
    "phi_3_1",
    "phi_3_2",
    "phi_3_3"
  )
] <- -1.5
model$ub[
  c(
    "phi_1_1",
    "phi_1_2",
    "phi_1_3",
    "phi_2_1",
    "phi_2_2",
    "phi_2_3",
    "phi_3_1",
    "phi_3_2",
    "phi_3_3"
  )
] <- +1.5

Fit the Model

results <- dynr::dynr.cook(
  model,
  debug_flag = TRUE,
  verbose = FALSE
)
#> [1] "Get ready!!!!"
#> using C compiler: β€˜gcc (Ubuntu 13.3.0-6ubuntu2~24.04) 13.3.0’
#> Optimization function called.
#> Starting Hessian calculation ...
#> Finished Hessian calculation.
#> Original exit flag:  3 
#> Modified exit flag:  3 
#> Optimization terminated successfully: ftol_rel or ftol_abs was reached. 
#> Original fitted parameters:  0.004772387 0.006387041 0.001928491 -0.3583514 
#> -0.001773536 -0.0009242187 0.7717298 -0.5106306 -0.0009425439 -0.4504035 
#> 0.7272245 -0.6928862 -1.400091 0.09056937 -0.1990023 -2.655458 0.2825195 
#> -2.817898 -1.612043 -1.611921 -1.609778 -0.01930384 -0.001695005 0.002906709 
#> -1.098928 0.8660663 0.01750772 -1.302699 0.9128824 -2.284439 
#> 
#> Transformed fitted parameters:  0.004772387 0.006387041 0.001928491 -0.3583514 
#> -0.001773536 -0.0009242187 0.7717298 -0.5106306 -0.0009425439 -0.4504035 
#> 0.7272245 -0.6928862 0.2465745 0.0223321 -0.04906889 0.07228922 0.01540755 
#> 0.07510468 0.1994797 0.199504 0.199932 -0.01930384 -0.001695005 0.002906709 
#> 0.3332281 0.2885976 0.005834063 0.521742 0.2531716 0.3284367 
#> 
#> Doing end processing
#> Successful trial
#> Total Time: 20.74132 
#> Backend Time: 20.74114

Summary

summary(results)
#> Coefficients:
#>              Estimate Std. Error  t value   ci.lower   ci.upper Pr(>|t|)    
#> mu_1_1      0.0047724  0.0047757    0.999 -0.0045879  0.0141327    0.159    
#> mu_2_1      0.0063870  0.0077251    0.827 -0.0087539  0.0215280    0.204    
#> mu_3_1      0.0019285  0.0049565    0.389 -0.0077861  0.0116431    0.349    
#> phi_1_1    -0.3583514  0.0185105  -19.359 -0.3946314 -0.3220715   <2e-16 ***
#> phi_1_2    -0.0017735  0.0159471   -0.111 -0.0330293  0.0294823    0.456    
#> phi_1_3    -0.0009242  0.0114279   -0.081 -0.0233226  0.0214741    0.468    
#> phi_2_1     0.7717298  0.0094082   82.028  0.7532901  0.7901694   <2e-16 ***
#> phi_2_2    -0.5106306  0.0084223  -60.628 -0.5271381 -0.4941231   <2e-16 ***
#> phi_2_3    -0.0009425  0.0062797   -0.150 -0.0132505  0.0113655    0.440    
#> phi_3_1    -0.4504035  0.0081993  -54.932 -0.4664739 -0.4343332   <2e-16 ***
#> phi_3_2     0.7272245  0.0072519  100.280  0.7130110  0.7414380   <2e-16 ***
#> phi_3_3    -0.6928862  0.0055482 -124.886 -0.7037604 -0.6820119   <2e-16 ***
#> sigma_1_1   0.2465745  0.0031079   79.337  0.2404830  0.2526659   <2e-16 ***
#> sigma_2_1   0.0223321  0.0009036   24.714  0.0205610  0.0241032   <2e-16 ***
#> sigma_3_1  -0.0490689  0.0009402  -52.192 -0.0509116 -0.0472262   <2e-16 ***
#> sigma_2_2   0.0722892  0.0006781  106.606  0.0709602  0.0736183   <2e-16 ***
#> sigma_3_2   0.0154076  0.0004469   34.474  0.0145316  0.0162835   <2e-16 ***
#> sigma_3_3   0.0751047  0.0006930  108.377  0.0737464  0.0764629   <2e-16 ***
#> theta_1_1   0.1994797  0.0004123  483.818  0.1986716  0.2002878   <2e-16 ***
#> theta_2_2   0.1995040  0.0003204  622.761  0.1988761  0.2001319   <2e-16 ***
#> theta_3_3   0.1999320  0.0003213  622.319  0.1993023  0.2005617   <2e-16 ***
#> mu0_1_1    -0.0193038  0.0215547   -0.896 -0.0615503  0.0229426    0.185    
#> mu0_2_1    -0.0016950  0.0369205   -0.046 -0.0740579  0.0706679    0.482    
#> mu0_3_1     0.0029067  0.0345815    0.084 -0.0648718  0.0706852    0.467    
#> sigma0_1_1  0.3332281  0.0254503   13.093  0.2833463  0.3831098   <2e-16 ***
#> sigma0_2_1  0.2885976  0.0321793    8.968  0.2255274  0.3516678   <2e-16 ***
#> sigma0_3_1  0.0058341  0.0218162    0.267 -0.0369249  0.0485930    0.395    
#> sigma0_2_2  0.5217420  0.0553415    9.428  0.4132746  0.6302093   <2e-16 ***
#> sigma0_3_2  0.2531716  0.0311815    8.119  0.1920571  0.3142862   <2e-16 ***
#> sigma0_3_3  0.3284367  0.0303100   10.836  0.2690302  0.3878433   <2e-16 ***
#> ---
#> Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
#> 
#> -2 log-likelihood value at convergence = 4293422.48
#> AIC = 4293482.48
#> BIC = 4293836.94
#> [1] -0.019303838 -0.001695005  0.002906709

Parameter Estimates

mu_hat
#> [1] 0.004772387 0.006387041 0.001928491
phi_hat
#>            [,1]         [,2]          [,3]
#> [1,] -0.3583514 -0.001773536 -0.0009242187
#> [2,]  0.7717298 -0.510630602 -0.0009425439
#> [3,] -0.4504035  0.727224464 -0.6928861600
sigma_hat
#>             [,1]       [,2]        [,3]
#> [1,]  0.24657448 0.02233210 -0.04906889
#> [2,]  0.02233210 0.07228922  0.01540755
#> [3,] -0.04906889 0.01540755  0.07510468
theta_hat
#>           [,1]     [,2]     [,3]
#> [1,] 0.1994797 0.000000 0.000000
#> [2,] 0.0000000 0.199504 0.000000
#> [3,] 0.0000000 0.000000 0.199932
mu0_hat
#> [1] -0.019303838 -0.001695005  0.002906709
sigma0_hat
#>             [,1]      [,2]        [,3]
#> [1,] 0.333228051 0.2885976 0.005834063
#> [2,] 0.288597597 0.5217420 0.253171644
#> [3,] 0.005834063 0.2531716 0.328436733
beta_var1_hat <- expm::expm(phi_hat)
beta_var1_hat
#>            [,1]         [,2]          [,3]
#> [1,]  0.6984495 -0.001349417 -0.0005482773
#> [2,]  0.5002334  0.599439403 -0.0007292751
#> [3,] -0.1003474  0.399072761  0.5000156072

References

Chow, S.-M., Ho, M. R., Hamaker, E. L., & Dolan, C. V. (2010). Equivalence and differences between structural equation modeling and state-space modeling techniques. Structural Equation Modeling: A Multidisciplinary Journal, 17(2), 303–332. https://doi.org/10.1080/10705511003661553
Chow, S.-M., Losardo, D., Park, J., & Molenaar, P. C. M. (2023). Continuous-time dynamic models: Connections to structural equation models and other discrete-time models. In R. H. Hoyle (Ed.), Handbook of structural equation modeling (2nd ed.). The Guilford Press.
Deboeck, P. R., & Preacher, K. J. (2015). No need to be discrete: A method for continuous time mediation analysis. Structural Equation Modeling: A Multidisciplinary Journal, 23(1), 61–75. https://doi.org/10.1080/10705511.2014.973960
Oravecz, Z., Tuerlinckx, F., & Vandekerckhove, J. (2011). A hierarchical latent stochastic differential equation model for affective dynamics. Psychological Methods, 16(4), 468–490. https://doi.org/10.1037/a0024375
Ou, L., Hunter, M. D., & Chow, S.-M. (2019). What’s for dynr: A package for linear and nonlinear dynamic modeling in R. The R Journal, 11(1), 91. https://doi.org/10.32614/rj-2019-012
R Core Team. (2024). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
Uhlenbeck, G. E., & Ornstein, L. S. (1930). On the theory of the brownian motion. Physical Review, 36(5), 823–841. https://doi.org/10.1103/physrev.36.823