Estimated-Network-Models

In this project, we consider three primary application settings for estimated networks: correlation networks, feature interaction networks, and multivariate Hawkes processes.

If simulation data are used, each setting contains two parts:

Simulation data generation
Parameter prediction

Otherwise, we only need parameter prediction.

1. Correlation Networks

1.1 Simulated Data Generation

$$\begin{aligned} &\qquad z_k \\\ &\qquad \downarrow \\\ z_j &\rightarrow \theta_{jk} \rightarrow x \end{aligned}$$

$d$-dimensional latent positions are sampled according to a multivariate normal distribution:

$$z_i \sim \mathcal{N}(0, I\sigma_z^2), \text{ } i \in \{0, 1, ..., p\}$$

The covariance matrix is sampled according to an inverse-Wishart distribution with scale matrix equal to $ZZ^T$ and $\nu$ degrees of freedom:

$$\text{Cov} \sim \mathcal{W}^{-1}(ZZ^T, \nu)$$

The covariance matrix, which is guaranteed to be symmetric positive definite by the properties of the inverse-Wishart distribution, is parameterized by lower triangular matrix $\Theta$ such that $\text{Cov} = \Theta\Theta^T$.

1.2 Parameter Estimation

Parameter estimation is performed by maximizing the log-likelihood of the data and model parameters given the generative process described in section 1.1. Thus, the following loss function is minimized with respect to latent positions $Z$ and covariance parameters $\Theta$:

$$\text{Loss} = -l(Z) - l(\Theta|Z) - l(X|\Theta)$$

$$l(Z) = \sum_{i=1}^p\ln((2\pi)^{-k/2}\det(I\sigma_z^2)^{-1/2}\exp(\frac{-1}{2}z_i^T(I\sigma_z^2)^{-1}z_i))$$

$$l(\Theta|Z) = \ln(\frac{|ZZ^T|^{\nu/2}}{2^{\nu p/2}\Gamma_p(\frac{\nu}{2})}|\Theta\Theta^T|^{-(\nu+p+1)/2}e^{-\frac{1}{2}\text{tr}(ZZ^T(\Theta\Theta^T)^{-1})})$$

$$l(X|\Theta) = \sum_{i=1}^p\ln((2\pi)^{-p/2}\det(\Theta\Theta^T)^{-1/2}\exp(\frac{-1}{2}x_i^T(\Theta\Theta^T)^{-1}x_i))$$

1.3 Evaluation

Simulated Data: For simulated data, we can directly compare the estimated covariance values to the true values using the Frobenius norm of the difference between the covariance matrices:

$$\|\Theta - \hat{\Theta}\|_F^2$$

Real Data: For real data, we do not known the true covariance values, so we instead evaluate the log-likelihood of a held out set of data on the fitted covariance model:

$$l(X_{\text{test}}|\Theta) = \sum_{x_i \in X_{\text{test}}}\ln((2\pi)^{-p/2}\det(\Theta\Theta^T)^{-1/2}\exp(\frac{-1}{2}x_i^T(\Theta\Theta^T)^{-1}x_i))$$

2. Feature Interaction Networks

2.1 Simulation data generation

$$\begin{aligned} &\qquad z_k \\\ &\qquad \downarrow \\\ z_j &\rightarrow \theta_{jk} \rightarrow y \leftarrow x \\\ &\qquad\qquad\uparrow \\\ &\qquad\qquad\beta \end{aligned}$$

$$ X \in \mathbb{R}^{n \times p} $$

$$ y \in \mathbb{R}^{n} $$

$$ Z \in \mathbb{R}^{p \times d} $$

$$ \Theta \in \mathbb{R}^{p \times p} $$

$$ \beta \in \mathbb{R}^{p} $$

$$ \tilde{\beta} \in \mathbb{R}^{p+1} $$

$$x_i \sim \mathcal{N}(0, \Sigma_x)$$

$$\Sigma_x = \sigma_x^2 \left(\rho^{|j-k|}\right)_{j,k=1,\dots,p}$$

$$\theta_{jk} \sim \mathcal{N}\left(\alpha - \|z_j - z_k\|_2^2,\; \sigma_\theta^2\right)$$

$$z_j \sim \mathcal{N}(0, \sigma_z^2 I_d)$$

$$y_i \sim \mathcal{N}( \beta_0 + \beta^\top x_i + \sum_{j \lt k}\theta_{jk}x_{ij}x_{ik}, \; \sigma_y^2 )$$

2.2 Parameter prediction

The model likelihood is

$$p(y, \beta, \Theta, Z \mid x) = p(y \mid x, \beta, \Theta)\, p(\Theta \mid Z)\, p(\beta)\, p(Z)$$

with

$$p(y \mid x, \Theta, \beta) = \prod_{i=1}^n \mathcal{N}( y_i \mid \beta_0 + \beta^\top x_i + \sum_{j \lt k}\theta_{jk}x_{ij}x_{ik}, \; \sigma_y^2 )$$

$$p(\Theta \mid Z) = \prod_{j \lt k} \mathcal{N}( \theta_{jk} \mid \alpha - \|z_j - z_k\|_2^2, \; \sigma_\theta^2 )$$

$$p(\beta) = \prod_{j=1}^p \mathcal{N}(\beta_j \mid 0,\sigma_\beta^2)$$

$$p(Z) = \prod_{j=1}^p \prod_{l=1}^d \mathcal{N}(z_{jl} \mid 0, \sigma_z^2)$$

The full negative log-likelihood is

$$\mathrm{NLL} = -\log p(y, x, \Theta, Z) = -\log p(y, \beta, \Theta, Z \mid x) -\log p(x) -\log p(\Theta \mid Z) -\log p(Z)$$

In our current implementation, we do not explicitly include the priors $p(\beta)$ and $p(z)$ in the optimization objective. This is because priors have large variances with ero mean priors

Therefore, the training loss is

$$\mathcal{L} = -\log p(y \mid x, \Theta) -\log p(\Theta \mid Z)$$

2.3 Evaluation

Simulation data: compare the estimated interaction matrix with the true interaction matrix

$$ |\Theta - \hat{\Theta}|_F^2 $$

Real data: compare the prediction with the observed response

$$ |y - \hat{y}|_2^2 $$

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
__pycache__		__pycache__
covariance_data		covariance_data
covariance_plots		covariance_plots
results		results
MHP_ADM4_trials.py		MHP_ADM4_trials.py
MHP_chicago_homicides.py		MHP_chicago_homicides.py
MHP_model_misspecification.py		MHP_model_misspecification.py
MHP_model_misspecification_data_to_graph.py		MHP_model_misspecification_data_to_graph.py
MHP_parameter_count.py		MHP_parameter_count.py
MHP_parameter_count_data_to_graph.py		MHP_parameter_count_data_to_graph.py
MultivariateHawkesProcess.ipynb		MultivariateHawkesProcess.ipynb
MultivariateHawkesProcess.py		MultivariateHawkesProcess.py
README.md		README.md
covariance_model.py		covariance_model.py
covariance_real_data_experiment.py		covariance_real_data_experiment.py
covariance_synthetic_experiment.py		covariance_synthetic_experiment.py
feature_interaction.py		feature_interaction.py
feature_interaction_experiment_realdata.py		feature_interaction_experiment_realdata.py
feature_interaction_experiment_simu.py		feature_interaction_experiment_simu.py
feature_interaction_experiment_simu_sbm.py		feature_interaction_experiment_simu_sbm.py
feature_interaction_simu.py		feature_interaction_simu.py
homicides_in_chicago.dta		homicides_in_chicago.dta
plot_results.py		plot_results.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Estimated-Network-Models

1. Correlation Networks

1.1 Simulated Data Generation

1.2 Parameter Estimation

1.3 Evaluation

2. Feature Interaction Networks

2.1 Simulation data generation

2.2 Parameter prediction

2.3 Evaluation

3. Multivariate Hawkes Processes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Estimated-Network-Models

1. Correlation Networks

1.1 Simulated Data Generation

1.2 Parameter Estimation

1.3 Evaluation

2. Feature Interaction Networks

2.1 Simulation data generation

2.2 Parameter prediction

2.3 Evaluation

3. Multivariate Hawkes Processes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages