vignettes/NAME-OF-VIGNETTE.Rmd
NAME-OF-VIGNETTE.Rmd
DRDNetPro working flow
DRDNetPro is a bio-protocol for recovering Disease Risk-associated pseudo-Dynamic Networks (DRDNet) from steady-state gene expression data (as opposed to long-run temporal expression data that is very hard to collect in practice). It incorporates risk prediction model of having certain disease, a varying coefficient model, multiple ordinary differential equations, and group lasso estimation to learn a series of networks. This page summarizes the three main procedures to implement network learning.
To begin with, the user need to install our packages
if (!require("devtools")) {
install.packages("devtools")
}
devtools::install_github("chencxxy28/DRDNetPro")
To run the following tutorial, the user need to install the packages listed below
library(DRDNetPro)
library(pROC)
library(ranger) #optional
library(xgboost) #optional
library(dplyr) #optional
library(np)
library(splines2)
library(grpreg)
library(Matrix)
library(igraph)
library(ggplot2)
The overall goal for this tutorial is to provide detailed codes to learn and visualize the hypertension risk-associated pseudo-dynamic gene-gene networks in the tissue of blood vessel, which will be illustrated in details in the following three tutorials. These networks include the gene-gene interactions for the baseline demographic profile and interactions caused by the effect from a binary covariate, such as smoking. The illustrating data is from GTEx consisting of gene expression from blood vessel and de-identified phenotypical information from donors. The user is highly recommended to go through the following tutorials one by one.
The first step is to use the phenotypical data (pheno_did) to impute the risk of having hypertension (named as agent in general). Note that the raw phenotype data is confidential and protected. The illustrating data used in the tutorial is subset of the raw data and also de-identified. This section includes the content of preparing the data, building the prediction model for the agent, and evaluating the model for the predicted agent. The imputed agent will be then used to construct disease risk-associated pseudo-dynamic networks. Please find the details in Tutorial 1
Based on the imputed agent, the next step is to learn the network for gene expression data (data vessel). This section is the main section in DRDNetPro, which includes the content of preparing gene expression data, training the varying coefficient model for gene expressions, generating the base matrices for network learning, and training the network model. Please find the details in Tutorial 2
This section includes the content of extracting the results from the trained network model, visualizing the pseudo-dynamic network for a given agent value, and visualizing the interaction behaviors from genes to a targetd gene over the spectrum of the agent (risk of having hypertension). Please find the details in Tutorial 3