APLMforPool

This project is to fit APLM for pooled biomonitoring data. A paper corresponding to this project is currently under review at Computational Statistics & Data Analysis.

The necessary source code is src.R and PLM.cpp. Please download it to your working directory of R or Rstudio.

A simple example

Below are simple example codes. All codes below are included in 'Example.R' in this repository. It analyzed the PBDE-47 concentration collected by NHANES during 2015-2016. You can download the dataset from NHANES official website. We have also prepared a cleaned dataset in 'PBDE47-2015-2016.csv' in this repository.

Step 1. Clean the memory, install (if has not) and load the required R packages, and source the code in this repository

packages <- c("maxLik","Rcpp","RcppArmadillo")
install.packages(setdiff(packages, rownames(installed.packages())))  

rm(list=ls(all=TRUE))
library(maxLik)
library(Rcpp)
library(RcppArmadillo)
source('src.R')

Step 2. Input data

input_data=read.csv('PBDE47-2015-2016.csv')

The dataset should be in the following form, where the first column Z is the pooled concentration levels

Step 3. Determine the input argument

kernel_type=2 #kernel_type=1 Gaussian kernel; 2 Epanechnikov kernel
Z=input_data$Z # pooled concentration
U=cbind(log(input_data$BMI),input_data$age) #covariates in nonlinear component
                                            #In this example, U1 is log(NMI), U2 is age
X=cbind(1,input_data$race,input_data$gender) #covariates in linear component; the first column is a vector of 1's for intercept
                                             #In this example, X1 is race with 1 white 2 non-white
                                             #                 X2 is gender with 1 male 2 female   
original.Weight=input_data$Weight #sampling weight of each individual
PoolID=input_data$PoolID #Pool ID
U.ps=c('nonhomo','homo') #pooling structure corresponding to U
                         #In this example, BMI is non-homogeneous pooling and age is homogeneous pooling
max.iter=10 #maximum iteration
len=100 #length of covariates for prediction
U.pred=cbind((seq(min(U[,1]),max(U[,1]),length=len)),(seq(min(U[,2]),max(U[,2]),length=len))) # U for prediction
X.pred=matrix(c(1,rep(0,ncol(X)-1)),nrow=len,ncol=ncol(X),byrow = TRUE) # X for prediction

Step 4. Fit the model

#output includes:
# beta:   final estimation of beta
# f:      an N by q matrix with column 1 to q the estimation of f1 to fq at each U
# f.pred: a len by q matrix with column 1 to q the estimation of f1 to fq at the corresponding U.pred
# record.h: recording the bandwidths at each iteration
res=fit.aplm(Z,U,X,original.Weight,PoolID,U.ps,X.pred=X.pred,U.pred=U.pred,kernel_type=kernel_type,max.iter=max.iter,plot=TRUE,plot.pred=TRUE)

The program will output the selected bandwidth in each iteration. If plot=TRUE and plot.pred=TRUE are specified, it will also output the fitted curves in each iteration and the predicted curves in the end of the program, as shown below:

Step 5. Check the outputs

We can check the outputs by visiting the 'res' variable in the previous line. We can access the estimation of beta and make more formal plots about nonlinear curves by the following codes:

res$beta
plot(U.pred[,1],res$f.pred[1:len,1],col="black",ylab='',type='l',xlab='log(BMI)',cex.lab=1.5,cex.axis=1.1,lwd=2)
title(ylab=expression(hat('f')[1]*'(log(BMI))'), line=2, cex.lab=1.5)
plot(U.pred[,2],res$f.pred[1:len,2],col="black",ylab='',type='l',xlab='Age',cex.lab=1.5,cex.axis=1.1,lwd=2)
title(ylab=expression(hat('f')[2]*'(age)'), line=2, cex.lab=1.5)

The output will be:

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
DatasetScreenshot.PNG		DatasetScreenshot.PNG
Example.R		Example.R
IterationOutputConsole.PNG		IterationOutputConsole.PNG
IterationPlot.PNG		IterationPlot.PNG
PBDE47-2015-2016.csv		PBDE47-2015-2016.csv
PLM.cpp		PLM.cpp
README.md		README.md
betaestimation.PNG		betaestimation.PNG
finalplot.PNG		finalplot.PNG
src.R		src.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

APLMforPool

A simple example

Step 1. Clean the memory, install (if has not) and load the required R packages, and source the code in this repository

Step 2. Input data

Step 3. Determine the input argument

Step 4. Fit the model

Step 5. Check the outputs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

abc1m2x3c/APLMforPool

Folders and files

Latest commit

History

Repository files navigation

APLMforPool

A simple example

Step 1. Clean the memory, install (if has not) and load the required R packages, and source the code in this repository

Step 2. Input data

Step 3. Determine the input argument

Step 4. Fit the model

Step 5. Check the outputs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages