EBookClubs

Read Books & Download eBooks Full Online

EBookClubs

Read Books & Download eBooks Full Online

Book Efficient Nonparametric and Semiparametric Regression Methods with Application in Case Control Studies

Download or read book Efficient Nonparametric and Semiparametric Regression Methods with Application in Case Control Studies written by Shahina Rahman and published by . This book was released on 2015 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt: Regression Analysis is one of the most important tools of statistics which is widely used in other scientific fields for projection and modeling of association between two variables. Nowadays with modern computing techniques and super high performance devices, regression analysis on multiple dimensions has become an important issue. Our task is to address the issue of modeling with no assumption on the mean and the variance structure and further with no assumption on the error distribution. In other words, we focus on developing robust semiparametric and nonparamteric regression problems. In modern genetic epidemiological association studies, it is often important to investigate the relationships among the potential covariates related to disease in case-control data, a study known as "Secondary Analysis". First we focus to model the association between the potential covariates in univariate dimension nonparametrically. Then we focus to model the association in mulivariate set up by assuming a convenient and popular multivariate semiparametric model, known as Single-Index Model. The secondary analysis of case-control studies is particularly challenging due to multiple reasons (a) the case-control sample is not a random sample, (b) the logistic intercept is practically not identifiable and (c) misspecification of error distribution leads to inconsistent results. For rare disease, controls (individual free of disease) are typically used for valid estimation. However, numerous publication are done to utilize the entire case-control sample (including the diseased individual) to increase the efficiency. Previous work in this context has either specified a fully parametric distribution for regression errors or specified a homoscedastic distribution for the regression errors or have assumed parametric forms on the regression mean. In the first chapter we focus on to predict an univariate covariate Y by another potential univariate covariate X neither by any parametric form on the mean function nor by any distributional assumption on error, hence addressing potential heteroscedasticity, a problem which has not been studied before. We develop a tilted Kernel based estimator which is a first attempt to model the mean function nonparametrically in secondary analysis. In the following chapters, we focus on i.i.d samples to model both the mean and variance function for predicting Y by multiple covariates X without assuming any form on the regression mean. In particular we model Y by a single-index model m(X^T [Lowercase theta symbol]), where [Lowercase theta symbol] is a single-index vector and m is unspecified. We also model the variance function by another flexible single index model. We develop a practical and readily applicable Bayesian methodology based on penalized spline and Markov Chain Monte Carlo (MCMC) both in i.i.d set up and in case-control set up. For efficient estimation, we model the error distribution by a Dirichlet process mixture models of Normals (DPMM). In numerical examples, we illustrate the finite sample performance of the posterior estimates for both i.i.d and for case-control set up. For single-index set up, in i.i.d case only one existing work based on local linear kernel method addresses modeling of the variance function. We found that our method based on DPMM vastly outperforms the other existing method in terms of mean square efficiency and computation stability. We develop the single-index modeling in secondary analysis to introduce flexible mean and variance function modeling in case-control studies, a problem which has not been studies before. We showed that our method is almost 2 times efficient than using only controls, which is typically used for many cases. We use the real data example from NIH-AARP study on breast cancer, from Colon Cancer Study on red meat consumption and from National Morbidity Air Pollution Study to illustrate the computational efficiency and stability of our methods. The electronic version of this dissertation is accessible from http://hdl.handle.net/1969.1/155719

Book Semiparametric Regression

Download or read book Semiparametric Regression written by David Ruppert and published by Cambridge University Press. This book was released on 2003-07-14 with total page 408 pages. Available in PDF, EPUB and Kindle. Book excerpt: Even experts on semiparametric regression should find something new here.

Book Nonparametric and Semiparametric Regression  Missing Data  and Related Algorithms

Download or read book Nonparametric and Semiparametric Regression Missing Data and Related Algorithms written by Mingyu Li and published by . This book was released on 2010 with total page 59 pages. Available in PDF, EPUB and Kindle. Book excerpt: This dissertation consists of two chapters: Chapter 1 develops nonparametric and semiparametric regression methodologies which relate the group testing responses to the individual covariates information. In this chapter, we extend the parametric regression model of Xie (2001) for binary group testing data to the nonparametric and semiparametric models. We fit nonparametric and semiparametric models and obtain estimators of the parameters by maximizing penalized likelihood function. For implementation, we apply EM algorithm considering the individual responses as complete data and the group testing responses as observed data. Simulation studies are performed to illustrate the methodologies and to evaluate the finite sample performance of our methods. In general, group testing involves a large number of subjects, hence, the computational aspect is also discussed. The results show that our estimation methods perform well for estimating both the individual probability of positive outcome and the prevalence rate in the population. Chapter 2 studies a partially linear regression model with missing response variable and develops semiparametric efficient inference for the parametric component of the model. The missingness considered here includes a broad range of missing patterns. For the estimation method, we use the concept of least favorable curve, least favorable direction and the generalized profile likelihood in Severini and Wong (1992). Asymptotic distributions for the estimators of the parametric components are obtained. It is shown that the estimators are asymptotically normally distributed under some conditions. Furthermore, we prove that the asymptotic covariance of the estimators achieves the semiparametric lower bound under the regularity conditions and additional conditions given in the appendix. We also propose an algorithm which runs iteratively between fitting parametric components and fitting nonparametric components while holding the other fixed. EM algorithms are used in estimating the parametric components by a semiparametric estimating equation and in estimating the nonparametric components by smoothing methods. It is proved that the estimators from this iterative algorithm equal to the conditional expectations (conditioned on observed data) of the semiparametric efficient estimators from complete data. The methodology is illustrated and evaluated by numerical examples.

Book The Work of Raymond J  Carroll

Download or read book The Work of Raymond J Carroll written by Marie Davidian and published by Springer. This book was released on 2014-06-06 with total page 599 pages. Available in PDF, EPUB and Kindle. Book excerpt: This volume contains Raymond J. Carroll's research and commentary on its impact by leading statisticians. Each of the seven main parts focuses on a key research area: Measurement Error, Transformation and Weighting, Epidemiology, Nonparametric and Semiparametric Regression for Independent Data, Nonparametric and Semiparametric Regression for Dependent Data, Robustness, and other work. The seven subject areas reviewed in this book were chosen by Ray himself, as were the articles representing each area. The commentaries not only review Ray’s work, but are also filled with history and anecdotes. Raymond J. Carroll’s impact on statistics and numerous other fields of science is far-reaching. His vast catalog of work spans from fundamental contributions to statistical theory to innovative methodological development and new insights in disciplinary science. From the outset of his career, rather than taking the “safe” route of pursuing incremental advances, Ray has focused on tackling the most important challenges. In doing so, it is fair to say that he has defined a host of statistics areas, including weighting and transformation in regression, measurement error modeling, quantitative methods for nutritional epidemiology and non- and semiparametric regression.

Book Nonparametric Regression Methods for Longitudinal Data Analysis

Download or read book Nonparametric Regression Methods for Longitudinal Data Analysis written by Hulin Wu and published by John Wiley & Sons. This book was released on 2006-05-12 with total page 401 pages. Available in PDF, EPUB and Kindle. Book excerpt: Incorporates mixed-effects modeling techniques for more powerful and efficient methods This book presents current and effective nonparametric regression techniques for longitudinal data analysis and systematically investigates the incorporation of mixed-effects modeling techniques into various nonparametric regression models. The authors emphasize modeling ideas and inference methodologies, although some theoretical results for the justification of the proposed methods are presented. With its logical structure and organization, beginning with basic principles, the text develops the foundation needed to master advanced principles and applications. Following a brief overview, data examples from biomedical research studies are presented and point to the need for nonparametric regression analysis approaches. Next, the authors review mixed-effects models and nonparametric regression models, which are the two key building blocks of the proposed modeling techniques. The core section of the book consists of four chapters dedicated to the major nonparametric regression methods: local polynomial, regression spline, smoothing spline, and penalized spline. The next two chapters extend these modeling techniques to semiparametric and time varying coefficient models for longitudinal data analysis. The final chapter examines discrete longitudinal data modeling and analysis. Each chapter concludes with a summary that highlights key points and also provides bibliographic notes that point to additional sources for further study. Examples of data analysis from biomedical research are used to illustrate the methodologies contained throughout the book. Technical proofs are presented in separate appendices. With its focus on solving problems, this is an excellent textbook for upper-level undergraduate and graduate courses in longitudinal data analysis. It is also recommended as a reference for biostatisticians and other theoretical and applied research statisticians with an interest in longitudinal data analysis. Not only do readers gain an understanding of the principles of various nonparametric regression methods, but they also gain a practical understanding of how to use the methods to tackle real-world problems.

Book Nonparametric and Semiparametric Models

Download or read book Nonparametric and Semiparametric Models written by Wolfgang Karl Härdle and published by Springer Science & Business Media. This book was released on 2012-08-27 with total page 317 pages. Available in PDF, EPUB and Kindle. Book excerpt: The statistical and mathematical principles of smoothing with a focus on applicable techniques are presented in this book. It naturally splits into two parts: The first part is intended for undergraduate students majoring in mathematics, statistics, econometrics or biometrics whereas the second part is intended to be used by master and PhD students or researchers. The material is easy to accomplish since the e-book character of the text gives a maximum of flexibility in learning (and teaching) intensity.

Book Semiparametric Odds Ratio Model and Its Applications

Download or read book Semiparametric Odds Ratio Model and Its Applications written by Hua Yun Chen and published by CRC Press. This book was released on 2021-12-20 with total page 296 pages. Available in PDF, EPUB and Kindle. Book excerpt: Beginning with familiar models and moving onto advanced semiparametric modelling tools Semiparametric Odds Ratio Model and its Applications introduces readers to a new range of flexible statistical models and provides guidance on their application using real data examples. This books range of real-world examples and exploration of common statistical problems makes it an invaluable reference for research professionals and graduate students of biostatistics, statistics, and other quantitative fields. Key Features: Introduces flexible statistical models that have yet to systematically introduced in course materials. Discusses applications of the proposed modelling framework in several important statistical problems, ranging from biased sampling designs and missing data, graphical models, survival analysis, Gibbs sampler and model compatibility, and density estimation. Includes real data examples to demonstrate the use of the proposed models, and estimation and inference tools.

Book Efficient Inference in General Semiparametric Regression Models

Download or read book Efficient Inference in General Semiparametric Regression Models written by Arnab Maity and published by . This book was released on 2010 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt: Semiparametric regression has become very popular in the field of Statistics over the years. While on one hand more and more sophisticated models are being developed, on the other hand the resulting theory and estimation process has become more and more involved. The main problems that are addressed in this work are related to efficient inferential procedures in general semiparametric regression problems. We first discuss efficient estimation of population-level summaries in general semiparametric regression models. Here our focus is on estimating general population-level quantities that combine the parametric and nonparametric parts of the model (e.g., population mean, probabilities, etc.). We place this problem in a general context, provide a general kernel-based methodology, and derive the asymptotic distributions of estimates of these population-level quantities, showing that in many cases the estimates are semiparametric efficient. Next, motivated from the problem of testing for genetic effects on complex traits in the presence of gene-environment interaction, we consider developing score test in general semiparametric regression problems that involves Tukey style 1 d.f form of interaction between parametrically and non-parametrically modeled covariates. We develop adjusted score statistics which are unbiased and asymptotically efficient and can be performed using standard bandwidth selection methods. In addition, to over come the difficulty of solving functional equations, we give easy interpretations of the target functions, which in turn allow us to develop estimation procedures that can be easily implemented using standard computational methods. Finally, we take up the important problem of estimation in a general semiparametric regression model when covariates are measured with an additive measurement error structure having normally distributed measurement errors. In contrast to methods that require solving integral equation of dimension the size of the covariate measured with error, we propose methodology based on Monte Carlo corrected scores to estimate the model components and investigate the asymptotic behavior of the estimates. For each of the problems, we present simulation studies to observe the performance of the proposed inferential procedures. In addition, we apply our proposed methodology to analyze nontrivial real life data sets and present the results.

Book Handbook of Statistical Methods for Case Control Studies

Download or read book Handbook of Statistical Methods for Case Control Studies written by Ørnulf Borgan and published by CRC Press. This book was released on 2018-06-27 with total page 536 pages. Available in PDF, EPUB and Kindle. Book excerpt: Handbook of Statistical Methods for Case-Control Studies is written by leading researchers in the field. It provides an in-depth treatment of up-to-date and currently developing statistical methods for the design and analysis of case-control studies, as well as a review of classical principles and methods. The handbook is designed to serve as a reference text for biostatisticians and quantitatively-oriented epidemiologists who are working on the design and analysis of case-control studies or on related statistical methods research. Though not specifically intended as a textbook, it may also be used as a backup reference text for graduate level courses. Book Sections Classical designs and causal inference, measurement error, power, and small-sample inference Designs that use full-cohort information Time-to-event data Genetic epidemiology About the Editors Ørnulf Borgan is Professor of Statistics, University of Oslo. His book with Andersen, Gill and Keiding on counting processes in survival analysis is a world classic. Norman E. Breslow was, at the time of his death, Professor Emeritus in Biostatistics, University of Washington. For decades, his book with Nick Day has been the authoritative text on case-control methodology. Nilanjan Chatterjee is Bloomberg Distinguished Professor, Johns Hopkins University. He leads a broad research program in statistical methods for modern large scale biomedical studies. Mitchell H. Gail is a Senior Investigator at the National Cancer Institute. His research includes modeling absolute risk of disease, intervention trials, and statistical methods for epidemiology. Alastair Scott was, at the time of his death, Professor Emeritus of Statistics, University of Auckland. He was a major contributor to using survey sampling methods for analyzing case-control data. Chris J. Wild is Professor of Statistics, University of Auckland. His research includes nonlinear regression and methods for fitting models to response-selective data.

Book Regression

Download or read book Regression written by L. Fahrmeir and published by . This book was released on 2021 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt: Now in its second edition, this textbook provides an applied and unified introduction to parametric, nonparametric and semiparametric regression that closes the gap between theory and application. The most important models and methods in regression are presented on a solid formal basis, and their appropriate application is shown through numerous examples and case studies. The most important definitions and statements are concisely summarized in boxes, and the underlying data sets and code are available online on the books dedicated website. Availability of (user-friendly) software has been a major criterion for the methods selected and presented. The chapters address the classical linear model and its extensions, generalized linear models, categorical regression models, mixed models, nonparametric regression, structured additive regression, quantile regression and distributional regression models. Two appendices describe the required matrix algebra, as well as elements of probability calculus and statistical inference. In this substantially revised and updated new edition the overview on regression models has been extended, and now includes the relation between regression models and machine learning, additional details on statistical inference in structured additive regression models have been added and a completely reworked chapter augments the presentation of quantile regression with a comprehensive introduction to distributional regression models. Regularization approaches are now more extensively discussed in most chapters of the book. The book primarily targets an audience that includes students, teachers and practitioners in social, economic, and life sciences, as well as students and teachers in statistics programs, and mathematicians and computer scientists with interests in statistical modeling and data analysis. It is written at an intermediate mathematical level and assumes only knowledge of basic probability, calculus, matrix algebra and statistics.

Book Semiparametric Regression Models for Between  and Within subject Attributes

Download or read book Semiparametric Regression Models for Between and Within subject Attributes written by Jinyuan Liu and published by . This book was released on 2022 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Breakthroughs such as high-throughput sequencing are generating flourishing high-dimensional data that provoke challenges in both statistical analyses and interpretations. Since directly modeling such data often suffers from multiple testing and low power, an emerging alternative is to first reduce the dimension at the outset, by comparing two subjects' genome sequences using dissimilarity metrics, yielding "between-subject attributes." In the first half of this talk, I will extend the classical generalized linear models (GLM) to establish a new regression paradigm for between-subject attributes, using a class of semiparametric functional response models (FRM). Despite its growing applications, the efficiency of estimators for the FRM has not yet been carefully studied. This is of fundamental importance for semiparametric models due to the efficiency loss at the price of minimum model assumptions. For the next half of the talk, we leverage the Hilbert-Space-based semiparametric efficiency theory to show that estimators from a class of U-statistics-based generalized estimating equation (UGEE) achieve the semiparametric efficiency bound. Thus, like GEE for semiparametric GLM, UGEE estimators also harmonize efficiency and robustness, propelling growing applications in biomedical, psychosocial, and related research.

Book Secondary Analysis of Case control Studies in Genomic Contexts

Download or read book Secondary Analysis of Case control Studies in Genomic Contexts written by Jiawei Wei and published by . This book was released on 2011 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt: This dissertation consists of five independent projects. In each project, a novel statistical method was developed to address a practical problem encountered in genomic contexts. For example, we considered testing for constant nonparametric effects in a general semiparametric regression model in genetic epidemiology; analyzed the relationship between covariates in the secondary analysis of case-control data; performed model selection in joint modeling of paired functional data; and assessed the prediction ability of genes in gene expression data generated by the CodeLink System from GE. In the first project in Chapter II we considered the problem of testing for constant nonparametric effects in a general semiparametric regression model when there is the potential for interaction between the parametrically and nonparametrically modeled variables. We derived a generalized likelihood ratio test for this hypothesis, showed how to implement it, and gave evidence that it can improve statistical power when compared to standard partially linear models. The second project in Chapter III addressed the issue of score testing for the independence of X and Y in the second analysis of case-control data. The semiparametric efficient approaches can be used to construct semiparametric score tests, but they suffer from a lack of robustness to the assumed model for Y given X. We showed how to adjust the semiparametric score test to make its level/Type I error correct even if the assumed model for Y given X is incorrect, and thus the test is robust. The third project in Chapter IV took up the issue of estimation of a regression function when Y given X follows a homoscedastic regression model. We showed how to estimate the regression parameters in a rare disease case even if the assumed model for Y given X is incorrect, and thus the estimates are model-robust. In the fourth project in Chapter V we developed novel AIC and BIC-type methods for estimating the smoothing parameters in a joint model of paired, hierarchical sparse functional data, and showed in our numerical work that they are many times faster than 10-fold crossvalidation while at the same time giving results that are remarkably close to the crossvalidated estimates. In the fifth project in Chapter VI we introduced a practical permutation test that uses cross-validated genetic predictors to determine if the list of genes in question has "good" prediction ability. It avoids overfitting by using cross-validation to derive the genetic predictor and determines if the count of genes that give "good" prediction could have been obtained by chance. This test was then used to explore gene expression of colonic tissue and exfoliated colonocytes in the fecal stream to discover similarities between the two.

Book Advances in Semi parametric Regression Modeling

Download or read book Advances in Semi parametric Regression Modeling written by Hwang-Dae Kim and published by . This book was released on 2010 with total page 163 pages. Available in PDF, EPUB and Kindle. Book excerpt: Highly nonlinear relationships between a response variable and a set of regressor variables can be difficult to model across the entire regressor spaces. Parametric regression models can be biased by missing important anomalies associated with the nonlinear structure. On the other hand, nonparametric regression models may result in more variable estimates, especially for small sample sizes. Semi-parametric regression modeling produces a hybrid fit combining assets of both the parametric and nonparametric regression approaches. This dissertation will address issues and extensions of semi-parametric models for small sample design settings. A detailed case study motivating the utility of semi-parametric regression within aeronautics is provided. Critical to the success of semi-parametric regression is an effective method of choosing optimal smoothing parameters. Weighted cross-validation has been shown to be most effective in choosing optimal smoothing parameters for semi-parametric models. We demonstrate that the type of weighted cross-validation that is most appropriate largely depends upon the magnitude of the underlying variation relative to roughness/bumpiness of the true mean surface. Two data sets from the literature serve as the basis for extensive simulation studies which demonstrate the utility of our recommendations. Two other weighted cross-validation methods are developed for semi-parametric regression and guidance is offered as to the appropriateness of their use.

Book Nonparametric and Semiparametric Regression with Missing Data

Download or read book Nonparametric and Semiparametric Regression with Missing Data written by Lu Wang and published by . This book was released on 2008 with total page 280 pages. Available in PDF, EPUB and Kindle. Book excerpt: In this dissertation, we consider nonparametric and semiparametric regression for both independent and longitudinal data with missing at random (MAR). The thesis consists of three chapters. In chapter 1, we focus on nonparametric regression of a scalar outcome on a covariate when the outcome is MAR. We show that the usual nonparametric kernel regression estimation based only on complete cases is generally inconsistent. We propose inverse probability weighted (IPW) kernel estimating equations (KEEs) and a class of augmented IPW (AIPW) KEEs. Both approaches do not require specification of a parametric model for the error distribution. We show that the IPW kernel estimator is consistent when the probability that a sampling unit is observed, i.e., the selection probability, is known by design or is estimated using a correctly specified model. We further show that the AIPW kernel estimator is double-robust in the sense that it is consistent if either the model for the selection probability or the model for the conditional mean of the outcome given covariates and auxiliary variables is correctly specified, not necessarily both. We argue that adequate augmentation terms in the AIPW KEEs help increase the efficiency of the estimator. We study the asymptotic properties of the proposed IPW and AIPW kernel estimators, perform simulations to evaluate their finite sample performance, and apply to the analysis of the AIDS Costs and Services Utilization Survey data.

Book Exploring Research Frontiers in Contemporary Statistics and Econometrics

Download or read book Exploring Research Frontiers in Contemporary Statistics and Econometrics written by Ingrid Van Keilegom and published by Springer Science & Business Media. This book was released on 2011-09-15 with total page 276 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book collects contributions written by well-known statisticians and econometricians to acknowledge Léopold Simar’s far-reaching scientific impact on Statistics and Econometrics throughout his career. The papers contained herein were presented at a conference in Louvain-la-Neuve in May 2009 in honor of his retirement. The contributions cover a broad variety of issues surrounding frontier estimation, which Léopold Simar has contributed much to over the past two decades, as well as related issues such as semiparametric regression and models for censored data. This book collects contributions written by well-known statisticians and econometricians to acknowledge Léopold Simar’s far-reaching scientific impact on Statistics and Econometrics throughout his career. The papers contained herein were presented at a conference in Louvain-la-Neuve in May 2009 in honor of his retirement. The contributions cover a broad variety of issues surrounding frontier estimation, which Léopold Simar has contributed much to over the past two decades, as well as related issues such as semiparametric regression and models for censored data.

Book Mixed Effects Models for Complex Data

Download or read book Mixed Effects Models for Complex Data written by Lang Wu and published by CRC Press. This book was released on 2009-11-11 with total page 431 pages. Available in PDF, EPUB and Kindle. Book excerpt: Although standard mixed effects models are useful in a range of studies, other approaches must often be used in correlation with them when studying complex or incomplete data. Mixed Effects Models for Complex Data discusses commonly used mixed effects models and presents appropriate approaches to address dropouts, missing data, measurement errors, censoring, and outliers. For each class of mixed effects model, the author reviews the corresponding class of regression model for cross-sectional data. An overview of general models and methods, along with motivating examples After presenting real data examples and outlining general approaches to the analysis of longitudinal/clustered data and incomplete data, the book introduces linear mixed effects (LME) models, generalized linear mixed models (GLMMs), nonlinear mixed effects (NLME) models, and semiparametric and nonparametric mixed effects models. It also includes general approaches for the analysis of complex data with missing values, measurement errors, censoring, and outliers. Self-contained coverage of specific topics Subsequent chapters delve more deeply into missing data problems, covariate measurement errors, and censored responses in mixed effects models. Focusing on incomplete data, the book also covers survival and frailty models, joint models of survival and longitudinal data, robust methods for mixed effects models, marginal generalized estimating equation (GEE) models for longitudinal or clustered data, and Bayesian methods for mixed effects models. Background material In the appendix, the author provides background information, such as likelihood theory, the Gibbs sampler, rejection and importance sampling methods, numerical integration methods, optimization methods, bootstrap, and matrix algebra. Failure to properly address missing data, measurement errors, and other issues in statistical analyses can lead to severely biased or misleading results. This book explores the biases that arise when naïve methods are used and shows which approaches should be used to achieve accurate results in longitudinal data analysis.

Book Three Essays on Nonparametric and Semiparametric Regression Models

Download or read book Three Essays on Nonparametric and Semiparametric Regression Models written by Feng Yao and published by . This book was released on 2004 with total page 111 pages. Available in PDF, EPUB and Kindle. Book excerpt: