Prelims

The Econometrics of Complex Survey Data

ISBN: 978-1-78756-726-9, eISBN: 978-1-78756-725-2

ISSN: 0731-9053

Publication date: 10 April 2019

Citation

(2019), "Prelims", The Econometrics of Complex Survey Data (Advances in Econometrics, Vol. 39), Emerald Publishing Limited, Leeds, pp. i-xi. https://doi.org/10.1108/S0731-905320190000039005

Publisher

:

Emerald Publishing Limited

Copyright © 2019 Emerald Publishing Limited


Half Title Page

THE ECONOMETRICS OF COMPLEX SURVEY DATA: THEORY AND APPLICATIONS

Series Page

ADVANCES IN ECONOMETRICS

Series editors: Thomas B. Fomby, R. Carter Hill, Ivan Jeliazkov, Juan Carlos Escanciano, Eric Hillebrand, Daniel L. Millimet, Rodney Strachan, David T. Jacho-Chávez, Alicia Rambaldi

Previous Volumes

Volume 21: Modelling and Evaluating Treatment Effects in Econometrics – Edited by Daniel L. Millimet, Jeffrey A. Smith and Edward Vytlacil
Volume 22: Econometrics and Risk Management – Edited by Jean-Pierre Fouque, Thomas B. Fomby and Knut Solna
Volume 23: Bayesian Econometrics – Edited by Siddhartha Chib, Gary Koop, Bill Griffiths and Dek Terrell
Volume 24: Measurement Error: Consequences, Applications and Solutions – Edited by Jane Binner, David Edgerton and Thomas Elger
Volume 25: Nonparametric Econometric Methods – Edited by Qi Li and Jeffrey S. Racine
Volume 26: Maximum Simulated Likelihood Methods and Applications – Edited by R. Carter Hill and William Greene
Volume 27A: Missing Data Methods: Cross-Sectional Methods and Applications – Edited by David M. Drukker
Volume 27B: Missing Data Methods: Time-Series Methods and Applications – Edited by David M. Drukker
Volume 28: DSGE Models in Macroeconomics: Estimation, Evaluation and New Developments – Edited by Nathan Balke, Fabio Canova, Fabio Milani and Mark Wynne
Volume 29: Essays in Honor of Jerry Hausman – Edited by Badi H. Baltagi, Whitney Newey, Hal White and R. Carter Hill
Volume 30: 30th Anniversary Edition – Edited by Dek Terrell and Daniel Millmet
Volume 31: Structural Econometric Models – Edited by Eugene Choo and Matthew Shum
Volume 32: VAR Models in Macroeconomics – New Developments and Applications: Essays in Honor of Christopher A. Sims – Edited by Thomas B. Fomby, Lutz Kilian and Anthony Murphy
Volume 33: Essays in Honor of Peter C. B. Phillips – Edited by Thomas B. Fomby, Yoosoon Chang and Joon Y. Park
Volume 34: Bayesian Model Comparison – Edited by Ivan Jeliazkov and Dale J. Poirier
Volume 35: Dynamic Factor Models – Edited by Eric Hillebrand and Siem Jan Koopman
Volume 36: Essays in Honor of Aman Ullah – Edited by Gloria Gonzalez-Rivera, R. Carter Hill and Tae-Hwy Lee
Volume 37: Spatial Econometrics – Edited by Badi H. Baltagi, James P. LeSage, and R. Kelley Pace
Volume 38: Regression Discontinuity Designs: Theory and Applications – Edited by Matias D. Cattaneo and Juan Carlos Escanciano

Title Page

ADVANCES IN ECONOMETRICS VOLUME 39

THE ECONOMETRICS OF COMPLEX SURVEY DATA: THEORY AND APPLICATIONS

EDITED BY

KIM P. HUYNH

Bank of Canada, Canada

DAVID T. JACHO-CHÁVEZ

Emory University, USA

GAUTAM TRIPATHI

University of Luxembourg, Luxembourg

United Kingdom – North America – Japan – India – Malaysia – China

Copyright Page

Emerald Publishing Limited

Howard House, Wagon Lane, Bingley BD16 1WA, UK

First edition 2019

Copyright © 2019 Emerald Publishing Limited

Reprints and permissions service

Contact:

No part of this book may be reproduced, stored in a retrieval system, transmitted in any form or by any means electronic, mechanical, photocopying, recording or otherwise without either the prior written permission of the publisher or a licence permitting restricted copying issued in the UK by The Copyright Licensing Agency and in the USA by The Copyright Clearance Center. Any opinions expressed in the chapters are those of the authors. Whilst Emerald makes every effort to ensure the quality and accuracy of its content, Emerald makes no representation implied or otherwise, as to the chapters’ suitability and application and disclaims any warranties, express or implied, to their use.

British Library Cataloguing in Publication Data

A catalogue record for this book is available from the British Library

ISBN: 978-1-78756-726-9 (Print)

E-ISBN 978-1-78756-725-2 (Online)

ISBN: 978-1-78756-727-6 (Epub)

ISSN: 0731-9053 (Series)

List of Contributors

Marco Angrisani University of Southern California, USA
Gustavo J. Canavire-Bacarreza Inter-American Development Bank, USA
Heng Chen Bank of Canada, Canada
Luc Clair University of Winnipeg, Canada and Canadian Centre for Agri-Food Research in Health and Medicine, Canada
Antonio Cosma University of Luxembourg, Luxembourg
Brian Finley University of Southern California, USA
Geoffrey R. Gerdes Federal Reserve Board of Governors, USA
Christopher S. Henry Bank of Canada, Canada School of Economics/CERDI, University of Auvergne, France
Tamás Ilyés Magyar Nemzeti Bank, Hungary
Arie Kapteyn University of Southern California, USA
Andreï V. Kostyrka University of Luxembourg, Luxembourg
Jae Kwang Kim Iowa State University, USA
Steven F. Lehrer Queen’s University, Canada
Louis-Pierre Lepage University of Michigan, USA
Xuemei Liu Federal Reserve Board of Governors, USA
Alexander L. Lundberg West Virginia University, USA
James G. MacKinnon Queen’s University, Canada
Alejandra Montoya-Agudelo Universidad EAFIT, Colombia
Matthew D. Webb Carleton University, Canada
Iraj Rahmani Nazarbayev University, Kazakhstan
Q. Rallye Shen Bank of Canada, Canada
Gautam Tripathi University of Luxembourg, Luxembourg
Jeffrey M. Wooldridge Michigan State University, USA
Shu Yang North Carolina State University, USA

Introduction

The assumption of simple random sampling is widely used in applied research in the social, behavioral and biomedical sciences, as well as in empirical public policy analysis. However, this assumption is seldom true in practice. Stratified and cluster sampling are routinely used by most statistical agencies in the world, and because of budgetary reasons, the actual sampling process may be even more complicated. Correct statistical analysis therefore requires a careful consideration of these complex survey designs when performing estimation and inference.

The papers in this volume of Advances in Econometrics were presented at the “Econometrics of Complex Survey Data: Theory and Applications” conference organized by the Bank of Canada, Ottawa, Canada, from October 19 to 20, 2017. The editors would like to acknowledge the generous financial support provided by the Bank of Canada.

Below is a brief overview of the papers accepted in this volume, grouped into the following four categories: (1) sampling design; (2) variance estimation; (3) estimation and inference and (4) business, household and crime surveys.

Sampling Design

“Can Internet Match High Quality Traditional Surveys? Comparing the Health and Retirement Study and Its Online Version” by Marco Angrisani, Brian Finley and Arie Kapteyn revisit the question of comparability of online and more traditional interview modes by studying differences across Internet-based, face-to-face and phone-based surveys. They find little evidence of mode effects when comparing various outcomes providing support for internet-based surveys.

“Effectiveness of Stratified Random Sampling for Payment Card Acceptance and Usage” by Christopher S. Henry and Tamás Ilyés uses the universe of merchant cash registers in Hungary to assess the effect of stratified random sampling on estimates of payment card acceptance and usage. It compares county, industry, and store size stratifications to mimic the usual stratification criteria for standard merchant surveys. By doing this, they can quantify the effect on estimates of card acceptance for different sample sizes.

Variance Estimation

“Wild Bootstrap Randomization Inference for Few Treated Clusters” by James G. MacKinnon and Matthew D. Webb proposes a bootstrap-based alternative to randomization inference, which mitigates problems of over- or under-rejection in t tests in pure treatment or difference-in-differences settings when the number of clusters is very small.

“Variance Estimation for Survey-weighted Data Using Bootstrap Resampling Methods: 2013 Methods-of-Payment Survey Questionnaire” by Heng Chen and Q. Rallye Shen proposes a bootstrap-resampling method to estimate variability when sampling units are selected through an approximate stratified two-stage sampling design. Their proposed method allows for randomness from both the sampling design and the raking procedure.

Estimation and Inference

“Model Selection Tests for Complex Survey Samples” by Iraj Rahmani and Jeffrey M. Wooldridge extends Vuong’s model selection test (“Likelihood Ratio Tests for Model Selection and Non-Nested Hypothesis,” Econometrica, 1989) to allow for complex survey samples. By using an M-estimation setting, their test applies to general estimation problems including linear and nonlinear least squares, Poisson regression and fractional response models. With cluster samples and panel data, they show how to combine the weighted objective function with a cluster-robust variance estimator, thereby expanding the scope of their test.

“Inference in Conditional Moment Restriction Models When There is Selection Due to Stratification” by Antonio Cosma, Andre V. Kostyrka and Gautam Tripathi shows how to use a smoothed empirical likelihood approach to conduct efficient semiparametric inference in models characterized as conditional moment equalities when data are collected by variable probability sampling.

“Nonparametric Kernel Regression Using Complex Survey Data” by Luc Clair derives the asymptotic properties of a design-based nonparametric kernel-based regression estimator under a combined inference framework involving multivariate mixed data. It also proposes a least squares cross-validation procedure for selecting the bandwidth for both continuous and discrete variables. Simulation results show that the estimator is consistent and that efficiency gains can be achieved by weighting observations by the inverse of their inclusion probabilities if the sampling is endogenous.

“Nearest Neighbor Imputation for General Parameter Estimation in Survey Sampling” by Shu Yang and Jae Kwang Kim studies the asymptotic properties of the nearest neighbor population imputation estimator of population parameters when handling item nonresponse in survey sampling. When estimating a variance, the authors propose a replication variance estimator.

Business, Household and Crime Surveys

Last but not least, “Improving Response Quality with Planned Missing Data: An Application to a Survey of Banks” by Geoffrey R. Gerdes and Xuemei Liu reports a “random blocking” approach to shortening the questionnaires for individual respondents when collecting data on noncash payments by type, cash withdrawals and deposits, and related information in a survey of a population of depository institutions in the United States. Their approach is a special case of multiple matrix sampling and an extension of a split questionnaire or planned missing value design. They find that the proposed blocking approach helped increase unit-level and item-level response for smaller institutions.

“Does Selective Crime Reporting Influence Our Ability to Detect Racial Discrimination in the NYPD’s Stop-and-frisk Program?” by Steven F. Lehrer and Louis-Pierre Lepage uses data from the New York City’s Stop-and-Frisk program to assess the presence of crime type heterogeneity in racial bias and police officer decisions of reported crime type. They find evidence that differences in biases across crime types are substantial while accounting for sample-selection which may arise from conditioning on crime type.

“Survey Evidence on Black Market Liquor in Colombia” by Gustavo J. Canavire-Bacarreza, Alexander L. Lundberg and Alejandra Montoya-Agudelo uses a unique national survey on illegal liquor commissioned by the Colombian government to estimate the determinants of the demand for smuggled and adulterated liquor. To address unit and item nonresponse, they implement a multiple imputation procedure with chained equations.

Kim P. Huynh

David T. Jacho-Chávez

Gautam Tripathi