Informatics and Applications
2015, Volume 9, Issue 3, pp 216
ANALYSIS OF SURVEY DATA CONTAINING ROUNDED CENSORING INTERVALS
 Yu. K. Belyaev
 B. Kristrom
Abstract
This paper makes a contribution towards the statistical analysis of data sets containing intervals, that naturally arises in survey contexts. The suggested approach is sufficiently general to cover most cases where interval data are used. Interval data appear in many contexts, such as in reliability studies and survival analysis, in medicine and economics, in opinion elicit surveys, etc. There are several reasons for the extensive use of interval data, perhaps, the most common being one of necessity; exact values of the underlying observations are censored. The nature of the intervals analyzed here is somewhat unusual. The selfselected intervals (SeSeI) are (freely) chosen by the subjects. A generalization of the influential approach has been suggested to the statistical analysis of general censoring introduced by B. W Turnbull. A key independence assumption in Turnbull's analysis has been explained and generalized. A sampling stopping rule based on the coverage probability has been suggested and the properties of a twostep estimator, based on the idea of asking two questions, where the second involves a way of finegraining the information, has been discussed. This paper provides several informatics methods for SeSeI, targeting the problem of partial nonparametric identification. The properties of the suggested statistical models are stated, including a recursion for easy numerical calculations. An extensive simulation study, displaying, inter alia, the usefulness of the proposed resampling methods for the situation under study, completes the paper.
[+] References (24)
 Manski, C. F 1999. Identification problems in the social sciences. Harvard University Press. 194 p.
 Morgan, M. G., and M. Henrion. 1990. Uncertainty: A guide to dealing with uncertainty in quantitative risk and policy analysis. Cambridge University Press. 325 p.
 Billard, L., and E. Diday. 2006. Symbolic data analysis: Conceptual statistics and data mining. Wiley ser. in computational statistics. Wiley. Vol. 636. 330 p.
 Manski, C. F., and F Molinari. 2010. Rounding probabilistic expectations in surveys. J. Bus. Econ. Stat. 28:219 231.
 Johansson, P.O., and B. Kristrom. 2012. The economics of evaluating water projects. Hydroelectricity versus other uses. Heidelberg: Springer. 135 p.
 Belyaev, Y., and B. Kristrom. 2010. Approach to analysis of selfselected interval data. Umea: CERE. Working Paper 2010:2. 134. Available at: http:// www.cere.se/se/forskning/publikationer/155approach toanalysisofselfselectedintervaldata.html (accessed September 8, 2015).
 Belyaev, Y., and B. Kristrom. 2012. Twostep approach to selfselected interval data in elicitation surveys. Umea: CERE. Working Paper 2012:10. 146. Available at: http://www.cere.se/se/forskning/publikationer/386 twostepapproachtoselfselectedintervaldatain elicitationsurveys.html (accessed September 8, 2015).
 Turnbull, B. W 1974. Nonparametric estimation of a survivorship function with doubly censored data. J. Am. Stat. Assoc. 69:169173.
 Turnbull, B.W. 1976. The empirical distribution function with arbitrarily grouped, censored and truncated data. J. Roy. Stat. Soc. B 38:290295.
 Manski, C. F 2003. Partial identification of probability distributions. Springer ser. in statistics. Springer. 196 p.
 Carson, R. 2012. Contingent valuation: A comprehensive bibliography and history. Edward Elgar Publishing. 464 p.
 Hakansson, C. 2008. A new valuation question  analysis of and insights from interval open ended data in contingent valuation. Environ. Resour. Econ. 39(2):175188.
 Broberg, T, and R. Brannlund. 2008. An alternative interpretation of multiple bounded WTP datacertainty dependent payment card intervals. Energy Resour. Econ. 30:555567.
 Mahieu, P., P. Riera, B. Kristroom, R. Broannlund, and M. Giergiczny. 2014. Exploring the determinants of uncertainty in contingent valuation surveys. J. Environ. Econ. Policy 3(2):186200. Available at: http://dx.doi.org/10.1080/21606544.2013.876941 (accessed September 3, 2015).
 Good, I. J. 1953. The population frequencies of species and the estimation of population parameters. Biometrika 40(34):237264.
 Belyaev, Y., and B. Kristrom. 2013. Analysis of contingent valuation data with selfselected rounded WTPintervals collected by twosteps sampling plans. 9th Tartu Conference on Multivariate Statistics and 20th IWMSProceedings. Tartu: World Scientific. 4860.
 Gomez, J., M. Calle, and R. Oller. 2004. Frequentist and Bayesian approaches for intervalcensored data. Stat. Pap. 45:139173.
 Gomez, J., M. Calle, R. Oller, and K. Langhor. 2009. Tutorial on methods for intervalcensored data and their implementations in R. Stat. Model. 9(4):259297.
 Gentleman, R., and C. J. Geyer. 1994. Maximum likelihood for interval censored data: Consistency and computation. Biometrika 81(3):618623.
 Brinkhuis, J., and V. Tihomirov. 2005. Optimization: Insights and applications. PrincetonOxford: Princeton University Press. 680 p.
 Jammalamadaka, S. R., and V. Mangalam. 2003. Nonparametric estimation for middlecensored data. J. Non parametr. Stat. 15:253265.
 Belyaev, Y. K., and L. Nilsson. 1997. Parametric maximum likelihood estimators. Department ofMathematical Statistics, Umea University. Research Report 199715. 128.
 Belyaev, Y. K. 2003. Necessary and sufficient conditions for consistency of resampling. Sweden: Centre of Biostochastics, Swedish University of Agricultural Sciences. Research Report 20031. 126. Available at: http://biostochastics.slu.se/publikationer/ dokument/Report2003_l.pdf (accessed September 3, 2015).
 Klein, J. P., and M. L. Moeschberger. 2003. Survival analysis: Techniques for censored and truncated data. New York, N.Y.: SpringerVerlag. 536 p.
[+] About this article
Title
ANALYSIS OF SURVEY DATA CONTAINING ROUNDED CENSORING INTERVALS
Journal
Informatics and Applications
2015, Volume 9, Issue 3, pp 216
Cover Date
20150230
DOI
10.14357/19922264150301
Print ISSN
19922264
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
elicitation surveys; random sampling; rounding; anchoring; coverage probability; likelihood; recursion; maximization; resampling
Authors
Yu. K. Belyaev and B. Kristrom
Author Affiliations
Department of Mathematics and Mathematical Statistics, Ume a University, Ume a SE901 87, Sweden, yuri.belyaev@umu.se
Center for Environmental and Resource Economics (CERE), Swedish University of Agricultural Sciences, Ume a SE901 83, Sweden,
bengt.kristrom@umu.se
