Abstract
This paper studies properties of parameter estimators obtained by minimizing a distance between the empirical probability generating function and the probability generating function of a model for count data. Specifically, it is shown that, under certain not restrictive conditions, the resulting estimators are consistent and, suitably normalized, asymptotically normal. These properties hold even if the model is misspecified. Three applications of the obtained results are considered. First, we revisit the goodness-of-fit problem for count data and propose a weighted bootstrap estimator of the null distribution of test statistics based on the above cited distance. Second, we give a probability generating function version of the model selection test problem for separate, overlapping and nested families of distributions. Finally, we provide an application to the problem of testing for separate families of distributions. All applications are illustrated with numerical examples.

Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Baringhaus L, Henze N (1992) A goodness of fit test for the poisson distribution based on the empirical generating function. Stat Probab Lett 13(4):269–274
Burke MD (2000) Multivariate tests-of-fit and uniform confidence bands using a weighted bootstrap. Stat Probab Lett 46:13–20
Cox DR (1961) Tests of separate families of hypothesis. Proceedings of the fourth Berkeley symposium in mathematical statistics and probability. University of California Press, Berkeley, CA, pp 105–123
Cox DR (1962) Further results on tests of separate families of hypothesis. J R Stat Soc B 24:406–424
Delhing H, Mikosch T (1994) Random quadratic forms and the bootstrap for \(U\)-statistics. J Multivar Anal 51:392–413
Doray LG, Jiang SM, Luong A (2009) Some simple method of estimation for the parameters of the discrete stable distribution with the probability generating function. Commun Stat Simul Comput 38:2004–2017
Dowling MM, Nakamura M (1997) Estimating parameters for discrete distributions via the empirical probability generating function. Commun Stat Simul Comput 26:301–313
El-Shaarawi AH, Zhu R, Joe H (2011) Modelling species abundance using the Poisson–Tweedie family. Environmetrics 22:152–164
Epps TW (1995) A test of fit for lattice distributions. Commun Stat Theory Methods 24(6):1455–1479
Feuerverger A, McDunnough P (1984) On statistical transform methods and their efficiency. Can J Stat 12(4):303–317
Gómez-Deniz E, Calderín-Ojeda E (2011) The discrete Lindley distribution: properties and applications. J Stat Comput Simul 81:1405–1416
Gürtler N, Henze N (2000) Recent and classical goodness-of-fit test for the Poisson distribution. J Stat Plan Inference 90:207–225
Heathcote CR (1977) The integrated squared error estimation of parameters. Biometrika 64(2):255–264
Hildebrandt TH, Schoenberg IJ (1933) On linear functional operations and the moment problem for a finite interval in one or several dimensions. Ann Math 34:317–328
Jiménez-Gamero MD, Batsidis A, Alba-Fernández MV (2016) Fourier methods for model selection. Ann Inst Stat Math 68:105–133
Jiménez-Gamero MD, Kim HM (2015) Fast goodness-of-fit tests based on the characteristic function. Comput Stat Data Anal 89:172–191
Jiménez-Gamero MD, Pino-Mejías R, Rufián-Lizana A (2014) Minimum \(K_{\phi }\)-divergence estimators for multinomial models and applications. Comput Stat 29:363–401
Jodrá P, Jiménez-Gamero MD (2016) A note on the Log-Lindley distribution. Insur Math Econom 71:189–194
Kemp CD, Kemp AW (1988) Rapid estimation for discrete distribution. Statistician 37:213–255
Kocherlakota S, Kocherlakota K (1986) Goodness of fit test for discrete distributions. Comm Stat Theory Methods 15(3):815–829
Kojadinovic I, Yan J (2012) Goodness-of-fit testing based on a weighted bootstrap: a fast large sample alternative to the parametric bootstrap. Can J Stat 40:480–500
Kumaraswamy P (1980) Generalized probability density-function for double-bounded random-processes. J Hydrol 46:79–88
Marques M, Pérez-Abreu V (1989) Law of large numbers and central limit theorem for the empirical probability generating function of stationary random sequences and processes. Aport Mat Notas Investig 4:100–109
Meintanis S (2007) A new goodness-of-fit test for certain bivariate distributions applicable to traffic accidents. Stat Methodol 4:22–34
Meintanis S, Bassiakos Y (2005) Goodness-of-fit test for additively closed count models with an application to the generalized Hermite distribution. Sankhya 67:538–552
Meintanis S, Swanepoel J (2007) Bootstrap goodness-of-fit tests with estimated parameters based on empirical transforms. Stat Probab Lett 77:1004–1013
Nakamura M, Pérez-Abreu V (1993) Empirical probability generating function. An overview. Insur Math Econom 12:287–295
Newey WK, McFadden D (1994) Large sample estimation and hypothesis testing. In: Engle R, McFadden D (eds) Handbook of econometrics, vol IV. North-Holland, Amsterdam, pp 2112–2245
Ng CM, Ong SH, Srivastava HM (2013) Parameter estimation by Hellinger type distance for multivariate distributions based upon probability generating function. Appl Math Model 37:7374–7385
Novoa-Muñoz F, Jiménez-Gamero MD (2014) Testing for the bivariate Poisson distribution. Metrika 77:771–793
Rueda R, O’Reilly F (1999) Tests of fit for discrete distributions based on the probability generating function. Commun Stat Simul Comput 28:259–274
Rueda R, Pérez-Abreu V, O’Reilly F (1991) Goodness of fit for the Poisson distribution based on the probability generating function. Commun Stat Theory Methods 20:3093–3110
Sharifdoust M, Ng CM, Ong SH (2016) Probability generating function based Jeffrey’s divergence for statistical inference. Commun Stat Simul Comput 45(7):2445–2458
Sim SZ, Ong SH (2010) Parameter estimation for discrete distributions by generalized Hellinger-type divergence based on probability generating function. Commun Stat Simul Comput 39(2):305–314
van der Vaart AW, Wellner JA (1996) Weak convergence and empirical processes. Springer, New York
Vuong QH (1989) Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica 57:307–333
White H (1982a) Regularity conditions for Cox’s test of non-nested hypothesis. J Econom 19:301–315
White H (1982b) Maximum likelihood estimation of misspecified models. Econometrica 50:1–25
Wu CF (1981) Asymptotic theory of nonlinear least-squares estimation. Ann Stat 9:501–513
Zhu L (2005) Nonparametric Monte Carlo test and their applications, Lecture Notes in Statistics. Springer, New York
Acknowledgements
The authors thank the anonymous referees for their constructive comments and suggestions which helped to improve the presentation. M.D. Jiménez-Gamero acknowledges financial support from Grant MTM2014-55966-P of the Spanish Ministry of Economy and Competitiveness.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jiménez-Gamero, M.D., Batsidis, A. Minimum distance estimators for count data based on the probability generating function with applications. Metrika 80, 503–545 (2017). https://doi.org/10.1007/s00184-017-0614-3
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-017-0614-3