La distribución binomial negativa frente a la de Poisson en el análisis de fenómenos recurrentes

Navarro, A.; Utzet, F.; Puig, P.; Caminal, J.; Martín, M.

doi:10.1016/S0213-9111(01)71599-3

Gaceta Sanitaria

ISSN: 0213-9111

Gaceta Sanitaria es la revista científica y órgano de expresión de la Sociedad Española de Salud Pública y Administración Sanitaria (SESPAS).
Gaceta Sanitaria acepta para su publicación artículos en español e inglés. Es una revista Open Access (OA); todos sus artículos son accesibles libremente sin cargo para el usuario y se distribuyen con la licencia Creative Commons Reconocimiento-NoComercial-SinObraDerivada 4.0 Internacional

Convocatoria de envío de manuscritos. Monográfico sobre Reformas sanitarias en Latinoamérica y el Caribe

Convocatoria abierta y permanente para incorporar Editores/as asociados/as al Comité Editorial de Gaceta Sanitaria

Call for papers. Special Issue on Healthcare reforms in Latin America and the Caribbean

Open and permanent call to incorporate Associate Editors to the Editorial Committee of Gaceta Sanitaria

Indexada en:

Scopus, Medline, Directory of Open Access Journals (DOAJ), Social Sciences Citation Index (SSCI), Science Citation Index Expanded (SCIE), SCImago Journal Rank (SJR), SNIP

En el contexto de los fenómenos recurrentes, el análisis mediante la regresión de Poisson puede provocar sobredispersión o variancia extra-Poisson. Esto conduce a la subestimación de los errores estándares de los coeficientes, pudiendo derivar en la significación estadística de factores que realmente no estén asociados con el fenómeno. La binomial negativa puede captar parte de la variancia que no identifica la regresión de Poisson. Para comprobarlo se comparó ambas distribuciones sobre el número de hospitalizaciones que presentaron individuos, entre 65 y 69 años de edad, durante el año 1996. Esta comparación fue realizada en dos bases de datos agregadas distintas: por individuo y según las variables de interés.

Resultados

El ajuste mediante ambas distribuciones presenta diferencias en las dos bases de datos. Según el estudio de los residuos, en la base por individuo la binomial negativa ajusta correctamente el 67,9% de las observaciones mal ajustadas por la regresión de Poisson. Este porcentaje es del 50% en la base agregada según las variables. Además, en ambos casos, la regresión de Poisson estima significativas cuatro de las seis variables estudiadas. Para la binomial negativa son dos en la base por individuo y una en la base por variables.

Conclusiones

La existencia de sobredispersión es frecuente en fenómenos recurrentes. Cuando esto sucede, el uso de la binomial negativa es más apropiado que el de la regresión de Poisson.

Palabras clave:

Binomial negativa

Sobredispersión

Extra-Poisson

Episodios recurrentes

Poisson

Summary

Objective

The aim is to unfold the difficulties likely to arise in risk calculations through aggregated database when the studied phenomenon is recurrent and to display the negative binomial distribution as a valid and simple alternative to analyse this kind of phenomenon.

Methods

When the studied phenomenon is recurrent, the analysis by means of the Poisson regression can provoke overdispersion or extra-poisson variance, what leads to underestimating the standard errors in coefficients and may divert into the statistical significance of factors which as a matter of fact are not associated with the phenomenon beforehand. The negative binomial can grasp part of the variance which the Poisson is unable to identify. In order to check this out, the fit of both distributions were compared, based on the number of hospitalizations of individuals aged between 65 and 69, during 1996. This comparison was carried out by means of two different aggregated databases: by individuals and by variables.

Results

There were differences in the fitted models by means of both distributions in both databases. By the analysis of the residuals, when using the base by individuals, the negative binomial fits correctly 67.9% of the observations badly fitted by the Poisson. Using the aggregated variables database, the percentage is 50%. In both cases, Poisson estimates four out of the six studied variables as significant. As to the negative binomial, there are two significant based on individuals and one in the variable database.

Conclusion

The existence of overdispersion is frequent in recurrent-type phenomena. When this occurs, the negative binomial distribution is more appropiate than the Poisson.

Key words:

Negative binomial

Overdispersion

Extra-Poisson

Recurrent events

Poisson

El Texto completo está disponible en PDF

Biblografía

[1.]

R.G. Cumming, J.L. Kelsey, M.C. Nevitt.

Methodologic issues in the study of frequent and recurrent health problems: falls in the elderly.

Ann Epidemiol, 1 (1990), pp. 49-56

Medline

[2.]

J.K. Lindsey.

Counts and times to events.

Statist Med, 17 (1998), pp. 1745-1751

[3.]

J.K. Lindsey.

Models for repeated measuraments.

[4.]

J.F. Lawless.

Negative binomial and mixed Poisson regression.

Can J Stat, 15 (1987), pp. 209-225

[5.]

S.P. Miaou.

The relationship between truck accidents and geometric design of road sections: Poisson versus negative binomial regressions.

Accid Anal Prev, 26 (1994), pp. 471-482

Medline

[6.]

W.N. Venables, B.D. Ripley.

Modern applied statistics with S-Plus.

2.ª,

[7.]

R.J. Glynn, T.A. Stukel, S.M. Sharp, T.A. Bubolz, J.L. Freeman, E.S. Fisher.

Estimating the variance of standarized rates of recurrent events, with application to hospitalizations among the elderly in New England.

Am J Epidemiol, 7 (1993), pp. 776-786

[8.]

J.K. Lindsey.

Introductory statistics. A modelling approach.

[9.]

P. McCullagh, J.A. Nelder.

Generalized linear models.

2.ª,

[10.]

S-PLUS 4.5 Professional edition for Windows.

[11.]

J. Caminal.

Las hospitalizaciones por Ambulatory Care Sensitive Consitions: Un indicador de la capacidad de resolución de la atención primaria de salud [Tesis doctoral].

[12.]

Web del Institut d'Estadística de Catalunya. Disponible en: http://www.idescat.es/

[13.]

D.A. Pierce, D.W. Schafer.

Residuals in Generalized Linear Models.

J Am Stat Assoc, 81 (1986), pp. 977-986

[14.]

N.E. Breslow.

Extra-Poisson variation in log-linear models.

Appl Statist, 33 (1984), pp. 38-44

[15.]

D. Clayton.