Poisson Regression Models for Count Data: Use in the Number of Deaths in the Santo Angelo (Brazil)


 Deaths, Poisson regression models, Overdispersion

How to Cite

Suzana Russo, Diego Flender, & Gabriel Francisco da Silva. (2021). Poisson Regression Models for Count Data: Use in the Number of Deaths in the Santo Angelo (Brazil). Journal of Basic & Applied Sciences, 8(2), 266–269. https://doi.org/10.6000/1927-5129.2012.08.02.01


When speaking about data, presuppose its good quality otherwise the accuracy of information would be affected, which would lead to false interpretations. In Health Statistics data is obtained through surveys presented in its simplest expression, taking advantage of existing records; making an inquiry or by means of experiments. The rational organization of the data allows characterizing the priority issues and thus establishing health programs. To analyze the mortality data it is necessary to consider the mortality rate of certain age groups, so that we can find data which shows the prevalence of major groups of deaths. The analysis of data is followed by subsequent formulation of the Poisson regression models, where each group in question by age group is represented by a number of counting time. The Poisson regression model is a specific type of Generalized Linear Models (GLM) and non-linear. As [1], its main features are: a) to provide, in general, a satisfactory description of experimental data whose variance is proportional to the mean. b) It can be deduced theoretically from the first principles with a minimum of restrictions c) If events occur independently and randomly in time with constant average rate of occurrence, the model determines the number of time specified. At the end of this study, it could be seen through the analysis of the data that the age group from 70 to 79 years old sustains the highest incidence of deaths with 21.1%. Then comes the range of 60 to 69 years old with the morality rate of 20%. This was recorded for the time worked in January 2000 to December 2004. The death rate was 52.27and variance was equal to 102.43 in the city of Santo Angelo (Brazil). It was further found that the data analyzed over dispersion variance greater than average. AS a result it was necessary to remove the over dispersion to find the appropriate template. With the pattern found, some short-term forecasts were made.



Cordeiro GM. Modelos Lineares Generalizados. São Paulo, Campinas UNICAMP/UFPE: 1986.

Nelder JA, Wedderburn R, W., M. Generalized linear models. J Royal Statist Soc v. 1972; 135: pp. 370-384. http://dx.doi.org/10.2307/2344614

Dobson AJ. An introduction to generalized linear models. 2 ed. Chapman & Hall/CRC Press pp. 225.2002.

Cordeiro GM. Introdução à Teoria de verossimilhança. Livro Texto do 10º Simpósio Nacional de Probabilidade e Estatística. UFRJ/ABE. Rio de Janeiro 1992.

Ferrari SLP, David JSE, André PA, Pereira LAA. Use of overdispersed regression models in analyzing the association between air pollution and human health. Relatório Técnico, RTMAE-2002-10, IME-USP 2002.

Rippon P, Rayner J. Assessing Poisson and Logistic Regression Models Using Smooth Tests. Research Online 2011: pp. 1-4.

Schafer JL. Analyses of incomplete multivariate data. London Chapman & Hall 1997.

Mccullagh P, Nelder JA. Generalized Linear Models. Third Edition. New York: Chapman and Hall/CRC. Reprint 1989.

Piegorsch WW. An introduction to binary response regression and associated trend analyses. J Qualit Technol 1998; 30(3): 269-81.

Wang P, Puterman ML, Cockburn LEN. Mixed Poisson Regression Models With Covariate Dependent Rates. Biometrics 1996; 52: pp. 381-400. http://dx.doi.org/10.2307/2532881

Breslow NE. Extra-Poisson variation in log-linear models. Appl Statist 1984; 33: 38-44. http://dx.doi.org/10.2307/2347661

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Copyright (c) 2021 Suzana Russo, Diego Flender , Gabriel Francisco da Silva