Departmental Papers (CBE)

Document Type

Journal Article

Date of this Version

March 2007


Accident databases (NRC, RMP, and others) contain records of incidents (e.g., releases and spills) that have occurred in the USA chemical plants during recent years. For various chemical industries, [Kleindorfer, P. R., Belke, J. C., Elliott, M. R., Lee, K., Lowe, R. A., & Feldman, H. I. (2003). Accident epidemiology and the US chemical industry: Accident history and worst-case data from RMP*Info. Risk Analysis, 23(5), 865–881.] summarize the accident frequencies and severities in the RMP*Info database. Also, [Anand, S., Keren, N., Tretter, M. J., Wang, Y., O’Connor, T. M., & Mannan, M. S. (2006). Harnessing data mining to explore incident databases, the Journal of Hazardous Material, 130, 33–41.] use data mining to analyze the NRC database for Harris County, Texas.

Classical statistical approaches are ineffective for low frequency, high consequence events because of their rarity. Given this information limitation, this paper uses Bayesian theory to forecast incident frequencies, their relevant causes, equipment involved, and their consequences, in specific chemical plants. Systematic analyses of the databases also help to avoid future accidents, thereby reducing the risk.

More specifically, this paper presents dynamic analyses of incidents in the NRC database. The NRC database is exploited to model the rate of occurrence of incidents in various chemical and petrochemical companies using Bayesian theory. Probability density distributions are formulated for their causes (e.g., equipment failures, operator errors, etc.), and associated equipment items utilized within a particular industry. Bayesian techniques provide posterior estimates of the cause and equipment-failure probabilities. Cross-validation techniques are used for checking the modeling, validation, and prediction accuracies. Differences in the plant- and chemical-specific predictions with the overall predictions are demonstrated. Furthermore, extreme value theory is used for consequence modeling of rare events by formulating distributions for events over a threshold value. Finally, the fast-Fourier transform is used to estimate the capital at risk within an industry utilizing the frequency and loss-severity distributions.


Postprint version. Published in Journal of Loss Prevention in the Process Industries, Volume 20, Issue 2, March 2007, pages 113-127. Publisher URL:


risk, frequency modeling, consequence modeling, abnormal events, chemical plants



Date Posted: 01 August 2007

This document has been peer reviewed.