(126c) Extracting Causal Relations from Incident Reports: A Natural Language Processing and Topic Modeling Approach | AIChE

(126c) Extracting Causal Relations from Incident Reports: A Natural Language Processing and Topic Modeling Approach

Authors 

Liu, G. - Presenter, Mary Kay O'Connor Process Safety Center
Quddus, N. - Presenter, Texas A&M University
Boyd, M. - Presenter, Texas A&M University
Extracting causal relations from incident reports: a natural language processing and topic modeling approach

Guanyang Liu, Mason Boyd, and Noor Quddus

Mary Kay O’Connor Process Safety Center, Texas A&M University System, College Station, TX 77840, USA

Abstract

Lessons learned from past incidents are essential to enhancing process safety of chemical industry and should be considered as knowledge legacy that evolves over time for corporate and government. Although a wealth of empirical knowledge has been accumulated from public incident databases and incident investigation reports, learnings are still limited due to the high expense of manual content analysis and lack of methodology to gain insights from past incidents.

Recently there are a few attempts that develop methods to enable automated content analysis of incident reports by natural language processing (NLP) techniques, but with a manual list of key words still needed, the methods are not intelligent or automated enough to extract information that is outside the pre-defined vocabulary. In this work, topic modeling, an advanced NLP technique for text mining, is employed to identify causal relations from incident reports based on unsupervised learning algorithms. The topic model is capable of generating an exhaustive list of the incident causes described in the reports and indicating the potential of identifying root causes with more comprehensive training text data applied in the future work.

Keywords: incident data, natural language processing, topic modeling