(130c) Opportunities for Text Mining in Service of Chemical Engineering
AIChE Spring Meeting and Global Congress on Process Safety
2019
2019 Spring Meeting and 15th Global Congress on Process Safety
Industry 4.0 Topical Conference
Emerging Technologies in Data Analytics I
Tuesday, April 2, 2019 - 4:45pm to 5:15pm
Text mining is a field that brings together an understanding of natural language as an unstructured form of data and machine learning as a suite of modeling tools to layer structure over it. Using machine learning paradigms like probabilistic graphical models and deep learning, it is possible to architect modeling tools that are able to extract latent structure found in text. By identifying and extracting the latent structure found in text, it is possible to transform this unstructured data into structured data, at which point it is possible to apply a plethora of modeling tools to identify trends over time and even plausibly causal relations between events and states. A major goal of research in the area of text mining is to achieve robustness in the face of noisy data. With the big data revolution, new storehouses of vast amounts of textual have become available. While this data is far less well-structured than forms of language that were the target of work in natural language processing decades ago, now research on text mining applied to even the noisiest of this data (for example, from sources such as Twitter or Reddit) are commonplace.
This talk will offer a brief overview of state-of-the-art methodologies for applied machine learning, with pointers to resources and further instruction.