(114a) The Data Analytics Triangle
AIChE Spring Meeting and Global Congress on Process Safety
2017
2017 Spring Meeting and 13th Global Congress on Process Safety
3rd Big Data Analytics
Invited Tutorial Session - Approaches in Big Data Analytics I
Tuesday, March 28, 2017 - 1:30pm to 2:15pm
When analyzing real-world data sets, there is no one-size-fits-all algorithm. This talk discusses the importance of and methods for interrogating a dataset to understand its properties and better select appropriate data analytics methods. The presentation uses the framework of the âData Analytics Triangle,â which organizes datasets by associating each of the vertices of the triangle with one of three characteristics: nonlinearity, correlation, and dynamics. The edges of the triangle represent datasets that have two of these characteristics, whereas the center represents datasets that have all three characteristics. We then associate each of these parts of the triangle with different data analytics methods, so that the data integration and data analytics triangle guide the user to the best data analytics methods to apply for a particular data set. The final algorithm to apply then depends both and the properties of the dataset and the goal of the data-based model (e.g., prediction, classification).