(107c) Regression Strategies for Large Data Sets

Conference

AIChE Spring Meeting and Global Congress on Process Safety

Year

2017

Proceeding

2017 Spring Meeting and 13th Global Congress on Process Safety

Group

3rd Big Data Analytics

Session

Big Data Analytics and Smart Manufacturing I

Time

Tuesday, March 28, 2017 - 2:30pm to 3:00pm

Authors

Cross, J. III - Presenter, MathWorks

ABSTRACT for AIChE
2017 Spring Meeting

Regression
Strategies for Large Data Sets

James C Cross III

The MathWorks, Inc.

james.cross@mathworks.com

617-605-5818

Plant operators and engineers are increasingly using
historian data to gain insight into the relationships between and amongst process
parameters and product attributes. A universal goal is the determination of
the operating conditions that produce the highest product output and/or quality
per unit cost. Models provide insight, but seldom account for myriad physical
nuances which sometimes exhibit significant influence.

The quantity of data that must be analyzed in pursuit of the
sought inferences is invariably large, consisting of hundreds or thousands of
quantities sampled at many millions of points in time. It is seldom possible
to process data sets of this size in computing resource memory.

A number of frameworks for partitioning large data sets have
been developed and popularized, though adoption by industrial companies has
been notably slower than by IT-centric enterprises. This paper endeavors to
demystify the processing of out-of-memory data for an engineering audience.

Regression, which is fundamental in data analysis, is used
to motivate use of the techniques. This paper begins with computing simple
statistics. Subsequently, some strategies for matrix manipulations (relevant
to both direct and iterative methods) are discussed. Finally, an illustrative
example of a large data set regression is presented.

Topics

Computing and Systems Engineering

Plant Operations

Process Design & Development

Other Sites & Tools

Technical Groups

Technical

Professional/Personal Growth

Societal Needs

Leadership

Foundations of Molecular Modeling and Simulation (FOMMS 2024)

2024 International Mammalian Synthetic Biology Workshop (mSBW)

Upcoming Conferences & Events

Foundations of Molecular Modeling and Simulation (FOMMS 2024)

2024 Brazil Student Regional Conference

2024 Dow Sponsored CCPS Process Safety Faculty Workshop

2024 International Mammalian Synthetic Biology Workshop (mSBW)

2024 Chemical Ventures Conference

2024 China Chem-E-Car Competition

2024 India Student Regional Conference

CCPS India Regional Meeting

CCPS Process Safety Knowledge Webinar (Brazil)

CEP: July 2024

CEP: June 2024

Explore Areas of Advancement:

Learning Center:

Want to be an Entrepreneur? Personal Stories From Three Successful Entrepreneurs Who Have Traveled This Path.