Many chemical engineering applications face the challenge of building accurate and interpretable models from sparse, noisy, and costly data. This need arises especially in early-stage process design or when experimenting with novel unit equipment, where data acquisition is limited and prior knowledge is incomplete. Classical regression methods often fail to provide physically meaningful models that generalize beyond specific datasets or operating conditions. To address this issue, a novel workflow for physics-informed symbolic regression is proposed, with the objective to integrate physical knowledge directly into the learning process, ensuring consistency with first-principles models while retaining flexibility for data-driven discovery. The framework enables model discovery that is both interpretable and accurate, even in the presence of noisy and incomplete information. Two application areas are explored. First, the algorithms reconstruct governing equations, algebraic or differential, purely from data, making them useful where no prior models exist. Second, hybrid workflows embed mechanistic models into the regression process, supporting either the discovery of missing phenomena or refinement of existing equations through physically meaningful corrections. A key feature is the ability to interpret and correct plant-model mismatch, transforming error correction into a tool for scientific discovery. This approach transforms error correction into an opportunity to expand engineering knowledge, allowing the model to evolve in a principled way that enhances understanding of the system rather than just improving predictive performance. The resulting models bridge data-driven and first-principles sources of knowledge, yielding equations that are both accurate and physically sound.

Many chemical engineering applications face the challenge of building accurate and interpretable models from sparse, noisy, and costly data. This need arises especially in early-stage process design or when experimenting with novel unit equipment, where data acquisition is limited and prior knowledge is incomplete. Classical regression methods often fail to provide physically meaningful models that generalize beyond specific datasets or operating conditions. To address this issue, a novel workflow for physics-informed symbolic regression is proposed, with the objective to integrate physical knowledge directly into the learning process, ensuring consistency with first-principles models while retaining flexibility for data-driven discovery. The framework enables model discovery that is both interpretable and accurate, even in the presence of noisy and incomplete information. Two application areas are explored. First, the algorithms reconstruct governing equations, algebraic or differential, purely from data, making them useful where no prior models exist. Second, hybrid workflows embed mechanistic models into the regression process, supporting either the discovery of missing phenomena or refinement of existing equations through physically meaningful corrections. A key feature is the ability to interpret and correct plant-model mismatch, transforming error correction into a tool for scientific discovery. This approach transforms error correction into an opportunity to expand engineering knowledge, allowing the model to evolve in a principled way that enhances understanding of the system rather than just improving predictive performance. The resulting models bridge data-driven and first-principles sources of knowledge, yielding equations that are both accurate and physically sound.

AI-driven methodology to develop (hybrid) steady-state and dynamic process models

ROSSI, LORENZO
2024/2025

Abstract

Many chemical engineering applications face the challenge of building accurate and interpretable models from sparse, noisy, and costly data. This need arises especially in early-stage process design or when experimenting with novel unit equipment, where data acquisition is limited and prior knowledge is incomplete. Classical regression methods often fail to provide physically meaningful models that generalize beyond specific datasets or operating conditions. To address this issue, a novel workflow for physics-informed symbolic regression is proposed, with the objective to integrate physical knowledge directly into the learning process, ensuring consistency with first-principles models while retaining flexibility for data-driven discovery. The framework enables model discovery that is both interpretable and accurate, even in the presence of noisy and incomplete information. Two application areas are explored. First, the algorithms reconstruct governing equations, algebraic or differential, purely from data, making them useful where no prior models exist. Second, hybrid workflows embed mechanistic models into the regression process, supporting either the discovery of missing phenomena or refinement of existing equations through physically meaningful corrections. A key feature is the ability to interpret and correct plant-model mismatch, transforming error correction into a tool for scientific discovery. This approach transforms error correction into an opportunity to expand engineering knowledge, allowing the model to evolve in a principled way that enhances understanding of the system rather than just improving predictive performance. The resulting models bridge data-driven and first-principles sources of knowledge, yielding equations that are both accurate and physically sound.
2024
AI-driven methodology to develop (hybrid) steady-state and dynamic process models
Many chemical engineering applications face the challenge of building accurate and interpretable models from sparse, noisy, and costly data. This need arises especially in early-stage process design or when experimenting with novel unit equipment, where data acquisition is limited and prior knowledge is incomplete. Classical regression methods often fail to provide physically meaningful models that generalize beyond specific datasets or operating conditions. To address this issue, a novel workflow for physics-informed symbolic regression is proposed, with the objective to integrate physical knowledge directly into the learning process, ensuring consistency with first-principles models while retaining flexibility for data-driven discovery. The framework enables model discovery that is both interpretable and accurate, even in the presence of noisy and incomplete information. Two application areas are explored. First, the algorithms reconstruct governing equations, algebraic or differential, purely from data, making them useful where no prior models exist. Second, hybrid workflows embed mechanistic models into the regression process, supporting either the discovery of missing phenomena or refinement of existing equations through physically meaningful corrections. A key feature is the ability to interpret and correct plant-model mismatch, transforming error correction into a tool for scientific discovery. This approach transforms error correction into an opportunity to expand engineering knowledge, allowing the model to evolve in a principled way that enhances understanding of the system rather than just improving predictive performance. The resulting models bridge data-driven and first-principles sources of knowledge, yielding equations that are both accurate and physically sound.
Symbolic regression
Model discovery
Hybrid modeling
Model mismatch
Equations learning
File in questo prodotto:
File Dimensione Formato  
Rossi_Lorenzo.pdf

Accesso riservato

Dimensione 4.56 MB
Formato Adobe PDF
4.56 MB Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/87614