Information value (VoI) analysis is a key component for decision-making supported by quantitative simulations. A key obstacle to the full utilization of VoI is computational efficiency. This thesis examines the use of machine learning approaches to estimating VoI for realistic simulators. We compare the smoothing approaches already introduced, and propose two novelties. First, an approach based on the nearest neighbors. We prove a central limit theorem and then we discuss the automatic selection of the number of neighbors through a LassoLars weighting approach. We also propose a modification of a previously introduced algorithm by using MARS regression. We compare the resulting estimators through a wide range of numerical experiments. We then adapt the algorithms for the estimation of a new quantity, the information density. Experiments show that the algorithms can be successfully modified and one obtains consistent indications about the regional importance of variables, both individually and in groups.

Machine Learning Approaches for Computing Information Value and Information Density

BUDZINSKI, MARIUSZ
2023

Abstract

Information value (VoI) analysis is a key component for decision-making supported by quantitative simulations. A key obstacle to the full utilization of VoI is computational efficiency. This thesis examines the use of machine learning approaches to estimating VoI for realistic simulators. We compare the smoothing approaches already introduced, and propose two novelties. First, an approach based on the nearest neighbors. We prove a central limit theorem and then we discuss the automatic selection of the number of neighbors through a LassoLars weighting approach. We also propose a modification of a previously introduced algorithm by using MARS regression. We compare the resulting estimators through a wide range of numerical experiments. We then adapt the algorithms for the estimation of a new quantity, the information density. Experiments show that the algorithms can be successfully modified and one obtains consistent indications about the regional importance of variables, both individually and in groups.
23-gen-2023
Inglese
34
2021/2022
STATISTICS
Settore SECS-S/01 - Statistica
BORGONOVO, EMANUELE
File in questo prodotto:
File Dimensione Formato  
Thesis_Budzinski_Mariusz.pdf

accesso aperto

Descrizione: Thesis_Budzinski_Mariusz
Tipologia: Tesi di dottorato
Dimensione 3.45 MB
Formato Adobe PDF
3.45 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11565/4058416
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact