Deep learning via message passing algorithms based on belief propagation

IRIS

Message-passing algorithms based on the belief propagation (BP) equations constitute a well-known distributed computational scheme. They yield exact marginals on tree-like graphical models and have also proven to be effective in many problems defined on loopy graphs, from inference to optimization, from signal processing to clustering. The BP-based schemes are fundamentally different from stochastic gradient descent (SGD), on which the current success of deep networks is based. In this paper, we present and adapt to mini-batch training on GPUs a family of BP-based message-passing algorithms with a reinforcement term that biases distributions towards locally entropic solutions. These algorithms are capable of training multi-layer neural networks with performance comparable to SGD heuristics in a diverse set of experiments on natural datasets including multi-class image classification and continual learning, while being capable of yielding improved performances on sparse networks. Furthermore, they allow to make approximate Bayesian predictions that have higher accuracy than point-wise ones.

Deep learning via message passing algorithms based on belief propagation

Lucibello, Carlo;Pittorino, Fabrizio;Perugini, Gabriele;Zecchina, Riccardo

2022

Abstract

Message-passing algorithms based on the belief propagation (BP) equations constitute a well-known distributed computational scheme. They yield exact marginals on tree-like graphical models and have also proven to be effective in many problems defined on loopy graphs, from inference to optimization, from signal processing to clustering. The BP-based schemes are fundamentally different from stochastic gradient descent (SGD), on which the current success of deep networks is based. In this paper, we present and adapt to mini-batch training on GPUs a family of BP-based message-passing algorithms with a reinforcement term that biases distributions towards locally entropic solutions. These algorithms are capable of training multi-layer neural networks with performance comparable to SGD heuristics in a diverse set of experiments on natural datasets including multi-class image classification and continual learning, while being capable of yielding improved performances on sparse networks. Furthermore, they allow to make approximate Bayesian predictions that have higher accuracy than point-wise ones.

Scheda breve

Scheda completa

Scheda completa (DC)

	Year / Anno
	
				2022
			
	Date first on line publication / Data di prima pubblicazione on line
	
				2022
			
	DOI
	
				https://dx.doi.org/10.1088/2632-2153/ac7d3b
			
	Journal / Rivista
	
				MACHINE LEARNING: SCIENCE AND TECHNOLOGY
			
	URL / Indirizzo web
	
				https://iopscience.iop.org/article/10.1088/2632-2153/ac7d3b
			
	Tutti gli autori
	
						Lucibello, Carlo; Pittorino, Fabrizio; Perugini, Gabriele; Zecchina, Riccardo
					
	Appare nelle tipologie:
	
				01 - Article in academic journal / Articolo su rivista scientifica

File in questo prodotto:

File	Dimensione	Formato
Lucibello_2022_Mach._Learn.__Sci._Technol._3_035005.pdf accesso aperto Descrizione: article Tipologia: Pdf editoriale (Publisher's layout) Licenza: Creative commons Dimensione 2.54 MB Formato Adobe PDF Visualizza/Apri	2.54 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11565/4056996

Citazioni

ND

9

6

social impact