GLL: A Differentiable Graph Learning Layer for Neural Networks

Name: GLL: A Differentiable Graph Learning Layer for Neural Networks
Keywords: Machine Learning, FOS: Computer and information sciences, I.2.6; I.2.10; I.4.0, 68T05, 68T07, 35R02, Machine Learning (stat.ML), Machine Learning (cs.LG)

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2024Embargo end date: 01 Jan 2024Publisher:arXivFunded by:NSF | Graduate Research Fellows..., NSF | NRT-HDR: Modeling and Und..., NSF | ATD: Active Learning Acti... +1 projects

Authors: Brown, Jason; Chen, Bohan; Hardiman-Mostow, Harris; Calder, Jeff; Bertozzi, Andrea L.;

doi: 10.48550/arxiv.2412.08016

arXiv: 2412.08016

GLL: A Differentiable Graph Learning Layer for Neural Networks

- Summary
- Subjects
- Related research
  (3)
- Metrics

tips_and_updates
Recommended

Abstract

Standard deep learning architectures used for classification generate label predictions with a projection head and softmax activation function. Although successful, these methods fail to leverage the relational information between samples for generating label predictions. In recent works, graph-based learning techniques, namely Laplace learning, have been heuristically combined with neural networks for both supervised and semi-supervised learning (SSL) tasks. However, prior works approximate the gradient of the loss function with respect to the graph learning algorithm or decouple the processes; end-to-end integration with neural networks is not achieved. In this work, we derive backpropagation equations, via the adjoint method, for inclusion of a general family of graph learning layers into a neural network. The resulting method, distinct from graph neural networks, allows us to precisely integrate similarity graph construction and graph Laplacian-based label propagation into a neural network layer, replacing a projection head and softmax activation function for general classification task. Our experimental results demonstrate smooth label transitions across data, improved generalization and robustness to adversarial attacks, and improved training dynamics compared to a standard softmax-based approach.

58 pages, 12 figures. Preprint. Submitted to the Journal of Machine Learning Research. v2: several new experiments, improved exposition

Related Organizations

University of California, Los Angeles
United States

Keywords

Machine Learning, FOS: Computer and information sciences, I.2.6; I.2.10; I.4.0, 68T05, 68T07, 35R02, Machine Learning (stat.ML), Machine Learning (cs.LG)

3 Research products, page 1 of 1

annoy software on GitHub
IsRelatedTo
SupContrast software on GitHub
IsRelatedTo
GraphLearningLayer software on GitHub
IsRelatedTo

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

GLL: A Differentiable Graph Learning Layer for Neural Networks

GLL: A Differentiable Graph Learning Layer for Neural Networks

3 Research products, page 1 of 1

annoy software on GitHub

SupContrast software on GitHub

GraphLearningLayer software on GitHub