GLL: A Differentiable Graph Learning Layer for Neural Networks
arXiv: 2412.08016
GLL: A Differentiable Graph Learning Layer for Neural Networks
Standard deep learning architectures used for classification generate label predictions with a projection head and softmax activation function. Although successful, these methods fail to leverage the relational information between samples for generating label predictions. In recent works, graph-based learning techniques, namely Laplace learning, have been heuristically combined with neural networks for both supervised and semi-supervised learning (SSL) tasks. However, prior works approximate the gradient of the loss function with respect to the graph learning algorithm or decouple the processes; end-to-end integration with neural networks is not achieved. In this work, we derive backpropagation equations, via the adjoint method, for inclusion of a general family of graph learning layers into a neural network. The resulting method, distinct from graph neural networks, allows us to precisely integrate similarity graph construction and graph Laplacian-based label propagation into a neural network layer, replacing a projection head and softmax activation function for general classification task. Our experimental results demonstrate smooth label transitions across data, improved generalization and robustness to adversarial attacks, and improved training dynamics compared to a standard softmax-based approach.
58 pages, 12 figures. Preprint. Submitted to the Journal of Machine Learning Research. v2: several new experiments, improved exposition
- University of California, Los Angeles United States
Machine Learning, FOS: Computer and information sciences, I.2.6; I.2.10; I.4.0, 68T05, 68T07, 35R02, Machine Learning (stat.ML), Machine Learning (cs.LG)
Machine Learning, FOS: Computer and information sciences, I.2.6; I.2.10; I.4.0, 68T05, 68T07, 35R02, Machine Learning (stat.ML), Machine Learning (cs.LG)
3 Research products, page 1 of 1
- IsRelatedTo
- IsRelatedTo
- IsRelatedTo
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).0 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Average influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average
