arxiv:2505.11881

Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks

Published on May 17

· Submitted by

BootsofLagrangian on May 26

Upvote

Authors:

Giyeong Oh ,

Siyeol Kim ,

Abstract

Orthogonal Residual Updates enhance feature learning and training stability by decomposing module outputs to contribute primarily novel features.

AI-generated summary

Residual connections are pivotal for deep neural networks, enabling greater depth by mitigating vanishing gradients. However, in standard residual updates, the module's output is directly added to the input stream. This can lead to updates that predominantly reinforce or modulate the existing stream direction, potentially underutilizing the module's capacity for learning entirely novel features. In this work, we introduce Orthogonal Residual Update: we decompose the module's output relative to the input stream and add only the component orthogonal to this stream. This design aims to guide modules to contribute primarily new representational directions, fostering richer feature learning while promoting more efficient training. We demonstrate that our orthogonal update strategy improves generalization accuracy and training stability across diverse architectures (ResNetV2, Vision Transformers) and datasets (CIFARs, TinyImageNet, ImageNet-1k), achieving, for instance, a +4.3\%p top-1 accuracy gain for ViT-B on ImageNet-1k.

View arXiv page View PDF GitHub repository Add to collection

Community

BootsofLagrangian

Paper author Paper submitter 12 days ago

🚀 Are we using residual connections efficiently in Deep Learning? We propose "Orthogonal Residual Updates": decomposing a module's output relative to the input stream and adding only the component orthogonal to it. This fosters richer feature learning and more efficient training.

💻 Code: https://github.com/BootsofLagrangian/ortho-residual

librarian-bot

11 days ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 2

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2505.11881 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2505.11881 in a Space README.md to link it from this page.