docs(gradmax): rewrite GradMax algorithm page by maxencelebaron · Pull Request #16 · growingnet/growing_wiki

maxencelebaron · 2026-05-26T15:53:48Z

This PR adds the documentation page for the GradMax algorithm.

Content:

Introduction: GradMax focuses solely on the "how to grow" question, initializing new neurons by maximizing gradient norms rather than greedily minimizing the loss
Theory: general optimization problem
Experiments: two result tables on CIFAR-10/100 and ImageNet comparing GradMax against random initialization, Firefly, and baselines

Files changed:

docs/algorithms/gradmax.rst: new algorithm page
docs/_static/gradmax.png: figure illustrating neuron addition in GradMax

TheoRudkiewicz · 2026-05-26T21:21:19Z

Thank you for this PR. Here are a few comments:

you should consider that people already red the introduction and in particular https://growingnet.github.io/growing_wiki/overview/neuron_addition_problem.html. As a consequence it would be better if you use coherent notations (Psi / Omega)
Even if it's not the goal of GradMax it is important to report how they solve When, Where and How many to be able to compare with other methods.
If you could give at least one-line explanation about "Random" (I think the paper is unclear, hopefully the code is clearer). In particular, if it is a Kaiming style init, what is the fan-in size considered ?

TheoRudkiewicz · 2026-05-26T21:22:33Z

+
+The solution to this maximization problem is found in closed-form by setting the columns of :math:`W_{\ell+1}^{\mathrm{new}}` as the top-:math:`k` left-singular vectors of the matrix
+


I think this is false (it's a misstake of the paper). It should add the hypotestis of orthogonality of the different component.

Thanks for the review! If I understand correctly, without an orthogonality constraint on the columns of $W_{\ell+1}^{\text{new}}$, the SVD solution (top-k left singular vectors) is not the actual optimum. Is that what you mean?

TheoRudkiewicz · 2026-05-26T21:23:28Z

+
+- Optimizer: SGD with momentum 0.9, weight decay :math:`0.2`, base learning rate :math:`\eta_0 = 0.1` for Wide-ResNet an  with cosine decay and :math:`\eta_0 = 0.05` for VGG
+


I a am bit skeptical about the weight decay value.

Yes, you're right. That's a mistake. The correct value is 2e-4.

…ine description

docs(gradmax): rewrite GradMax algorithm page

c2f21de

maxencelebaron requested a review from TheoRudkiewicz May 26, 2026 15:54

TheoRudkiewicz reviewed May 26, 2026

View reviewed changes

docs(gradmax): add when/where implementation details and random basel…

5872334

…ine description

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(gradmax): rewrite GradMax algorithm page#16

docs(gradmax): rewrite GradMax algorithm page#16
maxencelebaron wants to merge 2 commits into
growingnet:mainfrom
maxencelebaron:maxence/gradmax-doc

maxencelebaron commented May 26, 2026

Uh oh!

TheoRudkiewicz commented May 26, 2026

Uh oh!

TheoRudkiewicz May 26, 2026

Uh oh!

maxencelebaron May 27, 2026 •

edited

Loading

Uh oh!

TheoRudkiewicz May 26, 2026

Uh oh!

maxencelebaron May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		The solution to this maximization problem is found in closed-form by setting the columns of :math:`W_{\ell+1}^{\mathrm{new}}` as the top-:math:`k` left-singular vectors of the matrix


		- Optimizer: SGD with momentum 0.9, weight decay :math:`0.2`, base learning rate :math:`\eta_0 = 0.1` for Wide-ResNet an with cosine decay and :math:`\eta_0 = 0.05` for VGG

Conversation

maxencelebaron commented May 26, 2026

Uh oh!

TheoRudkiewicz commented May 26, 2026

Uh oh!

TheoRudkiewicz May 26, 2026

Choose a reason for hiding this comment

Uh oh!

maxencelebaron May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TheoRudkiewicz May 26, 2026

Choose a reason for hiding this comment

Uh oh!

maxencelebaron May 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

maxencelebaron May 27, 2026 •

edited

Loading