add cosine restart learning rate by hellozhaoming · Pull Request #2953 · deepmodeling/deepmd-kit

hellozhaoming · 2023-10-27T10:17:26Z

No description provided.

Signed-off-by: hellozhaoming <747247642@qq.com>

Add cosine restart learning rate

codecov · 2023-10-27T12:57:59Z

Codecov Report

❌ Patch coverage is 41.79104% with 39 lines in your changes missing coverage. Please review.
✅ Project coverage is 75.07%. Comparing base (2fe6927) to head (05052c1).
⚠️ Report is 1227 commits behind head on master.

Files with missing lines	Patch %	Lines
deepmd/utils/learning_rate.py	22.72%	34 Missing ⚠️
deepmd/train/trainer.py	54.54%	5 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2953      +/-   ##
==========================================
- Coverage   75.36%   75.07%   -0.30%     
==========================================
  Files         245      220      -25     
  Lines       24648    20297    -4351     
  Branches     1582      903     -679     
==========================================
- Hits        18577    15238    -3339     
+ Misses       5140     4526     -614     
+ Partials      931      533     -398

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

wanghan-iapcm · 2023-10-27T13:00:04Z

deepmd/utils/learning_rate.py

+        """Get the start lr."""
+        return self.start_lr_
+
+    def value(self, step: int) -> float:


you may not need to implement the value method if you do not print the information regarding the learning rate at the beginning of the training:
https://github.com/hellozhaoming/deepmd-kit/blob/05052c195308f61b63ce2bab130ce0e8cba60604/deepmd/train/trainer.py#L566

njzjz

Please run pre-commit to format and lint the code: https://docs.deepmodeling.com/projects/deepmd/en/master/development/coding-conventions.html#run-scripts-to-check-the-code. Or you can submit from a non-protect branch and pre-commit.ci can do it for you.

Unit tests should be added for two new learning rate classes.

njzjz · 2023-10-27T19:35:31Z

deepmd/common.py

    "softplus": tf.nn.softplus,
    "sigmoid": tf.sigmoid,
    "tanh": tf.nn.tanh,
+    "swish": tf.nn.swish,


It seems that it has been renamed to silu: tensorflow/tensorflow#41066

njzjz · 2023-10-27T19:39:35Z

deepmd/train/trainer.py

-            )
-        else:
-            for fitting_key in self.fitting:
+            if self.lr_type == "exp":


It's not a good behavior to switch the learning rate in the Trainer. Instead, implement the method LearningRate.log_start (LearningRate should be an abstract base class and inherited by all learning rate classes) and call self.lr.log_start(self.sess) here.

njzjz · 2023-10-27T19:41:29Z

deepmd/utils/argcheck.py

-        [Argument("exp", dict, learning_rate_exp())],
+        [Argument("exp", dict, learning_rate_exp()),
+         Argument("cos", dict, learning_rate_cos()),
+         Argument("cosrestart", dict, learning_rate_cosrestarts())],


You may need to add some documentation to variants (doc="xxx"). Otherwise, no one knows what they are.

njzjz · 2023-10-27T19:43:16Z

deepmd/utils/learning_rate.py

+  ```python
+  global_step = min(global_step, decay_steps)
+  cosine_decay = 0.5 * (1 + cos(pi * global_step / decay_steps))
+  decayed = (1 - alpha) * cosine_decay + alpha
+  decayed_learning_rate = learning_rate * decayed
+  ```


Please use this style: https://numpydoc.readthedocs.io/en/latest/format.html#other-points-to-keep-in-mind

njzjz · 2023-10-27T19:43:48Z

deepmd/utils/learning_rate.py

+
+  The function returns the cosine decayed learning rate while taking into account
+  possible warm restarts.
+  ```


This line should be removed.

njzjz · 2026-03-02T13:30:23Z

The feature has been implemented by #5142 and #5154.

hellozhaoming added 3 commits October 27, 2023 17:16

add swish activation function

7abd94e

Signed-off-by: hellozhaoming <747247642@qq.com>

Add cosine restart learning rate

3f3449c

Add cosine restart learning rate

add cosine restart learning rate

05052c1

github-actions bot added the Python label Oct 27, 2023

wanghan-iapcm reviewed Oct 27, 2023

View reviewed changes

wanghan-iapcm changed the base branch from master to devel October 27, 2023 13:00

njzjz requested changes Oct 27, 2023

View reviewed changes

njzjz added the new feature label Nov 4, 2023

njzjz closed this Mar 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add cosine restart learning rate#2953

add cosine restart learning rate#2953
hellozhaoming wants to merge 3 commits intodeepmodeling:masterfrom
hellozhaoming:master

hellozhaoming commented Oct 27, 2023

Uh oh!

codecov bot commented Oct 27, 2023 •

edited

Loading

Uh oh!

wanghan-iapcm Oct 27, 2023

Uh oh!

njzjz left a comment

Uh oh!

njzjz Oct 27, 2023

Uh oh!

njzjz Oct 27, 2023

Uh oh!

njzjz Oct 27, 2023

Uh oh!

njzjz Oct 27, 2023

Uh oh!

njzjz Oct 27, 2023

Uh oh!

njzjz commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hellozhaoming commented Oct 27, 2023

Uh oh!

codecov bot commented Oct 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

wanghan-iapcm Oct 27, 2023

Choose a reason for hiding this comment

Uh oh!

njzjz left a comment

Choose a reason for hiding this comment

Uh oh!

njzjz Oct 27, 2023

Choose a reason for hiding this comment

Uh oh!

njzjz Oct 27, 2023

Choose a reason for hiding this comment

Uh oh!

njzjz Oct 27, 2023

Choose a reason for hiding this comment

Uh oh!

njzjz Oct 27, 2023

Choose a reason for hiding this comment

Uh oh!

njzjz Oct 27, 2023

Choose a reason for hiding this comment

Uh oh!

njzjz commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Oct 27, 2023 •

edited

Loading