This repository was archived by the owner on Mar 21, 2024. It is now read-only.
  
  
  - 
                Notifications
    You must be signed in to change notification settings 
- Fork 147
Fix recovery of SSL training, scale SSL training to multiple nodes #565
          
     Closed
      
      
    
  
     Closed
                    Changes from all commits
      Commits
    
    
            Show all changes
          
          
            54 commits
          
        
        Select commit
          Hold shift + click to select a range
      
      a708061
              
                first version
              
              
                ant0nsc 6c65c6a
              
                Merge remote-tracking branch 'origin/main' into antonsc/recovery
              
              
                ant0nsc d1b86f1
              
                docu update
              
              
                ant0nsc be27a1b
              
                docu update
              
              
                ant0nsc b4b9d74
              
                tests working
              
              
                ant0nsc 8927c20
              
                unit test
              
              
                ant0nsc a291d09
              
                cleanup
              
              
                ant0nsc ca8bd69
              
                tests for nih
              
              
                ant0nsc 608a404
              
                adding more tests
              
              
                ant0nsc a6da745
              
                index error
              
              
                ant0nsc b2f96f1
              
                diagnostics
              
              
                ant0nsc 22d3f08
              
                rank_zero for epoch
              
              
                ant0nsc 0b0621b
              
                sync_dist for loss
              
              
                ant0nsc eb266d8
              
                diagnostics
              
              
                ant0nsc bb36f59
              
                Merge remote-tracking branch 'origin/main' into antonsc/recovery
              
              
                ant0nsc 79e3dbe
              
                disabling logger
              
              
                ant0nsc 1e33c5c
              
                remove storing logger
              
              
                ant0nsc cfa0dd6
              
                diagnostics in AML logger
              
              
                ant0nsc 7fb62ad
              
                change logged "epoch"
              
              
                ant0nsc c1ff09e
              
                improved logger
              
              
                ant0nsc 2cdcd47
              
                byol on_step=True
              
              
                ant0nsc aa7e306
              
                flags for speedup
              
              
                ant0nsc 8bd7b46
              
                cleaned up logging
              
              
                ant0nsc 1375753
              
                more logging cleanup
              
              
                ant0nsc e68e469
              
                remove diagnostics
              
              
                ant0nsc a0bc050
              
                flake and mypy
              
              
                ant0nsc 0adae84
              
                fix logging int problem
              
              
                ant0nsc a5e5e6b
              
                fix logging issue
              
              
                ant0nsc 7d11086
              
                improve logging
              
              
                ant0nsc 77c2ab2
              
                log_on_epoch function
              
              
                ant0nsc 1ff69e6
              
                tests passing
              
              
                ant0nsc c3272f4
              
                tests
              
              
                ant0nsc 4ece208
              
                checkpoint fix
              
              
                ant0nsc db61fc0
              
                typo
              
              
                ant0nsc 40fe258
              
                logging learning rates
              
              
                ant0nsc e03f0a9
              
                sync across GPU in linear layer
              
              
                ant0nsc 0ed3214
              
                Revert "sync across GPU in linear layer"
              
              
                ant0nsc 3d5e178
              
                helpers to log learning rates
              
              
                ant0nsc e8cd6a1
              
                find unused
              
              
                ant0nsc 13b7884
              
                tests for logging LR
              
              
                ant0nsc 86ae2e4
              
                change semantics of batch size
              
              
                ant0nsc 92cfccb
              
                manual optimization with single GPU loss function
              
              
                ant0nsc 00443af
              
                fixing LR update for manual optimization
              
              
                ant0nsc 8ae2746
              
                remove single GPU loss
              
              
                ant0nsc aca6a26
              
                changelog
              
              
                ant0nsc a4ed77d
              
                cleanup callback
              
              
                ant0nsc ced9fd1
              
                test fixes
              
              
                ant0nsc 1f3893c
              
                flake
              
              
                ant0nsc c7b6e11
              
                mypy
              
              
                ant0nsc 56ca8da
              
                test fixes
              
              
                ant0nsc 0d23c83
              
                fix batch sizes
              
              
                ant0nsc 3a7e9b9
              
                reduce tolerance
              
              
                ant0nsc ec75319
              
                batch size back to 75
              
              
                ant0nsc 3fe959b
              
                sync_dist=False
              
              
                ant0nsc File filter
Filter by extension
Conversations
          Failed to load comments.   
        
        
          
      Loading
        
  Jump to
        
          Jump to file
        
      
      
          Failed to load files.   
        
        
          
      Loading
        
  Diff view
Diff view
There are no files selected for viewing
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
      
      Oops, something went wrong.
        
    
  
  Add this suggestion to a batch that can be applied as a single commit.
  This suggestion is invalid because no changes were made to the code.
  Suggestions cannot be applied while the pull request is closed.
  Suggestions cannot be applied while viewing a subset of changes.
  Only one suggestion per line can be applied in a batch.
  Add this suggestion to a batch that can be applied as a single commit.
  Applying suggestions on deleted lines is not supported.
  You must change the existing code in this line in order to create a valid suggestion.
  Outdated suggestions cannot be applied.
  This suggestion has been applied or marked resolved.
  Suggestions cannot be applied from pending reviews.
  Suggestions cannot be applied on multi-line comments.
  Suggestions cannot be applied while the pull request is queued to merge.
  Suggestion cannot be applied right now. Please check back later.
  
    
  
    
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it would be good to attach ssl training run results for future reference - before and after this manual optimisation change. (both for SimCLR and BYOL)