Lightning-AI
diff --git a/‎CHANGELOG.md‎
Lines changed: 88 additions & 4 deletions b/‎CHANGELOG.md‎
Lines changed: 88 additions & 4 deletions
diff --git a/‎README.md‎
Lines changed: 9 additions & 9 deletions b/‎README.md‎
Lines changed: 9 additions & 9 deletions
diff --git a/‎_notebooks‎ b/‎_notebooks‎
diff --git a/‎docs/source/clouds/cloud_training.rst‎
Lines changed: 30 additions & 24 deletions b/‎docs/source/clouds/cloud_training.rst‎
Lines changed: 30 additions & 24 deletions
diff --git a/‎docs/source/common/optimizers.rst‎
Lines changed: 15 additions & 0 deletions b/‎docs/source/common/optimizers.rst‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎docs/source/common/test_set.rst‎
Lines changed: 3 additions & 6 deletions b/‎docs/source/common/test_set.rst‎
Lines changed: 3 additions & 6 deletions
diff --git a/‎docs/source/common/trainer.rst‎
Lines changed: 2 additions & 3 deletions b/‎docs/source/common/trainer.rst‎
Lines changed: 2 additions & 3 deletions
diff --git a/‎docs/source/conf.py‎
Lines changed: 5 additions & 1 deletion b/‎docs/source/conf.py‎
Lines changed: 5 additions & 1 deletion
diff --git a/‎docs/source/index.rst‎
Lines changed: 1 addition & 0 deletions b/‎docs/source/index.rst‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎pytorch_lightning/__about__.py‎
Lines changed: 1 addition & 1 deletion b/‎pytorch_lightning/__about__.py‎
Lines changed: 1 addition & 1 deletion
@@ -5,7 +5,91 @@ All notable changes to this project will be documented in this file.
 The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/).
 
 
-## [1.4.0] - 2021-MM-DD
+## [unReleased] - 2021-MM-DD
+
+### Added
+
+-
+
+
+-
+
+
+-
+
+
+-
+
+
+-
+
+### Changed
+
+- Replace `iteration_count` and other index attributes in the loops with progress dataclasses ([#8477](https://github.com/PyTorchLightning/pytorch-lightning/pull/8477))
+
+
+- Load ckpt path when model provided in validate/test/predict ([#8352](https://github.com/PyTorchLightning/pytorch-lightning/pull/8352)))
+
+
+
+-
+
+
+-
+
+
+-
+
+### Deprecated
+
+-
+
+
+-
+
+
+-
+
+
+-
+
+
+-
+
+### Removed
+
+-
+
+
+-
+
+
+-
+
+
+-
+
+
+-
+
+### Fixed
+
+-
+
+
+-
+
+
+-
+
+
+-
+
+
+-
+
+
+## [1.4.0] - 2021-07-27
 
 ### Added
 
@@ -153,7 +237,6 @@ The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/).
 - Moved `DeviceDtypeModuleMixin` and `HyperparametersMixin` mixin to `core` ([#8396](https://github.com/PyTorchLightning/pytorch-lightning/pull/8396))
 - Return the `default_root_dir` as the `log_dir` when the logger is a `LoggerCollection` ([#8187](https://github.com/PyTorchLightning/pytorch-lightning/pull/8187))
 
-
 ### Deprecated
 
 - Deprecated `LightningModule.loaded_optimizer_states_dict` ([#8229](https://github.com/PyTorchLightning/pytorch-lightning/pull/8229))
@@ -173,7 +256,7 @@ The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/).
 - Deprecated the `Trainer.disable_validation` property in favor of `not Trainer.enable_validation` ([#8291](https://github.com/PyTorchLightning/pytorch-lightning/pull/8291))
 - Deprecated `mode` parameter in `ModelSummary` in favor of `max_depth` ([#8062](https://github.com/PyTorchLightning/pytorch-lightning/pull/8062))
 - Deprecated `reload_dataloaders_every_epoch` argument of `Trainer` in favor of `reload_dataloaders_every_n_epochs` ([#5043](https://github.com/PyTorchLightning/pytorch-lightning/pull/5043))
-
+- Deprecated `distributed_backend` argument for `Trainer` ([#8575](https://github.com/PyTorchLightning/pytorch-lightning/pull/8575))
 
 ### Removed
 
@@ -191,7 +274,6 @@ The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/).
 - Removed DeepSpeed FP16 Exception as FP32 is now supported ([#8462](https://github.com/PyTorchLightning/pytorch-lightning/pull/8462))
 - Removed environment variable `PL_EXP_VERSION` from DDP subprocesses ([7403](https://github.com/PyTorchLightning/pytorch-lightning/pull/7403))
 
-
 ### Fixed
 
 - Fixed the `GPUStatsMonitor` callbacks to use the correct GPU IDs if `CUDA_VISIBLE_DEVICES` set ([#8260](https://github.com/PyTorchLightning/pytorch-lightning/pull/8260))
@@ -233,6 +315,8 @@ The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/).
 - Fixed `accumulate_grad_batches` not been recomputed during model reload ([#5334](https://github.com/PyTorchLightning/pytorch-lightning/pull/5334))
 - Fixed a `TypeError` when wrapping optimizers in the `HorovodPlugin` and running `Trainer.test` ([#7840](https://github.com/PyTorchLightning/pytorch-lightning/pull/7840))
 - Fixed `BackboneFinetuning` restoration ([#8501](https://github.com/PyTorchLightning/pytorch-lightning/pull/8501))
+- Fixed `lr_scheduler` with metric (e.g. `torch.optim.lr_scheduler.ReduceLROnPlateau`) when using `automatic_optimization = False` ([#7643](https://github.com/PyTorchLightning/pytorch-lightning/pull/7643))
+- Fixed `DeepSpeed` breaking with no schedulers ([#8580](https://github.com/PyTorchLightning/pytorch-lightning/pull/8580))
 
 
 ## [1.3.8] - 2021-07-01
 
@@ -118,22 +118,22 @@ pip install pytorch-lightning
   conda install pytorch-lightning -c conda-forge
   ```
 
-  #### Install stable 1.3.x
+  #### Install stable 1.4.x
 
-  the actual status of 1.3 [stable] is following:
+  the actual status of 1.4 [stable] is following:
 
-  ![CI base testing](https://github.com/PyTorchLightning/pytorch-lightning/workflows/CI%20base%20testing/badge.svg?branch=release%2F1.3.x&event=push)
-  ![CI complete testing](https://github.com/PyTorchLightning/pytorch-lightning/workflows/CI%20complete%20testing/badge.svg?branch=release%2F1.3.x&event=push)
-  ![PyTorch & Conda](https://github.com/PyTorchLightning/pytorch-lightning/workflows/PyTorch%20&%20Conda/badge.svg?branch=release%2F1.3.x&event=push)
-  ![TPU tests](https://github.com/PyTorchLightning/pytorch-lightning/workflows/TPU%20tests/badge.svg?branch=release%2F1.3.x&event=push)
-  ![Docs check](https://github.com/PyTorchLightning/pytorch-lightning/workflows/Docs%20check/badge.svg?branch=release%2F1.3.x&event=push)
+  ![CI base testing](https://github.com/PyTorchLightning/pytorch-lightning/workflows/CI%20base%20testing/badge.svg?branch=release%2F1.4.x&event=push)
+  ![CI complete testing](https://github.com/PyTorchLightning/pytorch-lightning/workflows/CI%20complete%20testing/badge.svg?branch=release%2F1.4.x&event=push)
+  ![PyTorch & Conda](https://github.com/PyTorchLightning/pytorch-lightning/workflows/PyTorch%20&%20Conda/badge.svg?branch=release%2F1.4.x&event=push)
+  ![TPU tests](https://github.com/PyTorchLightning/pytorch-lightning/workflows/TPU%20tests/badge.svg?branch=release%2F1.4.x&event=push)
+  ![Docs check](https://github.com/PyTorchLightning/pytorch-lightning/workflows/Docs%20check/badge.svg?branch=release%2F1.4.x&event=push)
 
   Install future release from the source
   ```bash
-  pip install git+https://github.com/PytorchLightning/pytorch-lightning.git@release/1.3.x --upgrade
+  pip install git+https://github.com/PytorchLightning/pytorch-lightning.git@release/1.4.x --upgrade
   ```
 
-  #### Install bleeding-edge - future 1.4
+  #### Install bleeding-edge - future 1.5
 
   Install nightly from the source (no guarantees)
   ```bash
 
@@ -4,39 +4,45 @@
 Cloud Training
 ##############
 
-Lightning has a native solution for training on AWS/GCP at scale.
-Go to `grid.ai <https://www.grid.ai/>`_ to create an account.
+Lightning makes it easy to scale your training, without the boilerplate.
+If you want to train your models on the cloud, without dealing with engineering infrastructure and servers, you can try `Grid.ai <https://www.grid.ai/>`_.
 
-We've designed Grid to work seamlessly with Lightning, without needing to make ANY code changes.
+Developed by the creators of `PyTorch Lightning <https://www.pytorchlightning.ai/>`_, Grid is a platform that allows you to:
 
-To use Grid, replace ``python`` in your regular command:
 
-.. code-block:: bash
+- **Scale your models to multi-GPU and multiple nodes** instantly with interactive sessions
+- **Run Hyperparameter Sweeps on 100s of GPUs** in one command
+- **Upload huge datasets** for availability at scale
+- **Iterate faster and cheaper**, you only pay for what you need
+
+
+****************
+Training on Grid
+****************
+
+.. raw:: html
 
-    python my_model.py --learning_rate 1e-6 --layers 2 --gpus 4
+    <video width="50%" max-width="400px" controls
+    poster="https://grid-docs.s3.us-east-2.amazonaws.com/grid.png"
+    src="https://pl-bolts-doc-images.s3.us-east-2.amazonaws.com/pl_docs/grid.mp4"></video>
 
-To use the ``grid run`` command:
+|
+
+You can launch any Lightning model on Grid using the Grid `CLI <https://pypi.org/project/lightning-grid/>`_:
 
 .. code-block:: bash
 
-    grid run --gpus 4 my_model.py --learning_rate 'uniform(1e-6, 1e-1, 20)' --layers '[2, 4, 8, 16]'
+    grid run --instance_type v100 --gpus 4 my_model.py --gpus 4 --learning_rate 'uniform(1e-6, 1e-1, 20)' --layers '[2, 4, 8, 16]'
+
+You can also start runs or interactive sessions from the `Grid platform <https://platform.grid.ai>`_, where you can upload datasets, view artifacts, view the logs, the cost, log into tensorboard, and so much more.
+
 
-The above command will launch (20 * 4) experiments, each running on 4 GPUs (320 GPUs!) - by making ZERO changes to
-your code.
+**********
+Learn More
+**********
 
-The ``uniform`` command is part of our new expressive syntax which lets you construct hyperparameter combinations
-using over 20+ distributions, lists, etc. Of course, you can also configure all of this using yamls which
-can be dynamically assembled at runtime.
+`Sign up for Grid <http://platform.grid.ai>`_ and receive free credits to get you started!
 
-***************
-Grid Highlights
-***************
+`Grid in 3 minutes <https://docs.grid.ai/#introduction>`_
 
-* Run any public or private repository with Grid, or use an interactive session.
-* Grid allocates all the machines and GPUs you need on demand, so you only pay for what you need when you need it.
-* Grid handles all the other parts of developing and training at scale: artifacts, logs, metrics, etc.
-* Grid works with the experiment manager of your choice, no code changes needed.
-* Use Grid Datastores- high-performance, low-latency, versioned datasets.
-* Attach Datastores to a Run so you don't have to keep downloading datasets
-* Use Grid Sessions for fast prototyping on a cloud machine of your choice
-* For more information check the `grid documentation <https://docs.grid.ai/>`_
+`Grid.ai Terms of Service <https://www.grid.ai/terms-of-service/>`_
@@ -234,6 +234,21 @@ If you want to call ``lr_scheduler.step()`` every ``n`` steps/epochs, do the fol
         if self.trainer.is_last_batch and (self.trainer.current_epoch + 1) % n == 0:
             sch.step()
 
+If you want to call schedulers that require a metric value after each epoch, consider doing the following:
+
+.. testcode::
+
+    def __init__(self):
+        super().__init__()
+        self.automatic_optimization = False
+
+    def training_epoch_end(self, outputs):
+        sch = self.lr_schedulers()
+
+        # If the selected scheduler is a ReduceLROnPlateau scheduler.
+        if isinstance(sch, torch.optim.lr_scheduler.ReduceLROnPlateau):
+            sch.step(self.trainer.callback_metrics["loss"])
+
 -----
 
 Use closure for LBFGS-like optimizers
 
@@ -20,15 +20,12 @@ To run the test set after training completes, use this method.
     trainer.fit(model)
 
     # (1) load the best checkpoint automatically (lightning tracks this for you)
-    trainer.test()
+    trainer.test(ckpt_path='best')
 
-    # (2) don't load a checkpoint, instead use the model with the latest weights
-    trainer.test(ckpt_path=None)
-
-    # (3) test using a specific checkpoint
+    # (2) test using a specific checkpoint
     trainer.test(ckpt_path="/path/to/my_checkpoint.ckpt")
 
-    # (4) test with an explicit model (will use this model and not load a checkpoint)
+    # (3) test with an explicit model (will use this model and not load a checkpoint)
     trainer.test(model)
 
 ----------
 
@@ -335,7 +335,7 @@ auto_scale_batch_size
 Automatically tries to find the largest batch size that fits into memory,
 before any training.
 
-.. code-block::
+.. code-block:: python
 
     # default used by the Trainer (no scaling of batch size)
     trainer = Trainer(auto_scale_batch_size=None)
@@ -1353,7 +1353,6 @@ By setting to False, you have to add your own distributed sampler:
 
 .. code-block:: python
 
-
     # in your LightningModule or LightningDataModule
     def train_dataloader(self):
         # default used by the Trainer
@@ -1575,7 +1574,7 @@ Can specify as float or int.
     trainer = Trainer(val_check_interval=1000)
 
 
-.. code-block::
+.. code-block:: python
 
     # Here is the computation to estimate the total number of batches seen within an epoch.
 
 
@@ -142,7 +142,11 @@ def _transform_changelog(path_in: str, path_out: str) -> None:
 # List of patterns, relative to source directory, that match files and
 # directories to ignore when looking for source files.
 # This pattern also affects html_static_path and html_extra_path.
-exclude_patterns = [f"{FOLDER_GENERATED}/PULL_REQUEST_TEMPLATE.md", "notebooks/course_UvA-DL/*", "notebooks/template*"]
+exclude_patterns = [
+    f"{FOLDER_GENERATED}/PULL_REQUEST_TEMPLATE.md",
+    "notebooks/course_UvA-DL/*",
+    "notebooks/sample-template*",
+]
 
 # The name of the Pygments (syntax highlighting) style to use.
 pygments_style = None
 
@@ -62,6 +62,7 @@ PyTorch Lightning Documentation
    notebooks/lightning_examples/datamodules.ipynb
    notebooks/lightning_examples/cifar10-baseline.ipynb
    notebooks/lightning_examples/basic-gan.ipynb
+   notebooks/lightning_examples/mnist-tpu-training.ipynb
    notebooks/lightning_examples/text-transformers.ipynb
    notebooks/lightning_examples/reinforce-learning-DQN.ipynb
    notebooks/lightning_examples/augmentation_kornia.ipynb
 
@@ -1,7 +1,7 @@
 import time
 
 _this_year = time.strftime("%Y")
-__version__ = "1.4.0rc2"
+__version__ = "1.5.0dev"
 __author__ = "William Falcon et al."
 __author_email__ = "[email protected]"
 __license__ = "Apache-2.0"