Update gradient urls to SDK 3.1 (need cherry-pick) (#418)

payoto · Ian Hales · commit 5b133269b714 · 2023-01-12T14:31:47.000Z
diff --git a/gnn/cluster_gcn/tensorflow2/README.md b/gnn/cluster_gcn/tensorflow2/README.md
@@ -3,7 +3,7 @@ Cluster graph convolutional networks for node classification, using cluster samp
 
 Run our Cluster GCN training on arXiv dataset on Paperspace.
 <br>
-[![Gradient](https://assets.paperspace.io/img/gradient-badge.svg)](https://ipu.dev/3UYkV6d)
+[![Gradient](https://assets.paperspace.io/img/gradient-badge.svg)](https://ipu.dev/3CHtqfy)
 
 | Framework | domain | Model | Datasets | Tasks| Training| Inference | Reference |
 |-------------|-|------|-------|-------|-------|---|---|
@@ -29,7 +29,7 @@ If no path is provided, then follow these steps:
 1. Navigate to your Poplar SDK root directory
 
 2. Enable the Poplar SDK with:
-```bash 
+```bash
 cd poplar-<OS version>-<SDK version>-<hash>
 . enable.sh
 ```
@@ -83,10 +83,10 @@ these datasets can be selected in the config by setting `dataset_type`.
 
 ### PPI (Protein-protein interactions) dataset <a name='ppi' ></a>
 
-The [PPI dataset](https://paperswithcode.com/dataset/ppi) depicts protein roles in various protein-protein 
+The [PPI dataset](https://paperswithcode.com/dataset/ppi) depicts protein roles in various protein-protein
 interaction (PPI) graphs. Each graph in the datasets corresponds to a different human tissue. Positional gene sets are
-used, motif gene sets and immunological signatures as features and gene ontology sets as multi-class binary labels 
-(121 in total). The dataset contains in total 56944 nodes, 818716 edges and node feature size 50. The preprocessed PPI 
+used, motif gene sets and immunological signatures as features and gene ontology sets as multi-class binary labels
+(121 in total). The dataset contains in total 56944 nodes, 818716 edges and node feature size 50. The preprocessed PPI
 datasets can be downloaded from [Stanford GraphSAGE](https://snap.stanford.edu/graphsage).
 
 ### Reddit dataset <a name='reddit' ></a>
@@ -121,9 +121,9 @@ for test. To use this dataset, simply use the train_products.json config, the da
 The [ogbn-mag dataset](https://ogb.stanford.edu/docs/nodeprop/#ogbn-mag) is a directed heterogeneous graph that is
 a subset of Microsoft Academic Graph (MAG). The node types are papers, authors, institutions and fields of study.
 These are connected by four edge types, author affiliated with institution, author writes a paper, paper cites a paper,
-and paper has a topic of a field of study. Each paper has a 128-dimmensional node feature vector, that 
+and paper has a topic of a field of study. Each paper has a 128-dimmensional node feature vector, that
 encodes the title and abstract, similar to ogbn-arxiv. The task is to predict the venue of each paper. The train
-portion of the dataset is all papers published until 2017, the papers published in 2018 are the validation 
+portion of the dataset is all papers published until 2017, the papers published in 2018 are the validation
 set, and papers published in 2019 are the test set. To use this dataset, simply use the train_mag.json config,
 the dataset will be downloaded automatically.
 
@@ -144,12 +144,12 @@ which uses PCA to reduce the feature size. There a script is provided to pre-pro
 download the pre-processed data directly, which you can download from
 [DeepMind’s cloud storage](https://storage.googleapis.com/deepmind-ogb-lsc/mag/data/preprocessed/merged_feat_from_paper_feat_pca_129.npy).
 Note that the dataset is licensed under ODC-BY.
-The path for the downloaded `.npy` file can be given in the `train_mag240.json` config under the `pca_features_path` 
-parameter, which is expected relative to the data path. 
+The path for the downloaded `.npy` file can be given in the `train_mag240.json` config under the `pca_features_path`
+parameter, which is expected relative to the data path.
 The other parts of the dataset, for example the edges, nodes and labels, will be downloaded automatically when running
 the application. The dataset is around 200Gb so can take some time to download (a few hours to a day). The path of this
-can be given in the `train_mag240.json` config under `data_path`, or with the `--data-path` argument in the command 
-line. 
+can be given in the `train_mag240.json` config under `data_path`, or with the `--data-path` argument in the command
+line.
 For example, the following configuration will load the data from or download to directory
 `/graph-datasets/ogb-lsc-mag240`, and will attempt to load the PCA features from file
 `/graph-datasets/ogb-lsc-mag240/mag240m_kddcup2021/merged_feat_from_paper_feat_pca_129.npy`:
diff --git a/gnn/tgn/pytorch/README.md b/gnn/tgn/pytorch/README.md
@@ -4,7 +4,7 @@ Temporal graph networks for link prediction in dynamic graphs, based on [`exampl
 
 Run our TGN on paperspace.
 <br>
-[![Gradient](https://assets.paperspace.io/img/gradient-badge.svg)](https://ipu.dev/3uUI2nt)
+[![Gradient](https://assets.paperspace.io/img/gradient-badge.svg)](https://ipu.dev/3CG1WqL)
 
 | Framework | domain | Model | Datasets | Tasks| Training| Inference | Reference |
 |-------------|-|------|-------|-------|-------|---|---|
@@ -28,13 +28,13 @@ If no path is provided, then follow these steps:
 1. Navigate to your Poplar SDK root directory
 
 2. Enable the Poplar SDK with:
-```bash 
+```bash
 cd poplar-<OS version>-<SDK version>-<hash>
 . enable.sh
 ```
 
 3. Additionally, enable PopArt with:
-```bash 
+```bash
 cd popart-<OS version>-<SDK version>-<hash>
 . enable.sh
 ```
diff --git a/nlp/bert/pytorch/README.md b/nlp/bert/pytorch/README.md
@@ -3,11 +3,11 @@ Bidirectional Encoder Representations from Transformers for NLP pre-training and
 
 Run our BERT-L Fine-tuning on SQuAD dataset on Paperspace.
 <br>
-[![Gradient](https://assets.paperspace.io/img/gradient-badge.svg)](https://ipu.dev/3WiyZIC)
+[![Gradient](https://assets.paperspace.io/img/gradient-badge.svg)](https://ipu.dev/3GTWwK7)
 
 | Framework | domain | Model | Datasets | Tasks| Training| Inference | Reference |
 |-------------|-|------|-------|-------|-------|---|---|
-| Pytorch | NLP | BERT | WIKI-103 | Next sentence prediction, Masked language modelling, Question/Answering | ✅  | ✅ | [BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding](https://arxiv.org/abs/1810.04805v2) | 
+| Pytorch | NLP | BERT | WIKI-103 | Next sentence prediction, Masked language modelling, Question/Answering | ✅  | ✅ | [BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding](https://arxiv.org/abs/1810.04805v2) |
 
 
 ## Instructions summary
@@ -29,13 +29,13 @@ If no path is provided, then follow these steps:
 1. Navigate to your Poplar SDK root directory
 
 2. Enable the Poplar SDK with:
-```bash 
+```bash
 cd poplar-<OS version>-<SDK version>-<hash>
 . enable.sh
 ```
 
 3. Additionally, enable PopArt with:
-```bash 
+```bash
 cd popart-<OS version>-<SDK version>-<hash>
 . enable.sh
 ```
diff --git a/vision/vit/pytorch/README.md b/vision/vit/pytorch/README.md
@@ -3,7 +3,7 @@ Vision Transformer for image recognition, optimised for Graphcore's IPU.  Based
 
 Run our ViT on Paperspace.
 <br>
-[![Gradient](https://assets.paperspace.io/img/gradient-badge.svg)](https://ipu.dev/3uTF5Uj)
+[![Gradient](https://assets.paperspace.io/img/gradient-badge.svg)](https://ipu.dev/3W2Ru39)
 
 | Framework | domain | Model | Datasets | Tasks| Training| Inference | Reference |
 |-------------|-|------|-------|-------|-------|---|-------|
@@ -29,13 +29,13 @@ If no path is provided, then follow these steps:
 1. Navigate to your Poplar SDK root directory
 
 2. Enable the Poplar SDK with:
-```bash 
+```bash
 cd poplar-<OS version>-<SDK version>-<hash>
 . enable.sh
 ```
 
 3. Additionally, enable PopArt with:
-```bash 
+```bash
 cd popart-<OS version>-<SDK version>-<hash>
 . enable.sh
 ```
@@ -132,7 +132,7 @@ python validation.py --config b16_imagenet1k_valid
 ```
 ### Employing automatic loss scaling (ALS) for half precision training
 
-ALS is a feature in the Poplar SDK which brings stability to training large models in half precision, specially when gradient accumulation and reduction across replicas also happen in half precision. 
+ALS is a feature in the Poplar SDK which brings stability to training large models in half precision, specially when gradient accumulation and reduction across replicas also happen in half precision.
 
 NB. This feature expects the `poptorch` training option `accumulationAndReplicationReductionType` to be set to `poptorch.ReductionType.Mean`, and for accumulation by the optimizer to be done in half precision (using `accum_type=torch.float16` when instantiating the optimizer), or else it may lead to unexpected behaviour.