@@ -6,9 +6,60 @@ New features, bug fixes, and improvements are regularly made to the SageMaker
66distributed model parallel library.
77
88
9- SageMaker Distributed Model Parallel 1.11 .0 Release Notes
9+ SageMaker Distributed Model Parallel 1.13 .0 Release Notes
1010=========================================================
1111
12+ *Date: Dec. 15. 2022 *
13+
14+ **New Features **
15+
16+ * Sharded data parallelism now supports a new backend for collectives called *SMDDP Collectives *.
17+ For supported scenarios, SMDDP Collectives are on by default for the AllGather operation.
18+ For more information, see
19+ `Sharded data parallelism with SMDDP Collectives
20+ <https://docs.aws.amazon.com/sagemaker/latest/dg/model-parallel-extended-features-pytorch-sharded-data-parallelism.html#model-parallel-extended-features-pytorch-sharded-data-parallelism-smddp-collectives> `_
21+ in the *Amazon SageMaker Developer Guide *.
22+ * Introduced FlashAttention for DistributedTransformer to improve memory usage and computational
23+ performance of models such as GPT2, GPTNeo, GPTJ, GPTNeoX, BERT, and RoBERTa.
24+
25+ **Bug Fixes **
26+
27+ * Fixed initialization of ``lm_head `` in DistributedTransformer to use a provided range
28+ for initialization, when weights are not tied with the embeddings.
29+
30+ **Improvements **
31+
32+ * When a module has no parameters, we have introduced an optimization to execute
33+ such a module on the same rank as its parent during pipeline parallelism.
34+
35+ **Migration to AWS Deep Learning Containers **
36+
37+ This version passed benchmark testing and is migrated to the following AWS Deep Learning Containers (DLC):
38+
39+ - SageMaker training container for PyTorch v1.12.1
40+
41+ .. code ::
42+
43+ 763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.12.1-gpu-py38-cu113-ubuntu20.04-sagemaker
44+
45+
46+ Binary file of this version of the library for `custom container
47+ <https://docs.aws.amazon.com/sagemaker/latest/dg/model-parallel-sm-sdk.html#model-parallel-bring-your-own-container> `_ users:
48+
49+ - For PyTorch 1.12.0
50+
51+ .. code ::
52+
53+ https://sagemaker-distributed-model-parallel.s3.us-west-2.amazonaws.com/pytorch-1.12.1/build-artifacts/2022-12-08-21-34/smdistributed_modelparallel-1.13.0-cp38-cp38-linux_x86_64.whl
54+
55+ ----
56+
57+ Release History
58+ ===============
59+
60+ SageMaker Distributed Model Parallel 1.11.0 Release Notes
61+ ---------------------------------------------------------
62+
1263*Date: August. 17. 2022 *
1364
1465**New Features **
@@ -41,12 +92,7 @@ Binary file of this version of the library for `custom container
4192
4293 .. code ::
4394
44- https://sagemaker-distributed-model-parallel.s3.us-west-2.amazonaws.com/pytorch-1.12.0/build-artifacts/2022-08-12-16-58/smdistributed_modelparallel-1.11.0-cp38-cp38-linux_x86_64.whl
45-
46- ----
47-
48- Release History
49- ===============
95+ https://sagemaker-distribu
5096
5197 SageMaker Distributed Model Parallel 1.10.1 Release Notes
5298---------------------------------------------------------
0 commit comments