diff --git a/docs/_static/mxnet-theme/index.html b/docs/_static/mxnet-theme/index.html
index 005bc88f255d..b644423e9ed1 100644
--- a/docs/_static/mxnet-theme/index.html
+++ b/docs/_static/mxnet-theme/index.html
@@ -9,7 +9,7 @@
Install
diff --git a/docs/faq/multi_devices.md b/docs/faq/multi_devices.md
index 3bb00105a622..a43879cb5233 100644
--- a/docs/faq/multi_devices.md
+++ b/docs/faq/multi_devices.md
@@ -83,5 +83,5 @@ Note that this option may result in higher GPU memory usage.
When using a large number of GPUs, e.g. >=4, we suggest using `device` for better performance.
## Distributed training with multiple devices across machines
-Refer [Distributed training](https://mxnet.incubator.apache.org/versions/master/how_to/distributed_training.html)
+Refer [Distributed training](https://mxnet.incubator.apache.org/versions/master/faq/distributed_training.html)
for information on how distributed training works and how to use it.
diff --git a/docs/tutorials/python/types_of_data_augmentation.md b/docs/tutorials/python/types_of_data_augmentation.md
index 4cd1ad7bd05e..4ec461d68ec0 100644
--- a/docs/tutorials/python/types_of_data_augmentation.md
+++ b/docs/tutorials/python/types_of_data_augmentation.md
@@ -326,7 +326,7 @@ plot_mx_array(aug_image)
![png](https://raw.githubusercontent.com/dmlc/web-data/master/mxnet/doc/tutorials/data_aug/outputs/types_of/output_53_0.png)
-And lastly, you can use [`mxnet.image.RandomOrderAug`](https://mxnet.incubator.apache.org/api/python/image.html#mxnet.image.RandomOrderAug) to apply multiple augmenters to an image, in a random order.
+And lastly, you can use [`mxnet.image.RandomOrderAug`](https://mxnet.incubator.apache.org/api/python/image/image.html#mxnet.image.RandomOrderAug) to apply multiple augmenters to an image, in a random order.
```python
diff --git a/python/mxnet/gluon/block.py b/python/mxnet/gluon/block.py
index a737817928a9..abc474850f24 100644
--- a/python/mxnet/gluon/block.py
+++ b/python/mxnet/gluon/block.py
@@ -156,7 +156,7 @@ def forward(self, x):
Prefix acts like a name space. All children blocks created in parent block's
:py:meth:`name_scope` will have parent block's prefix in their name.
Please refer to
- `naming tutorial `_
+ `naming tutorial `_
for more info on prefix and naming.
params : ParameterDict or None
:py:class:`ParameterDict` for sharing weights with the new :py:class:`Block`. For example,
@@ -249,7 +249,7 @@ def name_scope(self):
self.dense = nn.Dense(20)
Please refer to
- `naming tutorial `_
+ `naming tutorial `_
for more info on prefix and naming.
"""
return self._scope