Skip to content

Commit 04fe8c4

Browse files
authored
Merge pull request #1191 from savitamittal1/patch-4
Update README.md
2 parents 4980184 + d22e76d commit 04fe8c4

File tree

1 file changed

+19
-5
lines changed
  • how-to-use-azureml/azure-databricks/automl

1 file changed

+19
-5
lines changed

how-to-use-azureml/azure-databricks/automl/README.md

Lines changed: 19 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,21 @@
1-
# Adding an init script to an Azure Databricks cluster
1+
# Automated ML introduction
2+
Automated machine learning (automated ML) builds high quality machine learning models for you by automating model and hyperparameter selection. Bring a labelled dataset that you want to build a model for, automated ML will give you a high quality machine learning model that you can use for predictions.
23

3-
The [azureml-cluster-init.sh](./azureml-cluster-init.sh) script configures the environment to
4-
1. Install the latest AutoML library
54

6-
To create the Azure Databricks cluster-scoped init script
5+
If you are new to Data Science, automated ML will help you get jumpstarted by simplifying machine learning model building. It abstracts you from needing to perform model selection, hyperparameter selection and in one step creates a high quality trained model for you to use.
6+
7+
If you are an experienced data scientist, automated ML will help increase your productivity by intelligently performing the model and hyperparameter selection for your training and generates high quality models much quicker than manually specifying several combinations of the parameters and running training jobs. Automated ML provides visibility and access to all the training jobs and the performance characteristics of the models to help you further tune the pipeline if you desire.
8+
9+
# Install Instructions using Azure Databricks :
10+
11+
#### For Databricks non ML runtime 7.1(scala 2.21, spark 3.0.0) and up, Install Automated Machine Learning sdk by adding and running the following command as the first cell of your notebook. This will install AutoML dependencies specific for your notebook.
12+
13+
%pip install --upgrade --force-reinstall -r https://aka.ms/automl_linux_requirements.txt
14+
15+
16+
#### For Databricks non ML runtime 7.0 and lower, Install Automated Machine Learning sdk using init script as shown below before running the notebook.**
17+
18+
**Create the Azure Databricks cluster-scoped init script 'azureml-cluster-init.sh' as below
719

820
1. Create the base directory you want to store the init script in if it does not exist.
921
```
@@ -15,7 +27,7 @@ To create the Azure Databricks cluster-scoped init script
1527
dbutils.fs.put("/databricks/init/azureml-cluster-init.sh","""
1628
#!/bin/bash
1729
set -ex
18-
/databricks/python/bin/pip install -r https://aka.ms/automl_linux_requirements.txt
30+
/databricks/python/bin/pip install --upgrade --force-reinstall -r https://aka.ms/automl_linux_requirements.txt
1931
""", True)
2032
```
2133
@@ -24,6 +36,8 @@ To create the Azure Databricks cluster-scoped init script
2436
display(dbutils.fs.ls("dbfs:/databricks/init/azureml-cluster-init.sh"))
2537
```
2638
39+
**Install libraries to cluster using init script 'azureml-cluster-init.sh' created in previous step
40+
2741
1. Configure the cluster to run the script.
2842
* Using the cluster configuration page
2943
1. On the cluster configuration page, click the Advanced Options toggle.

0 commit comments

Comments
 (0)