How to pass parameters to a training script in Azure Machine Learning service?

Tags:

I am trying to submit an experiment in Azure Machine Learning service locally on an Azure VM using a ScriptRunConfig object in my workspace ws, as in

Click to copy

from azureml.core import ScriptRunConfig    
from azureml.core.runconfig import RunConfiguration
from azureml.core import Experiment

experiment = Experiment(ws, name='test')
run_local = RunConfiguration()

script_params = {
    '--data-folder': './data',
    '--training-data': 'train.csv'
}

src = ScriptRunConfig(source_directory = './source_dir', 
                      script = 'train.py', 
                      run_config = run_local, 
                      arguments = script_params)

run = experiment.submit(src)

However, this fails with

ExperimentExecutionException: { "error_details": { "correlation": { "operation": "bb12f5b8bd78084b9b34f088a1d77224", "request": "iGfp+sjC34Q=" }, "error": { "code": "UserError", "message": "Failed to deserialize run definition"

Worse, if I set my data folder to use a datastore (which likely I will need to)

Click to copy

script_params = {
    '--data-folder': ds.path('mydatastoredir').as_mount(),
    '--training-data': 'train.csv'
}

the error is

UserErrorException: Dictionary with non-native python type values are not supported in runconfigs.
{'--data-folder': $AZUREML_DATAREFERENCE_d93269a580ec4ecf97be428cd2fe79, '--training-data': 'train.csv'}

I don't quite understand how I should pass my script_params parameters to my train.py (the documentation of ScriptRunConfig doesn't include a lot of details on this unfortunately).

Does anybody know how to properly create src in these two cases?

552

asked Apr 06 '19 22:04

2 Answers

In the end I abandoned ScriptRunConfig and used Estimator as follows to pass script_params (after having provisioned a compute target):

Click to copy

estimator = Estimator(source_directory='./mysourcedir',
                      script_params=script_params,
                      compute_target='cluster',
                      entry_script='train.py',
                      conda_packages = ["pandas"],
                      pip_packages = ["git+https://github.com/..."], 
                      use_docker=True,
                      custom_docker_image='<mydockeraccount>/<mydockerimage>')

This also allowed me to install my pip_packages dependency by putting on https://hub.docker.com/ a custom_docker_image Docker image created from a Dockerfile like:

Click to copy

FROM continuumio/miniconda
RUN apt-get update
RUN apt-get install git gcc g++ -y

(it worked!)

answered Oct 12 '22 00:10

The correct way of passing arguments to the ScriptRunConfig and RunConfig is as a list of strings according to https://learn.microsoft.com/nb-no/python/api/azureml-core/azureml.core.runconfiguration?view=azure-ml-py.

Modified and working code would be as follows.

Click to copy

from azureml.core import ScriptRunConfig    
from azureml.core.runconfig import RunConfiguration
from azureml.core import Experiment

experiment = Experiment(ws, name='test')
run_local = RunConfiguration()

script_params = [
    '--data-folder',
    './data',
    '--training-data',
    'train.csv'
]

src = ScriptRunConfig(source_directory = './source_dir', 
                      script = 'train.py', 
                      run_config = run_local, 
                      arguments = script_params)

run = experiment.submit(src)

answered Oct 11 '22 23:10

Ole-Henrik Borlaug

Related questions
                            
                                Installing PyTorch under conda fails with permissions error and Rolling back transaction
                            
                                About unique=True and (unique=True, index=True) in sqlalchemy
                            
                                Plot datetime.time in seaborn
                            
                                Python - Overloading asynchronous methods
                            
                                Plotting numpy array using Seaborn
                            
                                Pandas any() returning false with true values present
                            
                                Dask: Drop NAs on columns?
                            
                                Django shortcut get_object_or_404 inside Django Rest framework Class Based Views
                            
                                How does tensorflow handle non differentiable nodes during gradient calculation?
                            
                                swagger flask restplus, upload a file and take json input together
                            
                                Copying weights of a specific layer - keras
                            
                                What is the difference between Model.train_on_batch from keras and Session.run([train_optimizer]) from tensorflow?
                            
                                In Python, why do properties take priority over instance attributes?
                            
                                Why I got an error in my gitlab CI with Pip which is not found?
                            
                                Global query timeout in MySQL 5.6
                            
                                Apply TensorFlow Transform to transform/scale features in production
                            
                                How can I comment out multiple cells in Jupyter Ipython / JupyterLab notebook?
                            
                                Using line_profiler with numba jitted functions
                            
                                GDAL : Reprojecting netCDF file
                            
                                How to use DRF serializers with Graphene

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to pass parameters to a training script in Azure Machine Learning service?

Tags:

python

azure

azure-machine-learning-service

Davide Fiocco

People also ask

2 Answers

Davide Fiocco

Ole-Henrik Borlaug

Recent Activity

Donate For Us