I am trying to get MLFlow on another machine in a local network to run and I would like to ask for some help because I don't know what to do now. I have a mlflow server running on a server. The mlflow server is running under my user on the server and has been started like this: <pre class="prettyprint"><code>mlflow server --host 0.0.0.0 --port 9999 --default-artifact-root sftp://<MYUSERNAME>@<SERVER>:<PATH/TO/DIRECTORY/WHICH/EXISTS> </code></pre> My program which should log all the data to the mlflow server looks like this: <pre class="prettyprint lang-py prettyprint-override"><code>from mlflow import log_metric, log_param, log_artifact, set_tracking_uri if __name__ == "__main__": remote_server_uri = '<SERVER>' # this value has been replaced set_tracking_uri(remote_server_uri) # Log a parameter (key-value pair) log_param("param1", 5) # Log a metric; metrics can be updated throughout the run log_metric("foo", 1) log_metric("foo", 2) log_metric("foo", 3) # Log an artifact (output file) with open("output.txt", "w") as f: f.write("Hello world!") log_artifact("output.txt") </code></pre> The parameters get and metrics get transfered to the server but not the artifacts. Why is that so? Note on the SFTP part: I can log in via SFTP and the pysftp package is installed

I guess your problem is that you need to create also the experiment so using the sftp remote storage <pre class="prettyprint"><code>mlflow.create_experiment("my_experiment", artifact_location=sftp_uri) </code></pre> This fixed it for me.

I don't know if I will get an answer to my problem but I did solved it this way. On the server I created the directory <code>/var/mlruns</code>. I pass this directory to mlflow via <code>--backend-store-uri file:///var/mlruns</code> Then I mount this directory via e.g. <code>sshfs</code> on my local machine under the same path. I don't like this solution but it solved the problem good enough for now.

Artifact storage and MLFLow on remote server

Tags:

python

mlflow

I am trying to get MLFlow on another machine in a local network to run and I would like to ask for some help because I don't know what to do now.

I have a mlflow server running on a server. The mlflow server is running under my user on the server and has been started like this:

mlflow server --host 0.0.0.0 --port 9999 --default-artifact-root sftp://<MYUSERNAME>@<SERVER>:<PATH/TO/DIRECTORY/WHICH/EXISTS>

My program which should log all the data to the mlflow server looks like this:

from mlflow import log_metric, log_param, log_artifact, set_tracking_uri

if __name__ == "__main__":
    remote_server_uri = '<SERVER>' # this value has been replaced
    set_tracking_uri(remote_server_uri)
    # Log a parameter (key-value pair)
    log_param("param1", 5)

    # Log a metric; metrics can be updated throughout the run
    log_metric("foo", 1)
    log_metric("foo", 2)
    log_metric("foo", 3)

    # Log an artifact (output file)
    with open("output.txt", "w") as f:
        f.write("Hello world!")
    log_artifact("output.txt")

The parameters get and metrics get transfered to the server but not the artifacts. Why is that so?

Note on the SFTP part: I can log in via SFTP and the pysftp package is installed

615

asked Nov 22 '19 13:11

Spark Monkay

2 Answers

I guess your problem is that you need to create also the experiment so using the sftp remote storage

mlflow.create_experiment("my_experiment", artifact_location=sftp_uri)

This fixed it for me.

answered Sep 20 '22 14:09

sklingel

I don't know if I will get an answer to my problem but I did solved it this way.

On the server I created the directory /var/mlruns. I pass this directory to mlflow via --backend-store-uri file:///var/mlruns

Then I mount this directory via e.g. sshfs on my local machine under the same path.

I don't like this solution but it solved the problem good enough for now.

answered Sep 22 '22 14:09

Spark Monkay

Related questions
                            
                                Obtain input_array and output_array items to convert model to tflite format
                            
                                Executing multiple lines of input in pycharm console from first line
                            
                                Jupyter notebook with Python 2 and Python3 Kernel
                            
                                Feature-wise scaling and shifting (FiLM layer) in Keras
                            
                                Django: using F() expressions on JSONField?
                            
                                Controlling stack-order of an altair area
                            
                                what's the difference between airflow's 'parallelism' and 'dag_concurrency'
                            
                                Why does this dict of 7 items only consume 368 bytes?
                            
                                Python ThreadPoolExecutor Suppress Exceptions
                            
                                Create a generic List from C# dll in python script
                            
                                In TensorFlow 2.0 with eager-execution, how to compute the gradients of a network output wrt a specific layer?
                            
                                Multiple output regression or classifier with one (or more) parameters with Python
                            
                                Rolling sum with strings
                            
                                Indexing numpy array with index array of lower dim yields array of higher dim than both
                            
                                Tensorflow suppresses logging messages bug
                            
                                UnboundLocalError: local variable 'arith_flex' referenced before assignment
                            
                                AWS Lambda Python3.7 Function - numpy: cannot import name 'WinDLL'
                            
                                Custom RMSE not the same as taking the root of built-in Keras MSE for same prediction
                            
                                Displaying of FastAPI validation errors to end users
                            
                                Python dictionary with multiple keys pointing to same list in memory efficient way

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Artifact storage and MLFLow on remote server

Tags:

python

mlflow

Spark Monkay

People also ask

2 Answers

sklingel

Spark Monkay

Recent Activity

Donate For Us