I'm trying to access the Tensorboard for the tensorflow_resnet_cifar10_with_tensorboard
example, but not sure what the url should be, the help text gives 2 options:
You can access TensorBoard locally at http://localhost:6006 or using your SageMaker notebook instance proxy/6006/(TensorBoard will not work if forget to put the slash, '/', in end of the url). If TensorBoard started on a different port, adjust these URLs to match.
When it says access locally, does that mean the local container Sagemaker creates in AWS? If so, how do I get there?
Or if I use run_tensorboard_locally=False
, what should the proxy url be?
This is where an Amazon SageMaker endpoint steps in – an Amazon SageMaker endpoint is a fully managed service that allows you to make real-time inferences via a REST API.
Domain: An Amazon SageMaker Domain consists of an associated Amazon Elastic File System (Amazon EFS) volume; a list of authorized users; and a variety of security, application, policy, and Amazon Virtual Private Cloud (Amazon VPC) configurations. An AWS account is limited to one Domain per Region.
Here is my solution:
If URL of my sagemaker notebook instance is:
https://myinstance.notebook.us-east-1.sagemaker.aws/notebooks/image_classify.ipynb
And URL of accessing TensorBoard will be:
https://myinstance.notebook.us-east-1.sagemaker.aws/proxy/6006/
You can access TensorBoard on your notebook using the link "proxy/6006".
If you set run_tensorboard_locally=False then it won't start TensorBoard.
If the URL you clicked gives you the error "[Errno 111] Connection refused" then it seems that training has already stopped. According to https://github.com/aws/sagemaker-python-sdk it "terminates TensorBoard when the execution ends" so it seems you have to access it during the training step only.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With