I used the following ubuntu command to access SSH login as,
ssh user@hostname_or_IP
Refer this doc, https://docs.azuredatabricks.net/user-guide/clusters/spark-config.html#spark-config
With autoscaling local storage, Databricks monitors the amount of free disk space available on your cluster's Spark workers. If a worker begins to run too low on disk, Databricks automatically attaches a new EBS volume to the worker before it runs out of disk space.
Unfortunately, we cannot SSH to the Cluster for now.
I did a test in my lab:
There was a SSH section in the Cluster configuration. But now, we cannot see it here.
Also, I found the VMs behind the Databricks in a resource group, I try to change the SSH configuration from portal but failed. Then I found the Databricks resource group has been locked to Read-Only. You cannot delete it from portal neither.
I tried to found the cause why cannot SSH the Cluster behind the Databricks, I saw the NSG rule of the VMs which belongs to that Databricks :
It means that Azure Databricks only allow only one source to SSH the VM, and the source is Databricks control plane . We can also understand this in the picture of Architecture of Azure Databricks:
Azure Databricks is a very new feature in Azure. I believe it will be better in future.You can post your idea in Azure Feedback Forum or in this blog's comment. Azure Team will review it .
Hope this helps!
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With