How to detect Databricks environment programmatically

Tags:

I'm writing a spark job that needs to be runnable locally as well as on Databricks.

The code has to be slightly different in each environment (file paths) so I'm trying to find a way to detect if the job is running in Databricks. The best way I have found so far was to look for a "dbfs" directory in the root dir and if it's there then assume it's running on Databricks. This doesn't feel like the right solution. Does anyone have a better idea?

383

asked Jul 13 '18 16:07

steven35

1 Answers

You can simply check for the existence of an environment variable e.g.:

def isRunningInDatabricks(): Boolean = 
  sys.env.contains("DATABRICKS_RUNTIME_VERSION")

answered Sep 19 '22 14:09

pathikrit

Related questions
                            
                                Why doesn't IntelliJ Idea recognize my Spek tests?
                            
                                Java 8 Comparator comparing doesn't chain
                            
                                Spring boot with embedded tomcat + access log with authentication user
                            
                                Catch exception from parallel stream
                            
                                How to get heap usage using jstat?
                            
                                Default interface method for abstract superclass
                            
                                Spring Boot - where to place the jsp files
                            
                                Retain keys with null values while writing JSON in spark
                            
                                What happens to a Thread when it throws an exception when executed by a thread pool?
                            
                                Flow programming: subscriber and publisher to keep track of count?
                            
                                How to use the Image(Stored Image of device) with Text on TextView Android?
                            
                                Calling a Kotlin higher-order function from Java
                            
                                hibernate second level cache with Redis -will it improve performance?
                            
                                Spring Kafka-Difference between configuring KafkaTemplate with Producer Listener and registering a callback with Listenable Future
                            
                                How to implement created_at and updated_at column using Room Persistence ORM tools in android
                            
                                How can I disable showing the value of a Spring @Value annotation in IntelliJ IDEA?
                            
                                Could a Java compiler reorder function calls?
                            
                                Wait until Firestore data is retrieved to launch an activity
                            
                                Spring Boot project fails to run because of Schema-validation: missing sequence [hibernate_sequence]
                            
                                How to set tableName dynamically using environment variable in spring boot?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to detect Databricks environment programmatically

Tags:

java

apache-spark

databricks

steven35

People also ask

1 Answers

pathikrit

Recent Activity

Donate For Us