Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Is spark sql like case sensitive?

Looks like spark sql is case sensitive for "like" queries, right ?

spark.sql("select distinct status, length(status)  from table")

Returns

Active|6

spark.sql("select distinct status  from table where status like '%active%'")

Returns no value

spark.sql("select distinct status  from table where status like '%Active%'")

Returns

 Active
like image 449
djohon Avatar asked Jan 02 '23 13:01

djohon


1 Answers

Yes, Spark is case sensitive. Most of the RDBMSs are case sensitive by default for string comparison. If you want case-insensitive, try rlike or convert the column to upper/lower case.

scala> val df = Seq(("Active"),("Stable"),("Inactive")).toDF("status")
df: org.apache.spark.sql.DataFrame = [status: string]

scala> df.createOrReplaceTempView("tbl")

scala> df.show
+--------+
|  status|
+--------+
|  Active|
|  Stable|
|Inactive|
+--------+


scala> spark.sql(""" select status from tbl where status like '%Active%' """).show
+------+
|status|
+------+
|Active|
+------+


scala> spark.sql(""" select status from tbl where lower(status) like '%active%' """).show
+--------+
|  status|
+--------+
|  Active|
|Inactive|
+--------+


scala>
like image 50
stack0114106 Avatar answered Jan 05 '23 02:01

stack0114106