Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Can pyspark.sql.function be used in udf?
Feb 07, 2023
python
sql
apache-spark
pyspark
user-defined-functions
Is Apache Zeppelin stable enough to be used in Production
Feb 06, 2023
apache-spark
production
amazon-emr
apache-zeppelin
Scala Spark : Difference in the results returned by df.stat.sampleBy()
Feb 07, 2023
scala
apache-spark
Scala-Spark(version1.5.2) Dataframes split error
Feb 07, 2023
scala
apache-spark
spark-dataframe
How to retrieve yarn's logs programmatically using java
Feb 06, 2023
java
hadoop
apache-spark
hadoop-yarn
How to filter Spark dataframe by array column containing any of the values of some other dataframe/set
Feb 07, 2023
apache-spark
apache-spark-sql
how can I keep partition'number not change when I use window.partitionBy() function with spark/scala?
Feb 06, 2023
scala
apache-spark
apache-spark-sql
Access to WrappedArray elements
Feb 05, 2023
python
scala
apache-spark
pyspark
What is the main cause of "self-suppression not permitted" in Spark?
Feb 06, 2023
apache-spark
hdfs
Is garbage collection time part of execution time of a task in apache spark?
Feb 01, 2023
apache-spark
How should I write unit tests in Spark, for a basic data frame creation example?
Jan 31, 2023
scala
unit-testing
apache-spark
intellij-idea
Spark Dataframe Group by having New Indicator Column
Feb 01, 2023
scala
apache-spark
dataframe
apache-spark-sql
Spark dataframe: Pivot and Group based on columns
Jan 31, 2023
scala
hadoop
apache-spark
spark-dataframe
PySpark: How to check if a column contains a number using isnan [duplicate]
Jan 31, 2023
apache-spark
pyspark
Update Spark Dataframe's window function row_number column for Delta Data
Feb 01, 2023
scala
apache-spark
apache-spark-sql
Spark Scala : Getting Cumulative Sum (Running Total) Using Analytical Functions
Feb 06, 2023
sql
scala
apache-spark
apache-spark-sql
window-functions
How to drop all columns with null values in a PySpark DataFrame?
Feb 06, 2023
python
apache-spark
pyspark
apache-spark-sql
Spark2 Can't write dataframe to parquet hive table : HiveFileFormat`. It doesn't match the specified format `ParquetFileFormat`
Feb 06, 2023
apache-spark
hive
parquet
apache-spark-2.0
Rename nested struct columns in a Spark DataFrame [duplicate]
Feb 06, 2023
scala
apache-spark
dataframe
column-alias
Which method is better to check if a dataframe is empty ? `df.limit(1).count == 0` or `df.isEmpty`?
Feb 06, 2023
scala
apache-spark
apache-spark-sql
« Newer Entries
Older Entries »