Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in parquet
Spark job with large text file in gzip format
Mar 14, 2023
hadoop
apache-spark
amazon-s3
apache-spark-sql
parquet
read a parquet files from HDFS using PyArrow
Mar 09, 2023
hdfs
parquet
pyarrow
Creating Hive table on top of multiple parquet files in s3
Mar 09, 2023
hadoop
apache-spark
hive
amazon-emr
parquet
How to save spark dataframe to parquet without using INT96 format for timestamp columns?
Mar 04, 2023
apache-spark
avro
parquet
Refresh metadata for Dataframe while reading parquet file
Mar 05, 2023
apache-spark
apache-spark-sql
parquet
apache-spark-dataset
UPSERT in parquet Pyspark
Mar 05, 2023
amazon-s3
pyspark
etl
parquet
How to load parquet file into Snowflake database?
Jan 03, 2023
database
parquet
snowflake-cloud-data-platform
Spark: Avro vs Parquet performance
Jan 04, 2023
apache-spark
avro
parquet
AWS Athena: HIVE_BAD_DATA ERROR: Field type DOUBLE in parquet is incompatible with type defined in table schema
Jan 02, 2023
hive
parquet
amazon-athena
pyarrow
How to change the location of _spark_metadata directory?
Dec 25, 2022
apache-spark
amazon-s3
parquet
spark-structured-streaming
Spark SQL - loading csv/psv files with some malformed records
Dec 09, 2022
csv
apache-spark
apache-spark-sql
parquet
Unable to get parquet-tools working from the command-line
Dec 01, 2022
parquet
Spark2 Can't write dataframe to parquet hive table : HiveFileFormat`. It doesn't match the specified format `ParquetFileFormat`
Feb 06, 2023
apache-spark
hive
parquet
apache-spark-2.0
Read parquet data from ByteArrayOutputStream instead of file
Feb 05, 2023
java
parquet
bytearrayoutputstream
Older Entries »