Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in parquet
Why does Zeppelin fail with "mismatched input ';' expecting <EOF>" in %spark.sql paragraph?
Feb 28, 2026
apache-spark
apache-spark-sql
parquet
apache-zeppelin
Writing a dask dataframe to parquet: 'TypeError'
Feb 27, 2026
python
dask
parquet
How to read parquet files from Azure Blobs into Pandas DataFrame?
Feb 23, 2026
azure
azure-blob-storage
parquet
How to write and read dataframe to parquet where column contains list of dicts
Feb 23, 2026
python
pandas
parquet
pyarrow
Azure Data Factory pipeline into compressed Parquet file: “java.lang.OutOfMemoryError:Java heap space”
Feb 20, 2026
azure
azure-blob-storage
azure-pipelines
parquet
azure-data-factory
Firehose JSON -> S3 Parquet -> ETL Spark, error: Unable to infer schema for Parquet
Feb 19, 2026
apache-spark
pyspark
parquet
amazon-kinesis
aws-glue
dask dataframe read parquet schema difference
Feb 18, 2026
python
dataframe
parquet
dask
Worker Behavior with two (or more) dataframes having the same key
Feb 17, 2026
apache-spark
pyspark
apache-spark-sql
partitioning
parquet
create a Parquet backed Hive table by using a schema file
Feb 17, 2026
hadoop
hive
schema
avro
parquet
How to perform parallel computation on Spark Dataframe by row?
Feb 15, 2026
python-3.x
pyspark
apache-spark-sql
parquet
pyarrow
Preserve parquet file names in PySpark
Feb 13, 2026
apache-spark
pyspark
apache-spark-sql
databricks
parquet
File compression formats and container file formats
Feb 14, 2026
hadoop
mapreduce
hadoop2
avro
parquet
How to catch exceptions.NoFilesFound error from awswrangler in Python 3
Feb 09, 2026
python
amazon-s3
exception
parquet
aws-data-wrangler
Pyarrow.lib.Schema vs. pyarrow.parquet.Schema
Feb 08, 2026
python
pyspark
parquet
pyarrow
How to read from textfile(String type data) map and load data into parquet format(multiple columns with different datatype) in Spark scala dynamically
Feb 07, 2026
scala
apache-spark
sqoop
parquet
PyArrow: read single file from partitioned parquet dataset is unexpectedly slow
Feb 02, 2026
python
pandas
parquet
pyarrow
Older Entries »