Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in bigdata
Algorithm for counting common group memberships with big data
Jun 09, 2022
java
sql
algorithm
postgresql
bigdata
Apache Spark - How does internal job scheduler in spark define what are users and what are pools
Nov 11, 2022
scala
hadoop
apache-spark
bigdata
job-scheduling
Can Flink be used with Kotlin?
Aug 29, 2022
scala
kotlin
apache-flink
flink-streaming
bigdata
How to rename huge amount of files in Hadoop/Spark?
Nov 13, 2022
hadoop
parallel-processing
bigdata
apache-spark
What happens if an RDD can't fit into memory in Spark? [duplicate]
Sep 02, 2021
scala
hadoop
apache-spark
bigdata
How to get the first not null value from a column of values in Big Query?
Nov 13, 2022
sql
bigdata
google-bigquery
How do Dask dataframes handle larger-than-memory datasets?
Apr 17, 2022
python
dask
bigdata
What is the difference between "predicate pushdown" and "projection pushdown"?
Aug 17, 2022
apache-spark
bigdata
parquet
Hadoop - Hive : Delete data which is older than specified no of days
Sep 23, 2022
hadoop
hive
bigdata
updating Hive external table with HDFS changes
Jan 31, 2019
hadoop
hive
bigdata
hiveql
Recreation of mapping elastic search
Sep 29, 2022
elasticsearch
logstash
kibana
bigdata
Python. Pandas. BigData. Messy TSV file. How to wrangle the data?
Jun 23, 2022
python
pandas
numpy
data-analysis
bigdata
Hbase - How to get column names in a table?
Nov 20, 2022
hadoop
hbase
bigdata
When to use dynamoDB -UseCases
May 16, 2022
nosql
bigdata
amazon-dynamodb
Understanding and building a social network algorithm
May 19, 2017
algorithm
social-networking
graph-algorithm
bigdata
Finding Minimum hamming distance of a set of strings in python
Nov 10, 2022
python
algorithm
bigdata
hamming-distance
Bigtable / HBase: Rich column family vs a single JSON Object
Feb 27, 2022
json
hbase
google-cloud-bigtable
bigdata
nosql
how to load json file greater than 10gb in pandas/python of a particular pattern
Nov 06, 2022
python
pandas
bigdata
POC for Hadoop in real time scenario
Jan 29, 2020
hadoop
real-time
bigdata
hadoop-streaming
How to install Apache Zeppelin on existing Apache Spark standalone cluster
Sep 07, 2022
amazon-web-services
apache-spark
bigdata
apache-spark-sql
apache-zeppelin
« Newer Entries
Older Entries »