Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in bigdata
How to rename huge amount of files in Hadoop/Spark?
Nov 13, 2022
hadoop
parallel-processing
bigdata
apache-spark
What happens if an RDD can't fit into memory in Spark? [duplicate]
Sep 02, 2021
scala
hadoop
apache-spark
bigdata
How to get the first not null value from a column of values in Big Query?
Nov 13, 2022
sql
bigdata
google-bigquery
How do Dask dataframes handle larger-than-memory datasets?
Apr 17, 2022
python
dask
bigdata
What is the difference between "predicate pushdown" and "projection pushdown"?
Aug 17, 2022
apache-spark
bigdata
parquet
Hadoop - Hive : Delete data which is older than specified no of days
Sep 23, 2022
hadoop
hive
bigdata
updating Hive external table with HDFS changes
Jan 31, 2019
hadoop
hive
bigdata
hiveql
Recreation of mapping elastic search
Sep 29, 2022
elasticsearch
logstash
kibana
bigdata
Python. Pandas. BigData. Messy TSV file. How to wrangle the data?
Jun 23, 2022
python
pandas
numpy
data-analysis
bigdata
Hbase - How to get column names in a table?
Nov 20, 2022
hadoop
hbase
bigdata
When to use dynamoDB -UseCases
May 16, 2022
nosql
bigdata
amazon-dynamodb
Understanding and building a social network algorithm
May 19, 2017
algorithm
social-networking
graph-algorithm
bigdata
Finding Minimum hamming distance of a set of strings in python
Nov 10, 2022
python
algorithm
bigdata
hamming-distance
Bigtable / HBase: Rich column family vs a single JSON Object
Feb 27, 2022
json
hbase
google-cloud-bigtable
bigdata
nosql
how to load json file greater than 10gb in pandas/python of a particular pattern
Nov 06, 2022
python
pandas
bigdata
POC for Hadoop in real time scenario
Jan 29, 2020
hadoop
real-time
bigdata
hadoop-streaming
How to install Apache Zeppelin on existing Apache Spark standalone cluster
Sep 07, 2022
amazon-web-services
apache-spark
bigdata
apache-spark-sql
apache-zeppelin
Skipping the first line of the .csv in Map reduce java
Jun 24, 2022
java
mapreduce
bigdata
High Level Java Optimization
Oct 14, 2022
java
algorithm
language-agnostic
distributed
bigdata
How to calculate 5^262144 in Erlang
Nov 14, 2021
math
erlang
elixir
bigdata
« Newer Entries
Older Entries »