Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in bigdata
Pentaho Data Integration (PDI) 9.4 Marketplace missing, how to install Plugin now?
Jan 28, 2026
plugins
bigdata
pentaho
pentaho-spoon
pentaho-data-integration
What is the difference between the hive metastore in derby vs the one in hive/warehouse?
Jan 28, 2026
hadoop
hive
bigdata
How to train a Keras model with very a big dataset?
Jan 28, 2026
python
keras
bigdata
autoencoder
unsupervised-learning
Matching many files against many patterns in Java
Jan 22, 2026
java
string
algorithm
pattern-matching
bigdata
Hadoop: How to collect output of Reduce into a Java HashMap
Jan 22, 2026
hadoop
mapreduce
bigdata
similarity
cascading
Sqoop import job fails due to task timeout
Jan 20, 2026
hadoop
bigdata
sqoop
Neo4j's MERGE command on big datasets
Jan 03, 2026
merge
neo4j
bigdata
nodes
graph-databases
Data Modelling for Big Data
Jan 02, 2026
graph
hive
google-bigquery
arangodb
bigdata
Plot subplots from a very large file in gnuplot
Jan 01, 2026
plot
gnuplot
bigdata
What is the ideal format to store large results generated by R?
Dec 31, 2025
r
bigdata
mclapply
Read JSON files from multiple line file in spark scala
Dec 30, 2025
json
scala
apache-spark
bigdata
Calculating unique URLs in a huge dataset (150+ billions)
Dec 23, 2025
java
bigdata
Hive - Out of Memory Exception - Java Heap Space
Dec 23, 2025
hadoop
hive
bigdata
Connect to Spark running on VM
Dec 23, 2025
apache-spark
virtualbox
bigdata
How do I take advantage of my local resources using Spark in local mode?
Dec 22, 2025
java
apache-spark
kotlin
bigdata
cluster-computing
Invalid URI for NameNode address, s3a is not of schema 'hdfs'
Dec 21, 2025
hadoop
hdfs
bigdata
ceph
Older Entries »