Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in bigdata
Apache Spark ALS recommendations approach
Apr 03, 2020
apache-spark
machine-learning
bigdata
recommendation-engine
apache-spark-mllib
Spark 2.3 dynamic partitionBy not working on S3 AWS EMR 5.13.0
Nov 15, 2022
scala
apache-spark
amazon-s3
bigdata
amazon-emr
Akka for simulations
Nov 06, 2019
simulation
akka
bigdata
How do I submit a Spark jar to a EMR cluster?
Dec 10, 2019
amazon-web-services
mapreduce
apache-spark
bigdata
emr
R ff package ffsave 'zip' not found
Mar 06, 2022
r
bigdata
ffbase
AWS Glue convert files from JSON to Parquet with same partitions as source table
Sep 19, 2022
amazon-web-services
bigdata
aws-glue
Which data structure to store binary strings and query with hamming distane
May 02, 2018
distance
hamming-distance
bigdata
How does Cassandra store null values?
Oct 02, 2022
cassandra
bigdata
Tips for creating a very large database of hashes
Nov 06, 2022
database
hash
inverted-index
bigdata
Using Twitter Storm to process log data?
Nov 01, 2022
logging
bigdata
apache-storm
Wrapping R's plot function (or ggplot2) to prevent plotting of large data sets
Aug 17, 2022
r
plot
ggplot2
bigdata
Is it possible to run Python's scikit-learn algorithms over Hadoop? [closed]
Oct 22, 2022
python
hadoop
machine-learning
bigdata
scikit-learn
Why does the author proposed the HBase Tall-Thin schema over Short-Wide described inside?
Nov 04, 2022
java
hbase
bigdata
Handling large String lists in java
Sep 28, 2022
java
data-structures
bigdata
hashset
Numpy efficient big matrix multiplication
Jul 24, 2019
python
numpy
matrix
bigdata
pytables
Is it possible to read pdf/audio/video files(unstructured data) using Apache Spark?
May 04, 2022
hadoop
apache-spark
bigdata
Joining a large and a massive spark dataframe
Feb 15, 2022
python
apache-spark
dataframe
pyspark
bigdata
Stream processing architecture
May 17, 2022
java
bigdata
system-design
stream-processing
event-stream-processing
Generating a very large matrix of string combinations using combn() and bigmemory package
Aug 01, 2022
r
combinatorics
bigdata
doing PCA on very large data set in R
Sep 03, 2022
r
bigdata
pca
« Newer Entries
Older Entries »