Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in bigdata

R ff package ffsave 'zip' not found

r bigdata ffbase

AWS Glue convert files from JSON to Parquet with same partitions as source table

Which data structure to store binary strings and query with hamming distane

How does Cassandra store null values?

cassandra bigdata

Tips for creating a very large database of hashes

Using Twitter Storm to process log data?

Wrapping R's plot function (or ggplot2) to prevent plotting of large data sets

r plot ggplot2 bigdata

Is it possible to run Python's scikit-learn algorithms over Hadoop? [closed]

Why does the author proposed the HBase Tall-Thin schema over Short-Wide described inside?

java hbase bigdata

Handling large String lists in java

Numpy efficient big matrix multiplication

Is it possible to read pdf/audio/video files(unstructured data) using Apache Spark?

hadoop apache-spark bigdata

Joining a large and a massive spark dataframe

Stream processing architecture

Generating a very large matrix of string combinations using combn() and bigmemory package

r combinatorics bigdata

doing PCA on very large data set in R

r bigdata pca

What is the best way to load huge result set in memory?

c# ado.net bigdata datareader

NumPy: 3-byte, 6-byte types (aka uint24, uint48)

python numpy bigdata

NoSQL or RDBMS for audit data

Is there a good way to avoid memory deep copy or to reduce time spent in multiprocessing?