Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in bigdata

How does Cassandra store null values?

cassandra bigdata

Tips for creating a very large database of hashes

Using Twitter Storm to process log data?

Wrapping R's plot function (or ggplot2) to prevent plotting of large data sets

r plot ggplot2 bigdata

Is it possible to run Python's scikit-learn algorithms over Hadoop? [closed]

Why does the author proposed the HBase Tall-Thin schema over Short-Wide described inside?

java hbase bigdata

Handling large String lists in java

Numpy efficient big matrix multiplication

Is it possible to read pdf/audio/video files(unstructured data) using Apache Spark?

hadoop apache-spark bigdata

Joining a large and a massive spark dataframe

Stream processing architecture

Generating a very large matrix of string combinations using combn() and bigmemory package

r combinatorics bigdata

doing PCA on very large data set in R

r bigdata pca

What is the best way to load huge result set in memory?

c# ado.net bigdata datareader

NumPy: 3-byte, 6-byte types (aka uint24, uint48)

python numpy bigdata

NoSQL or RDBMS for audit data

Is there a good way to avoid memory deep copy or to reduce time spent in multiprocessing?

Social-networking: Hadoop, HBase, Spark over MongoDB or Postgres?

What is the difference between broadcast_address and broadcast_rpc_address in cassandra.yaml?

cassandra bigdata

Getting exception : java.lang.NoSuchMethodError: scala.reflect.api.JavaUniverse.runtimeMirror(Ljava/lang/ClassLoader;) while using data frames