Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in bigdata

How to produce massive amount of data?

java hadoop nutch bigdata

Any good tools to make 3D data visualizations for Big Data? [closed]

Calculate Euclidean distance matrix using a big.matrix object

Pig - ERROR 1045: AVG as multiple or none of them fit. Please use an explicit cast

How do I turn a JSON file into a Java 8 Object Stream?

java arrays json java-8 bigdata

How to transform a categorical variable in Spark into a set of columns coded as {0,1}?

How do I increase decimal precision in Spark?

R: Is it possible to parallelize / speed-up the reading in of a 20 million plus row CSV into R?

Can RethinkDB handle large data sets (TB+) and serve as DB for an OLAP app?

bigdata olap rethinkdb

Does a flatMap in spark cause a shuffle?

scala apache-spark bigdata

How can I add a column with a value to a new Dataset in Spark Java?

Skewed tables in Hive

hadoop hive bigdata

Is a good idea to store chat messages in a mongodb collection?

fitting a linear mixed model to a very large data set

How to efficiently store and query a billion rows of sensor data

Python Pandas: Convert 2,000,000 DataFrame rows to Binary Matrix (pd.get_dummies()) without memory error?

How Apache Apex is different from Apache Storm?