Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in bigdata

Processing each row of a large database table in Python

python bigdata psycopg2

How to compute the distance matrix in spark?

HIVE> FAILED: SemanticException Line 1:23 Invalid path

hive bigdata

Is there a faster way than fread() to read big data?

r data.table bigdata fread

How to produce massive amount of data?

java hadoop nutch bigdata

Any good tools to make 3D data visualizations for Big Data? [closed]

Calculate Euclidean distance matrix using a big.matrix object

Pig - ERROR 1045: AVG as multiple or none of them fit. Please use an explicit cast

How do I turn a JSON file into a Java 8 Object Stream?

java arrays json java-8 bigdata

How to transform a categorical variable in Spark into a set of columns coded as {0,1}?

How do I increase decimal precision in Spark?

R: Is it possible to parallelize / speed-up the reading in of a 20 million plus row CSV into R?

Can RethinkDB handle large data sets (TB+) and serve as DB for an OLAP app?

bigdata olap rethinkdb

Does a flatMap in spark cause a shuffle?

scala apache-spark bigdata

How can I add a column with a value to a new Dataset in Spark Java?

Skewed tables in Hive

hadoop hive bigdata

Is a good idea to store chat messages in a mongodb collection?