Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in bigdata
Why does Spark's OneHotEncoder drop the last category by default?
Aug 29, 2022
apache-spark
machine-learning
pyspark
one-hot-encoding
bigdata
Haskell: Can I perform several folds over the same lazy list without keeping list in memory?
Jan 11, 2022
performance
haskell
lazy-evaluation
bigdata
How to quickly export data from R to SQL Server
Sep 12, 2022
sql
sql-server
r
bigdata
Fastest way to compare row and previous row in pandas dataframe with millions of rows
Sep 12, 2022
python
performance
pandas
bigdata
cython
Python Shared Memory Dictionary for Mapping Big Data
Feb 07, 2022
python
dictionary
bigdata
python-multiprocessing
Strategies for reading in CSV files in pieces?
Oct 29, 2022
r
bigdata
Hive padding leading zeroes
Sep 12, 2022
sql
hive
bigdata
Books to start learning big data [closed]
Sep 06, 2022
hadoop
hbase
hive
pentaho
bigdata
How to copy data from one HDFS to another HDFS?
Sep 05, 2022
hadoop
hdfs
bigdata
sqoop
Best solution for finding 1 x 1 million set intersection? Redis, Mongo, other
Oct 06, 2022
mongodb
redis
bigdata
nosql
MongoDB as file storage
Sep 04, 2022
mongodb
storage
gridfs
bigdata
When do you start additional Elasticsearch nodes? [closed]
Sep 01, 2022
elasticsearch
sharding
bigdata
Determining optimal number of Spark partitions based on workers, cores and DataFrame size
Aug 31, 2022
apache-spark
spark-dataframe
distributed-computing
partitioning
bigdata
What methods can we use to reshape VERY large data sets?
Aug 31, 2022
r
performance
bigdata
reshape
Machine Learning & Big Data [closed]
Aug 31, 2022
machine-learning
bigdata
How can I tell when my dataset in R is going to be too large?
Aug 31, 2022
r
bigdata
logfile-analysis
scala.reflect.internal.MissingRequirementError: object java.lang.Object in compiler mirror not found
Mar 09, 2022
scala
apache-spark
bigdata
How to get started with Big Data Analysis [closed]
Aug 30, 2022
python
r
hadoop
bigdata
Recommended package for very large dataset processing and machine learning in R [closed]
Aug 30, 2022
r
machine-learning
signal-processing
bigdata
Is there something like Redis DB, but not limited with RAM size? [closed]
Aug 29, 2022
database
redis
nosql
bigdata
« Newer Entries
Older Entries »