I am looking for some performance review on Hadoop (300-600 boxes cluster, commodity hardware), especially on the following aspects:
This is not a specific question, maybe that is why nobody answered until now. Performance on 3-600 nodes cluster can be best analyzed with benchmarks.
However, I found some really interesting articles regarding Hadoop and its implementations in production:
I hope those links will get you started and give you all the info you need.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With