I've no preference as regards OS; any tool will be fine so long as it allows me to measure cache performance on Core 2 and i7 architectures.
This team have a product http://www.jclarity.com/ which is new but use the CPU performance counters to extract cache hit/miss information and other low level stuff.
It seems to me that the only tool that does this at present is Intel VTune Performance Analyzer. It's not horrifically expensive, but it doesn't come cheap, either.
This article by Intel outlines processes to improve Java app performance using this tool.
I'm not sure if that would be enough for you, but I used several times JVisualVM (included in the JDKs, at least the Sun one) as a profiling tool and I was pretty happy with it.
To find it, go to: JAVA_HOME/bin/jvisualvm(.exe)
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With