When I ask customers how they deal with performance problems in MapReduce, they often tell me, beyond generic Hadoop tuning they don't bother. In my case adding more hardware would not have helped at all! Nevertheless I was able to speed up my pig script from over 15 hours to 70 minutes without adding any hardware, without expert help and within minutes of starting my investigation.

>>> Read the full blog by Michael Kopp!


Not Logged In? Customers and AJAX Edition Users Login with your Community Account