Implementing a parallel first-order Theorem Solver by JAVA/Scala (previously) and now C++/Charm++. Focusing on performance issue now.