Application Level Evaluation

Put Explanation into product and have it tested by the users. A good baseline would be to compare the explanations to how a human would decide.