Classic ML shipped with clean report cards. Accuracy, precision, recall, F1. You could argue about tradeoffs, but at least everyone agreed on what “good” meant. LLM apps don’t give you that comfort. RAG answers can be fluent and wrong. Copilots can...