upload_results=False 传递给 evaluate() / aevaluate() 来执行此操作。
这将像往常一样运行您的应用程序和评估器并返回相同的输出,但不会将任何内容记录到 LangSmith。这不仅包括实验结果,还包括应用程序和评估器跟踪。
示例
让我们看一个示例: 需要langsmith>=0.2.0。示例还使用 pandas。
| inputs.question | outputs.answer | reference.answer | feedback.is_concise | |
|---|---|---|---|---|
| 0 | What is the largest mammal? | What is the largest mammal? is a good question. I don’t know the answer. | The blue whale | False |
| 1 | What do mammals and birds have in common? | What do mammals and birds have in common? is a good question. I don’t know the answer. | They are both warm-blooded | False |