The annotated replies feature provides customizable high-quality question-and-answer responses through manual editing and annotation.
Applicable scenarios:
The annotated replies feature essentially provides another set of retrieval-enhanced systems, allowing you to bypass the LLM generation phase and avoid the hallucination issues of RAG.
Enable the annotated replies switch by navigating to “Orchestrate -> Add Features”:
When enabling, you need to set the parameters for annotated replies, which include: Score Threshold and Embedding Model.
Score Threshold: This sets the similarity threshold for matching annotated replies. Only annotations with scores above this threshold will be recalled.
Embedding Model: This is used to vectorize the annotated text. Changing the model will regenerate the embeddings.
Click save and enable, and the settings will take effect immediately. The system will generate embeddings for all saved annotations using the embedding model.
You can directly add or edit annotations on the model response information in the debug and preview pages.
Edit the response to the high-quality reply you need and save it.
Re-enter the same user question, and the system will use the saved annotation to reply to the user’s question directly.
Enable the annotated replies switch by navigating to “Logs & Ann. -> Annotations”:
The parameters that can be set for annotated replies include: Score Threshold and Embedding Model.
Score Threshold: This sets the similarity threshold for matching annotated replies. Only annotations with scores above this threshold will be recalled.
Embedding Model: This is used to vectorize the annotated text. Changing the model will regenerate the embeddings.
In the bulk import feature, you can download the annotation import template, edit the annotated Q&A pairs according to the template format, and then import them in bulk.
Through the bulk export feature, you can export all saved annotated Q&A pairs in the system at once.
In the annotation hit history feature, you can view the edit history of all hits on the annotation, the user’s hit questions, the response answers, the source of the hits, the matching similarity scores, the hit time, and other information. You can use this information to continuously improve your annotated content.
Edit this page | Report an issue