Evaluation metric example: RAG document relevance Workflow Solution

Workflow overview

Why this workflow matters

Supports knowledge capture and document intelligence use cases.

AI evaluation in AlekSystem This is a template for AlekSystem's evaluation feature. Evaluation is a technique for getting confidence that your AI workflow performs reliably, by running a test dataset containing different inputs through the workflow. By calculating a metric (score) for each input, you can see where the workflow is performing well and where it isn't. How it works This template shows how to calculate a workflow evaluation metric: retrieved document relevance (i.e. whether the information retrieved from a vector store is relevant to the question). The workflow takes a question and checks whether the information retrieved to answer it is relevant. To run this workflow, you need to insert documents into a vector data store, so that they can be retrieved by the agent to answer questions. You can do this by running the top part of the workflow once. The main workflow works as follows: We use an evaluation trigger to read in our dataset It is wired up in parallel with the regular trigger so that the workflow can be started from either one. More info We make sure that the agent outputs the list data from the tools that it used If we’re evaluating (i.e. the execution started from the evaluation trigger), we calculate the relevance metric using AI to compare the retrieved documents with the question We pass this information back to AlekSystem as a metric If we’re not evaluating we avoid calculating the metric, to reduce cost

Best fit

Services

Google SheetsAI AgentEmbeddings OpenAIOpenAI Chat ModelRecursive Character Text SplitterSimple Vector StoreDefault Data LoaderOpenAI

Use cases

document intelligence

Need another direction?

Continue a new search Request this workflow

Evaluation metric example: RAG document relevance Workflow Solution

Why this workflow matters

Categories

Services

Use cases

Related AlekSystem workflow ideas

Automated AI Timesheets for Consulting Teams

Executive AI Briefing and Follow-Up Assistant

Automated Multi-Channel Customer Support with Gmail, Telegram, and GPT AI Solution