Research Guides: Library Search AI Research Assistant: How does the Research Assistant work?

Retrieval Augmented Generation (RAG)

Library Search AI Research Assistant uses a Retrieval Augmented Generation (RAG) architecture as follows to provide responses:

The user's question is sent to the LLM (Large Language Model), where it is converted to a Boolean query that contains a number of variations of the query, connected with an OR. If the query is non-English, some of the variations will be in the query language, and the other variations will be in English.
- Because it relies on the language capabilities of the LLM, support for local languages may vary. Currently, it uses Open AI's GPT-4o mini, but that may change in the future.
The boolean query is sent to the Central Discovery Index (CDI) to retrieve the results. It uses the entirety of CDI metadata and abstracts with the following exceptions:
- News content (Newspaper articles, Newsletters, Text resources).
- Sources with insufficient metadata and abstracts to effectively run the tool.
- Documents marked as withdrawn or retracted; retraction notes.
- Any collections from the following content providers: APA, DataCite, Elsevier, JSTOR, Kogan Page, Conde Nast.
- Any content published by the providers above coming via aggregator collections.
The top results (currently up to 30) are re-ranked using embeddings to optimize the result based on the query match.
The top five results are sent to the LLM with the instructions to create the overview with inline references, based on the abstracts.
The overview and sources are returned to the user in the response.