Benchmarking LLMs for Trustworthy Multimedia Retrieval in Computational Biology Using Structured Zotero Graphs