2026-05-24•By Mohamed Mohamoud

Why Museum Search Is Broken — And What Semantic Search Fixes

Museum search requires you to already know what you're looking for. Semantic search inverts that — you describe a memory, a mood, a visual impression, and the collection finds you.

Why Museum Search Is Broken — And What Semantic Search Fixes

Try this on the National Gallery of Art's own website. You remember a painting — warm candlelight, a solitary figure, probably Dutch, something melancholy about the light. You type "warm candlelight Dutch melancholy" into the search box.

You get nothing useful. Or you get results sorted by accession number. Or the system asks you to refine your medium, date range, and nationality before it will try.

The NGA's search is not unusual. Almost every major museum search works the same way: it matches against catalog fields. Title. Artist. Medium. Accession number. Date range. Classification. These fields are useful if you already know what you're looking for. They are nearly useless if you don't.

The Fundamental Problem

Museum catalog search is built around the assumption that search is a lookup. You have a target — a specific Rembrandt, a particular genre — and you use the interface to retrieve it. The system is optimized for provenance research, acquisition tracking, and curatorial retrieval. It was never designed for discovery.

The consequence is a hard prerequisite: to find a work, you must already know something precise about it. Artist name. A word from the title. A classification term the cataloger happened to use. If your mental model of the work doesn't match the catalog vocabulary, you're out.

flowchart LR
    A["What you remember\n'warm light, woman reading, Dutch'"]
    B["Keyword search"]
    C["Catalog fields\ntitle · artist · medium · date"]
    D["No match"]

    A --> B
    B --> C
    C --> D

    classDef memory fill:#f5f0e8,stroke:#a86845,color:#2c2926;
    classDef system fill:#ebe5d9,stroke:#6f685f,color:#2c2926;
    classDef fields fill:#fcf9f2,stroke:#d1c7b7,color:#2c2926;
    classDef result fill:#fffaf0,stroke:#a86845,color:#2c2926;
    class A memory;
    class B system;
    class C fields;
    class D result;

The painting you're thinking of is almost certainly in the collection. The collection knows everything about it. But the interface can't translate between how you remember a painting and how a cataloger described it in 1968.

What Semantic Search Changes

Retrievals is built around a different model. Instead of matching your query against catalog fields, it embeds your query and every artwork in the collection into the same mathematical space — a space where meaning, not vocabulary, determines proximity.

"Warm candlelight, solitary figure, Dutch" lands near a cluster of works that share those visual and emotional properties. Not because those words appear in their catalog records, but because the model that embedded them understood that those descriptions and those images occupy the same region of semantic space.

flowchart LR
    A["What you remember\n'warm light, woman reading, Dutch'"]
    B["Qwen3-VL\nembedding model"]
    C[("Shared\nsemantic space")]
    D["Nearest artworks\nby visual meaning"]
    E["Vermeer · de Hooch\nRembrandt · Leyster"]

    A --> B
    B --> C
    C --> D
    D --> E

    classDef memory fill:#f5f0e8,stroke:#a86845,color:#2c2926;
    classDef model fill:#ebe5d9,stroke:#6f685f,color:#2c2926;
    classDef space fill:#fcf9f2,stroke:#2c2926,color:#2c2926,stroke-width:2px;
    classDef result fill:#fffaf0,stroke:#a86845,color:#2c2926;
    class A memory;
    class B model;
    class C space;
    class D,E result;

The query doesn't need to match any field. It doesn't need to be a correct art-historical term. It just needs to describe, in whatever language comes naturally, what you remember or what you want to feel.

The Difference in Practice

Here's a direct comparison. The same intent, two systems.

Query	NGA keyword search	Retrievals
"warm candlelight solitary figure"	0 results	Rembrandt, de La Tour, Honthorst
"impressionist water reflections peaceful"	Requires medium filter + date range	Monet, Pissarro, Sisley — top results
"woman reading letter Dutch interior"	Matches if "letter" is in the title	Vermeer's Woman Reading a Letter, de Hooch's interiors
"dark stormy landscape dramatic sky"	Requires knowing "Romanticism" as a term	Turner, Rubens, Cole — by visual content
"melancholy figure in blue light"	No catalog field for mood	Works where that emotional register is visible

The last row matters most. Catalog search has no field for mood. No field for the quality of light. No field for how a painting makes you feel when you stand in front of it. Semantic search makes those things searchable because it operates on the visual and semantic content of the work itself, not on what a cataloger wrote about it.

What This Doesn't Fix

Semantic search changes what you can ask. It doesn't change what's in the collection.

The NGA's open-access dataset covers 68,816 objects — a substantial fraction of their holdings, with strong representation in Dutch Golden Age painting, American art, French Impressionism, and works on paper. But works still under copyright, three-dimensional objects, and some media types are absent. If the painting you're thinking of isn't in the dataset, the search won't find it.

The system also works best when you describe visual qualities — light, composition, color, mood, subject matter. It handles that register far better than highly specific iconographic terminology ("annunciation with lily" will work; obscure hagiographic references may not surface what you want if the catalog record doesn't support the visual signal).

Within those limits, it handles the cases that keyword search fails on entirely: the half-remembered image, the feeling you want to recreate, the style you can describe but not name.

The Underlying Shift

The difference between keyword search and semantic search is a difference in what the system assumes about the user. Keyword search assumes you know the vocabulary of the domain. Semantic search assumes you know what you want to find.

For a museum collection, that shift matters. The people most likely to benefit from a 68,000-object archive are not the scholars who already know the catalog. They're the people who remember standing in front of a painting as a child and wanting to find it again. The people who know the feeling a work produced but not the name of the artist who made it.

Retrievals is built for that second group.

Search the collection →

2026-05-24•By Mohamed Mohamoud

Why Museum Search Is Broken — And What Semantic Search Fixes

Museum search requires you to already know what you're looking for. Semantic search inverts that — you describe a memory, a mood, a visual impression, and the collection finds you.

Why Museum Search Is Broken — And What Semantic Search Fixes

You get nothing useful. Or you get results sorted by accession number. Or the system asks you to refine your medium, date range, and nationality before it will try.

The Fundamental Problem

flowchart LR
    A["What you remember\n'warm light, woman reading, Dutch'"]
    B["Keyword search"]
    C["Catalog fields\ntitle · artist · medium · date"]
    D["No match"]

    A --> B
    B --> C
    C --> D

    classDef memory fill:#f5f0e8,stroke:#a86845,color:#2c2926;
    classDef system fill:#ebe5d9,stroke:#6f685f,color:#2c2926;
    classDef fields fill:#fcf9f2,stroke:#d1c7b7,color:#2c2926;
    classDef result fill:#fffaf0,stroke:#a86845,color:#2c2926;
    class A memory;
    class B system;
    class C fields;
    class D result;

What Semantic Search Changes

flowchart LR
    A["What you remember\n'warm light, woman reading, Dutch'"]
    B["Qwen3-VL\nembedding model"]
    C[("Shared\nsemantic space")]
    D["Nearest artworks\nby visual meaning"]
    E["Vermeer · de Hooch\nRembrandt · Leyster"]

    A --> B
    B --> C
    C --> D
    D --> E

    classDef memory fill:#f5f0e8,stroke:#a86845,color:#2c2926;
    classDef model fill:#ebe5d9,stroke:#6f685f,color:#2c2926;
    classDef space fill:#fcf9f2,stroke:#2c2926,color:#2c2926,stroke-width:2px;
    classDef result fill:#fffaf0,stroke:#a86845,color:#2c2926;
    class A memory;
    class B model;
    class C space;
    class D,E result;

The Difference in Practice

Here's a direct comparison. The same intent, two systems.

Query	NGA keyword search	Retrievals
"warm candlelight solitary figure"	0 results	Rembrandt, de La Tour, Honthorst
"impressionist water reflections peaceful"	Requires medium filter + date range	Monet, Pissarro, Sisley — top results
"woman reading letter Dutch interior"	Matches if "letter" is in the title	Vermeer's Woman Reading a Letter, de Hooch's interiors
"dark stormy landscape dramatic sky"	Requires knowing "Romanticism" as a term	Turner, Rubens, Cole — by visual content
"melancholy figure in blue light"	No catalog field for mood	Works where that emotional register is visible

What This Doesn't Fix

Semantic search changes what you can ask. It doesn't change what's in the collection.

Within those limits, it handles the cases that keyword search fails on entirely: the half-remembered image, the feeling you want to recreate, the style you can describe but not name.

The Underlying Shift

Retrievals is built for that second group.

Search the collection →