mbg@portfolio:~/research-library · case study 04 · read 9 min
Quiet search --:--:--
$ cd case_04/research-library
// alpha → v1.5 · 1000+ studies indexed · xLOB in production

Dell Research Library.

Turning thousands of scattered research artifacts into a findable, reusable, company-wide asset. A custom cloud platform a small team and I architected, shipped, and iterated from alpha to v1.5.

Role
Lead & Architect
From concept through v1.5 live release
Timeline
2020 → Present
Shipping iterative releases
Team
3 + 1 external dev
UX & UXR, home-grown
My Contribution
Experience architecture
Vision, IA, platform design, delivery, system architecture (inc. IT & security design), cloud management
Research Library home page showing search, recent studies, research areas taxonomy, and people directory
FIG. 00 Research Library home, v1.5. Shipped product. Search and discovery across studies, taxonomy, and people. Names and study titles blurred.
01 Context & Problem 2020

Dell had decades of research, and no way to find any of it.

Dell generates enormous amounts of customer evidence every year: interviews, diary studies, ethnographies, surveys, heuristic audits, competitive analyses. Two decades of it was scattered across shared drives, SharePoint, personal laptops, and the memory of whichever researcher originally ran the study. The result was predictable. A PM in Round Rock would commission a $40k study to answer a question that had been answered three years earlier in Hopkinton. I didn't want another taxonomy. I wanted infrastructure.

The goal was never to archive research. The goal was to make every study that Dell had ever conducted behave like a living, queryable asset, something a designer in Taiwan or a PM in Austin could interrogate at 2am and get a useful answer from.

Internal kickoff memo · 2020
Backstory slide showing why we built in-house instead of using third-party tools
FIG. 01 The original backstory. Third-party tools weren't fit for purpose, so we built in-house.
02 Approach 3 principles

Treat research as a platform.

My technical background (a decade of cloud, virtualization, and distributed systems work before I ever managed a designer) shaped the first instinct. This wasn't a cataloging exercise. It was a software product with users, data, ingestion, retrieval, and a long roadmap.

A.01Research the researchers

Before a line of code was written, we ran a cross-LOB discovery with eighteen research consumers (designers, PMs, engineers, services leads, executives). The failures clustered around three things: taxonomy they didn't understand, file formats they couldn't preview, and findings buried in 90-minute videos with no transcript.

A.02Architect for retrieval.

That insight reshaped the system. We designed around the moment of recall, not the moment of upload. Auto-transcription on every video asset. Machine-readable tagging against a controlled vocabulary we authored with the research team.

A.03Ship small, ship real

Alpha launched internally with a single LOB and about 200 studies loaded. By v1.5 the platform was serving the full company footprint, and researchers were voluntarily reaching for it, the truest signal a tool has crossed into useful.

Feb 2018 sketchbook wireframes showing early concept
FIG. 02Feb 2018 sketchbook wireframes.
Internal pilot study validating MVP concept with 10 participants
FIG. 03Pilot study. 10 participants, 1 testing round, 50+ issues identified.
Phased delivery roadmap from Q3 2019 to Q4 2020
FIG. 04Alpha Q4 2019 · v1.0 tiered launch Q3 2020 · Auto-Transcription Q3–Q4 2020.
03 What We Built 4 capabilities

Four capabilities that turned a repository into a platform.

F.01

Auto-transcription on everything.

Every video asset ingested is transcribed, time-coded, and made full-text searchable. A PM looking for "thermal complaints" can jump to the 47-minute mark of a 2019 interview in two clicks.

F.02

Study tracking & roadmap generation.

Active research is tracked from intake to closeout, and the platform auto-generates forward-looking research roadmaps by LOB. Planning conversations start with evidence, not guesswork.

F.03

Controlled tagging & IA.

We authored a shared research taxonomy with the global team, then enforced it at ingestion. Retrieval quality is only as good as the vocabulary, so we invested there first.

F.04

Built for cross-LOB reuse.

Permissions, versioning, and citation live in the platform. A study conducted for one product unit can be cited, adapted, and re-used by another without losing provenance.

04 Outcome shipped

People used it.

The most honest measure of an internal platform is whether people reach for it when nobody's watching. By v1.5 they were. Researchers used it unprompted. PMs cited it in planning docs. Newly-hired designers used it to get up to speed on a decade of product evidence in a week. A study stopped being a one-off deliverable for a single stakeholder, and became something findable, citable, and re-usable.

v1.5live
Shipped from alpha to company-wide release
1000+
Research artifacts indexed and retrievable
xLOB
Adoption across every major Dell line of business
Shipped product home dashboard view with search, recent studies, and taxonomy
FIG. 05Shipped home dashboard. Search, recent studies, taxonomy.
Shipped product library view showing study cards with goals and metadata
FIG. 06Shipped library view. Faceted filtering, rich study cards.
Research Library roadmap view v1.5 shipped product showing studies on a Gantt-style timeline
FIG. 07Roadmap view. Every study from the library plotted on a Gantt-style timeline by status, group, and research area. Project names blurred.
05 People & Expertise finding who already knows

People-first search. For when you don't know what document you need.

As a consumer of research, you often don't know exactly what you're looking for from outside a topic. So we introduced robust searching across experts. You could figure out who had done what work before, browse what they specialize in, and reach them directly. Each employee has a profile page that surfaces their full research history, the areas they've worked across, and how to contact them.

People search page filtered by PowerEdge HW research area, showing 13 results with researcher cards displaying titles, research areas, and study counts. Names blurred.
FIG. 08People search with the same faceted filter system used for studies. Filter by research area, sort by name, contact directly. Names blurred.
Individual researcher profile page showing photo, role, bio, research areas pills, and full study history with collaborators. Collaborator names blurred.
FIG. 09Individual profile. Bio, research areas, full study history with collaborators, direct contact. Collaborator names blurred.
06 ML Foundation structured for the moment that came four years later

In 2022, we structured the corpus for ML retrieval. Two years before the LLM boom.

We worked with an AI data architect and ML expert in 2022 to structure the library's data properly ahead of time, so it could be ingested into machine learning when the moment came. That foresight has paid off in 2026: we can now model against the entire research catalog whenever it's needed. The structured-data foundation we put in place years before the AI wave is what makes that possible today.

Structured data foundation showing the exported spreadsheet view with every study tagged and schematized for ML ingestion
FIG. 10The structured-data foundation. Every study exported as tagged, schematized rows, ready for ML ingestion. Built in 2022, paying off in 2026.
07 A Dividend We Didn't Plan For retrospect

The corpus became training data.

A year or two in, something unplanned started to matter. Because every asset was transcribed, tagged against a shared taxonomy, and stored against a consistent schema, we had quietly built a clean structured corpus of Dell's research history, the exact shape modern AI systems need. A designer can now ask "have we heard this complaint before?" and get an answer grounded in a decade of real interviews. A PM can ask "what does the research say about this segment?" and get a synthesized view in seconds.

It wasn't the original goal. It's what happens when you build the data layer correctly the first time, the tools you'll want two years later find your data ready for them. It's also why the agentic AI work I'm leading inside Dell now has a running start most enterprises don't.

The Library stopped being just a research tool and became research infrastructure — and then, without anyone planning it, became the training layer for the next generation of AI-assisted research at Dell.

Observed, retroactively
Auto-transcription feature pitch deck showing machine-generated transcripts
FIG. 11Early exploration of ML-generated transcripts with confidence scores.
Shipped auto-transcription UI with searchable timeline and video
FIG. 12Shipped. Searchable video timeline, indexed quotes, auto-generated insights, the layer that quietly turned the Library into a training corpus.
→ NEXT_CASE

05/ai · NDA-protected.

Back to all projects