Data Portraits

This portrait is a sketch on the Pile. Enter a query to check if parts of your text appear in the Pile. Use a full document for best results.

Enter your own text or use a prefill button.

Matching Text

Found spans are in grey. The longest span is in blue. Hovering over a character highlights the longest span that includes that character (there may be overlapping shorter spans). Clicking shows the component substrings below.

Hashes for each of these strings appears in the sketch. Click above to select a new span.

The top 20 longest chained matches.


Matches are at least 50 characters. Resolution can be customized for specific usecases. Your name is probably too short to match, but try pasting a speaker bio to see if you are in the Pile.

By Marc Marone and Ben Van Durme. See our paper, project pitch, and other datasets back at