Unscraped Art

Why Not Use AI Generators?

Why not to ChatGPT, Midjourney, Stable Diffusion, DALL-E, Adobe Firefly – just some opinions.

"Scraped" means taken without asking. Scraping uses "Crawler Bots". A company writes programs that crawl the internet and download every image they find. Those images are stored on private company servers owned by wealthy shareholders. The shareholders are perhaps 1% of the population who are culturally homogenous.

Images are fed into a training process to connect their "styles" to linguistic terms like "oil painting" or "Van Gogh" or whatever. When you type a prompt, it generates new pixels via language. The output looks like art because it was blended from art, but it is arguably not art. The people who made the images were not asked and were not paid.

The extraction is not small. The major image generators appear to have trained on billions of images. Their output now competes directly with the people it was built from. An illustrator can spend years developing a recognizable style, only to watch a machine trained on that style sell a cheap imitation to anyone with twenty dollars and a prompt box. Fair use law was written to protect critics, teachers, scholars, and satirists. Using it to defend industrial-scale extraction for private profit is a different thing. And unlike music sampling, there is no real audit trail here. No artist can trace their work into the output, demand credit, or seek compensation. The line back to the source was cut on purpose.

Dozens of federal copyright lawsuits are currently pending in U.S. courts against the major AI companies. The landscape includes Andersen v. Stability AI (proceeding to trial September 2026), Bartz v. Anthropic (settled for $1.5 billion in 2025), New York Times v. OpenAI (ongoing discovery), Getty Images v. Stability AI, Universal Music's $3.1 billion suit against Anthropic, and Disney's suit against Midjourney over copied characters. Every image you generate with these tools carries unresolved legal provenance questions.

As of 2026, U.S. courts have split on some issues: training on copyrighted books can be fair use (Bartz ruling), but storing pirated copies is not. The Supreme Court has also confirmed that purely AI-generated works cannot be copyrighted in the U.S. (Thaler v. Perlmutter, 2026). For anyone in education, publishing, nonprofits, or institutional work, it is wiser for now to use creative-commons and public-domain images with documented provenance.

The AI image generation market sits in the tens of billions annually. Midjourney has around 100 employees and reportedly generates hundreds of millions in revenue. DALL-E is part of OpenAI, valued at $157 billion after a 2024 funding round. These companies trained on billions of images scraped from the internet.

Apple's market capitalization sits above $3 trillion. Microsoft's is similar. Nvidia crossed $3 trillion in 2024. OpenAI alone, a company that didn't exist a decade ago, was valued at $157 billion after a 2024 funding round. Analysts project the AI industry to add many trillions to global GDP by 2030.

The U.S. Bureau of Economic Analysis puts arts and cultural production — including commercial film, broadcasting, publishing, plus nonprofit museums, libraries, performing arts — at roughly $1.1 trillion a year in contribution to U.S. GDP. The nonprofit cultural sector (museums, libraries, archives, symphonies) is a smaller share of that figure. By comparison, the major AI companies' market valuations now exceed the annual GDP of every country on Earth except the largest two or three.

A handful of men now control more of the world's digital infrastructure than any government. One of them (Musk) operates roughly three-quarters of all active commercial satellites, the machinery behind GPS assistance, weather forecasting, and global communications.

The images that trained these systems came from everybody else: taxpayers, archives, working artists, and the accumulated visual record of the human species.

The data scraping was sudden, quick, global. The ownership of the output is private and concentrated inside a demographic so narrow that you are almost certainly not in it.

Yes, prompting DALL-E or Midjourney burns energy. So does driving a car, streaming video, or refrigerating strawberries in January.

People often talk about typing a prompt as if it were a singular ecological sin — but that framing ignores much larger, routine forms of damage. A single beef cheeseburger carries a much heavier material footprint than one image prompt because cows require land, feed, water, methane-producing digestion, slaughter, refrigeration, packaging, and trucking. The same goes for air travel, fast fashion, and endless consumer junk.

So be honest: AI image generation has an environmental cost, but it is usually being folded into a much bigger industrial mess. Moral panic about prompts can become a convenient way to avoid talking about capitalism, meat, energy, logistics, and scale.

Image generation runs through data centers, and data centers need power, water, land, cooling systems, and transmission infrastructure. When one town fights a facility over water use, noise, tax giveaways, or grid strain, the problem does not vanish. The company just looks for a place with weaker resistance, cheaper land, or poorer people who have less power to say no. The environmental issue is not just "AI uses electricity." The issue is where the burden gets dumped, who absorbs it, and how quickly wealthy firms can move extraction to communities with less political power.

Your phone or whatever you are reading this on is a daily, normalized object built on extraction, labor exploitation, surveillance, and energy use at massive scale. Most people will not throw it away because the device is now structurally tied to work, banking, maps, social life, medicine, and survival. The honest position is that we are all living inside systems of damage and dependency, and the real question is scale, necessity, and where the burden lands.

Input Your API Keys (Private)

Your keys are stored in your browser and pass through our server only to reach museum APIs. We do not log or store them.

Active Sources

Where to Get API Keys...

An API key is a free library card. You register, you get a code, you paste it once. That's it.

Source	How to Get the Key
Smithsonian	api.data.gov/signup
Europeana	pro.europeana.eu
Rijksmuseum	rijksmuseum.nl/rijksstudio
Harvard Art Museums	harvardartmuseums.org
DPLA	dp.la developers

The 15 Sources

Museum / Library	Key	What They Have
Metropolitan Museum of Art	None	400,000+ works
Art Institute of Chicago	None	50,000+ CC0
Cleveland Museum of Art	None	30,000+ CC0
SMK — National Gallery of Denmark	None	European & Danish art
Wellcome Collection	None	Medical & scientific imagery
Princeton Art Museum	None	Greek/Roman, pre-Columbian
Wikimedia Commons	None	100M+ files
National Archives (NARA)	None	US gov records, maps
Library of Congress	None	FSA photos, maps, prints
NASA	None	Space photography
Smithsonian Institution	Free key	21 museums
Europeana	Free key	50M+ European items
Rijksmuseum	Free key	800,000+ Dutch masters
Harvard Art Museums	Free key	250,000 objects
DPLA	Free key	Thousands of US libraries

Licenses, Privacy, and Policy

Effective April 20, 2026. Operated by Al Tarbet in Salem, Oregon. Questions: al@tarbet.design.

Unscraped Art is a search interface. It queries public APIs, returns public metadata, and displays links to publicly hosted images. It does not host, store, proxy, or redistribute any image or non-public data.

Rights and Licenses

Every image returned here is filtered at the source level: the tool only surfaces records the source institution has itself declared CC0 or public domain through its official API. A "CC0" or "Public Domain" label reflects that institution's own declaration, read directly from its API — not a third-party guess pasted on after the fact. We do not re-verify each institution's rights claim ourselves, and institutions occasionally revise them. So before using any image commercially or republishing it, confirm the rights statement at the original source record. The museum or archive is always the authoritative source for rights.

Privacy

We do not require accounts, collect names, emails, or store personal data. Your API keys are stored only in your browser (localStorage) and are sent as request headers when you search. Your keys pass through our server only to reach the museum APIs — they are not logged, stored, or retained on our side.

Ephemeral access logs may record the search topic and your IP address for operational reasons (debugging, rate-limiting). These logs are not retained long-term and are not linked to any identifying information.

Your collected images and API keys are stored in your browser's localStorage under the keys harvester_v1_keys and harvester_v1_basket. If you are on a shared or public-access computer, use the "Clear All" button in the API Keys modal and clear your browser data before leaving.

Your Rights (US and EU)

Under Oregon's OCPA, California's CCPA/CPRA, and the EU's GDPR, you have rights to access, correction, deletion, and opt-out of data sale where applicable. Unscraped Art retains nothing about you after a search completes. The museum APIs you query retain data per their own terms. For questions, email al@tarbet.design.

Code License

The Unscraped Art codebase is open source under the Apache 2.0 License. The website text is licensed CC BY 4.0. Images and metadata belong to their source institutions.

Ethics

This tool does not search community-governed repositories. Legal openness is not the same as ethical consent. If your institution wants to be removed or change how we interact with your API, contact us and we will respond within 30 days.

Breach Notification

In the event of a breach at Unscraped Art or its upstream providers (Replit, museum APIs) that affects search request data, a notice will be posted on unscraped.art within 72 hours of discovery.

Changelog

2026-04-20: Privacy claims clarified to describe how API keys pass through the server. Added effective date, your-rights section, breach notification policy, and localStorage disclosure. Credential phrasing aligned with Tarbet Design. Donate button simplified pending nonprofit determination verification.

Unscraped Art

Your Collection — 0 images

Results

Unscraped Art

Your Collection — 0 images

Results

Why Not Use AI Generators?

Unscraped, Really?

Tips for Better Searches

Core Rule

Input Your API Keys (Private)

Active Sources

The 15 Sources

Who Owns This Stuff?

Wait, You Use AI?

Licenses, Privacy, and Policy

Rights and Licenses

Privacy

Your Rights (US and EU)

Code License

Ethics

Breach Notification

Changelog

Wait a sec! Did You Input Your API Keys?

Have You Reviewed Tips for Searching?

Heads Up — This Takes a Minute