Research & Data

Do 3D Printing Sites Block AI Crawlers? 2 of 10 Do

Jun 14, 2026

The maker web is wide open to AI. Of the 10 3D Printing sites we checked, every single one returned a parseable robots.txt, and only two of them disallow an AI crawler by name. That makes 3D Printing one of the more permissive verticals in the entire snapshot.

This report reads one sealed snapshot of public robots.txt files, taken 14 June 2026 and content-addressed under sha 92ed5cd2858657d9. No site was asked, surveyed, or interviewed. We parsed only what each domain already publishes for any visitor to see.

2 of 10 3D Printing sites block at least one AI crawler.

The two gatekeepers are the hardware vendors makerbot.com and ultimaker.com. The eight sites that allow every crawler we tested are a broad mix of marketplaces, tutorials, and trade press: all3dp.com, prusa3d.com, thingiverse.com, 3dprint.com, 3dprintingindustry.com, matterhackers.com, printables.com, and formlabs.com.

A robots.txt directive is a published request that a named crawler skip some or all of a site. It is the cleanest public signal of how a brand feels about AI training and retrieval, and the maker space is signaling openness.

The distinctive thing about this slice is not the count but the kind of site doing the blocking. In most verticals, the heaviest gatekeepers are the publishers whose articles are the product. In 3D Printing, the two blockers are hardware vendors, while the libraries that host the actual community content stay open — an inversion worth keeping in mind for the rest of this report.

Which Sites Gate the Crawlers Here

3D Printing is a content economy built on sharing. STL repositories, slicer documentation, and how-to libraries exist to be found, copied, and remixed — so the model-hosting and tutorial sites leave their doors open while two hardware brands draw a line.

makerbot.com and ultimaker.com are the only two domains in this slice with an AI-crawler disallow. Both sell printers and managed software, so their published pages are marketing and product surfaces rather than community-contributed designs, which may explain the more cautious posture.

The allower list is where the category's character shows. The model libraries thingiverse.com and printables.com, the parts seller matterhackers.com, the trade outlets 3dprint.com and 3dprintingindustry.com, the guide hub all3dp.com, and the vendor sites prusa3d.com and formlabs.com all permit every crawler in our test.

It is worth noting that prusa3d.com and formlabs.com are hardware vendors too, yet they sit on the open side while makerbot.com and ultimaker.com gate. So the divide is not simply hardware versus community — within the vendor group itself, brands have landed on opposite policies. That tells you AI-access posture in this category is a per-company choice, not an industry norm, which is exactly the kind of detail a one-time read would miss and a recurring monitor would catch.

Both 3D Printing blockers — makerbot.com and ultimaker.com — sell printers and managed software.

That split matters for anyone reasoning about where their own content ends up. If you publish printable designs or build logs on these open platforms, AI systems can read them today, by the host's own published policy. The same openness powers other gear-heavy hobby verticals — see how the pattern holds in our Hunting AI-crawler report and the skateboarding sites breakdown.

What a 20% Block Rate Actually Means

A 20% block rate puts 3D Printing well below the corpus average. Across the snapshot, 231 of 743 sites with a published policy block at least one AI crawler — a 31.1% corpus rate. 3D Printing sits clearly under that line.

3D Printing sites post a 20% AI-crawler block rate.

Read carefully, this number is about disposition, not enforcement. robots.txt is advisory. A 20% rate tells you that four of every five maker sites with a policy have chosen not to name any AI crawler as unwelcome — a strong signal of how this community treats machine readers, not a guarantee of what any crawler does.

The category's full coverage is itself notable. Every 3D Printing site we checked returned a parseable robots.txt, so there are no silent abstentions to interpret here. When a vertical is both fully covered and lightly gated, the openness reads as deliberate rather than accidental.

There is a practical reason maker sites might want AI systems reading them. When someone asks an assistant how to fix stringing or which filament suits a part, the answer leans on whichever documentation and forums the model could ingest. A library that stays open is more likely to be the source those answers cite, and the citation can carry a reader back to the platform. For a sharing-first vertical, openness and discoverability point the same direction.

Where 3D Printing Sits Among Its Neighbors

To see where 3D Printing lands, here is a focused window of the block-rate ranking — the category and its nearest neighbors above and below it, every figure verbatim from the sealed set.

CategorySitesWith robots.txtBlock ≥1 crawlerBlock rate
Scuba1010330%
Legal107228.6%
Chess107228.6%
Crafts108225%
Space98225%
Podcasts1010220%
3D Printing1010220%
Finance1211218.2%
Retail1512216.7%
Education97114.3%

The neighbors tell a consistent story: 3D Printing groups with podcasts and finance, not with the heavily gated content categories. For contrast, the extremes of the snapshot sit far from this band.

CategorySitesWith robots.txtBlock ≥1 crawlerBlock rate
Gaming99888.9%
News20171482.4%
Drones10900%
Hunting1010110%

Gaming and News gate aggressively because their pages are the product. 3D Printing's pages are meant to be downloaded and remixed, which pulls it toward the open end alongside fellow hobby verticals like our fishkeeping AI-access report.

The Operator-Level Picture Across the Corpus

Block rates by category tell you who is gating; the operator leaderboard tells you which AI companies get named most across all 743 sites. Here is a focused cut of the top operators corpus-wide.

OperatorSites disallowing (all 743 sites)
Common Crawl169
Anthropic160
OpenAI150
Meta143
ByteDance142

Common Crawl leads because its archive feeds many downstream training pipelines, so blocking it is a one-line way to cover a lot of ground. Anthropic and OpenAI follow closely as the named model builders sites most often single out.

The ordering is informative on its own. A site that wants to limit AI reuse with the fewest rules tends to start with Common Crawl, because so many models train on its archive — and then add the direct retrieval agents from Anthropic and OpenAI. Meta and ByteDance round out this top cut. Reading down the list is effectively reading the order in which the broader web decided which crawlers were worth naming first, and the maker category has so far opted almost entirely out of that exercise.

Corpus-wide, 231 of 743 sites block at least one AI crawler.

Neither makerbot.com nor ultimaker.com changes this corpus-wide picture much on its own, but every site that adds a disallow line nudges these counts. That is exactly why the signal is worth watching over time rather than reading once. The maker space's openness today is a baseline, not a promise.

Methodology

The figures here come from one content-addressed snapshot of public robots.txt files, sealed 14 June 2026 under sha 92ed5cd2858657d9. We fetched each domain's robots.txt, parsed its user-agent and disallow directives, and matched them against a fixed list of known AI crawler tokens. A site counts as a blocker if it names at least one AI crawler with a disallow rule.

Every number in this report — counts, the 20% rate, the corpus totals, the leaderboard tallies — is a direct read from that sealed file; nothing is estimated, modeled, or extrapolated. We report what each site published, not what any crawler actually did, because robots.txt is a request and not an enforcement mechanism.

Coverage is the scope, too. Across the snapshot we checked 883 sites in 88 categories, of which 743 returned a parseable robots.txt and 171 also published an llms.txt file — 23% of the policied sites. For 3D Printing specifically, all 10 sites returned a parseable file, so the slice has no abstentions.

Frequently Asked Questions

Q: Which 3D Printing sites actually block an AI crawler?

A: Two: makerbot.com and ultimaker.com. Both are printer-and-software vendors. The other eight in our slice — including thingiverse.com, printables.com, all3dp.com, and matterhackers.com — allow every AI crawler we tested, by their own published robots.txt.

Q: Why is 3D Printing only at a 20% block rate when the corpus average is 31.1%?

A: The maker web runs on sharing — model libraries and tutorials exist to be found and remixed. With 2 of 10 sites gating any crawler, 3D Printing sits below the 31.1% corpus rate, near podcasts and finance and far from gaming and news.

Q: Does a disallow line in robots.txt actually stop a crawler?

A: No. robots.txt is an honor-system standard — a published request, not a wall. A well-behaved crawler reads it and complies; a misbehaving one can ignore it. The file records what a site asks for, which is the signal this report measures.

Q: Do any 3D Printing sites publish an llms.txt file?

A: This report measures robots.txt directives, so the per-category llms.txt list is not broken out here. Corpus-wide, 171 of 743 policied sites — 23% — published an llms.txt, the newer convention for stating AI-access preferences alongside the older robots.txt standard.

Key Takeaways

Only 2 of 10 3D Printing sites name an AI crawler to block.

The maker web is among the most open verticals in the snapshot. Two hardware vendors gate; the model libraries, marketplaces, and trade press do not. At 20%, 3D Printing sits below the 31.1% corpus rate, grouped with podcasts and finance rather than the heavily gated content categories.

The value in this data is not the single number — it is watching the number move. A point-in-time count is a baseline; the signal is the day a new maker site adds its first disallow line.

Put AI-Access Data to Work

A 3D-printing marketplace product manager at a site like thingiverse.com or printables.com should treat AI-access policy as a competitive surface: re-crawl your own and rival maker domains weekly, and alert the moment a peer adds an AI-crawler disallow that you do not — because if model builders index their STL libraries and not yours, you lose the answer-engine traffic. The trigger is a new disallow token; the cadence is weekly.

A competitive-intelligence analyst covering the hardware vendors should watch makerbot.com and ultimaker.com specifically, since they are today's only blockers — alerting when either tightens or relaxes its directives, a signal about how printer brands view AI training of their product content.

An AI retrieval engineer building a maker-knowledge assistant should re-validate the eight allower domains on a scheduled crawl so the corpus only ingests sources that still publish an open policy, dropping any domain the day it adds a disallow line.

US Tech Automations runs these scheduled robots.txt and llms.txt crawls, diffs each result against the prior seal, and routes change alerts into an AI-access policy dashboard. See how the monitoring is built on our agentic workflows platform.

Source: US Tech Automations Research — Closing Web edition; figures are verbatim counts from public robots.txt files sealed June 14, 2026 (snapshot sha 92ed5cd2858657d9).

Get this data as a daily feed

The numbers in this report come from a permit feed we monitor daily. Leave your email and we will follow up about a daily feed for your ZIPs and categories.

Prefer to talk first? Contact us.

Cite this report

US Tech Automations Research, 2026-06 edition. “Do 3D Printing Sites Block AI Crawlers? 2 of 10 Do.” https://ustechautomations.com/resources/blog/do-3d-printing-sites-block-ai-crawlers-2026

Sealed snapshot sha256: 92ed5cd2858657d9

Machine-readable data: CSV · JSON · All research & methodology

About the Author

Garrett Mullins
Garrett Mullins
Workflow Specialist

Helping businesses leverage automation for operational efficiency.