Do Trading-Card Sites Block AI Crawlers? 2 of 7 Do
Trading-card sites live on price discovery. Their value is the comp — the last sale, the population report, the graded-card index — and that data is exactly the kind of structured, frequently-updated record an AI assistant loves to summarize. So whether a marketplace welcomes or walls off retrieval agents is a real business decision, not an afterthought.
2 of 7 Trading-card sites block at least one AI crawler.
Of the trading-card domains we checked, seven returned a parseable robots.txt — the root-level file that tells automated agents which paths they may fetch — and two of those disallow an AI crawler. That works out to a 28.6% block rate. Every figure here is read straight from the sealed snapshot; nothing is estimated, modeled, or extrapolated.
The two blockers are comc.com and sportscardinvestor.com. The other five policied marketplaces leave the door open. Against the corpus, where 317 of 1203 sites with a policy gate at least one crawler for a 26.4% rate, trading-card sites sit just barely above the average — middle of the pack, not a fortress.
The Two Trading-Card Sites That Gate, and How Differently They Do It
What makes this category interesting is not the count but the contrast between the two blockers. They are not running the same playbook.
comc.com — the Check Out My Cards consignment marketplace — is the comprehensive gate. Its robots.txt names a long disallow group: GPTBot, ClaudeBot, Google-Extended, CCBot, Bytespider, Meta-ExternalAgent, meta-externalagent, Amazonbot, and Applebot-Extended. That is OpenAI, Anthropic, Google, Common Crawl, ByteDance, Meta, Amazon, and Apple, all named explicitly. When comc.com closes, it closes nearly the whole leaderboard at once.
sportscardinvestor.com is the surgical gate. It disallows a single agent — Amazonbot — and lets every other AI crawler through. That is a narrow, specific decision rather than a blanket wall, and it reads more like a targeted response to one operator than a stance against AI retrieval as a whole.
One trading-card blocker names eight AI agents; the other names exactly one.
The five open marketplaces are a who's-who of the hobby: tcgplayer.com, beckett.com, pwccmarketplace.com, paniniamerica.net, and upperdeck.com. None of them disallows an AI agent. A pricing marketplace or a card manufacturer runs on being found — the listings, the population data, the product pages are meant to surface when a collector asks an assistant what a card is worth.
Three more trading-card domains — psacard.com, topps.com, and cardladder.com — returned no parseable robots.txt at the seal. They are therefore silent: neither an allow nor a block, and excluded from the rate entirely. Those three timeouts are why the denominator is seven rather than the ten sites we checked. It would be wrong to read silence as a stance; it is an artifact of a busy host at one moment in time.
What This 28.6% Block Rate Actually Means
A robots.txt directive is a public request, and the trading-card read is mostly "request granted" — five of seven policied sites stay open. The honest interpretation is that, as a category, trading-card sites behave more like open marketplaces than data fortresses. The listings and comps they publish are an acquisition asset: a collector who finds a card's value through an AI answer is a collector pointed at a marketplace. The open majority here echoes the aquarium category, where not one site gates a crawler, even though a couple of trading-card platforms break the other way.
The two blockers are the instructive exceptions, and they bracket the range of motives. comc.com, a consignment platform whose inventory and pricing are the product, has a clear reason to control bulk automated harvesting of its catalog. sportscardinvestor.com, a data-and-analytics service, gates only Amazonbot — a far more selective posture. Two sites, two very different theories of what AI access costs them.
The small sample sharpens this rather than weakening it. With seven policied files, the read is really a story about ten named operators and two decisions: in trading cards, AI-access posture is set not by a broad wave of gating but by whether individual marketplaces with valuable price data choose to wall it off.
Trading-card sites post a 28.6% AI-crawler block rate.
Two trading-card domains — tcgplayer.com and paniniamerica.net — also serve an /llms.txt file, the emerging convention for telling AI agents how a site wants to be read. Both are in the open column, which fits the pattern: the marketplaces most comfortable being summarized are the ones leaning into the new standard rather than away from it.
Where Trading Cards Sit Among Similar Categories
A 28.6% block rate places Trading-card sites in the middle band of the ranking — above the open floor, well below the gated extremes. The focused window below shows Trading cards beside its nearest neighbors, verbatim from the sealed snapshot, name first and no rank column.
| Category | Sites | With robots.txt | Block at least 1 crawler | Block rate |
|---|---|---|---|---|
| Trading Cards | 10 | 7 | 2 | 28.6% |
| Legal | 10 | 7 | 2 | 28.6% |
| Pets | 10 | 7 | 2 | 28.6% |
| Real Estate | 10 | 7 | 2 | 28.6% |
| Beekeeping | 10 | 10 | 3 | 30% |
| Scuba | 10 | 10 | 3 | 30% |
| Yoga | 10 | 10 | 3 | 30% |
Trading cards share their 28.6% reading with an unremarkable band — Legal, Pets, and Real Estate all land on the same two-blocker mark, and Beekeeping, Scuba, and Yoga sit a hair above at 30%. It is a crowded part of the ranking, which is itself a sign that roughly-one-in-four is a common posture: most sites in these categories want to be readable, with a stubborn minority that does not. The extremes show what the ends look like:
| Category | Sites | With robots.txt | Block at least 1 crawler | Block rate |
|---|---|---|---|---|
| Gaming | 9 | 9 | 8 | 88.9% |
| News | 20 | 17 | 14 | 82.4% |
| Fast Food | 10 | 6 | 0 | 0% |
| Hotels | 10 | 3 | 0 | 0% |
Trading cards sit far below Gaming and News, where archives and content are the product, and a clear step above the zero-block floor that fast-food chains and hotels define with their fully open policies. The category is open by disposition, gated by a couple of exceptions.
The Bots the Whole Corpus Reaches For
Trading cards' two blockers split — one comprehensive, one single-bot — so the more useful context is which bots get gated most broadly across the edition. The cut below shows the most-disallowed bots across all 1203 sites with a robots.txt, bot name first, count next.
| Bot | Sites disallowing (of 1203) | Rate |
|---|---|---|
| CCBot | 234 | 19.5% |
| GPTBot | 210 | 17.5% |
| ClaudeBot | 207 | 17.2% |
| Bytespider | 203 | 16.9% |
| Meta-ExternalAgent | 178 | 14.8% |
CCBot, Common Crawl's agent, tops the corpus blocklist at 234 sites, with GPTBot and ClaudeBot close behind. comc.com names all five of these — and more — in its disallow group, so the comprehensive blocker is gating the highest-volume training crawlers the whole corpus gates first, just all at once. sportscardinvestor.com, by contrast, names none of the top five; it reaches for Amazonbot alone.
Corpus-wide, 317 of 1203 sites block at least one AI crawler.
How the Trading-Card Snapshot Was Sealed
These figures come from one point-in-time crawl of public robots.txt files, sealed June 19, 2026 under snapshot sha 040215878ac7b85a. For each trading-card domain we fetched robots.txt at the root, parsed its user-agent and disallow directives, and recorded whether any AI crawler token was disallowed. We report verbatim counts; nothing is estimated, modeled, or extrapolated. The three domains with no parseable file — psacard.com, topps.com, and cardladder.com — are logged as silent, neither allow nor block.
The counting rule is deliberately narrow. A block is an explicit Disallow aimed at a named AI agent — GPTBot, ClaudeBot, CCBot, and the other leaderboard tokens. A trading-card site can disallow checkout, search, or account paths without naming an AI agent, and that does not count here. Only a directive that names one moves a site into the blocker column, which is why the count is a clean two.
A note on what the snapshot deliberately does not do. It does not retry a slow host until a file appears, does not follow a redirect into a different domain's policy, and does not infer a block from a site that merely looks unfriendly to bots. Each domain is read once, at seal time, exactly as it answered — the rule that makes the result content-addressable: anyone holding sha 040215878ac7b85a can re-derive the same seven policied files and the same two blockers.
Frequently Asked Questions
Q: Which trading-card sites block AI crawlers?
A: comc.com and sportscardinvestor.com. They are the two of seven trading-card sites with a parseable robots.txt that disallow an AI crawler. comc.com does it comprehensively — naming GPTBot, ClaudeBot, Google-Extended, CCBot, Bytespider, Meta-ExternalAgent, Amazonbot, and Applebot-Extended — while sportscardinvestor.com gates only Amazonbot. Together they make up the entire 28.6% block rate.
Q: Why do most trading-card marketplaces leave AI crawlers in?
A: Discovery. tcgplayer.com, beckett.com, pwccmarketplace.com, paniniamerica.net, and upperdeck.com run on being found — their listings, comps, and product pages are meant to surface when a collector asks an assistant what a card is worth. For a marketplace whose growth depends on inbound interest, being readable extends reach rather than threatening it.
Q: Does the 28.6% rate cover all the trading-card sites you found?
A: No. It covers the seven sites that returned a parseable robots.txt. Three more — psacard.com, topps.com, and cardladder.com — produced no parseable file at the seal, so they are excluded from the rate rather than counted as an allow or a block.
Q: Does a Disallow in robots.txt actually stop an AI crawler?
A: Not by force. robots.txt is an honor-system standard: a cooperative crawler reads it and complies, but the file enforces nothing technically. comc.com signals that AI agents should stay out of its paths; each crawler decides whether to honor that request.
Put AI-Access Data to Work
For a trading-card marketplace owner or e-commerce lead — the person who decides how listings and comps appear online — this snapshot is a baseline worth watching. Most peers stay open while comc.com gates comprehensively, and the question that matters is whether your OWN robots.txt is accidentally walling off the answer engines collectors now ask first. The contrast with the gaming sites that block almost everything shows how fast a category can harden once the data becomes the product.
Set a recurring crawl that re-reads robots.txt for comc.com, tcgplayer.com, beckett.com, and your own domain weekly, and alert the moment any peer adds an AI crawler token to its disallow list. US Tech Automations runs exactly that kind of scheduled robots.txt crawl with change alerts and agentic monitoring, so a policy shift surfaces the week it lands rather than at the next annual audit.
A second fit is an AI-search or GEO analyst tracking which marketplaces remain eligible to surface in answer engines. Their job is to know, continuously, whether the pricing pages they rely on are still readable, and whether a sportscardinvestor.com-style single-bot block is a one-off or the start of a hardening trend.
The category sits beside the sneaker resale sites in the same open-but-watchful band, where one big marketplace's decision can reset the norm. US Tech Automations monitors that drift across a watchlist of domains and routes the alert when an operator flips. See how the agentic monitoring works, and you have a standing read on trading-card AI-access posture instead of a one-time count.
Corpus-wide, 330 of 1203 sites publish an llms.txt file.
Key Takeaways
Of the seven Trading-card sites with a parseable robots.txt, two block at least one AI crawler — a 28.6% rate, just above the corpus average.
The blockers split in style: comc.com gates eight named agents comprehensively, while sportscardinvestor.com disallows only Amazonbot.
The five open marketplaces — tcgplayer.com, beckett.com, pwccmarketplace.com, paniniamerica.net, and upperdeck.com — allow every crawler, and two of them also serve an llms.txt.
psacard.com, topps.com, and cardladder.com returned no parseable file at the seal and are excluded from the rate.
Corpus-wide, 317 of 1203 sites (26.4%) gate at least one crawler, so trading cards sit right around the average.
Source: US Tech Automations Research — Closing Web edition; figures are verbatim counts from public robots.txt files sealed June 19, 2026 (snapshot sha 040215878ac7b85a).
Get this data as a daily feed
The numbers in this report come from a permit feed we monitor daily. Leave your email and we will follow up about a daily feed for your ZIPs and categories.
Prefer to talk first? Contact us.
Cite this report
US Tech Automations Research, 2026-06 edition. “Do Trading-Card Sites Block AI Crawlers? 2 of 7 Do.” https://ustechautomations.com/resources/blog/do-trading-card-sites-block-ai-crawlers-2026
Sealed snapshot sha256: 040215878ac7b85a
Machine-readable data: CSV · JSON · All research & methodology
About the Author

Helping businesses leverage automation for operational efficiency.