Do Candlemaking Sites Block AI Crawlers? Zero of 8 Do
Candlemaking sites leave the lights on for AI. Of the 10 Candlemaking sites we checked, 8 returned a parseable robots.txt, and not one of those eight blocks any AI crawler. The category's block rate is a flat 0%.
That clean zero is the whole finding. Every Candlemaking site with a published policy allows all the AI crawlers we tracked — no disallow lines, no named user-agents, nothing fenced. It is one of the most open verticals in this edition.
0 of 8 Candlemaking sites block any AI crawler.
The eight fully permissive sites are candlescience.com, lonestarcandlesupply.com, naturesgardencandles.com, fillmorecontainer.com, candlewic.com, makesy.com, communitycandlesupply.com, and theflamingcandle.com. Two more — bittercreeknorth.com and aztec-candle.com — returned no robots.txt at all, which is a separate state we never read as an allow or a block.
This is sealed-snapshot research. Every figure is a verbatim count from public robots.txt files captured June 14, 2026, frozen under snapshot sha 5d5458529dab2773. We read the files and measured nothing else. The full edition spans 1197 sites across 120 categories, 993 with a parseable robots.txt; Candlemaking is one slice, and it landed at the open extreme.
For anyone skimming, the slice in a sentence: every Candlemaking site that publishes a crawl policy currently allows all AI bots, placing the category at the bottom of the block-rate ranking. A robots.txt file is the plain-text file at a domain's root listing which automated agents may fetch which paths — a voluntary standard, not a hard barrier.
What a Zero Block Rate Tells You
Zero does not mean Candlemaking is absent from the question — it means every site that bothered to publish a rule chose to let the crawlers through. None of the eight policies names an AI bot to disallow, so the category contributes exactly nothing to the web's gating totals.
It helps to remember what robots.txt can and cannot do. It is an honor-system standard: a compliant crawler reads the file and obeys, but the file enforces nothing on its own. A permissive Candlemaking policy therefore signals that owners have not asked anyone to leave, not that they are actively recruiting bots.
Every Candlemaking site we checked with a published policy allows all AI crawlers.
The economics fit. Candlemaking sites are overwhelmingly supply shops and how-to hubs — wax, wicks, fragrance oils, containers, and the tutorials that move them. Discoverability is the business model, so the default permissive file is exactly what you would expect. The same pattern appears in our report on whether embroidery sites block AI crawlers, another supply-driven craft sitting at a clean zero.
All 8 Candlemaking sites with a published robots.txt allow every AI crawler, against a 28.7% corpus block rate.
The named allowers make the read concrete. candlescience.com, candlewic.com, and lonestarcandlesupply.com are established supply retailers; makesy.com and theflamingcandle.com lean into kits and tutorials; naturesgardencandles.com, fillmorecontainer.com, and communitycandlesupply.com fill out a catalog-driven set. Every one of them earns its traffic by being found, and AI answer engines are now part of how customers find supply and how-to content — so none has a reason to fence the crawlers out. A craft with even one gatekeeper looks different, as our analysis of leathercraft sites and AI crawlers shows.
How Candlemaking Compares to the Other Categories
Candlemaking sits at the open extreme of the ranking. The focused window below places it among its nearest neighbors — the other zero-block categories and the band just above them.
| Category | Sites | With robots.txt | Block ≥1 crawler | Block rate |
|---|---|---|---|---|
| Hunting | 10 | 10 | 1 | 10% |
| Boating | 10 | 8 | 0 | 0% |
| Manufacturing | 10 | 8 | 0 | 0% |
| Prepping | 10 | 8 | 0 | 0% |
| Pickleball | 10 | 10 | 0 | 0% |
| Embroidery | 10 | 8 | 0 | 0% |
| Candlemaking | 10 | 8 | 0 | 0% |
| Geocaching | 10 | 4 | 0 | 0% |
| Tea | 10 | 10 | 0 | 0% |
| Pottery | 10 | 9 | 0 | 0% |
Candlemaking sits in a roomy 0% tie with Embroidery, Tea, Pickleball, and Pottery — a band of hobby, craft, and lifestyle verticals that have not gated at all. Even categories that gate only lightly, like Origami, stay in the open half; our breakdown of origami sites and AI crawlers shows just two of eight sites blocking there. The other end of the corpus looks nothing like it.
| Category | With robots.txt | Block ≥1 crawler | Block rate |
|---|---|---|---|
| Gaming | 9 | 8 | 88.9% |
| News | 16 | 13 | 81.3% |
| Candlemaking | 8 | 0 | 0% |
| Pickleball | 10 | 0 | 0% |
Against the 28.7% corpus block rate, Candlemaking's zero is a clear pull toward openness.
Corpus-wide, 285 of 993 sites block at least one AI crawler.
Not one of those 285 is a candle shop. In an edition where nearly three in ten sites gate at least one crawler, an entire supply-and-tutorial vertical staying fully open is a meaningful signal about where the AI-access fault lines actually run. They run through news, gaming, and large media properties — not through hobby commerce. Candlemaking's clean zero, sitting beside Embroidery, Tea, and Pottery, marks the open pole of that divide as plainly as Gaming's 88.9% marks the closed one.
Which Bots Would Get Named First
No Candlemaking site gates today, but the corpus-wide bot leaderboard shows which crawlers are most often disallowed elsewhere — the likely first names if the category ever shifts.
| Bot | Sites disallowing (across all 993 sites) | Share |
|---|---|---|
| CCBot | 211 | 21.2% |
| ClaudeBot | 188 | 18.9% |
| GPTBot | 187 | 18.8% |
| Bytespider | 183 | 18.4% |
| Meta-ExternalAgent | 162 | 16.3% |
CCBot leads at 211 sites, with ClaudeBot and GPTBot close behind. If a site like candlescience.com or makesy.com ever decided to gate, these are the user-agents most likely to land on the first disallow line — they are simply the most-targeted across the whole snapshot.
How the Snapshot Was Sealed
The procedure is plain by design, which is what lets a zero stand up. We fetched each site's public robots.txt, parsed every User-agent and Disallow line, and flagged any directive aimed at a tracked AI crawler. A site counts as a blocker only if it disallows at least one such agent on at least one path. For Candlemaking, none did: 10 sites checked, 8 with a parseable file, 0 blocking.
We then content-hashed and froze the reading, so snapshot sha 5d5458529dab2773 pins exactly this result to June 14, 2026. The zero is the literal count of AI-aimed disallow lines, which was none — not a figure derived from any subtraction. Nothing is estimated, modeled, or extrapolated.
The honest footnote is coverage: eight published policies is a thin base, so this reads the sites that publish a rule, not every candle page on the web. As with any clean zero, the worth is in watching that base for its first change.
There is a practical reading for anyone in the candle trade. The data says the whole published discovery surface for candle supplies and how-to content is open to AI answer engines today — wax, wicks, fragrance, containers, and the tutorials that sell them are all crawlable on the eight sites that publish a policy. A customer asking an assistant which wax pairs with soy or where to buy fragrance oil is being answered from pages these suppliers control.
Holding that position is valuable, and the only way to know it still holds is to watch for the first defector. A sealed zero is the most sensitive baseline for catching exactly that.
Candlemaking sites post a 0% AI-crawler block rate.
Key Takeaways
All 8 Candlemaking sites with a published robots.txt allow every AI crawler — a 0% block rate.
The permissive set includes candlescience.com, makesy.com, candlewic.com, and theflamingcandle.com.
bittercreeknorth.com and aztec-candle.com returned no robots.txt, a distinct state we do not read as intent.
Candlemaking's zero sits well under the 28.7% corpus rate, alongside Embroidery, Tea, and Pickleball.
Across all 993 sites, CCBot is the most-disallowed bot at 211 sites — a likely first target if Candlemaking gates.
Frequently Asked Questions
Q: Do any Candlemaking sites block AI crawlers?
A: No. All 8 Candlemaking sites with a published robots.txt allow every AI crawler we tracked, including candlescience.com, makesy.com, and candlewic.com. The block rate for the category is 0%.
Q: Why does Candlemaking block AI crawlers so much less than the wider web?
A: Candlemaking's 0% sits far under the 28.7% corpus rate. The vertical is mostly supply shops and tutorials that compete on being found, so a permissive robots.txt is the default. Adjacent crafts like Embroidery landed at the same clean zero.
Q: Is a missing robots.txt the same as allowing AI crawlers?
A: No. bittercreeknorth.com and aztec-candle.com returned no robots.txt at all, which is a different state from a published allow. We count those sites separately and never infer a stance from an absent file; nothing is estimated, modeled, or extrapolated.
Q: What would the first Candlemaking block look like?
A: A site such as lonestarcandlesupply.com or naturesgardencandles.com would add an AI user-agent — CCBot or GPTBot, the corpus leaders — to a disallow line. Because today's baseline is a clean zero, any single such change is immediately visible in the next sealed reading of the snapshot.
Q: How does Candlemaking compare to similar craft categories?
A: Candlemaking's 0% ties it with Embroidery, Tea, Pickleball, and Pottery at the open extreme of the ranking. It sits far below the 28.7% corpus average and worlds away from the heavy-gating end, where Gaming blocks at 88.9% and News at 81.3%. Supply-driven crafts consistently gate far less than news and gaming.
Q: Does a 0% block rate make Candlemaking content fully available to AI answers?
A: For the published policies, yes — none disallow an AI crawler, so compliant bots can fetch and surface those pages. But robots.txt is an honor-system standard, so the zero only means owners have not asked the crawlers to leave; it neither invites nor forces anything, and nothing is estimated, modeled, or extrapolated in that reading.
Put AI-Access Data to Work
The realistic first buyer is a market-research and data-licensing lead tracking how open craft corpora stay to AI retrieval. For someone licensing or modeling access across many hobby verticals, the recurring job is to re-crawl this Candlemaking set weekly and alert the instant any of the eight allowers — candlescience.com, makesy.com, theflamingcandle.com — adds a bot token to a disallow list. A clean zero is the most sensitive baseline there is: the first block in the category is a single-line change that rewrites the whole story, and that is the trigger worth routing.
The category-native second ICP is a candle-supply DTC operations lead. That buyer can monitor whether the supply catalogs their customers cross-shop — candlewic.com, communitycandlesupply.com — stay crawlable, since AI-surfaced product and how-to pages influence where makers source wax and fragrance. US Tech Automations automates that monitoring with scheduled robots.txt crawls, change alerts, and an AI-access policy dashboard. See how agentic workflows watch for drift like this.
For a craft where a single site does all the gating, compare our analysis of leathercraft sites and AI crawlers.
Source: US Tech Automations Research — Closing Web edition; figures are verbatim counts from public robots.txt files sealed June 14, 2026 (snapshot sha 5d5458529dab2773).
Get this data as a daily feed
The numbers in this report come from a permit feed we monitor daily. Leave your email and we will follow up about a daily feed for your ZIPs and categories.
Prefer to talk first? Contact us.
Cite this report
US Tech Automations Research, 2026-06 edition. “Do Candlemaking Sites Block AI Crawlers? Zero of 8 Do.” https://ustechautomations.com/resources/blog/do-candlemaking-sites-block-ai-crawlers-2026
Sealed snapshot sha256: 5d5458529dab2773
Machine-readable data: CSV · JSON · All research & methodology
About the Author

Helping businesses leverage automation for operational efficiency.