Model Context Optimization — Crawlability & Machine Readability Audit
https://www.homestoreandmore.ie
April 30, 2026
Site-Wide Crawlability Summary
Step 1 — SITE ACCESS INVENTORY: - Home (https://www.homestoreandmore.ie/): Resolves 200. Bot access: GPTBot (200), ClaudeBot (200), Googlebot (200), Perplexity (200), Applebot (200). Bingbot (403 - Blocked). SSR word count: 3143. H1 missing. - Stores (https://www.homestoreandmore.ie/stores): Resolves 200. Bingbot (403 - Blocked). SSR word count: 4680. H1 missing. - Shop By Department (https://www.homestoreandmore.ie/shop-by-department): Resolves to .../shop-by-department.html (301 redirect). Bingbot (403 - Blocked). SSR word count: 2833. H1 missing. - Specials (https://www.homestoreandmore.ie/specials): Resolves to .../specialoffers.html (301 redirect). Bingbot (403 - Blocked). SSR word count: 2880. H1 missing. - Clearance (https://www.homestoreandmore.ie/clearance-nav): Resolves to .../clearance (301 redirect). Bingbot (403 - Blocked). SSR word count: 4871. H1 found. - Bathroom Menu (https://www.homestoreandmore.ie/bathroom-menu): Resolves 200. Bingbot (403 - Blocked). SSR word count: 2884. H1 missing. - Bathroom Accessories (https://www.homestoreandmore.ie/bathroom-accessories): Resolves 200. Bingbot (403 - Blocked). SSR word count: 2869. H1 missing. - Category Pages (Bathroom Bins, Scales, Caddies, Curtains, Brushes, Seats, Cabinets, Mirrors, Mats, Towels): All resolve 200. Bingbot (403 - Blocked). SSR word counts: 3760–4276. H1 found on all PLP templates. - Product (BBQ): Resolves to .../gas-bbq/master-cook-classic-500-5-burner-gas-bbq/066033.html (301 redirect). Bingbot (403 - Blocked). SSR word count: 3336. H1 found. - Product (Knife Set): 404 error on .htm path; resolves 200 on .html path. Bingbot (403 - Blocked). SSR word count: 3232. H1 missing on the successful resolution. - Indexation: No x-robots "noindex" detected in server headers across analyzed pages. Step 2 — CROSS-PAGE ACCESS PATTERNS: - Bot Access Consistency: There is a site-wide blockade of Bingbot (403) across every single page, including the homepage and product pages. This indicates a systemic WAF or server-level rule targeting the Bingbot user-agent, despite robots.txt directives explicitly allowing it. Other major AI agents (GPTBot, ClaudeBot) are allowed. - Redirect Chain Patterns: The site relies heavily on .html extensions. Semantic URLs (e.g., /specials, /clearance-nav) consistently trigger 301 redirects to the .html or cleaned slug versions. - SSR Consistency: Content is consistently delivered via SSR with high word counts (>2500) across all templates. The site does not rely on client-side rendering for primary text content. - H1 Pattern Failure: A systemic failure exists in the semantic skeleton of high-level navigation and home templates. The Homepage, Stores, Shop By Department, Specials, Bathroom Menu, and Bathroom Accessories pages all fail to deliver an <h1> in the SSR payload. H1 coverage only begins at the specific sub-category (PLP) and product (PDP) levels. - Content Deferral: Listing pages (Clearance, Bins, Scales) exhibit extreme h1_char_offset values (440,000 to 502,000 characters). This signifies that half a megabyte of code/noise is transmitted before the primary page identifier is reached. Step 3 — EFFICIENCY & TOKEN WASTE ASSESSMENT: - Signal-to-Noise Ratio Patterns: Systemic inefficiency is present across the entire domain. The signal_to_noise_ratio ranges from 0.033 to 0.080. This indicates that 92% to 97% of the data retrieved by an AI crawler is technical noise, template bloat, or non-semantic metadata. - Data Island Prevalence: Category pages (PLPs) carry massive JSON-LD or data island payloads. Pages like "Clearance" and "Towels" contain 55 individual data islands totaling over 105 KB each. This exceeds the visible text character count (approx. 33 KB - 37 KB) by a factor of 3x. - UI Interference: landmark_interference.is_ui_element_preceding_content is true for every category and product page. Critical UI elements (headerv2, mask-overlay, search-overlay, global-banner) consistently precede the main content, forcing crawlers to process 400k+ characters of boilerplate navigation before reaching product data. - Internal Link Bloat: Every page analyzed contains an excessive internal link count, ranging from 1,132 to 1,629 links. A significant portion (approx. 1,090) are contained within the header. This creates massive token waste and dilutes the crawl priority of main content. Step 4 — INFRASTRUCTURE & DISCOVERY HEALTH: - Parent Path Stability: infrastructure.parent_path_status is consistently 200. The directory structure is stable. - Robots.txt Conflict: crawl_discovery.robots_txt shows "allow" for all bots, but network_access.bot_parity confirms Bingbot is 403 Forbidden. This discrepancy suggests an uncoordinated blocking strategy between the robots.txt file and the server/WAF layer. - Discovery Gaps: nav_visible_in_ssr is true across the site. AI crawlers can discover the site map via the massive link lists in the SSR payload, but the sheer volume of links (1,000+ per page) makes efficient discovery of new products difficult due to the "noise" of the header. - Header Consistency: Server headers (vary: accept-encoding, cache_control: no-cache) are uniform across the site, suggesting a centralized infrastructure management. Step 5 — CRITICAL CRAWLABILITY FAILURES: 1. Systematic Bingbot Blockade: Site-wide 403 Forbidden for Bingbot creates a total blind spot for any AI system or search engine utilizing Bing's crawl infrastructure (including Perplexity and various LLM browse tools). 2. Extreme Content Latency (Offset): AI crawlers must process 400KB to 500KB of metadata and navigation links before identifying the primary <h1> on listing pages. This increases compute costs for retrieval and risks context window overflow for many LLM-based agents. 3. High-Level Semantic Absence: The primary entry points (Home, Department, Specials, Bathroom Menu) contain zero <h1> tags in the SSR payload. This forces AI systems to rely on less reliable title tags or meta-descriptions to determine page intent. 4. Token Waste/Efficiency Crisis: With an SNR as low as 0.03, the site is 97% noise. The volume of header links (1090 per page) and data islands (100KB+ per page) creates a high-friction environment for machine readability. 5. Inconsistent H1 Distribution: H1s are present on PLPs but missing on the specific PDP analyzed (Berlinger Haus Knife Set), suggesting template-level inconsistencies within the product catalog.
Page Scores
Per-Page Analysis
https://www.homestoreandmore.ie/82 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/
FINAL DEST: https://www.homestoreandmore.ie/ (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 3143
H1 FOUND IN SSR: NO
EMPTY SHELL: NONE
IMAGES WITH SRC: 20
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.6%)
[PASS] H1 Source Position (NOT FOUND)
[PASS] UI Interference
HTML SIZE: 492.2 KB
VISIBLE TEXT: 22.8 KB
DATA ISLANDS: 6 blocks (19.62 KB total, largest: 12.71 KB)
04. Semantic Skeleton
URL SLUG: /
H1 TAG: MISSING
META TITLE: Home Store + More | Bedding, Curtains, Kitchen, Cookware + More - Home Store + More
META DESC: Discover Home Store + More's extensive collection of bedding, curtains, kitchen essentials, bathroom accessories, storage solutions, and home décor. Transform your living space with quality products at affordable prices. Shop by department or room today! Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1278
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page is a retail hub for 'Home Store + More' acting as the primary entry point. While SSR word count is healthy (3,143 words), the page is semantically crippled as it lacks an <h1> tag entirely in the SSR payload, forcing AI systems to rely on meta titles for intent. The content structure is dominated by navigation tiles and product carousels, which are accessible but buried under massive boilerplate.
Technical Access Assessment
The page exhibits a critical bot parity failure; Bingbot is met with a 403 Forbidden status while other AI agents are allowed 200 access. This confirms the site-wide WAF blockade identified in the Site Context remains active here. While robots.txt is permissive, the server-level rejection prevents indexing by any Bing-based LLM or browse tool. Infrastructure is stable (200 parent path), but the lack of an H1 prevents the crawler from establishing a content anchor.
Retrieval Efficiency Analysis
Retrieval efficiency is extremely poor with a signal-to-noise ratio of 0.046, indicating that over 95% of the 504,054 characters transmitted are technical noise or boilerplate. A massive internal link count (1,278 total) is heavily weighted in the header (1,090 links), forcing AI crawlers to process nearly 400KB of navigation data before reaching the primary content. Data islands are present but secondary to the overwhelming link bloat.
AI Retrieval Impact
The blockade of Bingbot creates a total invisibility risk for Microsoft Copilot and Perplexity. For allowed bots, the 500KB HTML payload creates high truncation risk; a RAG pipeline's chunker may exhaust its token budget on the header's 1,090 links before reaching meaningful product data. The absence of an H1 further complicates automated context window partitioning.
Recommendation
Priority 1: Synchronize WAF policies with robots.txt to allow Bingbot access, ending the 403 blockade. Priority 2: Inject a descriptive <h1> into the SSR payload to provide a semantic anchor. Priority 3: Refactor the header to externalize the 1,090 links or utilize a more efficient navigation structure to improve the signal-to-noise ratio above 0.10.
Score Justification
The page fails on machine readability due to the systemic 403 blockade of Bingbot and a total absence of a semantic H1. A signal-to-noise ratio of 0.04 means AI agents waste 95% of their compute/token budget on boilerplate navigation links before identifying page content.
https://www.homestoreandmore.ie/stores82 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/stores
FINAL DEST: https://www.homestoreandmore.ie/stores (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 4680
H1 FOUND IN SSR: NO
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (8.1%)
[PASS] H1 Source Position (NOT FOUND)
[PASS] UI Interference
HTML SIZE: 453.4 KB
VISIBLE TEXT: 36.5 KB
DATA ISLANDS: 7 blocks (21.34 KB total, largest: 12.71 KB)
04. Semantic Skeleton
URL SLUG: stores
H1 TAG: MISSING
META TITLE: Store Locator - Home Store + More
META DESC: Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1133
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page functions as a Store Locator hub. While the SSR word count is robust (4680 words), the page lacks a semantic <h1> tag in the SSR payload, mirroring the site-wide pattern of semantic failure on high-level navigation pages. The stripped text reveals a comprehensive list of store locations (Athlone, Blanchardstown, etc.), but the lack of a primary heading makes it difficult for machine agents to instantly classify the page's intent without full text parsing.
Technical Access Assessment
The page exhibits a critical bot access disparity: while GPTBot and ClaudeBot are allowed (200), Bingbot is systematically blocked (403 Forbidden), contradicting the robots.txt 'allow' directive. This suggests a server-side or WAF-level blockade targeting the Bing user agent, making the content invisible to Bing-powered AI systems like Copilot and Perplexity's Bing-based retrieval. There are no redirect hops, providing a direct but high-friction entry point.
Retrieval Efficiency Analysis
The retrieval efficiency is extremely poor with a signal-to-noise ratio of 0.0805, meaning over 92% of the 464KB HTML payload is technical noise. The primary barrier is the excessive internal link count (1133 total), of which 1090 are located in the header. This creates a massive 'token wall' that an AI crawler must navigate before reaching the actual store data. Additionally, the lack of an <h1> prevents crawlers from using standard DOM anchors to locate the start of meaningful content.
AI Retrieval Impact
There is a severe risk of content truncation or context window overflow for LLM-based retrievers. A RAG pipeline's chunker must process nearly 400,000 characters of boilerplate navigation before reaching the store list. For agents with limited token budgets, the actual store addresses and details may be discarded or truncated. The Bingbot blockade results in total invisibility for a significant portion of the AI ecosystem.
Recommendation
1. Resolve the WAF/CDN blockade of Bingbot to ensure parity across AI agents. 2. Implement a semantic <h1> tag (e.g., 'Our Store Locations') within the initial SSR payload. 3. Drastically prune the header link bloat; 1090 links per page is a critical efficiency failure. 4. Wrap store listings in a <main> element to provide a clear content anchor for machine extractors, bypassing the massive header boilerplate.
Score Justification
The score reflects the systemic blockade of Bingbot and the extreme token waste (SNR 0.08). While content is present in the SSR, the lack of an H1 and the requirement for an AI to process 400KB+ of noise to reach the text creates significant retrieval friction.
https://www.homestoreandmore.ie/shop-by-department81 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/shop-by-department
FINAL DEST: https://www.homestoreandmore.ie/shop-by-department.html (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 1
→ [301] https://www.homestoreandmore.ie/shop-by-department
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 2833
H1 FOUND IN SSR: NO
EMPTY SHELL: NONE
IMAGES WITH SRC: 20
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.7%)
[PASS] H1 Source Position (NOT FOUND)
[PASS] UI Interference
HTML SIZE: 446.5 KB
VISIBLE TEXT: 20.8 KB
DATA ISLANDS: 6 blocks (19.62 KB total, largest: 12.71 KB)
04. Semantic Skeleton
URL SLUG: shop-by-department.html
H1 TAG: MISSING
META TITLE: Shop By Department | Bedding, Curtains, Kitchen & More - Home Store + More
META DESC: Discover amazing deals across every category with our Shop by Department drop down, featuring bedding, candles, home décor and more. Explore special offers and clearance finds to refresh your home for less. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1192
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page serves as a Department Hub, designed to facilitate navigation across the site's categories. While it contains 2,833 words in the SSR payload, the unique content is thin (373 words), with the remainder composed of internal links. Consistent with the site-wide pattern for high-level navigation pages, this page lacks an <h1> tag in the SSR payload, providing no immediate semantic anchor for AI crawlers. The URL includes a .html extension, necessitating a 301 redirect from the semantic slug.
Technical Access Assessment
Access is characterized by a systemic failure in bot parity: Bingbot is 403 Forbidden, while other AI crawlers (GPTBot, ClaudeBot) are allowed. This creates a total blind spot for Bing-based systems like Copilot. The page also exhibits a site-wide redirect pattern (301 redirect to .html). Although robots.txt permits all agents, the server/WAF layer blocks Bingbot, indicating a configuration mismatch between the robots.txt directives and server-level security rules.
Retrieval Efficiency Analysis
The page suffers from extreme technical noise, with a signal-to-noise ratio (SNR) of 0.0467, meaning 95.33% of the HTML is boilerplate. The most critical gap is the total absence of an <h1> tag (h1_char_offset: -1), which forces LLMs to rely on non-semantic title tags to infer intent. Furthermore, the presence of 1,090 links in the header alone creates massive token waste for any crawler attempting to process the main content area.
AI Retrieval Impact
For LLM-based RAG pipelines and real-time browse sessions, the page is highly inefficient. The 457KB HTML size is dominated by a bloated header, consuming a significant portion of a typical context window before meaningful category links are reached. The Bingbot blockade ensures the page is excluded from the knowledge base of Microsoft-associated AI models. The lack of a semantic skeleton increases the risk of extraction errors or hallucinations regarding the page's primary purpose.
Recommendation
First, align WAF rules with robots.txt to whitelist Bingbot (User-Agent). Second, implement a semantic <h1> tag ('Shop By Department') in the SSR payload immediately following the body tag. Third, drastically reduce the SSR footprint by moving the 1,090-link header menu into a deferred script or an external JSON data island, which would improve the SNR and lower the compute cost for AI retrieval.
Score Justification
The score reflects the systematic 403 blockade of Bingbot and the total absence of a semantic H1. While the page is accessible to GPTBot and ClaudeBot, the extremely low SNR (0.04) and massive header bloat make it a high-friction target for machine extraction, consistent with the site-wide crawlability failures identified in the context.
https://www.homestoreandmore.ie/specials81 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/specials
FINAL DEST: https://www.homestoreandmore.ie/specialoffers.html (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 1
→ [301] https://www.homestoreandmore.ie/specials
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 2880
H1 FOUND IN SSR: NO
EMPTY SHELL: NONE
IMAGES WITH SRC: 17
IMAGES LAZY-ONLY: 1
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.9%)
[PASS] H1 Source Position (NOT FOUND)
[PASS] UI Interference
HTML SIZE: 434.8 KB
VISIBLE TEXT: 21.1 KB
DATA ISLANDS: 6 blocks (19.62 KB total, largest: 12.71 KB)
04. Semantic Skeleton
URL SLUG: specialoffers.html
H1 TAG: MISSING
META TITLE: Special Offers | Best Deals, Offers & More... - Home Store + More
META DESC: Discover incredible deals and exclusive promotions that transform your shopping experience. Our handpicked special offers deliver exceptional value across premium products, ensuring you never miss out on extraordinary savings. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1157
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page functions as a high-level promotional hub for 'Special Offers', resolving via a 301 redirect from /specials to /specialoffers.html. While the SSR payload contains a significant word count (2880 words), it follows the systemic site-wide failure of missing a semantic <h1> tag. The semantic skeleton is reliant entirely on the <title> and meta description, providing no structural anchor for AI content extraction.
Technical Access Assessment
Access is bifurcated: standard AI bots (GPTBot, ClaudeBot) are permitted, but Bingbot is met with a 403 Forbidden status, confirming a site-wide WAF blockade that contradicts robots.txt 'allow' directives. This creates a total blind spot for Bing-integrated AI systems (Copilot, Perplexity). The 301 redirect adds a minor latency hop, but the primary barrier is the server-level bot discrimination.
Retrieval Efficiency Analysis
Retrieval efficiency is critically low with a signal-to-noise ratio (SNR) of 0.0485, meaning 95.15% of the 445KB transfer is technical noise. The page contains 1,157 internal links, with 1,090 (94%) trapped in the header boilerplate. The h1_char_offset is non-existent (-1) because no H1 is present in the SSR, forcing crawlers to parse hundreds of thousands of characters before identifying page intent.
AI Retrieval Impact
AI agents face extreme 'Token Waste.' A RAG pipeline or LLM chunker must process nearly half a megabyte of code to extract roughly 21KB of visible text. For Bing-based agents, the page is invisible. For allowed bots, the excessive link density and lack of a semantic header risk context window truncation or 'hallucination' of page purpose due to the overwhelming navigation-to-content ratio.
Recommendation
1. Immediate WAF reconfiguration to whitelist the Bingbot user-agent. 2. DOM restructuring to include a primary <h1>Special Offers</h1> within the SSR payload. 3. Reduction of the 1,090-link header bloat by externalizing sub-navigation into a JSON endpoint or utilizing CSS/JS-based lazy loading to improve the signal-to-noise ratio from 0.04 to a target of >0.15.
Score Justification
The score is heavily penalized by the site-wide Bingbot blockade (403) and the extreme technical noise (SNR < 0.05). The absence of an <h1> tag on a primary navigation hub further degrades machine readability, as it lacks the essential semantic anchor required for reliable content chunking in AI workflows.
https://www.homestoreandmore.ie/clearance-nav75 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/clearance-nav
FINAL DEST: https://www.homestoreandmore.ie/clearance (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 1
→ [301] https://www.homestoreandmore.ie/clearance-nav
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 4871
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.8%)
[FAIL] H1 Source Position (502,787 chars)
[FAIL] UI Interference
HTML SIZE: 775.5 KB
VISIBLE TEXT: 37.1 KB
DATA ISLANDS: 55 blocks (104.55 KB total, largest: 40.56 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: clearance
H1 TAG: Clearance (94 products)
META TITLE: Clearance - Home Store + More
META DESC: Bag yourself a bargain with our selection of discounted items, When It's Gone It's Gone! Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1629
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
This is a Clearance Product Listing Page (PLP) identifying 94 products. The page successfully delivers content via SSR (4,871 words) following a 301 redirect from /clearance-nav to /clearance. While the semantic skeleton exists, the content is buried beneath a massive site-wide boilerplate architecture.
Technical Access Assessment
The page exhibits a major bot parity failure: Bingbot is 403 Forbidden (likely a WAF/Server rule) despite being allowed in robots.txt. This blocks all Bing-based AI services (Copilot, Perplexity). GPTBot and ClaudeBot are permitted. A single 301 redirect hop is present, consistent with the site's directory-to-file pathing strategy.
Retrieval Efficiency Analysis
Retrieval efficiency is extremely low with a Signal-to-Noise Ratio (SNR) of 0.0478; 95% of the 794KB HTML payload is noise. The primary <h1> is deferred by 502,787 characters, requiring an AI to process over 500KB of metadata, 55 data islands (104KB), and 1,090 header links before reaching the main content identifier.
AI Retrieval Impact
High risk of content truncation in LLM-based RAG pipelines due to the extreme H1 offset. The 'token tax' for retrieving 38KB of visible text is nearly 20x the actual content size. Total invisibility to Bingbot infrastructure prevents indexing by a major subset of AI search tools. Discovery of products is inefficient as the crawler is bombarded with 1,629 links per page.
Recommendation
Priority 1: Resolve the Bingbot 403 blockade at the WAF/CDN level. Priority 2: Move the main content block and <h1> higher in the DOM to reduce the 500KB+ character offset. Priority 3: Externalize or prune the 55 JSON-LD data islands. Priority 4: Implement a 'lite' header for bots to reduce the 1,090-link boilerplate waste.
Score Justification
The score reflects the total blockade of Bingbot and the severe retrieval friction caused by a 502,787-character H1 offset and a 0.04 SNR, which together make the page nearly unreadable for context-limited AI agents.
https://www.homestoreandmore.ie/bathroom-menu82 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/bathroom-menu
FINAL DEST: https://www.homestoreandmore.ie/bathroom-menu (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 2884
H1 FOUND IN SSR: NO
EMPTY SHELL: NONE
IMAGES WITH SRC: 11
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.9%)
[PASS] H1 Source Position (NOT FOUND)
[PASS] UI Interference
HTML SIZE: 432.8 KB
VISIBLE TEXT: 21.2 KB
DATA ISLANDS: 7 blocks (20.42 KB total, largest: 12.71 KB)
04. Semantic Skeleton
URL SLUG: bathroom-menu
H1 TAG: MISSING
META TITLE: Bathroom - Home Store + More
META DESC: Decorate your bathroom with our range of quality and affordable bathroom accessories; from toilet seats, toilet brushes, bathroom mirrors, cabinets and bathroom bins, we have everything you need to kit out your bathroom fully. Make life in the shower easier with our shower caddies and bathroom racks o your toiletries are always within arm's reach. Keep your bathroom organised with our toiler roll holders, tumblers, soap dispensers, towel rails and door hooks to avoid clutter. Treat yourself to some luxury with our premium Egyptian cotton towels that are sure to bring back memories of your finest hotel experience. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1146
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page functions as a departmental hub (Bathroom Menu) designed for navigation rather than direct product conversion. While it successfully delivers a substantial SSR word count of 2,884, it suffers from the site-wide semantic failure of missing a primary <h1> tag. The content consists primarily of category links (Towels, Accessories, Furniture) and SEO boilerplate text at the footer. The semantic skeleton relies entirely on the <title> and meta description, as the document body lacks a top-level heading to anchor the entity 'Bathroom'.
Technical Access Assessment
Access is bifurcated: most AI agents (GPTBot, ClaudeBot) receive a 200 OK, but Bingbot is met with a 403 Forbidden, confirming the site-wide WAF/server-level blockade noted in the Site Context. This contradicts the robots.txt 'allow' directive. The page avoids redirect chains (resolving directly), but the infrastructure delivers a massive 443KB payload for only 21KB of visible text. The internal link count (1,146) is excessive, with 1,090 links concentrated in the header, creating a massive crawl frontier that dilutes the current page's relevance.
Retrieval Efficiency Analysis
The primary retrieval barrier is the signal-to-noise ratio of 0.049, meaning 95.1% of the transmitted data is technical noise or template boilerplate. The absence of an <h1> (ssr_h1_found: false) prevents AI crawlers from definitively anchoring the page's primary topic within the first 500KB of data. While data island bloat (20.42 KB) is lower here than on product pages, the 'Landmark Interference' is high because the crawler finds no <main> or <h1> to use as a content anchor, forcing it to process 1,000+ header links before reaching departmental content.
AI Retrieval Impact
AI retrieval systems face significant 'Context Window Waste.' A RAG pipeline or LLM browser must ingest nearly half a megabyte of HTML to extract 21KB of text, significantly increasing token costs and the risk of content truncation. For systems utilizing Bing's infrastructure (Copilot, Perplexity), the page is entirely invisible due to the 403 status. The lack of an <h1> combined with the massive link-to-text ratio makes it difficult for LLMs to distinguish the 'main' content from the site-wide navigation boilerplate.
Recommendation
1. Immediate remediation of the Bingbot 403 error at the WAF level to ensure parity across AI agents. 2. Inject a semantic <h1>Bathroom</h1> within the SSR payload to provide a machine-readable content anchor. 3. Drastically reduce the header link density (currently 1,090 links) or implement a conditional 'lite' header for bot user-agents to improve the signal-to-noise ratio. 4. Wrap departmental content in a <main> landmark to assist bots in skipping boilerplate navigation.
Score Justification
The score reflects the critical 403 blockade of Bingbot and the systemic absence of a primary H1. While SSR integrity is high for word count, the 0.049 SNR and massive header bloat (1,090 links) create a high-friction environment for machine readability and token efficiency.
https://www.homestoreandmore.ie/bathroom-accessories82 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/bathroom-accessories
FINAL DEST: https://www.homestoreandmore.ie/bathroom-accessories (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 2869
H1 FOUND IN SSR: NO
EMPTY SHELL: NONE
IMAGES WITH SRC: 11
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.9%)
[PASS] H1 Source Position (NOT FOUND)
[PASS] UI Interference
HTML SIZE: 430.9 KB
VISIBLE TEXT: 21.1 KB
DATA ISLANDS: 6 blocks (19.62 KB total, largest: 12.71 KB)
04. Semantic Skeleton
URL SLUG: bathroom-accessories
H1 TAG: MISSING
META TITLE: Bathroom Accessories - Home Store + More
META DESC: Browse our wide selection of bathroom accessories, including bins, scales, caddies, curtains, mats, toilet brushes, and toilet seats. Find everything you need to complete your bathroom decor and personal care routine at our online home store. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1149
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page functions as a navigation hub for the 'Bathroom Accessories' category. While the SSR payload contains 2,869 words, the stripped content is only 282 words, indicating that the vast majority of the payload is navigation and boilerplate. Critically, the page lacks an <h1> tag, using <h2> for the primary title, which follows the problematic site-wide pattern for high-level entry pages.
Technical Access Assessment
The page exhibits a systemic 403 Forbidden status for Bingbot, mirroring the site-wide WAF blockade, which prevents access for Bing-based AI agents (Copilot/Perplexity). While GPTBot and ClaudeBot have access (200), the page delivers 1,149 internal links in the SSR payload, 1,090 of which are in the header. This creates massive retrieval overhead.
Retrieval Efficiency Analysis
The Signal-to-Noise Ratio (SNR) is a dismal 0.0489, meaning approximately 95% of the data ingested by an AI is technical noise or repetitive links. The absence of an <h1> tag (offset -1) means there is no clear semantic anchor for an AI to identify the page's primary entity within the first 440KB of code.
AI Retrieval Impact
LLM-based agents will suffer from extreme token waste, consuming a significant portion of their context window on the header's 1,090 links before reaching the actual category content. This risks truncation and poor embedding quality. Furthermore, the Bingbot blockade creates a total invisibility gap for a major segment of the AI ecosystem.
Recommendation
1. Immediately whitelist the Bingbot user-agent at the WAF level to resolve the 403 blockade. 2. Promote the 'Bathroom Accessories' <h2> to an <h1> to provide a semantic anchor. 3. Implement a 'Mobile-First' or truncated SSR navigation strategy to reduce the 1,090 header links to a manageable number (e.g., top-level categories only).
Score Justification
The score is significantly impacted by the Bingbot blockade and the extreme link bloat. While the content is theoretically accessible to GPTBot, the 0.048 SNR and the lack of an H1 make efficient extraction and accurate chunking highly difficult for RAG pipelines.
https://www.homestoreandmore.ie/bathroom-bins76 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/bathroom-bins
FINAL DEST: https://www.homestoreandmore.ie/bathroom-bins (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 4097
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (3.9%)
[FAIL] H1 Source Position (442,080 chars)
[FAIL] UI Interference
HTML SIZE: 814.5 KB
VISIBLE TEXT: 31.5 KB
DATA ISLANDS: 55 blocks (95.53 KB total, largest: 31.16 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: bathroom-bins
H1 TAG: Bathroom Bins (35 products)
META TITLE: Bathroom Bins - Home Store + More
META DESC: Discover a wide selection of bathroom bins, including swing, pedal, metal, plastic, and touch bins. Keep your bathroom organized and clutter-free with our stylish and functional bins. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1372
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page is a Product Listing Page (PLP) for 'Bathroom Bins', correctly identified by the H1 and a robust SSR word count of 4,097. It is not an empty shell, as primary product data (titles, prices, swatches) is rendered in the initial HTTP response. The semantic slug aligns perfectly with the content entity, but the structural integrity is compromised by extreme boilerplate volume.
Technical Access Assessment
Access is bifurcated: while GPTBot, ClaudeBot, and Googlebot are granted 200 OK status, Bingbot is met with a 403 Forbidden error. This site-wide WAF blockade contradicts the robots.txt 'allow' directive, creating a 'Silent Blockade' for Bing-based AI services. The infrastructure is stable with a 200 parent path status, but the heavy reliance on 1,372 internal links (1,090 in the header alone) creates significant crawl friction.
Retrieval Efficiency Analysis
The 'Context Window Budget' is severely strained. With a signal-to-noise ratio of 0.0386, only ~3.8% of the 834KB payload is meaningful text. A massive H1 character offset of 442,080 means an AI agent must ingest nearly half a megabyte of code, navigation links, and data islands (95.53 KB) before reaching the primary page identifier. Landmark interference from 'headerv2' and 'mask-overlay' pushes the core content deep into the DOM.
AI Retrieval Impact
The risk of content truncation is high; many RAG pipelines and LLM context windows will be exhausted by boilerplate before fully processing the 35 products listed. Bing-based AI systems (Copilot, Perplexity) are completely blind to this content. Furthermore, the massive link bloat in the header (1,090 links) dilutes the crawl budget and creates a high-token-cost environment for real-time browse sessions.
Recommendation
Primary: Resolve the WAF/CDN rule triggering 403 errors for Bingbot to enable Copilot access. Secondary: Critical DOM restructuring is required to reduce the h1_char_offset; move product content above the 1,000+ link navigation block. Tertiary: Externalize or prune the 95KB of data islands and reduce the header link density to improve token efficiency and signal-to-noise ratio.
Score Justification
While SSR content is technically present and accurate, the page suffers from a 403 blockade of Bingbot and extreme token waste. The 442KB content deferral and 0.038 SNR represent significant technical barriers to efficient machine readability and RAG ingestion.
https://www.homestoreandmore.ie/bathroom-scales76 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/bathroom-scales
FINAL DEST: https://www.homestoreandmore.ie/bathroom-scales (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 3810
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (5.0%)
[FAIL] H1 Source Position (442,893 chars)
[FAIL] UI Interference
HTML SIZE: 615.8 KB
VISIBLE TEXT: 30.5 KB
DATA ISLANDS: 49 blocks (95.04 KB total, largest: 36.13 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: bathroom-scales
H1 TAG: Bathroom Scales (21 products)
META TITLE: Bathroom Scales - Home Store + More
META DESC: Discover the ideal bathroom scales to suit your lifestyle and wellness objectives. Our range encompasses digital and mechanical models, featuring sleek designs, reliable accuracy, and user-friendly features. Track your progress and stay motivated every day with our trusted bathroom scales. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1245
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
This page is a Product Listing Page (PLP) for 'Bathroom Scales' that delivers a substantial SSR payload of 3,810 words. Unlike the site-wide hub pages which lack H1 tags, this page successfully renders a semantic H1 ('Bathroom Scales (21 products)'). However, the primary content is a small fraction of the total delivery, with only 31,193 characters of visible text within a 630,557 character HTML document, indicating a high-density boilerplate environment.
Technical Access Assessment
Access is inconsistent across the AI bot spectrum. While GPTBot, ChatGPT-User, and ClaudeBot are granted access (200), Bingbot is systematically blocked (403 Forbidden), mirroring a site-wide WAF pattern that ignores robots.txt 'allow' directives. The infrastructure uses 'no-cache' and 'vary: accept-encoding' headers, suggesting a dynamic but unoptimized delivery layer. No redirect hops are present, allowing for direct retrieval by non-blocked agents.
Retrieval Efficiency Analysis
The retrieval efficiency is severely compromised by content deferral and noise. The H1 character offset is 442,893, meaning an AI crawler must process nearly 443 KB of technical noise and navigation links before reaching the page's primary identifier. With a signal-to-noise ratio of 0.0495, 95% of the data retrieved is non-content material. Furthermore, 49 data islands totaling 95 KB (3x the visible text size) create significant token waste for LLM-based extractors.
AI Retrieval Impact
The extreme H1 offset presents a high risk of context window truncation; many RAG pipelines and browse tools may reach their token limit or timeout before reaching the product data. The systematic 403 blockade for Bingbot results in total invisibility for Microsoft Copilot and other Bing-based discovery systems. The excessive internal link count (1,090 links in the header alone) dilutes the crawl budget and creates a high-friction environment for machine readability.
Recommendation
1. Immediately whitelist Bingbot at the WAF/CDN level to resolve the site-wide 403 blockade. 2. Implement a 'content-first' DOM structure to move the primary H1 and product grid above the global navigation boilerplate, reducing the h1_char_offset. 3. Prune the header link volume (currently 1,090 links) to reduce token waste and improve discovery efficiency. 4. Consolidate and externalize the 49 data islands to minimize the JSON-LD footprint in the primary SSR payload.
Score Justification
The page is technically accessible to GPT/Claude but is effectively 'invisible' to Bing/Copilot. The machine readability is severely hindered by an extreme H1 offset (442k characters) and a massive internal link volume (1,090 header links), resulting in 95% token waste.
https://www.homestoreandmore.ie/shower-caddies-bathroom-fittings76 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/shower-caddies-bathroom-fittings
FINAL DEST: https://www.homestoreandmore.ie/shower-caddies-bathroom-fittings (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 4155
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.8%)
[FAIL] H1 Source Position (451,255 chars)
[FAIL] UI Interference
HTML SIZE: 704.1 KB
VISIBLE TEXT: 34.1 KB
DATA ISLANDS: 55 blocks (102.51 KB total, largest: 36.38 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: shower-caddies-bathroom-fittings
H1 TAG: Shower Caddies & Bathroom Fittings (82 products)
META TITLE: Shower Caddies & Bathroom Fittings - Home Store + More
META DESC: Explore a wide range of high-quality shower caddies, tension rods, plug chains, toothbrush caddies, and towel rails for your bathroom. Shop now for stylish and functional bathroom fittings. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1328
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
This is a Product Listing Page (PLP) for 'Shower Caddies & Bathroom Fittings' containing 82 product entities. Unlike the site-wide entry points described in the Site Context, this page successfully delivers a semantic <h1> in the SSR payload. The SSR word count (4,155) is robust, and the hydration_existence check confirms it is not an empty shell, with product data and filter options rendered in the initial HTTP response.
Technical Access Assessment
The page exhibits the systemic site-wide 403 Forbidden blockade for Bingbot, effectively excluding it from Microsoft Copilot and Perplexity's primary search index. Other major agents (GPTBot, ChatGPT-User, ClaudeBot) have 200 OK access. Robots.txt is technically permissive, confirming an uncoordinated WAF/Server-level blockade. Infrastructure is stable (200 parent path), but the redirect hygiene is better here than on other analyzed pages as the requested URL resolves directly.
Retrieval Efficiency Analysis
The retrieval efficiency is critical-tier poor. With a signal_to_noise_ratio of 0.0484, approximately 95% of the 721,001 characters retrieved is technical noise. The primary <h1> is deferred behind an offset of 451,255 characters, meaning an AI crawler must process nearly 0.5MB of boilerplate before identifying the page subject. Additionally, 55 data islands totaling 102.51 KB (3x the visible text volume) create a significant 'token tax' for any context window.
AI Retrieval Impact
There is a high risk of context window truncation; many LLM-based RAG pipelines may reach their token limit before processing the actual product grid due to the 451k character H1 offset. The massive internal link count (1,328 total, 1,090 in the header) dilutes the priority of main content. For Bing-based agents, the page is non-existent, creating a significant discovery gap in the AI ecosystem.
Recommendation
1. Immediately whitelist the Bingbot user-agent in the WAF/CDN to resolve the 403 blockade. 2. Implement DOM restructuring to move the <main> content and <h1> above the 1,090-link header boilerplate to reduce H1 offset. 3. Externalize or compress the 102KB of JSON-LD/script data islands to improve the signal-to-noise ratio. 4. Streamline the header navigation for bot-specific requests to reduce the 1,000+ link token waste.
Score Justification
The score reflects a functional SSR implementation and semantic <h1> (improving upon site-wide entry-point patterns), heavily offset by the 403 Bingbot blockade and extreme retrieval inefficiency (0.04 SNR and 451k character content deferral).
https://www.homestoreandmore.ie/shower-curtains-safety-mats76 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/shower-curtains-safety-mats
FINAL DEST: https://www.homestoreandmore.ie/shower-curtains-safety-mats (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 4070
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.6%)
[FAIL] H1 Source Position (440,041 chars)
[FAIL] UI Interference
HTML SIZE: 706.2 KB
VISIBLE TEXT: 32.3 KB
DATA ISLANDS: 55 blocks (96.05 KB total, largest: 30.38 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: shower-curtains-safety-mats
H1 TAG: Shower Curtains & Safety Mats (35 products)
META TITLE: Shower Curtains & Safety Mats - Home Store + More
META DESC: Find the perfect shower curtains and safety mats for your bathroom. Explore our wide selection of styles, sizes, and colours to enhance your bathroom decor. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1316
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page is a Product Listing Page (PLP) for 'Shower Curtains & Safety Mats'. It is not an empty shell; the SSR word count (4070) indicates a robust initial payload. The semantic skeleton aligns with the URL slug, and the <h1> tag correctly identifies the entity (35 products), providing a clear target for machine retrieval once the technical barriers are bypassed.
Technical Access Assessment
Access is bifurcated. While GPTBot, ClaudeBot, and Googlebot are granted 200 OK access, Bingbot is strictly blocked (403 Forbidden). This site-wide pattern indicates a WAF/CDN-level rule targeting Microsoft's infrastructure, effectively making the content invisible to Copilot and Perplexity sessions. The robots.txt file is misleading as it claims to allow Bingbot, creating a discovery/enforcement conflict. The infrastructure relies on a standard SSR delivery with no unnecessary redirect hops.
Retrieval Efficiency Analysis
The 'Context Window Budget' is under extreme stress. The h1_char_offset is 440,041 characters, meaning an AI agent must ingest nearly half a megabyte of boilerplate code and navigation before reaching the primary page identifier. This is exacerbated by a signal_to_noise_ratio of 0.0457, where 95% of the page is non-content metadata or UI noise. The presence of 1,090 links in the header alone creates a massive 'token wall' that precedes the product data.
AI Retrieval Impact
High risk of content truncation in RAG pipelines. Standard LLM context windows or retrieval chunkers may discard the product list if they stop processing before the 440k character mark. The site-wide Bingbot blockade prevents knowledge indexing for a significant portion of the AI ecosystem. Furthermore, the 96KB of data islands consumes a disproportionate amount of the retrieval budget relative to the 33KB of visible text.
Recommendation
1. Immediately whitelist Bingbot at the WAF/CDN level to resolve the 403 disparity. 2. Implement DOM restructuring to move the <main> content and <h1> above the 1,090-link header in the SSR payload, reducing h1_char_offset. 3. Externalize or compress the 55 data islands to improve the signal-to-noise ratio. 4. Prune the header link count to essential navigation to reduce token waste per page hit.
Score Justification
While SSR integrity is high and GPTBot access is clear, the page suffers from a site-wide Bingbot blockade and extreme content deferral (440k char offset). The 95% noise ratio makes it highly inefficient for real-time browse sessions.
https://www.homestoreandmore.ie/toilet-brushes76 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/toilet-brushes
FINAL DEST: https://www.homestoreandmore.ie/toilet-brushes (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 4064
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.0%)
[FAIL] H1 Source Position (439,971 chars)
[FAIL] UI Interference
HTML SIZE: 791.8 KB
VISIBLE TEXT: 31.9 KB
DATA ISLANDS: 55 blocks (95.02 KB total, largest: 30.54 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: toilet-brushes
H1 TAG: Toilet Brushes (44 products)
META TITLE: Toilet Brushes - Home Store + More
META DESC: Explore our wide selection of toilet brushes in different styles and colours to suit your bathroom decor. Shop now for high-quality, durable toilet brushes that will make your cleaning routine a breeze. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1358
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page is a Product Listing Page (PLP) for 'Toilet Brushes' with high SSR integrity, delivering over 4,000 words in the initial payload. It is not an empty shell, and the semantic slug accurately reflects the product entity. However, the core content is buried under an immense volume of boilerplate code.
Technical Access Assessment
While GPTBot, ClaudeBot, and Googlebot are permitted, there is a hard server-level 403 Forbidden blockade against Bingbot, which is consistent with the site-wide pattern. This prevents all Microsoft-ecosystem AI agents (Copilot) from accessing the page. The technical stack is heavy, containing 1,358 internal links—1,090 of which are in the header alone.
Retrieval Efficiency Analysis
Retrieval efficiency is critically low with a signal-to-noise ratio of 0.04, meaning 96% of the data is technical noise. The primary <h1> is deferred by 439,971 characters, forced deep into the payload by UI overlays and a massive header. Furthermore, the page carries 95KB of data islands, nearly triple the volume of the actual visible text (32KB).
AI Retrieval Impact
There is a severe risk of content truncation in RAG pipelines; many LLM chunkers will exhaust their token budget on the header and JSON-LD data islands before reaching the product grid. Bing-based AI systems are completely blind to this page. The excessive internal links create a discovery bottleneck, diluting crawl priority for specific product URLs.
Recommendation
1. Resolve the WAF/CDN rule causing the 403 status for Bingbot to enable Copilot accessibility. 2. Implement DOM restructuring to move the main content block above the 1,000+ header links in the SSR output. 3. Externalize or compress the 95KB of JSON-LD data islands. 4. Reduce the header link density to improve the signal-to-noise ratio.
Score Justification
The score reflects a total blockade of Bingbot (403) and extreme content deferral (H1 offset of 440k characters). While SSR word count is high, the 0.04 signal-to-noise ratio and excessive link bloat make the page highly inefficient for machine retrieval.
https://www.homestoreandmore.ie/toilet-seats76 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/toilet-seats
FINAL DEST: https://www.homestoreandmore.ie/toilet-seats (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 3760
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.8%)
[FAIL] H1 Source Position (438,819 chars)
[FAIL] UI Interference
HTML SIZE: 611.6 KB
VISIBLE TEXT: 29.2 KB
DATA ISLANDS: 45 blocks (85.81 KB total, largest: 30.86 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: toilet-seats
H1 TAG: Toilet Seats (19 products)
META TITLE: Toilet Seats - Home Store + More
META DESC: Shop our range of classic toilet seats or slow close and soft close toilet seats. Choose from a selection of vibrant colours and printed toilet seats to inject a splash of colour into your bathroom. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1253
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page is a Product Listing Page (PLP) for 'Toilet Seats'. It is not an empty shell, as the SSR payload contains a significant word count (3,760) and a clear semantic H1 ('Toilet Seats (19 products)'). The semantic slug aligns perfectly with the content found in the stripped text, which lists specific products like the 'Sabichi Slow Close White Toilet Seat'.
Technical Access Assessment
The page exhibits the site-wide Bingbot blockade (403 Forbidden), despite robots.txt directives claiming access is allowed, indicating a server-level WAF exclusion that prevents Microsoft-based AI tools from reaching the content. While GPTBot and ClaudeBot have access (200), the infrastructure suffers from extreme internal link bloat, with 1,090 links in the header alone out of 1,253 total internal links, forcing bots to process massive amounts of boilerplate.
Retrieval Efficiency Analysis
The primary barrier to extraction is Content Deferral. The H1 is buried at a character offset of 438,819, meaning an AI must process nearly 0.44MB of technical noise, data islands, and navigation links before identifying the page's primary entity. With a signal-to-noise ratio of 0.0477, over 95% of the data retrieved is non-semantic code, and the 85.81 KB of data islands exceeds the visible text count (29,902 characters) by nearly 3x.
AI Retrieval Impact
The high H1 offset and low SNR create a high risk of context window truncation for LLM-based browse sessions and RAG pipelines. An AI agent's token budget is largely consumed by the 1,000+ link header and massive JSON-LD payloads (45 islands) before reaching product details. This results in significant compute waste and potential failure to embed the actual product catalog during high-volume crawls.
Recommendation
Priority 1: Resolve the Bingbot 403 error at the WAF level to restore access to Microsoft AI agents. Priority 2: Restructure the DOM to move the main content block (including the H1 and product grid) above the massive header navigation to reduce the H1 offset. Priority 3: Externalize or prune the 45 data islands (85KB) to improve token efficiency for machine readers.
Score Justification
The score is heavily penalized by the site-wide Bingbot blockade and extreme content deferral (438k character offset). While SSR integrity is good for some bots, the 95% noise-to-signal ratio and extreme link bloat make it highly inefficient for AI retrieval.
https://www.homestoreandmore.ie/bathroom-cabinets76 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/bathroom-cabinets
FINAL DEST: https://www.homestoreandmore.ie/bathroom-cabinets (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 3884
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (5.0%)
[FAIL] H1 Source Position (440,433 chars)
[FAIL] UI Interference
HTML SIZE: 610.9 KB
VISIBLE TEXT: 30.7 KB
DATA ISLANDS: 45 blocks (81.16 KB total, largest: 25.85 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: bathroom-cabinets
H1 TAG: Bathroom Furniture (19 products)
META TITLE: Bathroom Furniture - Home Store + More
META DESC: Transform your bathroom with our stylish and functional bathroom cabinets. Choose from tall units, shelving, storage organisers, and drawer cabinets to optimise space and keep everything neat and orderly. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1287
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page is a Product Listing Page (PLP) for Bathroom Furniture/Cabinets. It exhibits strong Server-Side Rendering (SSR) integrity with a word count of 3,884 and a clearly defined semantic skeleton that aligns with the URL slug. It is not an 'empty shell,' as the product grid and filter text are present in the initial HTTP response.
Technical Access Assessment
Access is bifurcated: while GPTBot and ClaudeBot receive 200 OK, Bingbot is systematically blocked with a 403 Forbidden status, consistent with the site-wide pattern. This indicates a WAF or server-level blockade that contradicts the robots.txt 'allow' directives. No redirect chains were detected, providing a direct but high-latency path for allowed bots.
Retrieval Efficiency Analysis
Retrieval efficiency is severely compromised by a Signal-to-Noise Ratio (SNR) of only 0.05. The primary content is buried under 440,433 characters of boilerplate (H1 offset). Additionally, 81.16 KB of JSON-LD/data islands and 1,090 internal links in the header precede the main content, creating a massive 'noise' layer for machine extraction.
AI Retrieval Impact
The page carries a significant risk of truncation and context window waste. LLM-based crawlers must process nearly 0.5 MB of navigation and technical noise before reaching the first product entity. For agents with limited token limits (e.g., real-time browse sessions), the core content may be truncated or lost entirely. Bing-reliant AI systems (Copilot) have zero visibility.
Recommendation
1. Immediate WAF/CDN reconfiguration to allow Bingbot access. 2. DOM restructuring to move the main content block and H1 above the 1,000+ header links in the source code. 3. Externalize or optimize the 45 data islands to reduce the token budget consumed by technical metadata.
Score Justification
The page earns points for excellent SSR content and H1 presence, but is severely penalized for the site-wide Bingbot blockade and a catastrophic signal-to-noise ratio that forces AI to process 95% noise for 5% signal.
https://www.homestoreandmore.ie/bathroom-mirrors76 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/bathroom-mirrors
FINAL DEST: https://www.homestoreandmore.ie/bathroom-mirrors (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 4150
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.9%)
[FAIL] H1 Source Position (451,082 chars)
[FAIL] UI Interference
HTML SIZE: 672.1 KB
VISIBLE TEXT: 32.9 KB
DATA ISLANDS: 53 blocks (97.26 KB total, largest: 34.48 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: bathroom-mirrors
H1 TAG: Bathroom Mirrors (23 products)
META TITLE: Bathroom Mirrors - Home Store + More
META DESC: Title: Shop Bathroom Mirrors - Oval, Rectangular, Round, Hollywood, LED & Cosmetic Mirrors | Online Home Store Meta Description: Explore our wide selection of high-quality bathroom mirrors. Choose from oval, rectangular, round, Hollywood, LED, and cosmetic mirrors to elevate your bathroom decor. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1318
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page is a Product Listing Page (PLP) for 'Bathroom Mirrors' containing 23 products. It demonstrates strong SSR integrity with an SSR word count of 4,150 and a confirmed H1 tag. Unlike the high-level hub pages in the site context, this sub-category template successfully renders its primary semantic identifier ('Bathroom Mirrors (23 products)') in the initial HTML payload. The content is visible and well-structured within the SSR output, avoiding the 'empty shell' syndrome.
Technical Access Assessment
The page exhibits a critical failure in bot parity: Bingbot is 403 Forbidden at the server/WAF level despite robots.txt explicitly allowing it. This aligns with a site-wide blockade of Microsoft/Bing-based AI tools. While GPTBot, ChatGPT-User, and ClaudeBot have access (200 OK), the infrastructure suffers from massive internal link bloat (1,318 links), with 1,090 links residing in the header alone. This creates a high-friction environment for crawlers navigating via the SSR link list.
Retrieval Efficiency Analysis
Retrieval efficiency is severely compromised by a Signal-to-Noise Ratio (SNR) of 0.049, meaning 95% of the 688KB HTML is non-content noise. The H1 character offset is 451,082 characters—nearly half a megabyte of code (mostly the 1,000+ link header and 53 data islands totaling 97KB) precedes the primary page title. This 'latency' in content delivery forces AI agents to process massive amounts of boilerplate before reaching the product data.
AI Retrieval Impact
The extreme H1 offset and low SNR create a high risk of context window truncation; many LLM-based RAG pipelines or browse tools may exhaust their token budget on the global navigation links before ever reaching the product grid. Furthermore, the 403 status for Bingbot renders this category invisible to any AI system utilizing Bing's index, including Microsoft Copilot and specific Perplexity search modes.
Recommendation
1. Immediate WAF/CDN configuration change to whitelist the Bingbot user-agent. 2. Prioritize DOM restructuring to move the <h1> and primary product grid above the global navigation header in the HTML source to reduce the H1 offset from 450k to under 50k characters. 3. Externalize or optimize the 97KB of JSON-LD data islands. 4. Implement a 'mega-menu' strategy that loads internal links via client-side hydration or a secondary fetch to reduce SSR token waste.
Score Justification
The page is technically reachable for OpenAI and Anthropic bots, but the 403 Forbidden for Bingbot and the catastrophic 451k character H1 offset significantly hinder machine readability. An SNR of 0.049 indicates extreme token waste that could lead to content truncation in AI retrieval sessions.
https://www.homestoreandmore.ie/bathroom-mats76 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/bathroom-mats
FINAL DEST: https://www.homestoreandmore.ie/bathroom-mats (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 4276
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (4.0%)
[FAIL] H1 Source Position (451,339 chars)
[FAIL] UI Interference
HTML SIZE: 814.4 KB
VISIBLE TEXT: 32.6 KB
DATA ISLANDS: 55 blocks (100.98 KB total, largest: 36.59 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: bathroom-mats
H1 TAG: Bath Mats (31 products)
META TITLE: Bath Mats - Home Store + More
META DESC: Discover our extensive collection of premium bathroom mats, crafted from high-quality materials for lasting durability. Available in an array of vibrant colours and contemporary designs, our bathroom mats add both elegance and slip-resistant safety to your bathroom space. Transform your bathroom's look today - browse our selection! Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1388
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
This page is a Product Listing Page (PLP) for 'Bath Mats' containing 31 products. It is correctly identified as a non-empty shell with a robust SSR word count of 4,276. Unlike higher-level navigation pages on this site, this template successfully includes a primary <h1> tag within the SSR payload, making the core entity clear to crawlers once reached.
Technical Access Assessment
Access is bifurcated: while GPTBot, ClaudeBot, and Googlebot are permitted (200), Bingbot is systematically blocked (403) at the server or WAF level despite robots.txt permissions. This creates a total visibility failure for any AI services utilizing Bing infrastructure (e.g., Microsoft Copilot). The page is served directly without redirect chains, maintaining infrastructure stability.
Retrieval Efficiency Analysis
The retrieval efficiency is critically low with a signal-to-noise ratio of 0.04. The primary <h1> tag is deferred behind 451,339 characters of boilerplate code, and the page is burdened by 55 data islands totaling 101 KB. This technical noise significantly outweighs the 33 KB of visible text, forcing AI crawlers to process nearly 0.5 MB of data before reaching the main content.
AI Retrieval Impact
There is a severe risk of content truncation in RAG pipelines and LLM-based browsing sessions. The extreme H1 offset (451k characters) means the most relevant product data resides at the end of the context window for many models. Additionally, the 1,090 links in the header create significant token waste, diluting the importance of the actual product links.
Recommendation
1. Resolve the 403 Forbidden status for Bingbot at the WAF level to ensure parity across AI agents. 2. Implement DOM reordering to move the primary product content and <h1> above the 1,000+ header links in the SSR payload. 3. Externalize or compress the 100KB of JSON-LD data islands to improve the signal-to-noise ratio and reduce token consumption costs for AI retrieval.
Score Justification
The score reflects a site-wide conflict: while the page has high SSR integrity and a clear H1, it suffers from a 403 Bingbot blockade, extreme content deferral (451k offset), and an inefficient SNR of 0.04 that forces machine agents to process excessive noise.
https://www.homestoreandmore.ie/towels76 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/towels
FINAL DEST: https://www.homestoreandmore.ie/towels (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 4216
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (3.4%)
[FAIL] H1 Source Position (458,998 chars)
[FAIL] UI Interference
HTML SIZE: 957.1 KB
VISIBLE TEXT: 32.4 KB
DATA ISLANDS: 55 blocks (105.23 KB total, largest: 41.55 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner
04. Semantic Skeleton
URL SLUG: towels
H1 TAG: Bathroom Towels (96 products)
META TITLE: Bathroom Towels - Home Store + More
META DESC: Pull out all the stops to impress your guests with our range of bath towels, hand towels and facecloths. Wrap yourself in luxury with our best-selling Egyptian Cotton towels for supreme comfort. Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1477
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page is a Product Listing Page (PLP) for 'Bathroom Towels' featuring 96 products. SSR integrity is high with a word count of 4,216, and the semantic skeleton matches the content. However, the document is excessively large at 980,117 characters for only 33KB of visible text.
Technical Access Assessment
The page exhibits a critical bot access disparity: while GPTBot and ClaudeBot are allowed (200), Bingbot is systematically blocked (403), confirming a site-wide WAF or server-level blockade that contradicts the robots.txt 'allow' directive. No redirect chains are present for this specific URL, but the site's overall infrastructure remains hostile to Bing-based AI services.
Retrieval Efficiency Analysis
Retrieval efficiency is severely compromised by a Signal-to-Noise Ratio (SNR) of 0.0338, meaning 96.6% of the payload is technical noise. The primary <h1> is buried at an offset of 458,998 characters, preceded by 1,090 internal header links and 105KB of JSON-LD data islands. This creates extreme content latency for machine readers.
AI Retrieval Impact
There is a high risk of context window truncation; many RAG pipelines and LLM agents may exhaust their token budget on the 460KB of boilerplate and 1,000+ header links before reaching the actual product list. The site-wide Bingbot block creates a total 'invisibility' state for any AI agents utilizing Microsoft's crawl infrastructure.
Recommendation
1. Resolve the WAF/CDN blockade for Bingbot to restore access for Copilot and Perplexity. 2. Restructure the DOM to move the <h1> and primary product grid above the 1,000+ header links to reduce the char_offset. 3. Externalize or compress the 55 JSON-LD data islands (105KB) to improve SNR. 4. Prune the global header links to reduce token waste.
Score Justification
The score reflects high SSR content availability for most bots but penalizes the page for the systemic Bingbot blockade and the extreme 458k character H1 offset, which forces AI systems to process nearly half a megabyte of noise before reaching content.
https://www.homestoreandmore.ie/all-bbqs/master-cook-classic-500-5-burner-gas-bbq/066033.html75 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/all-bbqs/master-cook-classic-500-5-burner-gas-bbq/066033.html
FINAL DEST: https://www.homestoreandmore.ie/gas-bbq/master-cook-classic-500-5-burner-gas-bbq/066033.html (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 1
→ [301] https://www.homestoreandmore.ie/all-bbqs/master-cook-classic-500-5-burner-gas-bbq/066033.html
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 3336
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 22
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (5.0%)
[FAIL] H1 Source Position (423,879 chars)
[FAIL] UI Interference
HTML SIZE: 480.6 KB
VISIBLE TEXT: 24.1 KB
DATA ISLANDS: 10 blocks (35.49 KB total, largest: 12.71 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner, pinchzoomcarouselmodal, promo_banner custom-label
04. Semantic Skeleton
URL SLUG: 066033.html
H1 TAG: Master Cook Classic 500 5 Burner Gas BBQ Master Cook Classic 500 5 Burner Gas BBQ Master Cook Search Results 0.00
META TITLE: Master Cook Classic 500 5 Burner Gas BBQ - Home Store + More
META DESC: The Master Cook Classic 500 5-Burner Gas BBQ is a top-of-the-line outdoor cooking appliance perfect for satisfying all your barbecue cravings. Boasting five sta Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1155
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page is a Product Detail Page (PDP) for a 'Master Cook Classic 500 5 Burner Gas BBQ'. While the SSR word count of 3,336 suggests a content-rich environment, the stripped word count of 578 reveals that the vast majority of the delivered 'text' is repetitive boilerplate or navigational noise. The H1 is present in the SSR, but it is polluted with extraneous UI strings such as 'Search Results 0.00', which can confuse entity extraction models.
Technical Access Assessment
The page exhibits a critical failure in bot parity; while GPTBot and ClaudeBot can access the page, Bingbot is met with a 403 Forbidden status, rendering the content invisible to Microsoft Copilot and AI agents utilizing Bing's infrastructure. This aligns with the site-wide blockade noted in the context. Additionally, a 301 redirect from the '/all-bbqs/' path to '/gas-bbq/' adds latency to the retrieval process, though the final destination resolves with stable infrastructure headers.
Retrieval Efficiency Analysis
The retrieval efficiency is among the lowest in the industry with a Signal-to-Noise Ratio (SNR) of 0.0502, meaning 95% of the data ingested by an AI is technical waste. The 'h1_char_offset' is catastrophic at 423,879 characters. An AI crawler must process nearly 424KB of code, including a massive header containing 1,090 internal links, before reaching the primary product identifier. This creates an extreme 'Context Window Budget' deficit.
AI Retrieval Impact
There is a severe risk of content truncation; many RAG pipelines and LLM chunkers with a 128k or 256k character limit will exhaust their context window on the header navigation and UI overlays (mask-overlay, search-overlay) before ever seeing the product description. The site-wide Bingbot block creates a total discovery gap for a significant segment of the AI ecosystem.
Recommendation
1. Resolve the WAF/CDN rule causing the 403 Forbidden for Bingbot to enable Copilot accessibility. 2. Drastically prune the header; 1,090 links per page is extreme and should be externalized or lazy-loaded to reduce SSR payload. 3. Prioritize content in the DOM; move the main product schema and H1 higher to reduce the character offset below 50,000. 4. Clean the H1 to remove non-semantic UI strings like 'Search Results'.
Score Justification
The score is heavily penalized by the site-wide Bingbot blockade and the extreme H1 character offset (423k+), which forces AI systems to process nearly half a megabyte of noise before reaching meaningful product data. The SNR of 0.05 indicates systemic token waste that threatens the integrity of any RAG-based retrieval.
https://www.homestoreandmore.ie/chopping-boards-kitchen-knives/berlinger-haus-smoked-wood-knife-block-set-8-piece/083594.htm82 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/chopping-boards-kitchen-knives/berlinger-haus-smoked-wood-knife-block-set-8-piece/083594.htm
FINAL DEST: https://www.homestoreandmore.ie/chopping-boards-kitchen-knives/berlinger-haus-smoked-wood-knife-block-set-8-piece/083594.htm (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 2851
H1 FOUND IN SSR: NO
EMPTY SHELL: NONE
IMAGES WITH SRC: 10
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (5.0%)
[PASS] H1 Source Position (NOT FOUND)
[PASS] UI Interference
HTML SIZE: 418.2 KB
VISIBLE TEXT: 20.9 KB
DATA ISLANDS: 6 blocks (19.62 KB total, largest: 12.71 KB)
04. Semantic Skeleton
URL SLUG: 083594.htm
H1 TAG: MISSING
META TITLE: Sites-HomeStore-Site - Home Store + More
META DESC: Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1132
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
The page is a 'Soft 404' error masquerading as a successful 200 OK response. While the URL structure suggests a Berlinger Haus Knife Block Set product page, the actual SSR payload contains a 'Page or File Not Found' message. The high SSR word count (2,851) is misleading, as it consists entirely of template boilerplate and navigation links, with only 100 words of stripped content—none of which relates to the product entity.
Technical Access Assessment
The technical stack exhibits a critical failure in bot parity and status code integrity. Bingbot is explicitly blocked (403 Forbidden), creating a total blind spot for Microsoft-based AI tools, despite robots.txt claiming access. Furthermore, the server returns a 200 OK status for a page that does not exist, preventing AI crawlers from programmatically identifying this as a dead link and instead forcing them to process 428KB of noise to find an error message.
Retrieval Efficiency Analysis
There is a complete 'Entity Void' on this page. No <h1> tag exists (h1_char_offset: -1), the title tag is a generic site-wide string, and there is zero product metadata or descriptive text within the SSR payload. The 1,090 links in the header create a massive distraction, resulting in a signal-to-noise ratio of 0.0499 (95% code noise).
AI Retrieval Impact
AI retrieval is completely compromised. Any RAG system or LLM agent attempting to browse this URL will ingest an error message as 'content,' leading to hallucinations or retrieval failures. For Bing-based agents like Copilot, the page is technically unreachable. The extreme token waste (400k+ characters of boilerplate) ensures that even if content were present, it would be buried at the end of the context window.
Recommendation
1. Correct the Soft 404: Implement a 301 redirect from the .htm extension to the valid .html extension mentioned in the Site Context. 2. Unblock Bingbot at the WAF/CDN level to achieve bot parity. 3. Semantic Cleanup: Reduce the header link volume from 1,000+ to a manageable directory to improve signal-to-noise ratio and prevent context window overflow.
Score Justification
The page receives a low readability score due to the 'Soft 404' status, the systemic blockade of Bingbot, the total absence of semantic landmarks (H1), and an extreme token waste profile where 95% of the data is boilerplate noise.
https://www.homestoreandmore.ie/chopping-boards-kitchen-knives/berlinger-haus-smoked-wood-knife-block-set-8-piece/083594.html?variantId=08359476 / 100
Technical Crawlability Audit Log
01. Network Access & Bot Parity
[PASS] AI Bot Access
REQUESTED: https://www.homestoreandmore.ie/chopping-boards-kitchen-knives/berlinger-haus-smoked-wood-knife-block-set-8-piece/083594.html?variantId=083594
FINAL DEST: https://www.homestoreandmore.ie/chopping-boards-kitchen-knives/berlinger-haus-smoked-wood-knife-block-set-8-piece/083594.html?variantId=083594 (HTTP 200)
GPTBOT STATUS: 200
CHATGPT-USER STATUS: 200
CLAUDEBOT STATUS: 200
GOOGLEBOT STATUS: 200
BINGBOT STATUS: 403
PERPLEXITYBOT STATUS: 200
APPLEBOT STATUS: 200
PATH HOPS: 0
02. Hydration & Content Existence
[PASS] SSR Content Exists
SSR WORD COUNT: 3232
H1 FOUND IN SSR: YES
EMPTY SHELL: NONE
IMAGES WITH SRC: 17
IMAGES LAZY-ONLY: 0
03. Retrieval Efficiency & Token Budget
[FAIL] Signal-to-Noise Ratio (5.1%)
[FAIL] H1 Source Position (408,825 chars)
[FAIL] UI Interference
HTML SIZE: 468.5 KB
VISIBLE TEXT: 23.9 KB
DATA ISLANDS: 9 blocks (29.98 KB total, largest: 12.71 KB)
BLOCKING ELEMENTS: headerv2 top-banner new-main-header , mask-overlay, search-overlay, global-banner, pinchzoomcarouselmodal
04. Semantic Skeleton
URL SLUG: 083594.html
H1 TAG: Berlinger Haus Smoked Wood Knife Block Set 8 Piece Berlinger Haus Smoked Wood Knife Block Set 8 Piece 083594 Berlinger Haus Search Results 0.00
META TITLE: Berlinger Haus Smoked Wood Knife Set 8 Piece - Home Store + More
META DESC: The Berlinger Haus Smoked Wood 8 Piece Knife Set is an excellent knife kit for everyone's cutlery drawer. The set includes a chef, utility, pairing, bread... Home Store + More
05. Infrastructure & Discovery
[PASS] Parent Path Stability (HTTP 200)
VARY HEADER: accept-encoding
CACHE CONTROL: no-cache, no-store, must-revalidate
X-ROBOTS-TAG: NONE
NAV VISIBLE IN SSR: YES
INTERNAL LINKS: 1155
06. Robots.txt Bot Access
GPTBOT: PASS
CHATGPT-USER: PASS
CLAUDEBOT: PASS
GOOGLEBOT: PASS
BINGBOT: PASS
PERPLEXITYBOT: PASS
APPLEBOT: PASS
Page Existence & SSR Integrity
This is a Product Detail Page (PDP) for the 'Berlinger Haus Smoked Wood Knife Block Set 8 Piece'. The content is successfully delivered via Server-Side Rendering (SSR) with a robust word count of 3,232, ensuring the core product description, features, and specifications are physically present in the initial HTML payload without requiring JavaScript execution. The semantic skeleton correctly identifies the primary entity, though the slug includes a legacy .html extension.
Technical Access Assessment
The page exhibits a critical failure in bot parity: Bingbot is served a 403 Forbidden status, creating a total blind spot for Microsoft Copilot and Perplexity. This contradicts the robots.txt 'allow' directive, indicating a misconfigured WAF or server-level blockade. While other major AI bots (GPTBot, ClaudeBot) have access, the site-wide pattern of blocking Bingbot remains a significant hurdle for multi-platform AI retrieval. Infrastructure is stable (200 OK on parent paths), but the reliance on 1,155 internal links—1,090 of which are in the header—creates massive crawl friction.
Retrieval Efficiency Analysis
The 'Context Window Budget' is severely compromised by extreme content deferral. The primary H1 tag is buried behind 408,825 characters of boilerplate code, navigation links, and UI overlays. With a Signal-to-Noise Ratio (SNR) of 0.05, 95% of the page weight is technical noise. An AI system must process nearly 400KB of data before reaching the first meaningful product identifier, which is an inefficient use of token limits for RAG-based extraction.
AI Retrieval Impact
The primary risk is Truncation and Context Waste. Many LLM-based scrapers or RAG pipelines have a pre-extraction limit; a 400k character offset risks the meaningful product data being discarded or truncated. Furthermore, the 1,000+ header links dilute the 'relevance' of the page, forcing the AI to expend compute resources filtering out irrelevant navigation to find the product 'features' and 'safety' labels hidden at the bottom of the payload.
Recommendation
First, resolve the Bingbot 403 blockade at the WAF level to ensure parity across all major AI crawlers. Second, prioritize 'Content Above Boilerplate' by restructuring the DOM to place the product description and H1 within the first 50KB of the HTML payload. Third, externalize or prune the 1,090 header links into a dynamic or deferred menu to improve the Signal-to-Noise Ratio and prevent context window overflow.
Score Justification
While the page provides excellent SSR content and a clear H1, it is penalized for the systemic blockade of Bingbot, an extremely low SNR (5%), and an excessive H1 char offset (400k+) that forces AI systems to process nearly half a megabyte of noise before identifying the primary content.
Priority Actions
Resolve Site-wide Bingbot 403 Blockade
Low
Why This Is Priority
The page exhibits a critical bot parity failure; Bingbot is met with a 403 Forbidden status while other AI agents are allowed 200 access. This confirms the site-wide WAF blockade identified in the Site Context remains active here.
Action
Priority 1: Synchronize WAF policies with robots.txt to allow Bingbot access, ending the 403 blockade.
Expected Outcome
Eliminate the total invisibility risk for Microsoft Copilot and Perplexity and ensure parity across AI agents.
Source
https://www.homestoreandmore.ie/, https://www.homestoreandmore.ie/stores, https://www.homestoreandmore.ie/shop-by-department
Refactor Bloated Global Header Navigation
High
Why This Is Priority
A massive internal link count (1,278 total) is heavily weighted in the header (1,090 links), forcing AI crawlers to process nearly 400KB of navigation data before reaching the primary content.
Action
Priority 3: Refactor the header to externalize the 1,090 links or utilize a more efficient navigation structure to improve the signal-to-noise ratio above 0.10.
Expected Outcome
Improve retrieval efficiency and prevent RAG pipeline chunkers from exhausting token budgets on boilerplate navigation.
Source
https://www.homestoreandmore.ie/, https://www.homestoreandmore.ie/bathroom-bins, https://www.homestoreandmore.ie/bathroom-accessories
Inject Semantic H1 Anchors on Hub Pages
Low
Why This Is Priority
The page is semantically crippled as it lacks an <h1> tag entirely in the SSR payload, forcing AI systems to rely on meta titles for intent.
Action
Priority 2: Inject a descriptive <h1> into the SSR payload to provide a semantic anchor.
Expected Outcome
Establish a content anchor and assist in automated context window partitioning for LLM-based retrievers.
Source
https://www.homestoreandmore.ie/, https://www.homestoreandmore.ie/stores, https://www.homestoreandmore.ie/specials
Mitigate Extreme Content Deferral (H1 Offset)
Medium
Why This Is Priority
The primary <h1> is deferred by 502,787 characters, requiring an AI to process over 500KB of metadata, 55 data islands (104KB), and 1,090 header links before reaching the main content identifier.
Action
Priority 2: Move the main content block and <h1> higher in the DOM to reduce the 500KB+ character offset.
Expected Outcome
Reduce the risk of content truncation in LLM-based RAG pipelines where actual store addresses or product details may be discarded.
Source
https://www.homestoreandmore.ie/clearance-nav, https://www.homestoreandmore.ie/bathroom-bins, https://www.homestoreandmore.ie/bathroom-scales
Optimize Signal-to-Noise Ratio (SNR)
High
Why This Is Priority
Retrieval efficiency is extremely poor with a signal-to-noise ratio of 0.046, indicating that over 95% of the 504,054 characters transmitted are technical noise or boilerplate.
Action
Drastically prune the header link bloat; 1090 links per page is a critical efficiency failure.
Expected Outcome
Lower the compute cost for AI retrieval and ensure meaningful product data is reached within standard context limits.
Source
https://www.homestoreandmore.ie/, https://www.homestoreandmore.ie/shop-by-department, https://www.homestoreandmore.ie/toilet-brushes
Externalize JSON-LD Data Island Bloat
Medium
Why This Is Priority
The page carries 95KB of data islands, nearly triple the volume of the actual visible text (32KB), creating significant token waste for LLM-based extractors.
Action
Tertiary: Externalize or prune the 95KB of data islands and reduce the header link density to improve token efficiency and signal-to-noise ratio.
Expected Outcome
Improve token efficiency and minimize the JSON-LD footprint in the primary SSR payload.
Source
https://www.homestoreandmore.ie/toilet-brushes, https://www.homestoreandmore.ie/bathroom-mirrors, https://www.homestoreandmore.ie/towels
Implement Main Landmark Elements
Low
Why This Is Priority
The crawler finds no <main> or <h1> to use as a content anchor, forcing it to process 1,000+ header links before reaching departmental content.
Action
Wrap departmental content in a <main> landmark to assist bots in skipping boilerplate navigation.
Expected Outcome
Provide a clear content anchor for machine extractors, bypassing the massive header boilerplate.
Source
https://www.homestoreandmore.ie/stores, https://www.homestoreandmore.ie/bathroom-menu
Remediate Soft 404 and Status Code Integrity
Low
Why This Is Priority
The server returns a 200 OK status for a page that does not exist, preventing AI crawlers from programmatically identifying this as a dead link and instead forcing them to process 428KB of noise to find an error message.
Action
Correct the Soft 404: Implement a 301 redirect from the .htm extension to the valid .html extension mentioned in the Site Context.
Expected Outcome
Prevent AI retrieval failures and hallucinations caused by ingesting error messages as content.
Source
https://www.homestoreandmore.ie/chopping-boards-kitchen-knives/berlinger-haus-smoked-wood-knife-block-set-8-piece/083594.htm
Sanitize Product H1 Semantic Content
Low
Why This Is Priority
The H1 is present in the SSR, but it is polluted with extraneous UI strings such as 'Search Results 0.00', which can confuse entity extraction models.
Action
Clean the H1 to remove non-semantic UI strings like 'Search Results'.
Expected Outcome
Prevent confusion for entity extraction models and ensure accurate product identification.
Source
https://www.homestoreandmore.ie/all-bbqs/master-cook-classic-500-5-burner-gas-bbq/066033.html
Standardize AI Bot-Specific Header Delivery
Medium
Why This Is Priority
The excessive internal links create a discovery bottleneck, diluting crawl priority for specific product URLs.
Action
Implement a 'lite' header for bots to reduce the 1,090-link boilerplate waste.
Expected Outcome
Reduce token waste per page hit and improve the signal-to-noise ratio for machine readers.
Source
https://www.homestoreandmore.ie/clearance-nav, https://www.homestoreandmore.ie/bathroom-bins, https://www.homestoreandmore.ie/bathroom-mats