Facility: 001526

Good Guys Storage Llc

Stale Data Warning: This facility has not been successfully scraped in 26 days (threshold: 3 days). Data may be outdated.
Facility Information active
Facility ID
001526
Name
Good Guys Storage Llc
URL
https://www.loc8nearme.com/missouri/fenton/good-guys-storage-llc/4161372/
Address
1031 Gravois Rd, Fenton, MO 63026, USA, Fenton, Missouri 63026
Platform
custom_facility_001526
Parser File
src/parsers/custom/facility_001526_parser.py
Last Scraped
2026-03-27 13:39:24.336169
Created
2026-03-23 02:35:08.816820
Updated
2026-03-27 13:39:24.336169
Parser & Healing Diagnosis needs_fix
Parser Status
⚠ Needs Fix
Status Reason
Parser returned 0 units
Last Healing Attempt
Not attempted
Parser Source (src/parsers/custom/facility_001526_parser.py)
"""Parser for Good Guys Storage Llc (loc8nearme.com directory listing)."""

from __future__ import annotations

import re

from bs4 import BeautifulSoup

from src.parsers.base import BaseParser, ParseResult, UnitResult


class Facility001526Parser(BaseParser):
    """Extract storage units from Good Guys Storage Llc.

    The facility URL points to a loc8nearme.com directory listing page,
    not a direct facility website.  These pages sometimes contain
    embedded unit/pricing tables; this parser looks for them and falls
    back gracefully when the page only has business-directory info.
    """

    platform = "custom_facility_001526"

    _SIZE_PRICE_RE = re.compile(
        r"(\d+)\s*['\u2019\u2032]?\s*[xX\u00d7]\s*(\d+)\s*['\u2019\u2032]?"
        r"[^\$]{0,120}"
        r"\$\s*(\d[\d,.]*)",
        re.DOTALL,
    )

    _PRICE_SIZE_RE = re.compile(
        r"\$\s*(\d[\d,.]*)"
        r".{0,120}"
        r"(\d+)\s*['\u2019\u2032]?\s*[xX\u00d7]\s*(\d+)\s*['\u2019\u2032]?",
        re.DOTALL,
    )

    def parse(self, html: str, url: str = "") -> ParseResult:
        soup = BeautifulSoup(html, "lxml")
        result = ParseResult(
            platform=self.platform,
            parser_name=self.__class__.__name__,
        )

        for tag in soup.find_all(["script", "style"]):
            tag.decompose()

        body_text = soup.get_text(separator="\n")

        # Detect if the facility is marked as closed
        if re.search(r"(?i)temporarily\s+closed", body_text):
            result.warnings.append("Facility is listed as TEMPORARILY CLOSED")

        # Detect directory-only pages with no pricing content
        is_directory = bool(
            re.search(r"(?i)loc8nearme\.com", html)
            or re.search(r"(?i)hours,?\s*directions,?\s*reviews", body_text)
        )

        seen: set[tuple[str, str]] = set()

        # Try size-then-price pattern
        for m in self._SIZE_PRICE_RE.finditer(body_text):
            w_str, l_str, price_str = m.group(1), m.group(2), m.group(3)
            size_text = f"{w_str}x{l_str}"
            key = (size_text, price_str)
            if key in seen:
                continue
            seen.add(key)

            unit = UnitResult()
            unit.size = size_text
            w, ln, sq = self.normalize_size(size_text)
            if w is not None:
                unit.metadata = {"width": w, "length": ln, "sqft": sq}
            unit.price = self.normalize_price(price_str)
            unit.description = m.group(0).strip()[:200]
            if unit.size or unit.price:
                result.units.append(unit)

        # Try price-then-size pattern
        if not result.units:
            for m in self._PRICE_SIZE_RE.finditer(body_text):
                price_str, w_str, l_str = m.group(1), m.group(2), m.group(3)
                size_text = f"{w_str}x{l_str}"
                key = (size_text, price_str)
                if key in seen:
                    continue
                seen.add(key)

                unit = UnitResult()
                unit.size = size_text
                w, ln, sq = self.normalize_size(size_text)
                if w is not None:
                    unit.metadata = {"width": w, "length": ln, "sqft": sq}
                unit.price = self.normalize_price(price_str)
                unit.description = m.group(0).strip()[:200]
                if unit.size or unit.price:
                    result.units.append(unit)

        if not result.units:
            if is_directory:
                result.warnings.append(
                    "Page is a loc8nearme.com directory listing with no unit pricing data"
                )
            else:
                result.warnings.append("No units found")

        return result

Scrape Runs (3)

Run #1018 Details

Status
exported
Parser Used
Facility001526Parser
Platform Detected
table_layout
Units Found
0
Stage Reached
exported
Timestamp
2026-03-23 02:38:40.704471
Timing
Stage Duration
Fetch3829ms
Detect40ms
Parse79ms
Export6ms

Snapshot: 001526_20260323T023844Z.html · Show Snapshot · Open in New Tab

No units found in this run.

All Failures for this Facility (3)

parse _WarningAsException scraper no_units_extracted warning Run #N/A | 2026-03-27 13:39:24.317788

No units extracted for 001526

Stack trace
src.reporting.failure_reporter._WarningAsException: No units extracted for 001526
parse _WarningAsException scraper no_units_extracted warning Run #N/A | 2026-03-27 13:39:21.809305

No units extracted for 001526

Stack trace
src.reporting.failure_reporter._WarningAsException: No units extracted for 001526
parse _WarningAsException scraper no_units_extracted warning Run #N/A | 2026-03-23 02:38:44.675750

No units extracted for 001526

Stack trace
src.reporting.failure_reporter._WarningAsException: No units extracted for 001526

← Back to dashboard