Facility: 009373

U-Haul Caldwell ID

Stale Data Warning: This facility has not been successfully scraped in 30 days (threshold: 3 days). Data may be outdated.
Facility Information active
Facility ID
009373
Name
U-Haul Caldwell ID
URL
https://www.uhaul.com/Locations/Self-Storage-near-Caldwell-ID-83607/1022954/
Address
N/A
Platform
custom_facility_009373
Parser File
src/parsers/custom/facility_009373_parser.py
Last Scraped
2026-03-23 03:17:03.792100
Created
2026-03-06 23:45:35.865957
Updated
2026-03-23 03:17:03.811688
Parser & Healing Diagnosis working
Parser Status
✓ Working
Status Reason
N/A
Last Healing Attempt
Not attempted
Parser Source (src/parsers/custom/facility_009373_parser.py)
"""Parser for U-Haul self-storage location (Caldwell, ID).

The U-Haul location page uses a Foundation CSS grid layout with unit
cards inside div.grid-x.grid-margin-x containers. Each card has:
  - Size in span.nowrap (e.g. "5' x 5' x 9'")
  - Price in b.text-xl (e.g. "$45.00")
  - Description in p.medium-collapse span
"""

from __future__ import annotations

import re

from bs4 import BeautifulSoup

from src.parsers.base import BaseParser, ParseResult, UnitResult


class Facility009373Parser(BaseParser):
    """Extract storage units from U-Haul location page.

    Units are listed in Foundation grid cards with size and per-month rate.
    """

    platform = "custom_facility_009373"

    # Matches "5' x 5' x 9'" or "10' x 10'" — captures first two dims
    _SIZE_RE = re.compile(
        r"(\d+(?:\.\d+)?)'?\s*x\s*(\d+(?:\.\d+)?)'?(?:\s*x\s*\d+(?:\.\d+)?'?)?",
        re.IGNORECASE,
    )

    def parse(self, html: str, url: str = "") -> ParseResult:
        soup = BeautifulSoup(html, "lxml")
        result = ParseResult(platform=self.platform, parser_name=self.__class__.__name__)

        seen = set()

        for card in soup.find_all("div", class_="grid-x"):
            text = card.get_text(strip=True)
            if "Rate" not in text and "$" not in text:
                continue

            # Size from span.nowrap
            size_span = card.find("span", class_="nowrap")
            if not size_span:
                continue
            size_text = size_span.get_text(strip=True)

            # Price from bold tag inside dl
            price_tag = card.find("b", class_="text-xl")
            if not price_tag:
                continue
            price_text = price_tag.get_text(strip=True)

            # Deduplicate (page repeats cards for mobile/desktop)
            key = (size_text, price_text)
            if key in seen:
                continue
            seen.add(key)

            price = self.normalize_price(price_text)

            # Parse dimensions
            m = self._SIZE_RE.match(size_text)
            metadata: dict = {}
            if m:
                width = float(m.group(1))
                length = float(m.group(2))
                metadata = {"width": width, "length": length, "sqft": width * length}

            # Description from paragraph
            desc_p = card.find("p", class_=re.compile(r"medium-collapse"))
            description = desc_p.get_text(strip=True) if desc_p else ""

            unit = UnitResult(
                size=size_text,
                price=price,
                description=description or size_text,
                metadata=metadata,
            )
            result.units.append(unit)

        if not result.units:
            result.warnings.append("No unit grid cards found on page")

        return result

Scrape Runs (4)

Run #1451 Details

Status
exported
Parser Used
Facility009373Parser
Platform Detected
storageunitsoftware
Units Found
5
Stage Reached
exported
Timestamp
2026-03-23 03:16:57.390615
Timing
Stage Duration
Fetch6154ms
Detect147ms
Parse76ms
Export11ms

Snapshot: 009373_20260323T031703Z.html · Show Snapshot · Open in New Tab

Parsed Units (5)

5' x 5' x 9'

$45.00/mo

10' x 5' x 9'

$60.00/mo

5' x 10' x 9'

$60.00/mo

10' x 10' x 9'

$90.00/mo

10' x 15' x 0'

$75.00/mo

← Back to dashboard