Facility: 009373
U-Haul Caldwell ID
- Facility ID
- 009373
- Name
- U-Haul Caldwell ID
- URL
- https://www.uhaul.com/Locations/Self-Storage-near-Caldwell-ID-83607/1022954/
- Address
- N/A
- Platform
- custom_facility_009373
- Parser File
- src/parsers/custom/facility_009373_parser.py
- Last Scraped
- 2026-03-23 03:17:03.792100
- Created
- 2026-03-06 23:45:35.865957
- Updated
- 2026-03-23 03:17:03.811688
- Parser Status
- ✓ Working
- Status Reason
- N/A
- Last Healing Attempt
- Not attempted
Parser Source (src/parsers/custom/facility_009373_parser.py)
"""Parser for U-Haul self-storage location (Caldwell, ID).
The U-Haul location page uses a Foundation CSS grid layout with unit
cards inside div.grid-x.grid-margin-x containers. Each card has:
- Size in span.nowrap (e.g. "5' x 5' x 9'")
- Price in b.text-xl (e.g. "$45.00")
- Description in p.medium-collapse span
"""
from __future__ import annotations
import re
from bs4 import BeautifulSoup
from src.parsers.base import BaseParser, ParseResult, UnitResult
class Facility009373Parser(BaseParser):
"""Extract storage units from U-Haul location page.
Units are listed in Foundation grid cards with size and per-month rate.
"""
platform = "custom_facility_009373"
# Matches "5' x 5' x 9'" or "10' x 10'" — captures first two dims
_SIZE_RE = re.compile(
r"(\d+(?:\.\d+)?)'?\s*x\s*(\d+(?:\.\d+)?)'?(?:\s*x\s*\d+(?:\.\d+)?'?)?",
re.IGNORECASE,
)
def parse(self, html: str, url: str = "") -> ParseResult:
soup = BeautifulSoup(html, "lxml")
result = ParseResult(platform=self.platform, parser_name=self.__class__.__name__)
seen = set()
for card in soup.find_all("div", class_="grid-x"):
text = card.get_text(strip=True)
if "Rate" not in text and "$" not in text:
continue
# Size from span.nowrap
size_span = card.find("span", class_="nowrap")
if not size_span:
continue
size_text = size_span.get_text(strip=True)
# Price from bold tag inside dl
price_tag = card.find("b", class_="text-xl")
if not price_tag:
continue
price_text = price_tag.get_text(strip=True)
# Deduplicate (page repeats cards for mobile/desktop)
key = (size_text, price_text)
if key in seen:
continue
seen.add(key)
price = self.normalize_price(price_text)
# Parse dimensions
m = self._SIZE_RE.match(size_text)
metadata: dict = {}
if m:
width = float(m.group(1))
length = float(m.group(2))
metadata = {"width": width, "length": length, "sqft": width * length}
# Description from paragraph
desc_p = card.find("p", class_=re.compile(r"medium-collapse"))
description = desc_p.get_text(strip=True) if desc_p else ""
unit = UnitResult(
size=size_text,
price=price,
description=description or size_text,
metadata=metadata,
)
result.units.append(unit)
if not result.units:
result.warnings.append("No unit grid cards found on page")
return result
Scrape Runs (4)
-
exported Run #14512026-03-23 03:16:57.390615 | 5 units | Facility009373Parser | View Data →
-
exported Run #9582026-03-21 19:09:44.817953 | 5 units | Facility009373Parser | View Data →
-
exported Run #5112026-03-14 16:52:43.387009 | 5 units | Facility009373Parser | View Data →
-
exported Run #1092026-03-14 01:03:57.806442 | 5 units | Facility009373Parser | View Data →
Run #1451 Details
- Status
- exported
- Parser Used
- Facility009373Parser
- Platform Detected
- storageunitsoftware
- Units Found
- 5
- Stage Reached
- exported
- Timestamp
- 2026-03-23 03:16:57.390615
Timing
| Stage | Duration |
|---|---|
| Fetch | 6154ms |
| Detect | 147ms |
| Parse | 76ms |
| Export | 11ms |
Snapshot: 009373_20260323T031703Z.html · Show Snapshot · Open in New Tab
Parsed Units (5)
5' x 5' x 9'
$45.00/mo
10' x 5' x 9'
$60.00/mo
5' x 10' x 9'
$60.00/mo
10' x 10' x 9'
$90.00/mo
10' x 15' x 0'
$75.00/mo