Facility: 094794

B&H Mini-Storage

Stale Data Warning: This facility has not been successfully scraped in 26 days (threshold: 3 days). Data may be outdated.
Facility Information active
Facility ID
094794
Name
B&H Mini-Storage
URL
https://www.bhministorage.com/
Address
8305 Seippel Rd, Dubuque, IA 52002, USA, Dubuque, Iowa 52002
Platform
custom_facility_094794
Parser File
src/parsers/custom/facility_094794_parser.py
Last Scraped
2026-03-27 13:46:23.284134
Created
2026-03-20 23:23:10.660695
Updated
2026-03-27 13:46:23.311181
Parser & Healing Diagnosis working
Parser Status
✓ Working
Status Reason
N/A
Last Healing Attempt
Not attempted
Parser Source (src/parsers/custom/facility_094794_parser.py)
"""Parser for B&H Mini-Storage."""

from __future__ import annotations

import re

from bs4 import BeautifulSoup

from src.parsers.base import BaseParser, ParseResult, UnitResult


class Facility094794Parser(BaseParser):
    """Extract storage units from B&H Mini-Storage."""

    platform = "custom_facility_094794"

    def parse(self, html: str, url: str = "") -> ParseResult:
        soup = BeautifulSoup(html, "lxml")
        result = ParseResult(platform=self.platform, parser_name=self.__class__.__name__)

        # Unit sizes are in an h4 element containing "UNIT SIZES:" followed by
        # concatenated size strings like "10' X 10'10' X 15'10' X 20'10' x 30'"
        # Use 1-2 digit constraint to split concatenated sizes.
        size_pattern = re.compile(r"(\d{1,2})['\u2019]?\s*[xX]\s*(\d{1,2})['\u2019]?(?=\d|$|\s|')")

        for h4 in soup.find_all("h4"):
            text = h4.get_text(strip=True)
            if "UNIT SIZES" in text.upper():
                # Extract all size pairs from the concatenated text
                seen_sizes = set()
                for match in size_pattern.finditer(text):
                    size_text = f"{match.group(1)}' x {match.group(2)}'"
                    if size_text not in seen_sizes:
                        seen_sizes.add(size_text)
                        unit = UnitResult()
                        unit.size = size_text
                        w, ln, sq = self.normalize_size(size_text)
                        if w is not None:
                            unit.metadata = {"width": w, "length": ln, "sqft": sq}
                        result.units.append(unit)
                break

        if not result.units:
            result.warnings.append("No units found on page")

        return result

Scrape Runs (4)

Run #1108 Details

Status
exported
Parser Used
Facility094794Parser
Platform Detected
table_layout
Units Found
4
Stage Reached
exported
Timestamp
2026-03-23 02:47:48.539300
Timing
Stage Duration
Fetch5070ms
Detect35ms
Parse22ms
Export5ms

Snapshot: 094794_20260323T024753Z.html · Show Snapshot · Open in New Tab

Parsed Units (4)

10' x 10'

No price

10' x 15'

No price

10' x 20'

No price

10' x 30'

No price

← Back to dashboard