Facility 017201 - Facility Scrapers

Stale Data Warning: This facility has not been successfully scraped in 81 days (threshold: 3 days). Data may be outdated.

Facility Information active

Facility ID: 017201
Name: Blue Mountain Mini Storage
URL: https://www.bluemountainministorage.com/

Address: N/A
Platform: custom_facility_017201
Parser File: src/parsers/custom/facility_017201_parser.py

Last Scraped: 2026-03-23 03:16:32.429981
Created: 2026-03-06 23:45:35.865957
Updated: 2026-03-23 03:16:32.437681

Parser & Healing Diagnosis working

Parser Status: ✓ Working
Status Reason: N/A

Last Healing Attempt: Not attempted

Parser Source (src/parsers/custom/facility_017201_parser.py)

"""Parser for Blue Mountain Mini Storage (Walla Walla, WA).

This site (built on Duda) lists storage unit sizes as a plain HTML
unordered list under a "Storage unit sizes:" heading. No pricing data
is published on the page — only the available size options are shown.
"""

from __future__ import annotations

import re

from bs4 import BeautifulSoup

from src.parsers.base import BaseParser, ParseResult, UnitResult


class Facility017201Parser(BaseParser):
    """Extract storage unit sizes from Blue Mountain Mini Storage.

    The page contains a ``<ul class="defaultList bullet">`` element whose
    ``<li>`` children each hold a dimension string in the format
    ``5' x 10'``.  No pricing is published on the page.
    """

    platform = "custom_facility_017201"

    # Matches dimension strings like "5' x 10'" or "10' x 28'"
    _SIZE_RE = re.compile(
        r"(\d+(?:\.\d+)?)['\u2019\u2032]?\s*[xX\u00d7]\s*(\d+(?:\.\d+)?)['\u2019\u2032]?",
    )

    def parse(self, html: str, url: str = "") -> ParseResult:
        soup = BeautifulSoup(html, "lxml")
        result = ParseResult(platform=self.platform, parser_name=self.__class__.__name__)

        # Locate the "Storage unit sizes:" heading then grab the sibling <ul>
        sizes_heading = soup.find(string=re.compile(r"Storage unit sizes", re.IGNORECASE))
        unit_list = None
        if sizes_heading:
            # Walk up to the containing <p> or block element, then find next <ul>
            parent = sizes_heading.find_parent(["p", "div", "li", "span"])
            if parent:
                # Search the parent's parent for a sibling <ul>
                container = parent.parent
                if container:
                    unit_list = container.find("ul")

        # Fallback: any <ul> on the page whose <li> items look like dimensions
        if not unit_list:
            for ul in soup.find_all("ul"):
                items = ul.find_all("li")
                if items and self._SIZE_RE.search(items[0].get_text()):
                    unit_list = ul
                    break

        if not unit_list:
            result.warnings.append("No unit size list found on page")
            return result

        result.warnings.append("No pricing data available on this page — sizes only")

        for li in unit_list.find_all("li"):
            size_text = li.get_text(strip=True)
            match = self._SIZE_RE.search(size_text)
            if not match:
                continue

            width = float(match.group(1))
            length = float(match.group(2))
            sqft = width * length

            unit = UnitResult(
                size=f"{int(width)}' x {int(length)}'",
                description=size_text,
                metadata={"width": width, "length": length, "sqft": sqft},
            )
            result.units.append(unit)

        if not result.units:
            result.warnings.append("Size list found but no dimension patterns matched")

        return result

Stage	Duration
Fetch	3102ms
Detect	31ms
Parse	16ms
Export	5ms

Facility: 017201

Scrape Runs (4)

Run #1444 Details

Parsed Units (6)

5' x 10'

8' x 12'

8' x 20'

10' x 12'

10' x 20'

10' x 28'

HTML Snapshot — Run #1444