Facility: 017201
Blue Mountain Mini Storage
- Facility ID
- 017201
- Name
- Blue Mountain Mini Storage
- URL
- https://www.bluemountainministorage.com/
- Address
- N/A
- Platform
- custom_facility_017201
- Parser File
- src/parsers/custom/facility_017201_parser.py
- Last Scraped
- 2026-03-23 03:16:32.429981
- Created
- 2026-03-06 23:45:35.865957
- Updated
- 2026-03-23 03:16:32.437681
- Parser Status
- ✓ Working
- Status Reason
- N/A
- Last Healing Attempt
- Not attempted
Parser Source (src/parsers/custom/facility_017201_parser.py)
"""Parser for Blue Mountain Mini Storage (Walla Walla, WA).
This site (built on Duda) lists storage unit sizes as a plain HTML
unordered list under a "Storage unit sizes:" heading. No pricing data
is published on the page — only the available size options are shown.
"""
from __future__ import annotations
import re
from bs4 import BeautifulSoup
from src.parsers.base import BaseParser, ParseResult, UnitResult
class Facility017201Parser(BaseParser):
"""Extract storage unit sizes from Blue Mountain Mini Storage.
The page contains a ``<ul class="defaultList bullet">`` element whose
``<li>`` children each hold a dimension string in the format
``5' x 10'``. No pricing is published on the page.
"""
platform = "custom_facility_017201"
# Matches dimension strings like "5' x 10'" or "10' x 28'"
_SIZE_RE = re.compile(
r"(\d+(?:\.\d+)?)['\u2019\u2032]?\s*[xX\u00d7]\s*(\d+(?:\.\d+)?)['\u2019\u2032]?",
)
def parse(self, html: str, url: str = "") -> ParseResult:
soup = BeautifulSoup(html, "lxml")
result = ParseResult(platform=self.platform, parser_name=self.__class__.__name__)
# Locate the "Storage unit sizes:" heading then grab the sibling <ul>
sizes_heading = soup.find(string=re.compile(r"Storage unit sizes", re.IGNORECASE))
unit_list = None
if sizes_heading:
# Walk up to the containing <p> or block element, then find next <ul>
parent = sizes_heading.find_parent(["p", "div", "li", "span"])
if parent:
# Search the parent's parent for a sibling <ul>
container = parent.parent
if container:
unit_list = container.find("ul")
# Fallback: any <ul> on the page whose <li> items look like dimensions
if not unit_list:
for ul in soup.find_all("ul"):
items = ul.find_all("li")
if items and self._SIZE_RE.search(items[0].get_text()):
unit_list = ul
break
if not unit_list:
result.warnings.append("No unit size list found on page")
return result
result.warnings.append("No pricing data available on this page — sizes only")
for li in unit_list.find_all("li"):
size_text = li.get_text(strip=True)
match = self._SIZE_RE.search(size_text)
if not match:
continue
width = float(match.group(1))
length = float(match.group(2))
sqft = width * length
unit = UnitResult(
size=f"{int(width)}' x {int(length)}'",
description=size_text,
metadata={"width": width, "length": length, "sqft": sqft},
)
result.units.append(unit)
if not result.units:
result.warnings.append("Size list found but no dimension patterns matched")
return result
Scrape Runs (4)
-
exported Run #14442026-03-23 03:16:29.250696 | 6 units | Facility017201Parser | View Data →
-
exported Run #9512026-03-21 19:09:15.893656 | 6 units | Facility017201Parser | View Data →
-
exported Run #5042026-03-14 16:51:56.186610 | 6 units | Facility017201Parser | View Data →
-
exported Run #1022026-03-14 01:03:13.937690 | 6 units | Facility017201Parser | View Data →
Run #1444 Details
- Status
- exported
- Parser Used
- Facility017201Parser
- Platform Detected
- table_layout
- Units Found
- 6
- Stage Reached
- exported
- Timestamp
- 2026-03-23 03:16:29.250696
Timing
| Stage | Duration |
|---|---|
| Fetch | 3102ms |
| Detect | 31ms |
| Parse | 16ms |
| Export | 5ms |
Snapshot: 017201_20260323T031632Z.html · Show Snapshot · Open in New Tab
Parsed Units (6)
5' x 10'
No price
8' x 12'
No price
8' x 20'
No price
10' x 12'
No price
10' x 20'
No price
10' x 28'
No price