Facility: 000340

All One Storage - Waukesha

Stale Data Warning: This facility has not been successfully scraped in 26 days (threshold: 3 days). Data may be outdated.
Facility Information active
Facility ID
000340
Name
All One Storage - Waukesha
URL
https://www.allonestorage.com/waukesha-willmont/
Address
100 Wilmont Dr, Waukesha, WI 53189, USA, Waukesha, Wisconsin 53189
Platform
custom_facility_000340
Parser File
src/parsers/custom/facility_000340_parser.py
Last Scraped
2026-03-27 13:51:11.172272
Created
2026-03-14 16:21:53.706708
Updated
2026-03-27 13:51:11.200555
Parser & Healing Diagnosis working
Parser Status
✓ Working
Status Reason
N/A
Last Healing Attempt
Not attempted
Parser Source (src/parsers/custom/facility_000340_parser.py)
"""Parser for All One Storage - Waukesha."""

from __future__ import annotations

import re

from bs4 import BeautifulSoup

from src.parsers.base import BaseParser, ParseResult, UnitResult


class Facility000340Parser(BaseParser):
    """Extract storage units from All One Storage - Waukesha using regex fallback."""

    platform = "custom_facility_000340"

    _UNIT_RE = re.compile(
        r"(\d+\s*[\'\'\u2032]?\s*[xX\u00d7]\s*\d+\s*[\'\'\u2032]?)"
        r"[^\$]{0,100}"
        r"\$(\d[\d,.]*)",
        re.DOTALL,
    )

    def parse(self, html: str, url: str = "") -> ParseResult:
        soup = BeautifulSoup(html, "lxml")
        result = ParseResult(platform=self.platform, parser_name=self.__class__.__name__)

        # Remove script/style noise
        for tag in soup.find_all(["script", "style"]):
            tag.decompose()

        body_text = soup.get_text(separator="\n")

        seen = set()
        for m in self._UNIT_RE.finditer(body_text):
            size_text = m.group(1).strip()
            price_text = m.group(2).strip()

            key = (size_text, price_text)
            if key in seen:
                continue
            seen.add(key)

            unit = UnitResult()
            unit.size = size_text
            w, ln, sq = self.normalize_size(size_text)
            if w is not None:
                unit.metadata = {"width": w, "length": ln, "sqft": sq}
            unit.price = self.normalize_price(price_text)
            unit.description = m.group(0).strip()[:200]

            if unit.size or unit.price:
                result.units.append(unit)

        if not result.units:
            result.warnings.append("No units found via regex")

        return result

Scrape Runs (5)

Run #1801 Details

Status
exported
Parser Used
Facility000340Parser
Platform Detected
storageunitsoftware
Units Found
1
Stage Reached
exported
Timestamp
2026-03-27 13:51:07.785863
Timing
Stage Duration
Fetch3242ms
Detect9ms
Parse5ms
Export17ms

Snapshot: 000340_20260327T135111Z.html · Show Snapshot · Open in New Tab

Parsed Units (1)

10x15

$40.00/mo

← Back to dashboard