Facility: 000819

58th Avenue Mini Storage

Stale Data Warning: This facility has not been successfully scraped in 26 days (threshold: 3 days). Data may be outdated.
Facility Information active
Facility ID
000819
Name
58th Avenue Mini Storage
URL
http://www.yakimaministorageproperties.com/#!properties/ctzx
Address
101 N 58th Ave, Yakima, WA 98908, USA, Yakima, Washington 98908
Platform
custom_facility_000819
Parser File
src/parsers/custom/facility_000819_parser.py
Last Scraped
2026-03-27 13:52:53.329830
Created
2026-03-14 16:21:53.706708
Updated
2026-03-27 13:52:53.361447
Parser & Healing Diagnosis working
Parser Status
✓ Working
Status Reason
N/A
Last Healing Attempt
Not attempted
Parser Source (src/parsers/custom/facility_000819_parser.py)
"""Parser for 58th Avenue Mini Storage."""

from __future__ import annotations

import re

from bs4 import BeautifulSoup

from src.parsers.base import BaseParser, ParseResult, UnitResult


class Facility000819Parser(BaseParser):
    """Extract storage units from 58th Avenue Mini Storage."""

    platform = "custom_facility_000819"

    _UNIT_RE = re.compile(
        r"(\d+\s*[\'\'\u2032]?\s*[xX\u00d7]\s*\d+\s*[\'\'\u2032]?)"
        r"[^\$]{0,120}"
        r"\$(\d[\d,.]*)",
        re.DOTALL,
    )

    _PRICE_SIZE_RE = re.compile(
        r"\$(\d[\d,.]*)"
        r".{0,120}"
        r"(\d+\s*[\'\'\u2032]?\s*[xX\u00d7]\s*\d+\s*[\'\'\u2032]?)",
        re.DOTALL,
    )

    _SIZE_ONLY_RE = re.compile(
        r"(\d+\s*[\'\'\u2032]?\s*[xX\u00d7]\s*\d+\s*[\'\'\u2032]?)"
    )

    def parse(self, html: str, url: str = "") -> ParseResult:
        soup = BeautifulSoup(html, "lxml")
        result = ParseResult(platform=self.platform, parser_name=self.__class__.__name__)

        for tag in soup.find_all(["script", "style"]):
            tag.decompose()

        body_text = soup.get_text(separator="\n")

        seen: set[tuple[str, str]] = set()

        # Try size-then-price pattern
        for m in self._UNIT_RE.finditer(body_text):
            size_text = m.group(1).strip()
            price_text = m.group(2).strip()
            key = (size_text, price_text)
            if key in seen:
                continue
            seen.add(key)

            unit = UnitResult()
            unit.size = size_text
            w, ln, sq = self.normalize_size(size_text)
            if w is not None:
                unit.metadata = {"width": w, "length": ln, "sqft": sq}
            unit.price = self.normalize_price(price_text)
            unit.description = m.group(0).strip()[:200]
            if unit.size or unit.price:
                result.units.append(unit)

        # Try price-then-size pattern if no results
        if not result.units:
            for m in self._PRICE_SIZE_RE.finditer(body_text):
                price_text = m.group(1).strip()
                size_text = m.group(2).strip()
                key = (size_text, price_text)
                if key in seen:
                    continue
                seen.add(key)

                unit = UnitResult()
                unit.size = size_text
                w, ln, sq = self.normalize_size(size_text)
                if w is not None:
                    unit.metadata = {"width": w, "length": ln, "sqft": sq}
                unit.price = self.normalize_price(price_text)
                unit.description = m.group(0).strip()[:200]
                if unit.size or unit.price:
                    result.units.append(unit)

        # Fallback: extract sizes without prices
        if not result.units:
            seen_sizes: set[str] = set()
            for m in self._SIZE_ONLY_RE.finditer(body_text):
                size_text = m.group(1).strip()
                if size_text in seen_sizes:
                    continue
                w, ln, sq = self.normalize_size(size_text)
                if w is None or w < 3 or ln < 3:
                    continue
                seen_sizes.add(size_text)
                unit = UnitResult()
                unit.size = size_text
                unit.metadata = {"width": w, "length": ln, "sqft": sq}
                result.units.append(unit)

        if not result.units:
            result.warnings.append("No units found via regex")

        return result

Scrape Runs (5)

Run #1180 Details

Status
exported
Parser Used
Facility000819Parser
Platform Detected
table_layout
Units Found
43
Stage Reached
exported
Timestamp
2026-03-23 02:54:08.947375
Timing
Stage Duration
Fetch3391ms
Detect60ms
Parse27ms
Export6ms

Snapshot: 000819_20260323T025412Z.html · Show Snapshot · Open in New Tab

Parsed Units (43)

5 x 5

No price

6 x 6

No price

6 x 8

No price

7 x 7

No price

7 x 8

No price

5 x 10

No price

6 x 10

No price

6 x 12

No price

7 x 12

No price

7 x 13

No price

7 x 14

No price

8 x 12

No price

9 x 12

No price

10 x 10

No price

10 x 12

No price

10 x 15

No price

10 x 21

No price

10 x 24

No price

10 x 27

No price

12 x 30

No price

12 x 27

No price

14 x 27

No price

17 x 27

No price

8 x 20

No price

30 x 29

No price

12 x 28

No price

12 x 35

No price

14 x 31

No price

15 x 31

No price

18 x 32

No price

4 x 5

No price

4 x 10

No price

5 x 7

No price

5 x 12

No price

5 x 15

No price

8 x 10

No price

12 x 12

No price

12 x 15

No price

12 x 18

No price

12 x 20

No price

12 x 40

No price

14 x 15

No price

14 x 50

No price

← Back to dashboard