Facility 004393 - Facility Scrapers

Stale Data Warning: This facility has not been successfully scraped in 76 days (threshold: 3 days). Data may be outdated.

Facility Information active

Facility ID: 004393
Name: Zoom Self Storage
URL: http://www.zoomselfstore.com/

Address: 116 N Main St, Oakfield, NY 14125, USA, Oakfield, New York 14125
Platform: custom_facility_004393
Parser File: src/parsers/custom/facility_004393_parser.py

Last Scraped: 2026-03-27 13:40:23.294703
Created: 2026-03-23 02:35:08.816820
Updated: 2026-03-27 13:40:23.294703

Parser & Healing Diagnosis needs_fix

Parser Status: ⚠ Needs Fix
Status Reason: Parser returned 0 units

Last Healing Attempt: Not attempted

Parser Source (src/parsers/custom/facility_004393_parser.py)

"""Parser for Zoom Self Storage (facility 004393).

Site redirects to Facebook (facebook.com/zoomselfstorage/). No structured
pricing data is available. This parser detects the Facebook redirect and
reports the situation clearly so the pipeline can classify the facility
as ``no_pricing``.
"""

from __future__ import annotations

from bs4 import BeautifulSoup

from src.parsers.base import BaseParser, ParseResult, UnitResult


class Facility004393Parser(BaseParser):
    """Zoom Self Storage -- site redirects to Facebook, no pricing available."""

    platform = "custom_facility_004393"

    def _is_facebook_page(self, soup: BeautifulSoup) -> bool:
        """Return True if the HTML belongs to a Facebook page."""
        title = soup.find("title")
        if title and "facebook" in title.get_text(strip=True).lower():
            return True
        # Check for Facebook-specific meta / structural markers
        for meta in soup.find_all("meta", attrs={"property": True}):
            prop = (meta.get("property") or "").lower()
            if prop.startswith("fb:") or prop.startswith("al:"):
                return True
        if soup.find("div", id="facebook") or soup.find("div", id="fb-root"):
            return True
        return False

    def parse(self, html: str, url: str = "") -> ParseResult:
        soup = BeautifulSoup(html, "lxml")
        result = ParseResult(
            platform=self.platform,
            parser_name=self.__class__.__name__,
        )

        if self._is_facebook_page(soup):
            result.warnings.append(
                "Site (zoomselfstore.com) redirects to Facebook. "
                "No pricing data available. Consider marking facility as no_pricing."
            )
            return result

        # If the page is ever restored to a real storage site, attempt basic
        # extraction so we surface data instead of silently returning nothing.
        for tag in soup.find_all(["script", "style"]):
            tag.decompose()

        body_text = soup.get_text(separator="\n")
        seen: set[tuple[str, str]] = set()

        import re

        size_price_re = re.compile(
            r"(\d+\s*['\u2032]?\s*[xX\u00d7]\s*\d+\s*['\u2032]?)"
            r"[^\$]{0,120}"
            r"\$(\d[\d,.]*)",
            re.DOTALL,
        )

        for m in size_price_re.finditer(body_text):
            size_text = m.group(1).strip()
            price_text = m.group(2).strip()
            key = (size_text, price_text)
            if key in seen:
                continue
            seen.add(key)

            unit = UnitResult()
            unit.size = size_text
            w, ln, sq = self.normalize_size(size_text)
            if w is not None:
                unit.metadata = {"width": w, "length": ln, "sqft": sq}
            unit.price = self.normalize_price(price_text)
            unit.description = m.group(0).strip()[:200]
            if unit.size or unit.price:
                result.units.append(unit)

        if not result.units:
            result.warnings.append(
                "No storage units found. Page may not contain pricing data."
            )

        return result

Stage	Duration
Fetch	4015ms
Detect	22ms
Parse	8ms
Export	16ms

Facility: 004393

Scrape Runs (3)

Run #1542 Details

All Failures for this Facility (3)

HTML Snapshot — Run #1542