Facility: 109299

Whitcomb Family Storage

Stale Data Warning: This facility has not been successfully scraped in 26 days (threshold: 3 days). Data may be outdated.
Facility Information active
Facility ID
109299
Name
Whitcomb Family Storage
URL
https://whitcombfamilystorage.com/
Address
81 N Main St, Morrill, ME 04952, USA, Morrill, Maine 04952
Platform
custom_facility_109299
Parser File
src/parsers/custom/facility_109299_parser.py
Last Scraped
2026-03-27 14:08:18.610153
Created
2026-03-14 16:21:53.706708
Updated
2026-03-27 14:08:18.642708
Parser & Healing Diagnosis working
Parser Status
✓ Working
Status Reason
N/A
Last Healing Attempt
Not attempted
Parser Source (src/parsers/custom/facility_109299_parser.py)
"""Parser for Whitcomb Family Storage."""

from __future__ import annotations

import re

from bs4 import BeautifulSoup

from src.parsers.base import BaseParser, ParseResult, UnitResult


class Facility109299Parser(BaseParser):
    """Extract storage units from Whitcomb Family Storage."""

    platform = "custom_facility_109299"

    def parse(self, html: str, url: str = "") -> ParseResult:
        soup = BeautifulSoup(html, "lxml")
        result = ParseResult(platform=self.platform, parser_name=self.__class__.__name__)

        containers = soup.select("div.c1-1.c1-1.c1-2.c1-2.c1-3.c1-b.c1-b.c1-c.c1-c.c1-d.c1-d.c1-e.c1-e.c1-h.c1-h.c1-i.c1-i.x-el.x-el.x-el-div")

        for container in containers:
            unit = UnitResult()
            text = container.get_text(separator=" ", strip=True)

            # Extract size
            size_el = container.select_one("h4.x-el.x-el-h4.c1-5z.c1-2.c1-2k.c1-2l.c1-8d.c1-5v.c1-5w.c1-2z.c1-9w.c1-2c.c1-3q.c1-6t.c1-63.c1-9y.c1-9z.c1-52.c1-6u.c1-6v.c1-6w.c1-6x")
            size_text = size_el.get_text(strip=True) if size_el else None
            if not size_text:
                # Fallback: regex on full text
                m = re.search(r"(\d+\s*[\'\'\u2032]?\s*[xX\u00d7]\s*\d+\s*[\'\'\u2032]?)", text)
                if m:
                    size_text = m.group(1)
            if size_text:
                unit.size = size_text
                w, ln, sq = self.normalize_size(size_text)
                if w is not None:
                    unit.metadata = {"width": w, "length": ln, "sqft": sq}

            # Extract price
            price_el = container.select_one("li")
            price_text = price_el.get_text(strip=True) if price_el else None
            if not price_text:
                # Fallback: regex on full text
                pm = re.search(r"\$(\d[\d,.]*)", text)
                if pm:
                    price_text = pm.group(0)
            if price_text:
                pm = re.search(r"\$(\d[\d,.]*)", price_text)
                if pm:
                    unit.price = self.normalize_price(pm.group(1))

            unit.description = text[:200]

            if unit.size or unit.price:
                result.units.append(unit)

        if not result.units:
            result.warnings.append("No units found")

        return result

Scrape Runs (5)

Run #1371 Details

Status
exported
Parser Used
Facility109299Parser
Platform Detected
table_layout
Units Found
3
Stage Reached
exported
Timestamp
2026-03-23 03:10:42.552944
Timing
Stage Duration
Fetch2171ms
Detect29ms
Parse14ms
Export7ms

Snapshot: 109299_20260323T031044Z.html · Show Snapshot · Open in New Tab

Parsed Units (3)

5x10

$25.00/mo

Unknown Size

$25.00/mo

5x10

$70.00/mo

← Back to dashboard