Facility: 002967
SpareBox Storage
- Facility ID
- 002967
- Name
- SpareBox Storage
- URL
- https://www.spareboxstorage.com/storage-units/nh/rochester/110-south-main-street?utm_medium=yext&utm_source=gmb
- Address
- 110 S Main St, Rochester, NH 03867, USA, Rochester, New Hampshire 03867
- Platform
- custom_facility_002967
- Parser File
- src/parsers/custom/facility_002967_parser.py
- Last Scraped
- 2026-03-27 13:55:05.360062
- Created
- 2026-03-14 16:21:53.706708
- Updated
- 2026-03-27 13:55:05.386981
- Parser Status
- ✓ Working
- Status Reason
- N/A
- Last Healing Attempt
- Not attempted
Parser Source (src/parsers/custom/facility_002967_parser.py)
"""Parser for SpareBox Storage."""
from __future__ import annotations
import re
from bs4 import BeautifulSoup
from src.parsers.base import BaseParser, ParseResult, UnitResult
class Facility002967Parser(BaseParser):
"""Extract storage units from SpareBox Storage."""
platform = "custom_facility_002967"
_UNIT_RE = re.compile(
r"(\d+\s*[\'\'\u2032]?\s*[xX\u00d7]\s*\d+\s*[\'\'\u2032]?)"
r"[^\$]{0,120}"
r"\$(\d[\d,.]*)",
re.DOTALL,
)
_PRICE_SIZE_RE = re.compile(
r"\$(\d[\d,.]*)"
r".{0,120}"
r"(\d+\s*[\'\'\u2032]?\s*[xX\u00d7]\s*\d+\s*[\'\'\u2032]?)",
re.DOTALL,
)
_SIZE_ONLY_RE = re.compile(
r"(\d+\s*[\'\'\u2032]?\s*[xX\u00d7]\s*\d+\s*[\'\'\u2032]?)"
)
def parse(self, html: str, url: str = "") -> ParseResult:
soup = BeautifulSoup(html, "lxml")
result = ParseResult(platform=self.platform, parser_name=self.__class__.__name__)
for tag in soup.find_all(["script", "style"]):
tag.decompose()
body_text = soup.get_text(separator="\n")
seen: set[tuple[str, str]] = set()
# Try size-then-price pattern
for m in self._UNIT_RE.finditer(body_text):
size_text = m.group(1).strip()
price_text = m.group(2).strip()
key = (size_text, price_text)
if key in seen:
continue
seen.add(key)
unit = UnitResult()
unit.size = size_text
w, ln, sq = self.normalize_size(size_text)
if w is not None:
unit.metadata = {"width": w, "length": ln, "sqft": sq}
unit.price = self.normalize_price(price_text)
unit.description = m.group(0).strip()[:200]
if unit.size or unit.price:
result.units.append(unit)
# Try price-then-size pattern if no results
if not result.units:
for m in self._PRICE_SIZE_RE.finditer(body_text):
price_text = m.group(1).strip()
size_text = m.group(2).strip()
key = (size_text, price_text)
if key in seen:
continue
seen.add(key)
unit = UnitResult()
unit.size = size_text
w, ln, sq = self.normalize_size(size_text)
if w is not None:
unit.metadata = {"width": w, "length": ln, "sqft": sq}
unit.price = self.normalize_price(price_text)
unit.description = m.group(0).strip()[:200]
if unit.size or unit.price:
result.units.append(unit)
# Fallback: extract sizes without prices
if not result.units:
seen_sizes: set[str] = set()
for m in self._SIZE_ONLY_RE.finditer(body_text):
size_text = m.group(1).strip()
if size_text in seen_sizes:
continue
w, ln, sq = self.normalize_size(size_text)
if w is None or w < 3 or ln < 3:
continue
seen_sizes.add(size_text)
unit = UnitResult()
unit.size = size_text
unit.metadata = {"width": w, "length": ln, "sqft": sq}
result.units.append(unit)
if not result.units:
result.warnings.append("No units found via regex")
return result
Scrape Runs (5)
-
exported Run #18952026-03-27 13:55:00.944061 | 8 units | Facility002967Parser | View Data →
-
exported Run #18942026-03-27 13:55:00.874493 | 8 units | Facility002967Parser | View Data →
-
exported Run #12102026-03-23 02:56:20.392565 | 9 units | Facility002967Parser | View Data →
-
exported Run #7172026-03-21 18:47:46.518116 | 9 units | Facility002967Parser | View Data →
-
exported Run #2662026-03-14 16:29:39.427763 | 8 units | Facility002967Parser | View Data →
Run #717 Details
- Status
- exported
- Parser Used
- Facility002967Parser
- Platform Detected
- storageunitsoftware
- Units Found
- 9
- Stage Reached
- exported
- Timestamp
- 2026-03-21 18:47:46.518116
Timing
| Stage | Duration |
|---|---|
| Fetch | 4630ms |
| Detect | 76ms |
| Parse | 178ms |
| Export | 5ms |
Snapshot: 002967_20260321T184751Z.html · Show Snapshot · Open in New Tab
Parsed Units (9)
5' x 10'
$106.00/mo
5' x 15'
$169.00/mo
5' x 15'
$191.00/mo
10' x 10'
$104.00/mo
10' x 10'
$109.00/mo
10' x 15'
$139.00/mo
10' x 20'
$139.00/mo
10' x 25'
$179.00/mo
10' x 40'
$368.00/mo