Facility: 046851

Cherokee Storage / K&L Storage

Stale Data Warning: This facility has not been successfully scraped in 26 days (threshold: 3 days). Data may be outdated.
Facility Information active
Facility ID
046851
Name
Cherokee Storage / K&L Storage
URL
http://cherokeewaco.com/
Address
6368 N State Hwy 6, Woodway, TX 76712, USA, Woodway, Texas 76712
Platform
custom_facility_046851
Parser File
src/parsers/custom/facility_046851_parser.py
Last Scraped
2026-03-27 13:45:09.608779
Created
2026-03-20 23:23:10.660695
Updated
2026-03-27 13:45:09.643402
Parser & Healing Diagnosis working
Parser Status
✓ Working
Status Reason
N/A
Last Healing Attempt
Not attempted
Parser Source (src/parsers/custom/facility_046851_parser.py)
"""Parser for Cherokee Storage / K&L Storage."""

from __future__ import annotations

import re

from bs4 import BeautifulSoup

from src.parsers.base import BaseParser, ParseResult, UnitResult


class Facility046851Parser(BaseParser):
    """Extract storage units from Cherokee Storage / K&L Storage."""

    platform = "custom_facility_046851"

    def parse(self, html: str, url: str = "") -> ParseResult:
        soup = BeautifulSoup(html, "lxml")
        result = ParseResult(platform=self.platform, parser_name=self.__class__.__name__)

        # Units are in a tablesaw table with Type and Size columns
        # Find table by ID prefix or class
        table = soup.find("table", id=re.compile(r"^table-"))
        if not table:
            table = soup.find("table", class_=re.compile(r"pp-table-content"))

        if table:
            rows = table.find_all("tr")
            for row in rows:
                cells = row.find_all(["td", "th"])
                if len(cells) < 2:
                    continue
                # Skip header rows
                if cells[0].name == "th":
                    continue

                # In tablesaw-stack mode, data is in .tablesaw-cell-content spans
                type_el = cells[0].find("span", class_="tablesaw-cell-content")
                size_el = cells[1].find("span", class_="tablesaw-cell-content")

                type_text = type_el.get_text(strip=True) if type_el else cells[0].get_text(strip=True)
                size_text = size_el.get_text(strip=True) if size_el else cells[1].get_text(strip=True)

                # Filter out header repeats
                if not size_text or size_text.lower() == "size":
                    continue

                if re.search(r"\d", size_text):
                    unit = UnitResult()
                    unit.size = size_text
                    unit.description = type_text if type_text else None
                    w, ln, sq = self.normalize_size(size_text)
                    if w is not None:
                        unit.metadata = {"width": w, "length": ln, "sqft": sq}
                    result.units.append(unit)

        if not result.units:
            result.warnings.append("No units found on page")

        return result

Scrape Runs (4)

Run #1668 Details

Status
exported
Parser Used
Facility046851Parser
Platform Detected
table_layout
Units Found
7
Stage Reached
exported
Timestamp
2026-03-27 13:45:04.753791
Timing
Stage Duration
Fetch4749ms
Detect31ms
Parse18ms
Export22ms

Snapshot: 046851_20260327T134509Z.html · Show Snapshot · Open in New Tab

Parsed Units (7)

10 X 15

No price

12 X 30

No price

20 X 30

No price

25 X 40

No price

12 X 40

No price

10 X 12

No price

5 X 10

No price

← Back to dashboard