cupertino-docs/_archive/2026-03-13
Mihaela Mihaljevic 850b706c47 data: merge Apr 30 recrawl backup with size heuristic, archive churn
Merges /Volumes/Code2/cupertino-backup-2026-04-30-pre-recrawl/ (the
in-progress JSON-only recrawl from Claw, 345k files / 2.2 GB) into
docs/ with the rule "keep the larger version per file". Adds 84,226
new pages and replaces 36,278 with larger versions; leaves 224,653
unchanged where the existing Mar 13 file was equal-or-larger.

Archives 9 frameworks that Apple has removed or renamed in the
~6 weeks since the Mar 13 baseline, into _archive/2026-03-13/:

  Removed (now 404 on developer.apple.com):
    availability, cocoa, objective-c, performance, sensitivecontent

  Renamed/relocated (URL still resolves but under a different slug):
    passkit_apple_pay_and_wallet  →  passkit
    safariextensions              →  safariservices/safari_app_extensions
    touchcontrols                 →  touchcontroller
    carekit                       →  carekit-apple/CareKit (separate hierarchy)

Net effect: docs/ goes from 319,191 → 403,370 files (+26 %).

The 224,653 skipped files are notable — for those pages, the older
Mar 13 extractor produced richer content than the in-progress
JSON-only crawl. A WKWebView-only crawl (in flight on Claw) should
produce a fuller corpus and re-update docs/ when it completes.
2026-05-01 00:45:39 +02:00
..
availability data: merge Apr 30 recrawl backup with size heuristic, archive churn 2026-05-01 00:45:39 +02:00
carekit data: merge Apr 30 recrawl backup with size heuristic, archive churn 2026-05-01 00:45:39 +02:00
cocoa data: merge Apr 30 recrawl backup with size heuristic, archive churn 2026-05-01 00:45:39 +02:00
objective-c data: merge Apr 30 recrawl backup with size heuristic, archive churn 2026-05-01 00:45:39 +02:00
passkit_apple_pay_and_wallet data: merge Apr 30 recrawl backup with size heuristic, archive churn 2026-05-01 00:45:39 +02:00
performance data: merge Apr 30 recrawl backup with size heuristic, archive churn 2026-05-01 00:45:39 +02:00
safariextensions data: merge Apr 30 recrawl backup with size heuristic, archive churn 2026-05-01 00:45:39 +02:00
sensitivecontent data: merge Apr 30 recrawl backup with size heuristic, archive churn 2026-05-01 00:45:39 +02:00
touchcontrols data: merge Apr 30 recrawl backup with size heuristic, archive churn 2026-05-01 00:45:39 +02:00