Commit Graph

8 Commits

Author SHA1 Message Date
15c8971fee data: cleaned corpus + recovered missing URLs + fresh Swift Evolution
Major corpus refresh accompanying the cupertino #284 cleanup work:

docs/ (412,562 files / 2.4G total):
- Net change: 71,284 files (1,017,864 insertions / 4,503,415 deletions).
- Removed: 1,166 + 39 + 3 = 1,208 poison files Apple's CDN consistently
  returns error templates for (JS-disabled fallback + Web Server Error
  + 502/403 titles).
- Added: 8,357 .json files recovered from cross-backup URL match
  (May-09 jsonpass + private mirror + Apr-30 pre-recrawl).
- Added: 103 .md files from the November 2025 AppleDocs snapshot for
  URIs not available as .json anywhere.
- Added: 580 synthesized stub .json files for URIs whose own page
  Apple's CDN won't render but whose one-line description IS embedded
  in the parent type's Topics section. Each stub carries the parent-
  extracted description as 'abstract' and source: custom.
- Verified clean across 12 poison categories (no HTTP error titles,
  no JS-disabled fallback, no Skip-Navigation broken bodies, no
  openresty/CDN markers) by a full no-heuristics scan of all
  412,458 .json + 103 .md files.

swift-evolution/ (912 files, +41 new):
- Refetched via 'cupertino fetch --type evolution' against
  github.com/swiftlang/swift-evolution.
- 3 brand-new SE-XXXX proposals + 478 refreshed.
- Preserves the historical NNNN-descriptive-name files (429 of them)
  that the cupertino-docs repo kept from the original swift-evolution
  proposals/* directory layout — both naming conventions coexist.

swift-org/ (787 files, +2 from local sync).
archive/ (419 files, +12 from local sync).
packages/, hig/ unchanged.

Reference: #290 (re-publish v1.0.x bundle), #284 (poison rows, closed),
PR #289 + PR #291 (crawler + indexer defenses, merged into cupertino
main), #293 (URI collision design issue, filed).

No version bump on cupertino. The corpus tag below marks this state.
2026-05-12 16:22:55 +02:00
1eae0594fc data: refresh docs/ + add packages/ from .cupertino-dev
Bulk update of the public corpus from /Users/mmj/.cupertino-dev/, the
local source that built v1.0.0 / v1.0.1 release database bundles.

- docs/: 1,739 new pages, 403,381 modified (schema upgrade to the
  structured-page format used by v1.0+: overview, rawMarkdown,
  sections[].items[]). Total 404,568 page JSONs (2.4 GB).
- packages/: 1,587 packages (1.5 GB), previously empty in this repo.
  Includes per-package source archives and READMEs for the curated
  Swift package set. Some packages (notably swiftlang/swift-foundation-icu)
  ship 50-100 MB ICU lookup tables — under GitHub's 100 MB limit.

No deletions: existing files not in .cupertino-dev are preserved
(e.g. swift-org/ has 81 historical pages this update doesn't touch).
2026-05-09 12:58:32 +02:00
850b706c47 data: merge Apr 30 recrawl backup with size heuristic, archive churn
Merges /Volumes/Code2/cupertino-backup-2026-04-30-pre-recrawl/ (the
in-progress JSON-only recrawl from Claw, 345k files / 2.2 GB) into
docs/ with the rule "keep the larger version per file". Adds 84,226
new pages and replaces 36,278 with larger versions; leaves 224,653
unchanged where the existing Mar 13 file was equal-or-larger.

Archives 9 frameworks that Apple has removed or renamed in the
~6 weeks since the Mar 13 baseline, into _archive/2026-03-13/:

  Removed (now 404 on developer.apple.com):
    availability, cocoa, objective-c, performance, sensitivecontent

  Renamed/relocated (URL still resolves but under a different slug):
    passkit_apple_pay_and_wallet  →  passkit
    safariextensions              →  safariservices/safari_app_extensions
    touchcontrols                 →  touchcontroller
    carekit                       →  carekit-apple/CareKit (separate hierarchy)

Net effect: docs/ goes from 319,191 → 403,370 files (+26 %).

The 224,653 skipped files are notable — for those pages, the older
Mar 13 extractor produced richer content than the in-progress
JSON-only crawl. A WKWebView-only crawl (in flight on Claw) should
produce a fuller corpus and re-update docs/ when it completes.
2026-05-01 00:45:39 +02:00
b79bbc7c71 docs: restore deleted docs and add new framework coverage (319k files)
Restores 188k docs accidentally deleted by rsync --delete, plus adds
new docs for 134 previously missing/incomplete frameworks.

Total: 319,190 doc files across 385+ frameworks.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-13 12:19:34 +01:00
a3607fe5bd docs: update Apple framework documentation (partial batch 1)
Added/updated docs for 48+ frameworks including:
- New: apple_ads (271), applenews (267), cktooljs (646), cfnetwork (483),
  browserenginekit (442), automator (171), analytics-reports (137),
  coregraphics (1103+), coremedia (1196+), corevideo (848+),
  coretext (541+), coreaudiotypes (709+), coreaudio (481+)
- Updated existing frameworks with refreshed content

Part of cupertino#159 and cupertino#160 - missing framework docs fix.
More frameworks being fetched.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-10 16:27:49 +01:00
d211dbbb43 updated with macmini crawled 2025-12-08 22:40:34 +01:00
0f827e377c freshly crawled docs 2025-12-04 00:47:29 +01:00
7e55156f34 adding main docs 2025-12-02 22:22:13 +01:00