Search strategy
Boolean search across MEDLINE, Embase, Web of Science, and Scopus, peer-reviewed using PRESS and scoped per-subspecialty with an AI/ML hedge.
Every model card is the output of a defined seven-step process — pre-registered, dual-reviewed, quality-graded, version-controlled, and publicly auditable.
One pipeline per subspecialty review · dual-reviewed at every gate.
Boolean search across MEDLINE, Embase, Web of Science, and Scopus, peer-reviewed using PRESS and scoped per-subspecialty with an AI/ML hedge.
Two reviewers screen each record independently in Covidence; discrepancies reconciled by a third reviewer.
Eligible records are read in full and assessed against pre-registered inclusion criteria.
Two reviewers extract every Tier-1 and Tier-2 schema field per study, using controlled vocabularies for filterable categorisation.
Independent dual-reviewer scoring across 24 items in 6 subdomains; total reconciled and rendered as a quality grade.
Cards are written in MDX, Zod-validated at build, dual-reviewed in pull requests, and deployed automatically on merge.
Annual full re-search per subspecialty; continuous user submissions screened within four weeks. Versions follow semver.
Tier 1 for essence Tier 2 for evidence Tier 3 for provenance.
"Plain-language summary of where the model is and isn't safe to use."
Caveats on training cohort, external validation, and subgroup reporting.
The protocol, the rubric, the reviewer roster, the inclusion log.
EyeModelHub follows the PRISMA-ScR extension to scoping reviews. Each subspecialty review is a pre-registered, dual-reviewed mapping of the literature, refreshed annually with continuous interim updates from user submissions.
Peer-reviewed primary research describing AI/ML applied to a recognised ophthalmology subspecialty. Both research prototypes and regulator-cleared products are eligible if reported in primary literature. Excluded: preprints, reviews, editorials, conference abstracts, and non-human studies.
A 3-tier schema. Tier 1 (11 fields) is always visible. Tier 2 (~36 fields) is the expanded view. Tier 3 (~10 fields) is administrative metadata. Multi-value and enum fields use controlled vocabularies; "Not reported" is a legitimate value.
A 24-item, 6-subdomain critical-appraisal tool produces a 0–100 total per study. Subdomains: clinical relevance, data quality, methodological conduct, robustness, reporting quality, and reproducibility. Bands: Low <40, Moderate 40–59, High 60–79, Very High 80+.
Annual full re-search per subspecialty on the anniversary of the search closing date. Continuous user submissions reviewed within four weeks. Card versions follow semver — bump on any content change.
Every model card has at least two reviewers: one extracts, one adjudicates. APPRAISE-AI is scored independently by both. Discrepancies are resolved by discussion or by a third reviewer. Reviewer attribution is opt-in.
As the registry grows across subspecialties, methodological details — particularly around APPRAISE-AI scoring conventions, controlled vocabularies, and update cadence — may be refined. Material changes will be versioned, dated, and announced. The schema follows semantic versioning; this page is the living source of truth.