Triplet V1
The First Foundation Model for Pathology Multi-scale Reasoning
Triplet V1 features cross-scale reasoning as its core capability, enabling the model to perform visual planning, step-by-step verification, and conclusion convergence like a pathologist, with traceable evidence chains.
Cross-scale Reasoning, Fully Explainable
Triplet features unique multi-scale explainable evidence chain training with stepwise evidence-based reasoning, enabling clinically auditable reasoning expressions.
Interactivity Reasoning, Explicit Traceable
Stepwise display from low-magnification tissue architecture to high-magnification cytological evidence; clinicians can modify and trace at any time, forming a closed-loop reasoning workflow.
Human-in-the-loop Data Flywheel
Human-in-the-loop data annotation: AI generation, clinician review, and feedback training build a data-to-AI capability flywheel for continuous improvement.
Why Multi-scale Reasoning
Pathology diagnosis is not about viewing a single image. It starts with low-mag global scanning, then mid-mag structural localization, and finally high-mag cytological confirmation. Triplet defines this cross-scale workflow as the core capability target and designs training and evaluation accordingly.
Global scan: Identify overall tissue architecture and abnormal region distribution as the starting point for multi-scale reasoning.

Whole slide image shows lung tissue specimen with large dark purple solid areas contrasting surrounding normal alveolar structures, suggesting locally high cellularity lesion.
Dark purple solid area with significantly increased cell density and locally back-to-back arrangement suggests suspicious neoplastic lesion.

At 10× magnification, dysplastic glands show complex distribution suggesting stromal invasion. Dense cellularity without diffuse necrosis or hemorrhage. Mild stromal inflammatory infiltration, no significant carbon deposition or pseudodense areas from alveolar collapse, excluding non-neoplastic dense shadows.
This region shows loss of normal alveolar structures with acinar and papillary growth patterns, suggesting marginal features of neoplastic proliferation with clear diagnostic value.

Tumor cells infiltrate in acinar and papillary patterns with irregular lumina, partially in back-to-back arrangement. Nuclei are hyperchromatic and variably sized with visible nucleoli and eosinophilic cytoplasm. Stromal fibrosis present without keratin pearl formation or perivascular cuffing. Cell adhesion preserved with some desquamation.
Significantly enlarged and hyperchromatic nuclei with increased nuclear-to-cytoplasmic ratio, irregular nuclear membranes, and nuclear crowding suggest high-grade atypia consistent with typical cancer cell cytology — a key region for tumor characterization.

Under high magnification, tumor cell nuclei are significantly enlarged and hyperchromatic with clearly visible nucleoli. Intracytoplasmic mucin vacuoles of varying sizes suggest adenocarcinoma features. No intercellular bridges or keratinization observed.
What Triplet V1 Can Do
Cross-scale Reasoning and Evidence Linking
Explicitly connects evidence across at least two magnifications, avoiding single-scale guessing.
Multi-region Heterogeneity Understanding
Learns contextual relationships among multiple ROIs along a Path to understand diagnostic significance for the whole WSI.
Structured Output (Clinical Review Ready)
Outputs Reason and Caption at each magnification, forming cross-scale reasoning chains and summaries.
Bilingual Output
Supports Chinese and English expression, with the primary focus on cross-scale reasoning and evidence chain capabilities.
Core Mechanism: Triplet Structure
Triplet organizes learning with a triplet data structure: multi-scale images + selection rationale and pathological features + diagnostic significance and findings of the path. Each path is a complete cross-scale Path (10x → 40x → 200x).
Data & Platform: From Annotation to Reasoning Data Pipeline
The Triplet data review platform addresses multi-center data distribution and modality diversity, building a high-quality, multi-scale, and explainable reasoning data system.
AI Generation Phase (Automated)
Global WSI analysis to generate an overview (low-mag structure, tissue distribution, heterogeneity).
Generate candidate Paths and build cross-scale Paths for each region (10x -> 40x -> 200x).
Generate Reason / Caption / reasoning chain / summary at each level.
Doctor Review Phase (Human-in-the-loop)
Correct unreasonable magnification motivations and inaccurate descriptions.
Revise cross-scale reasoning chain logic and final summaries.
After review approval, feed back into training for the next data iteration cycle.

Training & Evaluation
SFT teaches the model evidence pathways; RL makes "reasoning ability" reliably measurable.
SFT: Teaching Evidence Pathways
Cross-scale Paths (10x → 40x → 200x) are built for each region at multiple scales, learning Reason / Caption / reasoning chain / summary at each level, teaching the model "why to magnify, what is seen, and how to summarize".
RL: Making Reasoning Reliably Measurable
A leakage-aware mechanism suppresses text shortcuts: text-only adversarial inference is performed first, and if the correct answer can be obtained without viewing the image, it is flagged as leakage and iteratively corrected. GRPO alignment then rewards answer correctness and cross-scale reasoning consistency while constraining output format stability.



Experimental Results
Triplet V1 achieves significant leading performance on two core evaluation tasks.
Accuracy on cross-scale multi-image TripletVQA-Test
Task: Accuracy on cross-scale multi-image TripletVQA-Test
| Domain | Model | Correspondence | Confirmation | Localization | Explanation | Diagnosis | Average |
|---|---|---|---|---|---|---|---|
| General | Qwen2.5VL-7B | 0.418 | 0.473 | 0.711 | 0.443 | 0.637 | 0.536 |
| Medical | LLaVA-Med-7B | 0.259 | 0.169 | 0.284 | 0.209 | 0.249 | 0.234 |
| HuatuoGPT-V-7B | 0.378 | 0.458 | 0.657 | 0.468 | 0.557 | 0.503 | |
| Lingshu-7B | 0.473 | 0.637 | 0.627 | 0.483 | 0.612 | 0.566 | |
| Pathology | Quilt-LLaVA | 0.323 | 0.144 | 0.453 | 0.303 | 0.294 | 0.303 |
| CLOVER | 0.373 | 0.617 | 0.731 | 0.463 | 0.657 | 0.568 | |
| Patho-R1 | 0.318 | 0.403 | 0.597 | 0.562 | 0.687 | 0.513 | |
| Triplet V1 | 0.806 | 0.891 | 0.846 | 0.761 | 0.841 | 0.829 |

Question
Based on the cellular morphology, which layer of stratified squamous epithelium is most prevalent on the left side of the image?
Model Response Comparison
Case data from the Triplet-V1 paper, showing response comparisons across models on cross-scale reasoning tasks.
Online Trial
Upload or select a sample image to experience Triplet cross-scale reasoning output.
Image Input
Upload or use preset images
Analysis Result
Click "Analyze" to see the model output here.
Product Offerings
Three forms for system integration, workflow collaboration, and research-grade reinforced reasoning.
Triplet API
Integrate cross-scale reasoning into your system: VQA, structured descriptions, evidence chain output, and QC interfaces.
Triplet Agents (Collaborative)
Workflow-oriented: region selection -> evidence aggregation -> report drafting -> human doctor supervision loop.
Triplet Reasoning Edition (Research/Advanced)
Built on leakage-aware data curation + GRPO reinforcement learning for improved cross-scale consistency and controllable output.
FAQ
What makes Triplet V1 fundamentally different?
Cross-scale reasoning is the primary capability. Training, data, and evaluation are all designed around closed-loop evidence chains, avoiding single-scale black-box judgments.
How do you ensure data reliability?
Iterative human-in-the-loop: AI generates first, then doctors review and correct, feeding back into training to continuously improve data quality and traceability.
Apply for Triplet V1 Partnership
If you want to validate cross-scale reasoning in real pathology scenarios, contact our team for early access, API integration, and Agents collaboration plans.
