Democratizing Cardiovascular AI: A Multi-Collection RAG Architecture and Product Requirements for the Cardiology Intelligence Agent¶
Author: Adam Jones Date: March 2026 Version: 0.1.0 (Pre-Implementation) License: Apache 2.0
Part of the HCLS AI Factory -- an end-to-end precision medicine platform. https://github.com/ajones1923/hcls-ai-factory
Abstract¶
Cardiovascular disease (CVD) remains the leading cause of death globally, claiming approximately 17.9 million lives annually -- representing 32% of all deaths worldwide. Despite exponential growth in cardiovascular AI research (over 4,500 publications in 2025 alone), critical barriers persist in clinical translation: fragmented evidence across modalities, siloed data systems, lack of integrated genomic-imaging correlation, and prohibitive infrastructure costs that limit advanced cardiac AI to elite academic centers.
This paper presents the architectural design, clinical rationale, and product requirements for the Cardiology Intelligence Agent -- a multi-collection retrieval-augmented generation (RAG) system purpose-built for cardiovascular medicine. The agent will unify 12 specialized Milvus vector collections spanning cardiac imaging (echocardiography, cardiac CT, cardiac MRI, nuclear cardiology), electrophysiology (12-lead ECG, Holter monitoring, device interrogation), hemodynamics, heart failure management, valvular heart disease, preventive cardiology, interventional cardiology, and cardio-oncology -- alongside a shared genomic_evidence collection containing 3.5 million variant vectors from the HCLS AI Factory genomics pipeline.
The system extends the proven multi-collection RAG architecture established by five existing intelligence agents in the HCLS AI Factory (Precision Biomarker, Precision Oncology, CAR-T, Imaging, and Autoimmune agents), adapting it with cardiology-specific clinical workflows, risk calculators, cross-modal imaging-genomics triggers, and structured reporting aligned with ACC/AHA guidelines. Eight reference clinical workflows will cover the highest-impact cardiovascular use cases: coronary artery disease assessment, heart failure classification, valvular disease quantification, arrhythmia detection, cardiac MRI tissue characterization, stress test interpretation, preventive risk stratification, and cardio-oncology surveillance.
The agent will deploy on a single NVIDIA DGX Spark ($3,999) using BGE-small-en-v1.5 embeddings (384-dimensional, IVF_FLAT, COSINE), Claude Sonnet 4.6 for evidence synthesis, and four NVIDIA NIM microservices for on-device inference. Licensed under Apache 2.0, the platform will democratize access to integrated cardiovascular intelligence that currently requires multi-million-dollar institutional investments in informatics infrastructure.
Table of Contents¶
- Introduction
- The Cardiovascular Data Challenge
- Clinical Landscape and Market Analysis
- Existing HCLS AI Factory Architecture
- Cardiology Intelligence Agent Architecture
- Milvus Collection Design
- Clinical Workflows
- Cross-Modal Integration
- NIM Integration Strategy
- Knowledge Graph Design
- Query Expansion and Retrieval Strategy
- API and UI Design
- Clinical Decision Support Engines
- Reporting and Interoperability
- Product Requirements Document
- Data Acquisition Strategy
- Validation and Testing Strategy
- Regulatory Considerations
- DGX Compute Progression
- Implementation Roadmap
- Risk Analysis
- Competitive Landscape
- Discussion
- Conclusion
- References
1. Introduction¶
1.1 The Cardiovascular Disease Burden¶
Cardiovascular disease represents the single largest cause of mortality and morbidity worldwide. According to the World Health Organization and the American Heart Association (AHA) 2025 Heart Disease and Stroke Statistics Update:
- 17.9 million deaths annually -- 32% of all global deaths
- 523 million people currently living with cardiovascular disease
- $407 billion in direct and indirect costs in the United States alone (2024)
- Ischemic heart disease and stroke are the top two causes of death globally, responsible for a combined 15.2 million deaths per year
- Heart failure affects 64.3 million people globally, with 5-year mortality rates exceeding 50%
- Atrial fibrillation prevalence has reached 59 million cases worldwide, with a projected 72 million by 2030
Despite these staggering numbers, cardiovascular AI adoption in clinical practice remains in early stages. A 2025 survey by the American College of Cardiology (ACC) found that while 89% of cardiologists believe AI will transform their practice, only 23% currently use AI-assisted tools in routine clinical decision-making. The gap between potential and adoption is driven by three factors: fragmented data, prohibitive cost, and lack of integrated systems.
1.2 The Opportunity for Integrated Cardiovascular Intelligence¶
The cardiovascular domain is uniquely suited for an integrated AI intelligence agent because:
-
Multi-modal data convergence: Cardiology routinely integrates imaging (echo, CT, MRI, nuclear), electrophysiology (ECG, Holter), hemodynamics (catheterization), biomarkers (troponin, BNP, lipids), genomics (familial cardiomyopathy, channelopathies), and clinical risk scores -- all of which must be synthesized for optimal patient care.
-
Established clinical guidelines: ACC/AHA guidelines provide structured frameworks for risk stratification, treatment decisions, and follow-up protocols that can be encoded into clinical decision support logic.
-
Quantitative measurement-rich: Unlike many medical specialties, cardiology generates highly quantitative data (ejection fraction percentages, valve gradients in mmHg, calcium scores in Agatston units, QTc intervals in milliseconds) that is well-suited for structured RAG retrieval and comparison.
-
Strong cross-modal triggers: A cardiac imaging finding (e.g., unexplained left ventricular hypertrophy) frequently triggers genomic workup (hypertrophic cardiomyopathy gene panel), which may lead to drug therapy optimization -- a natural fit for the HCLS AI Factory's three-stage pipeline.
-
Large addressable market: Every health system has a cardiology department. Cardiovascular AI is projected to reach $2.8 billion by 2028 (CAGR 42.3%), making this the highest-value vertical for an intelligence agent.
1.3 Our Contribution¶
This paper presents the complete architectural blueprint and product requirements for the Cardiology Intelligence Agent, the sixth domain-specific intelligence agent in the HCLS AI Factory platform. Our contributions include:
- A 12-collection Milvus vector schema designed for the full spectrum of cardiovascular data: imaging, electrophysiology, hemodynamics, heart failure, valvular disease, preventive cardiology, interventional procedures, cardio-oncology, guidelines, trials, devices, and literature
- Eight reference clinical workflows covering coronary artery disease, heart failure, valvular disease, arrhythmia, cardiac MRI, stress testing, preventive risk stratification, and cardio-oncology surveillance
- A cardiology knowledge graph with structured data on 30+ cardiovascular conditions, 15+ imaging modalities/protocols, 20+ cardiac biomarkers, 25+ drug classes, and 50+ ACC/AHA guideline recommendations
- Cross-modal triggers linking cardiac imaging findings to genomic workup (familial hypercholesterolemia, cardiomyopathy gene panels, channelopathies) via the shared
genomic_evidencecollection - Clinical decision support engines implementing validated risk calculators (ASCVD, HEART, CHA₂DS₂-VASc, HAS-BLED, MAGGIC, EuroSCORE II)
- A comprehensive product requirements document with user stories, acceptance criteria, and implementation prioritization
- Deployment on a single NVIDIA DGX Spark ($3,999), maintaining the platform's commitment to accessible AI
2. The Cardiovascular Data Challenge¶
2.1 Data Fragmentation in Cardiovascular Medicine¶
Cardiovascular clinical practice generates data across at least fifteen distinct categories, each with its own structure, vocabulary, source systems, and update cadences:
-
Cardiac Imaging Literature -- PubMed abstracts, JACC imaging supplements, Circulation reviews, conference proceedings (ACC, AHA, ESC, SCMR, SCCT, ASE). Over 4,500 publications in 2025 alone on cardiovascular AI.
-
Echocardiography Data -- Chamber dimensions (LVIDd, LVIDs, LA volume), systolic function (LVEF, GLS, TAPSE, S'), diastolic function (E/A ratio, E/e', deceleration time), valve assessments (gradients, regurgitant volumes, effective orifice area), strain imaging (longitudinal, circumferential, radial). Structured per ASE guidelines.
-
Cardiac CT Data -- Coronary artery calcium scores (Agatston, volume, mass scores), CTA stenosis grading (CAD-RADS 0-5), plaque characterization (calcified, non-calcified, mixed, vulnerable), fractional flow reserve from CT (FFR-CT), myocardial perfusion CT, cardiac structural assessment.
-
Cardiac MRI Data -- Volumes and function (LVEF, RVEF, indexed volumes), tissue characterization (T1 mapping, T2 mapping, ECV), late gadolinium enhancement (LGE) patterns (ischemic vs non-ischemic), perfusion (stress, rest, MPR), feature tracking strain, 4D flow, parametric mapping.
-
Nuclear Cardiology -- SPECT MPI (stress/rest perfusion, TID ratio, transient ischemic dilation), PET MPI (MBF quantification, CFR), MUGA (LVEF for cardio-oncology), cardiac amyloid scintigraphy (Tc-99m PYP, DPD), FDG-PET for sarcoidosis/endocarditis.
-
Electrophysiology Data -- 12-lead ECG interpretation (rhythm, intervals, morphology, axis, ischemic changes), Holter/event monitor data (arrhythmia burden, HR variability), device interrogation (ICD, pacemaker, CRT), EP study results (ablation lesion sets, activation mapping), QTc monitoring.
-
Hemodynamic Data -- Right heart catheterization (PA pressures, wedge pressure, cardiac output, PVR, SVR), left heart catheterization (LVEDP, aortic valve gradient), coronary angiography (TIMI flow, stenosis percentage, FFR/iFR), structural intervention hemodynamics.
-
Heart Failure Data -- NYHA classification, ACC/AHA staging (A-D), biomarkers (NT-proBNP, BNP, hs-troponin), GDMT titration (beta-blocker, ACEi/ARB/ARNI, MRA, SGLT2i), device therapy (ICD, CRT), transplant evaluation (MELD-XI, SHFM, HFSS), LVAD management.
-
Valvular Heart Disease -- Severity grading (mild/moderate/severe per ASE criteria), hemodynamic quantification (EOA, DVI, PHT, regurgitant fraction), intervention criteria (surgical AVR vs TAVR, mitral repair vs replacement), surveillance protocols.
-
Preventive Cardiology -- Lipid panels (LDL-C, HDL-C, TG, Lp(a), ApoB), risk calculators (PCE/ASCVD, HEART, MESA, Reynolds), statin eligibility, PCSK9i criteria, coronary calcium for risk reclassification, inflammatory markers (hsCRP, IL-6), metabolic parameters.
-
Interventional Cardiology -- PCI procedural data (stent type, vessel, lesion characteristics, TIMI flow), structural intervention data (TAVR, MitraClip, WATCHMAN, PFO closure), complications, antiplatelet/anticoagulant management, CTO techniques.
-
Cardio-Oncology -- Baseline cardiac assessment protocols (pre-anthracycline, pre-immunotherapy, pre-targeted therapy), surveillance schedules (GLS monitoring, troponin trends), cardiotoxicity detection (CTRCD definitions), risk scores (HFA-ICOS), cardioprotective strategies.
-
Clinical Trials -- ClinicalTrials.gov cardiovascular entries (16,000+ active/completed), landmark trial results (PARADIGM-HF, DAPA-HF, EMPEROR-Reduced, ISCHEMIA, REVIVED-BCIS2), outcome data, subgroup analyses.
-
Cardiovascular Devices -- FDA-cleared AI/ML cardiac devices (ECG interpretation, echo measurement, CT calcium scoring, arrhythmia detection), implantable devices (ICDs, pacemakers, CRT, LVAD), wearable cardiac monitors (smartwatch ECG, continuous glucose monitors for cardiometabolic risk).
-
Cardiovascular Genomics -- Familial hypercholesterolemia (LDLR, PCSK9, APOB), hypertrophic cardiomyopathy (MYH7, MYBPC3, TNNT2, TNNI3), dilated cardiomyopathy (TTN, LMNA, RBM20), arrhythmogenic cardiomyopathy (PKP2, DSP, DSG2), channelopathies (SCN5A, KCNQ1, KCNH2, RYR2), aortopathies (FBN1, TGFBR1/2, SMAD3, ACTA2).
2.2 Why Existing Tools Fall Short¶
Current approaches to cardiovascular intelligence fail to address this fragmentation:
| Approach | Limitation |
|---|---|
| PubMed search | Keyword-based; misses semantic connections; no cross-modal integration; no structured clinical data |
| UpToDate / DynaMed | Expert-curated but static; no patient-specific reasoning; no imaging or genomic data integration; subscription-based |
| Commercial CVIS (Solas, Lumedx) | Vendor-locked; limited to institutional data; no literature integration; $500K-$2M+ implementation |
| EHR-integrated CDS (Epic BestPractice) | Rule-based; cannot synthesize unstructured evidence; no imaging AI; limited to institutional data |
| General AI assistants | No citation provenance; hallucination risk; no structured cardiovascular data; not FDA-aligned |
| Imaging-only AI (Arterys, HeartFlow) | Single-modality; no genomic integration; no guideline reasoning; cloud-dependent; expensive per-scan pricing |
The Cardiology Intelligence Agent addresses all six limitations simultaneously by combining multi-collection vector search, cross-modal genomic triggers, validated clinical decision support logic, and guideline-grounded LLM synthesis -- all on a $3,999 desktop device.
2.3 The Case for Multi-Collection RAG in Cardiology¶
A cardiologist evaluating a patient with new-onset heart failure must simultaneously consider:
- Imaging data: Echocardiographic LVEF, cardiac MRI showing late gadolinium enhancement pattern (ischemic vs non-ischemic)
- Electrophysiology: ECG showing LBBB morphology (CRT candidacy), QTc for drug safety
- Biomarkers: NT-proBNP trend, hs-troponin for myocardial injury, iron studies for IV iron candidacy
- Genomics: If age < 50 or family history, cardiomyopathy gene panel (TTN, LMNA, MYH7)
- Guidelines: ACC/AHA heart failure guidelines for GDMT initiation sequence
- Trials: DAPA-HF, EMPEROR-Reduced for SGLT2i evidence; PARADIGM-HF for ARNI
- Risk scores: MAGGIC score for prognosis, Seattle Heart Failure Model for transplant timing
No existing tool synthesizes all seven data dimensions into a single clinical narrative. A multi-collection RAG architecture -- where each data dimension has its own optimized Milvus collection with domain-specific schema fields -- enables parallel retrieval across all dimensions with a single query, followed by LLM synthesis into a coherent clinical recommendation.
3. Clinical Landscape and Market Analysis¶
3.1 Cardiovascular AI Market¶
The global cardiovascular AI market demonstrates exceptional growth dynamics:
| Metric | Value | Source |
|---|---|---|
| Market size (2024) | $1.4 billion | Grand View Research |
| Projected size (2028) | $2.8 billion | Grand View Research |
| CAGR (2024-2028) | 42.3% | Grand View Research |
| FDA-cleared cardiac AI devices (cumulative) | 180+ | FDA AI/ML database |
| Active cardiovascular AI clinical trials | 450+ | ClinicalTrials.gov |
| Annual CV AI publications | 4,500+ | PubMed (2025) |
| US cardiology practices | 25,000+ | ACC Census |
| US cardiologists | 35,000+ | AHA Statistics |
| Global cardiologists | 200,000+ | WHO estimates |
3.2 Competitive Analysis¶
| Competitor | Strengths | Gaps |
|---|---|---|
| HeartFlow (FFR-CT) | FDA-cleared, clinical validation | Single modality (CT), no genomics, cloud-only, $1,100/scan |
| Eko Health (ECG + auscultation) | Point-of-care, stethoscope integration | Limited to murmur/arrhythmia screening, no imaging AI |
| Viz.ai (Stroke + cardiac) | Real-time triage, HIPAA-compliant | Narrow scope (LVO stroke, PE), SaaS pricing |
| Ultromics (EchoGo) | FDA-cleared echo AI, GLS analysis | Echo-only, no cross-modal integration |
| Cleerly (Coronary CTA) | Plaque quantification, FDA pathway | CT-only, cloud-only, per-patient pricing |
| Caption Health (Echo guidance) | AI-guided acquisition, GE partnership | Acquisition guidance only, no interpretation |
| Tempus (Cardiology platform) | Multi-modal data, genomics integration | Proprietary, expensive, cloud-dependent |
Our differentiation: The Cardiology Intelligence Agent is the only system that combines (1) multi-modal imaging AI, (2) genomic integration, (3) literature RAG, (4) guideline-aligned clinical decision support, (5) validated risk calculators, and (6) on-device deployment -- all in an open-source, $3,999 package. No competitor addresses more than two of these six dimensions.
3.3 Target Users¶
| User Segment | Use Case | Pain Point Addressed |
|---|---|---|
| Community cardiologists | Evidence-based decision support | Limited access to sub-specialty expertise |
| Academic medical centers | Research and education | Fragmented data across systems |
| Heart failure programs | GDMT optimization, transplant evaluation | Manual chart review for risk stratification |
| Structural heart teams | TAVR/MitraClip planning | Multi-modal data synthesis |
| Cardio-oncology clinics | Cardiotoxicity surveillance | No integrated monitoring platform |
| Preventive cardiology | Risk stratification, statin eligibility | Calculator fatigue, guideline complexity |
| Clinical trial sites | Patient screening, endpoint adjudication | Manual eligibility assessment |
| Cardiovascular genomics | Variant interpretation in cardiac context | Limited cardiac-specific annotation |
4. Existing HCLS AI Factory Architecture¶
4.1 Platform Overview¶
The HCLS AI Factory is a three-stage precision medicine platform running on NVIDIA DGX Spark:
Stage 1: Genomics Pipeline (Parabricks + DeepVariant)
FASTQ → VCF → 3.56M annotated variants
|
Stage 2: RAG/Chat Pipeline (Milvus + Claude)
Variant interpretation, clinical significance
|
Stage 3: Drug Discovery Pipeline (BioNeMo + DiffDock)
Target → Lead compound → Docking → Drug-likeness
Five intelligence agents extend this core platform with domain-specific knowledge:
| Agent | Collections | Seed Vectors | Unique Capability |
|---|---|---|---|
| Precision Biomarker | 11 | 6,134 | Biological age calculators, biomarker panels |
| Precision Oncology | 10 | 609 | Molecular tumor board packets, trial matching |
| CAR-T Intelligence | 11 | 6,266 | CAR construct comparison, manufacturing optimization |
| Imaging Intelligence | 10 | 876 | NIM inference, DICOM workflows, 3D segmentation |
| Autoimmune Intelligence | 10 | ~500 | Autoantibody panels, flare prediction |
4.2 Shared Infrastructure¶
All agents share:
- Milvus 2.4 vector database (IVF_FLAT, COSINE, 384-dim)
- BGE-small-en-v1.5 embedding model (sentence-transformers)
- Claude Sonnet 4.6 (Anthropic) primary LLM
genomic_evidencecollection (3,561,170 variants, read-only)- Docker Compose orchestration
- FastAPI (REST) + Streamlit (UI) pattern
- lib/hcls_common shared library (23 modules)
4.3 Proven Patterns¶
The Cardiology Intelligence Agent will leverage battle-tested patterns from existing agents:
| Pattern | Proven In | Adaptation for Cardiology |
|---|---|---|
| Multi-collection parallel search | All 11 agents | 12 cardiology-specific collections |
| Knowledge graph augmentation | CAR-T, Biomarker | Cardiac conditions, drug classes, risk factors |
| Query expansion maps | CAR-T (12 maps), Biomarker | Cardiology terminology (e.g., "MI" → "myocardial infarction", "STEMI", "NSTEMI", "ACS") |
| Comparative analysis | CAR-T, Imaging | "TAVR vs SAVR", "Amiodarone vs Sotalol" |
| Cross-modal genomic triggers | Imaging (Lung-RADS → EGFR) | Imaging finding → cardiomyopathy gene panel |
| FHIR R4 export | Imaging | DiagnosticReport with cardiac SNOMED/LOINC codes |
| NIM inference workflows | Imaging (4 NIMs) | Cardiac-specific NIM models |
| Sidebar guided tour | Imaging | Cardiology demo flow |
| Pre-filled example queries | Imaging, Biomarker | Cardiology-specific starter questions |
5. Cardiology Intelligence Agent Architecture¶
5.1 System Diagram¶
+==========================================================================+
| PRESENTATION: Streamlit Cardiology Workbench (8536) |
| 10 Tabs | Evidence | Workflows | Imaging | Risk Calcs |
| FastAPI REST Server (8526) |
+==========================================================================+
| |
+==========================================================================+
| INTELLIGENCE: Cardiology RAG Engine |
| 12-collection parallel search |
| Knowledge graph (30 conditions, 20 biomarkers) |
| Query expansion (15 maps, cardiology terminology) |
| Comparative analysis ("TAVR vs SAVR") |
| Risk calculator engine (6 validated scores) |
| GDMT optimization engine |
+==========================================================================+
| |
+==========================================================================+
| INFERENCE: NIM Services (VISTA-3D, MAISI, VILA-M3, Llama-3) |
| 8 Clinical Workflows: |
| CAD | HF | Valve | Arrhythmia | CMR | Stress | |
| Prevention | Cardio-Onc |
+==========================================================================+
| |
+==========================================================================+
| DATA: Milvus 2.4 (12 cardiology collections + genomic) |
| BGE-small-en-v1.5 (384-dim, IVF_FLAT, COSINE) |
| PubMed, ClinicalTrials.gov, ACC/AHA guidelines |
| Curated seed data (echo, CT, MRI, ECG, hemodynamics) |
+==========================================================================+
5.2 Design Principles¶
- Guideline-first reasoning: Every recommendation traces to ACC/AHA/ESC guideline evidence levels (Class I/IIa/IIb/III, LOE A/B/C)
- Quantitative precision: Cardiac measurements retain units and reference ranges (LVEF 55-70%, E/e' < 14, QTc < 470ms)
- Cross-modal by default: Every significant finding is checked against genomic context
- Risk-stratified output: Severity badges and urgency routing aligned with clinical acuity
- Graceful degradation: Full functionality in mock mode without GPU or live NIM services
- Familiar patterns: Follows the same FastAPI + Streamlit + Milvus patterns as existing agents
5.3 Port Allocation¶
| Port | Service |
|---|---|
| 8526 | FastAPI REST Server |
| 8536 | Streamlit Cardiology Workbench |
| 19530 | Milvus (shared) |
| 8520 | NIM LLM (shared) |
| 8530 | NIM VISTA-3D (shared) |
| 8531 | NIM MAISI (shared) |
| 8532 | NIM VILA-M3 (shared) |
6. Milvus Collection Design¶
6.1 Index Configuration¶
| Parameter | Value |
|---|---|
| Index type | IVF_FLAT |
| Metric | COSINE |
| nlist / nprobe | 1024 / 16 |
| Dimension | 384 |
| Embedding model | BAAI/bge-small-en-v1.5 |
6.2 Collection Schemas¶
Collection 1: cardio_literature -- ~3,000 records¶
Published cardiovascular research papers, reviews, and meta-analyses.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | PubMed ID or unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| title | VARCHAR(500) | Paper title |
| text_chunk | VARCHAR(8000) | Abstract or text section |
| year | INT16 | Publication year |
| journal | VARCHAR(200) | Journal name (JACC, Circulation, EHJ, etc.) |
| cv_domain | VARCHAR(100) | Cardiovascular subdomain (imaging, EP, HF, valve, prevention) |
| modality | VARCHAR(50) | Imaging modality if applicable |
| study_type | VARCHAR(50) | RCT, meta-analysis, cohort, case-control, review |
| keywords | VARCHAR(500) | MeSH terms and author keywords |
Source: PubMed E-utilities with cardiovascular MeSH filters.
Collection 2: cardio_trials -- ~500 records¶
Cardiovascular clinical trials from ClinicalTrials.gov and landmark trial results.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | NCT number or trial identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| title | VARCHAR(500) | Official trial title |
| text_summary | VARCHAR(4000) | Trial summary including results |
| phase | VARCHAR(20) | Phase I-IV |
| status | VARCHAR(30) | Active, completed, recruiting |
| sponsor | VARCHAR(200) | Lead sponsor |
| cv_domain | VARCHAR(100) | CAD, HF, valve, arrhythmia, prevention |
| intervention | VARCHAR(300) | Drug, device, or procedure tested |
| primary_endpoint | VARCHAR(300) | Primary outcome measure |
| enrollment | INT32 | Number of participants |
| start_year | INT16 | Year trial began |
| outcome_summary | VARCHAR(2000) | Key results (if completed) |
| landmark | BOOL | Is this a landmark trial (PARADIGM-HF, DAPA-HF, etc.) |
Source: ClinicalTrials.gov V2 API with cardiovascular condition filters.
Collection 3: cardio_imaging -- ~200 records¶
Cardiac imaging protocols, findings, and measurements across all modalities.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | Finding or protocol description |
| modality | VARCHAR(50) | echo, cardiac_ct, cardiac_mri, nuclear, cath |
| imaging_type | VARCHAR(100) | TTE, TEE, stress_echo, CTA, calcium_score, etc. |
| finding_category | VARCHAR(100) | Chamber size, systolic function, diastolic function, valve, plaque, LGE, perfusion |
| measurement_name | VARCHAR(100) | LVEF, GLS, E/e', calcium_score, stenosis_pct |
| measurement_value | VARCHAR(50) | Numeric value with units |
| reference_range | VARCHAR(100) | Normal range per guidelines |
| severity | VARCHAR(20) | Normal, mild, moderate, severe |
| guideline_source | VARCHAR(100) | ASE, SCCT, SCMR, ASNC |
| clinical_significance | VARCHAR(500) | Interpretation guidance |
Source: Curated from ASE/SCCT/SCMR/ASNC guideline documents and reference texts.
Collection 4: cardio_electrophysiology -- ~150 records¶
ECG interpretation, arrhythmia classification, and electrophysiology data.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | ECG pattern or arrhythmia description |
| ecg_pattern | VARCHAR(100) | LBBB, RBBB, LVH, ST_elevation, AF, VT, etc. |
| rhythm | VARCHAR(50) | Sinus, AF, flutter, SVT, VT, VF |
| rate_category | VARCHAR(30) | Bradycardia, normal, tachycardia |
| interval | VARCHAR(50) | PR, QRS, QT, QTc |
| interval_value_ms | FLOAT | Numeric interval value |
| clinical_significance | VARCHAR(500) | What this finding means |
| urgency | VARCHAR(20) | Routine, urgent, emergent |
| differential_diagnosis | VARCHAR(500) | DDx list |
| management | VARCHAR(500) | Recommended next steps |
Source: Curated from ACC/AHA/HRS guidelines and electrophysiology references.
Collection 5: cardio_heart_failure -- ~150 records¶
Heart failure classification, GDMT protocols, and management algorithms.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | HF management recommendation |
| hf_type | VARCHAR(30) | HFrEF, HFmrEF, HFpEF |
| acc_aha_stage | VARCHAR(5) | A, B, C, D |
| nyha_class | VARCHAR(5) | I, II, III, IV |
| drug_class | VARCHAR(100) | Beta-blocker, ARNI, MRA, SGLT2i, hydralazine/nitrate |
| drug_name | VARCHAR(100) | Specific medication |
| target_dose | VARCHAR(50) | Goal dose per guidelines |
| titration_protocol | VARCHAR(500) | How to uptitrate |
| evidence_level | VARCHAR(20) | Class I/IIa/IIb/III, LOE A/B/C |
| landmark_trial | VARCHAR(100) | Supporting trial name |
| contraindications | VARCHAR(500) | When not to use |
Source: ACC/AHA/HFSA 2022 Heart Failure Guidelines, ESC 2023 update.
Collection 6: cardio_valvular -- ~120 records¶
Valvular heart disease assessment, severity grading, and intervention criteria.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | Valve assessment or recommendation |
| valve | VARCHAR(30) | Aortic, mitral, tricuspid, pulmonic |
| pathology | VARCHAR(50) | Stenosis, regurgitation, prolapse, endocarditis |
| severity | VARCHAR(20) | Mild, moderate, severe |
| quantitative_criteria | VARCHAR(500) | EOA, gradient, regurgitant volume, EROA, vena contracta |
| intervention_criteria | VARCHAR(500) | When to intervene per guidelines |
| intervention_type | VARCHAR(100) | SAVR, TAVR, mitral_repair, MitraClip, TMVr |
| sts_risk_threshold | VARCHAR(50) | Low (<4%), intermediate (4-8%), high (>8%) |
| evidence_level | VARCHAR(20) | Class/LOE |
| surveillance_protocol | VARCHAR(500) | Follow-up echo frequency |
Source: ACC/AHA 2020 Valvular Heart Disease Guidelines, ESC 2021 VHD Guidelines.
Collection 7: cardio_prevention -- ~150 records¶
Preventive cardiology: risk stratification, lipid management, and lifestyle intervention.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | Prevention recommendation |
| risk_category | VARCHAR(50) | Low, borderline, intermediate, high, very_high |
| risk_calculator | VARCHAR(50) | PCE/ASCVD, MESA, Framingham, Reynolds |
| biomarker | VARCHAR(100) | LDL-C, Lp(a), ApoB, hsCRP, CAC |
| target_value | VARCHAR(50) | LDL < 70, LDL < 55 (very high risk) |
| therapy_class | VARCHAR(100) | Statin, ezetimibe, PCSK9i, bempedoic acid, inclisiran |
| therapy_name | VARCHAR(100) | Specific medication |
| evidence_level | VARCHAR(20) | Class/LOE |
| guideline_source | VARCHAR(100) | ACC/AHA 2018 Cholesterol, ESC 2019 Dyslipidemia |
| lifestyle_intervention | VARCHAR(500) | Diet, exercise, smoking, weight management |
Source: ACC/AHA 2019 Prevention Guidelines, 2018 Cholesterol Guidelines.
Collection 8: cardio_interventional -- ~100 records¶
Interventional cardiology procedures, techniques, and outcomes data.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | Procedure description or outcome data |
| procedure_type | VARCHAR(100) | PCI, TAVR, MitraClip, WATCHMAN, PFO_closure, CTO_PCI |
| indication | VARCHAR(200) | Clinical indication |
| technique | VARCHAR(300) | Specific approach (radial, femoral, antegrade, retrograde) |
| device | VARCHAR(200) | Stent, valve, clip, occluder type |
| success_rate | VARCHAR(50) | Procedural success rate |
| complication_rate | VARCHAR(50) | Major adverse event rate |
| antithrombotic_protocol | VARCHAR(500) | Antiplatelet/anticoagulant regimen |
| evidence_level | VARCHAR(20) | Class/LOE |
| landmark_trial | VARCHAR(100) | Supporting trial |
Source: SCAI guidelines, ACC/AHA PCI and VHD guidelines.
Collection 9: cardio_oncology -- ~100 records¶
Cardio-oncology surveillance, cardiotoxicity detection, and cardioprotection.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | Cardio-oncology recommendation |
| cancer_therapy | VARCHAR(100) | Anthracycline, trastuzumab, ICI, TKI, radiation |
| cardiotoxicity_type | VARCHAR(100) | CTRCD, myocarditis, QTc_prolongation, HTN, VTE |
| monitoring_protocol | VARCHAR(500) | Echo frequency, GLS thresholds, troponin schedule |
| gls_threshold | VARCHAR(50) | >15% relative decline = subclinical toxicity |
| risk_score | VARCHAR(50) | HFA-ICOS baseline risk |
| cardioprotective_strategy | VARCHAR(500) | Dexrazoxane, beta-blocker, ACEi, statin |
| when_to_hold | VARCHAR(500) | Criteria to hold/discontinue cancer therapy |
| evidence_level | VARCHAR(20) | Class/LOE |
| guideline_source | VARCHAR(100) | ESC 2022 Cardio-Oncology, ASCO |
Source: ESC 2022 Cardio-Oncology Guidelines, ASCO Cardiovascular Toxicity Guidelines.
Collection 10: cardio_devices -- ~80 records¶
FDA-cleared cardiovascular AI devices and implantable cardiac devices.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | Device description and capabilities |
| device_name | VARCHAR(200) | Commercial name |
| manufacturer | VARCHAR(200) | Company |
| device_category | VARCHAR(50) | AI_software, implantable, wearable |
| modality | VARCHAR(50) | ECG, echo, CT, MRI, wearable |
| clinical_task | VARCHAR(100) | Detection, measurement, prediction |
| regulatory_status | VARCHAR(50) | FDA_cleared, CE_marked, investigational |
| clearance_date | VARCHAR(20) | Approval date |
| performance_summary | VARCHAR(500) | Sensitivity, specificity, AUC |
| intended_use | VARCHAR(500) | FDA-labeled intended use statement |
Source: FDA AI/ML-Enabled Medical Device database, company filings.
Collection 11: cardio_guidelines -- ~150 records¶
ACC/AHA/ESC/HRS clinical practice guidelines and focused updates.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | Guideline recommendation |
| guideline_name | VARCHAR(300) | Full guideline title |
| organization | VARCHAR(50) | ACC/AHA, ESC, HRS, SCAI, ASE |
| year | INT16 | Publication year |
| cv_domain | VARCHAR(100) | CAD, HF, valve, arrhythmia, prevention |
| recommendation_class | VARCHAR(10) | I, IIa, IIb, III |
| level_of_evidence | VARCHAR(5) | A, B-R, B-NR, C-LD, C-EO |
| key_recommendation | VARCHAR(2000) | Specific recommendation text |
| clinical_scenario | VARCHAR(500) | When this recommendation applies |
| related_guidelines | VARCHAR(300) | Cross-references to other guidelines |
Source: ACC/AHA Practice Guidelines library, ESC Clinical Practice Guidelines.
Collection 12: cardio_hemodynamics -- ~80 records¶
Hemodynamic data: catheterization, pressure tracings, and derived calculations.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Unique identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | Hemodynamic scenario or reference |
| measurement_type | VARCHAR(100) | PCWP, PA_pressure, CO, SVR, PVR, LVEDP |
| normal_range | VARCHAR(50) | Expected normal values |
| abnormal_pattern | VARCHAR(200) | What elevation/depression indicates |
| associated_conditions | VARCHAR(500) | Clinical conditions causing this pattern |
| calculation_formula | VARCHAR(300) | How to derive (e.g., SVR = 80*(MAP-RAP)/CO) |
| clinical_significance | VARCHAR(500) | Interpretation guidance |
| catheterization_type | VARCHAR(50) | Right heart, left heart, coronary |
Source: Curated from hemodynamic references and catheterization guidelines.
Collection 13: genomic_evidence -- 3,561,170 records (read-only)¶
Shared genomic variant collection from HCLS AI Factory Stage 1+2.
| Field | Type | Description |
|---|---|---|
| id | VARCHAR(64) | Variant identifier |
| embedding | FLOAT_VECTOR(384) | BGE-small-en-v1.5 embedding |
| text_summary | VARCHAR(4000) | Variant annotation |
| gene | VARCHAR(50) | Gene symbol |
| clinical_significance | VARCHAR(50) | Pathogenic, likely pathogenic, VUS, benign |
| disease_associations | VARCHAR(1000) | Associated diseases |
| am_pathogenicity | FLOAT | AlphaMissense pathogenicity score |
Purpose: Cross-modal triggers query this collection for cardiovascular-relevant genes (LDLR, MYH7, TTN, SCN5A, etc.).
6.3 Collection Search Weights¶
| Collection | Weight | Rationale |
|---|---|---|
| Literature | 0.16 | Largest corpus, broadest evidence |
| Guidelines | 0.14 | Highest clinical authority |
| Trials | 0.12 | Primary evidence source |
| Imaging | 0.10 | Multi-modal imaging data |
| Heart Failure | 0.08 | Common, high-impact domain |
| Electrophysiology | 0.08 | Arrhythmia management |
| Valvular | 0.07 | Structural heart disease |
| Prevention | 0.07 | Primary/secondary prevention |
| Hemodynamics | 0.05 | Invasive assessment |
| Interventional | 0.05 | Procedural decision-making |
| Cardio-Oncology | 0.04 | Growing subspecialty |
| Devices | 0.04 | Technology landscape |
6.4 Estimated Vector Counts¶
| Collection | Seed Records | Post-Ingest Target |
|---|---|---|
| cardio_literature | 200 | 3,000+ (PubMed ingest) |
| cardio_trials | 50 | 500+ (ClinicalTrials.gov ingest) |
| cardio_imaging | 200 | 200 |
| cardio_electrophysiology | 150 | 150 |
| cardio_heart_failure | 150 | 150 |
| cardio_valvular | 120 | 120 |
| cardio_prevention | 150 | 150 |
| cardio_interventional | 100 | 100 |
| cardio_oncology | 100 | 100 |
| cardio_devices | 80 | 80 |
| cardio_guidelines | 150 | 150 |
| cardio_hemodynamics | 80 | 80 |
| Total (owned) | ~1,530 | ~4,780+ |
| genomic_evidence (read-only) | -- | 3,561,170 |
7. Clinical Workflows¶
7.1 Workflow Architecture¶
All workflows follow the established BaseImagingWorkflow pattern: preprocess → infer → postprocess → WorkflowResult. Each workflow supports full mock mode with clinically realistic synthetic results.
7.2 Eight Reference Workflows¶
Workflow 1: Coronary Artery Disease Assessment¶
| Attribute | Value |
|---|---|
| Workflow ID | coronary_artery_disease |
| Input | Coronary CTA, calcium score CT, or catheterization data |
| Target Latency | < 3 minutes |
| Models | VISTA-3D (coronary segmentation), vessel analysis pipeline |
| Key Outputs | CAD-RADS classification (0-5), Agatston calcium score, per-vessel stenosis grading, plaque characterization, FFR-CT estimate |
| Severity Routing | CAD-RADS 4-5 → Urgent cardiology consult |
| Cross-Modal Trigger | CAD-RADS 4+ with age < 55 → FH gene panel (LDLR, PCSK9, APOB) |
| Guideline Alignment | ACC/AHA 2021 Chest Pain Guidelines, SCCT CAD-RADS 2.0 |
Clinical Decision Logic:
Calcium Score:
0 → Very low risk, no further imaging needed
1-99 → Low risk, consider statin, lifestyle
100-399 → Moderate risk, statin indicated, stress test if symptoms
≥400 → High risk, coronary CTA or functional testing
CAD-RADS:
0 → No plaque, no stenosis
1 → 1-24% stenosis, minimal
2 → 25-49%, mild
3 → 50-69%, moderate → consider functional testing
4A → 70-99% → catheterization or FFR-CT
4B → Left main ≥ 50% or 3-vessel ≥ 70% → surgical evaluation
5 → Total occlusion → viability assessment
Plaque features → High-risk if:
- Low-attenuation plaque (< 30 HU)
- Positive remodeling (RI > 1.1)
- Napkin-ring sign
- Spotty calcification
Workflow 2: Heart Failure Classification and GDMT Optimization¶
| Attribute | Value |
|---|---|
| Workflow ID | heart_failure_gdmt |
| Input | Echo LVEF, BNP/NT-proBNP, clinical parameters |
| Target Latency | < 30 seconds |
| Key Outputs | HF classification (HFrEF/HFmrEF/HFpEF), ACC/AHA stage, NYHA class, GDMT recommendations with target doses, MAGGIC score |
| Severity Routing | Stage D or NYHA IV → Advanced HF evaluation |
| Cross-Modal Trigger | Age < 50 + non-ischemic DCM → cardiomyopathy gene panel (TTN, LMNA, MYH7, MYBPC3) |
| Guideline Alignment | ACC/AHA 2022 HF Guidelines, HFSA 2024 Update |
GDMT Optimization Algorithm:
HFrEF (LVEF ≤ 40%):
Pillar 1: Beta-blocker (carvedilol, metoprolol succinate, bisoprolol)
Pillar 2: ARNI (sacubitril/valsartan) or ACEi/ARB
Pillar 3: MRA (spironolactone, eplerenone)
Pillar 4: SGLT2i (dapagliflozin, empagliflozin)
→ If African American + NYHA III-IV: Add hydralazine/isosorbide dinitrate
→ If LVEF ≤ 35% + NYHA II-III + QRS ≥ 150ms LBBB: CRT
→ If LVEF ≤ 35% + ≥ 40 days post-MI or ≥ 90 days on optimal GDMT: ICD
→ If Stage D refractory: LVAD or transplant evaluation
HFpEF (LVEF ≥ 50%):
SGLT2i (Class I, LOE A -- EMPEROR-Preserved, DELIVER)
Diuretics for congestion
Treat comorbidities (HTN, AF, obesity, sleep apnea, CAD)
GLP-1 RA if obesity + HFpEF (STEP-HFpEF)
Workflow 3: Valvular Heart Disease Quantification¶
| Attribute | Value |
|---|---|
| Workflow ID | valvular_disease |
| Input | Echocardiographic measurements, clinical parameters |
| Target Latency | < 30 seconds |
| Key Outputs | Valve severity grading, intervention threshold assessment, SAVR vs TAVR recommendation for AS, STS-PROM risk score |
| Severity Routing | Severe symptomatic AS or MR → Heart team evaluation |
| Cross-Modal Trigger | Bicuspid AV + aortic dilation → aortopathy gene panel (FBN1, TGFBR1/2, SMAD3, ACTA2) |
| Guideline Alignment | ACC/AHA 2020 VHD Guidelines, ESC 2021 VHD Guidelines |
Severity Grading Criteria (Aortic Stenosis):
| Parameter | Mild | Moderate | Severe |
|---|---|---|---|
| Peak velocity (m/s) | 2.0-2.9 | 3.0-3.9 | ≥ 4.0 |
| Mean gradient (mmHg) | < 20 | 20-39 | ≥ 40 |
| AVA (cm²) | > 1.5 | 1.0-1.5 | < 1.0 |
| Indexed AVA (cm²/m²) | > 0.85 | 0.60-0.85 | < 0.60 |
Workflow 4: Arrhythmia Detection and Management¶
| Attribute | Value |
|---|---|
| Workflow ID | arrhythmia_management |
| Input | ECG data, clinical parameters, cardiac imaging |
| Target Latency | < 15 seconds |
| Key Outputs | Rhythm classification, CHA₂DS₂-VASc score (if AF), HAS-BLED score, rate vs rhythm control recommendation, ablation candidacy |
| Severity Routing | VT/VF or complete heart block → Emergent |
| Cross-Modal Trigger | Unexplained VT in young patient → channelopathy panel (SCN5A, KCNQ1, KCNH2, RYR2) |
| Guideline Alignment | ACC/AHA/HRS 2023 AF Guidelines, 2017 VA/SCD Guidelines |
CHA₂DS₂-VASc Calculator:
C Congestive HF (or LVEF ≤ 40%) +1
H Hypertension +1
A₂ Age ≥ 75 +2
D Diabetes mellitus +1
S₂ Stroke/TIA/thromboembolism +2
V Vascular disease (PAD, prior MI) +1
A Age 65-74 +1
Sc Sex category (female) +1
Score 0 (males) or 1 (females): No anticoagulation
Score 1 (males): Consider anticoagulation
Score ≥ 2: Anticoagulation recommended (Class I)
Workflow 5: Cardiac MRI Tissue Characterization¶
| Attribute | Value |
|---|---|
| Workflow ID | cardiac_mri_tissue |
| Input | Cardiac MRI data (cine, T1/T2 mapping, LGE, perfusion) |
| Target Latency | < 5 minutes |
| Models | VISTA-3D (cardiac segmentation), parametric map analysis |
| Key Outputs | Biventricular volumes and function, LGE pattern (ischemic vs non-ischemic), T1/T2/ECV values, perfusion defect quantification, tissue diagnosis |
| Severity Routing | Active myocarditis or severe biventricular failure → Urgent |
| Cross-Modal Trigger | Non-ischemic LGE + LV dysfunction → cardiomyopathy gene panel |
| Guideline Alignment | SCMR 2020 Standardized Imaging Protocols, Lake Louise Criteria |
LGE Pattern Interpretation:
| Pattern | Location | Diagnosis |
|---|---|---|
| Subendocardial / transmural | Coronary territory | Ischemic cardiomyopathy (prior MI) |
| Mid-wall (septal) | Interventricular septum | Dilated cardiomyopathy, myocarditis |
| Epicardial / patchy | Lateral wall | Myocarditis (acute/chronic) |
| Diffuse subendocardial | Global | Cardiac amyloidosis |
| RV insertion point | Anterior/posterior | Pulmonary hypertension, HCM |
| Asymmetric septal | Basal septum | Hypertrophic cardiomyopathy |
| Basal inferolateral | Mid-wall | Cardiac sarcoidosis, Chagas |
| No LGE + elevated T1/ECV | Global | Early-stage amyloidosis or diffuse fibrosis |
Workflow 6: Stress Test Interpretation¶
| Attribute | Value |
|---|---|
| Workflow ID | stress_test_interpretation |
| Input | Exercise or pharmacologic stress test data (ECG, imaging) |
| Target Latency | < 30 seconds |
| Key Outputs | Duke Treadmill Score, ischemia assessment, functional capacity (METs), hemodynamic response, stress imaging interpretation |
| Severity Routing | High-risk DTS or large perfusion defect → Catheterization |
| Guideline Alignment | ACC/AHA 2021 Chest Pain Guidelines, ASNC Stress Testing Standards |
Duke Treadmill Score:
DTS = Exercise time (minutes) - (5 × max ST deviation mm) - (4 × angina index)
Angina index: 0 = none, 1 = non-limiting, 2 = exercise-limiting
Low risk: DTS ≥ +5 (annual mortality < 1%)
Moderate risk: DTS -10 to +4 (annual mortality 1-3%)
High risk: DTS < -10 (annual mortality ≥ 5%)
Workflow 7: Preventive Risk Stratification¶
| Attribute | Value |
|---|---|
| Workflow ID | preventive_risk_stratification |
| Input | Demographics, lipids, BP, diabetes status, smoking, family history |
| Target Latency | < 15 seconds |
| Key Outputs | 10-year ASCVD risk, risk category, statin benefit group, statin intensity recommendation, risk-enhancing factors, CAC-based reclassification guidance |
| Cross-Modal Trigger | 10-year ASCVD ≥ 7.5% + family history of premature CAD → FH gene panel (LDLR, PCSK9, APOB) |
| Guideline Alignment | ACC/AHA 2019 Primary Prevention, 2018 Cholesterol Guidelines |
Statin Decision Framework:
1. Clinical ASCVD? → High-intensity statin (Class I)
- If very high risk (recent ACS, multiple events): Add ezetimibe → if LDL still ≥ 55: Add PCSK9i
2. LDL ≥ 190 mg/dL? → High-intensity statin without risk calculation (Class I)
- Consider FH genetic testing
3. Diabetes, age 40-75? → Moderate-intensity statin (Class I)
- If multiple risk factors or 10y ASCVD ≥ 20%: High-intensity
4. 10-year ASCVD 7.5-19.9%? → Moderate-intensity statin (Class I)
- If risk decision uncertain: Obtain CAC score
- CAC = 0: Withhold statin (unless diabetes, FH, smoking)
- CAC 1-99: Favors statin
- CAC ≥ 100: Statin indicated
5. 10-year ASCVD 5-7.4%? → Consider if risk-enhancing factors present
Risk-enhancing factors: Family hx premature ASCVD, Lp(a) ≥ 50 mg/dL,
hsCRP ≥ 2.0, ABI < 0.9, metabolic syndrome, South Asian ancestry
Workflow 8: Cardio-Oncology Surveillance¶
| Attribute | Value |
|---|---|
| Workflow ID | cardio_oncology_surveillance |
| Input | Cancer therapy regimen, baseline cardiac function, risk factors |
| Target Latency | < 30 seconds |
| Key Outputs | HFA-ICOS baseline risk category, surveillance protocol (echo frequency, biomarker schedule), GLS monitoring thresholds, cardioprotective recommendations, criteria for therapy modification |
| Severity Routing | LVEF decline > 10% to < 50% or GLS decline > 15% → Urgent cardio-oncology consult |
| Guideline Alignment | ESC 2022 Cardio-Oncology Guidelines, ASCO CV Toxicity Guidelines |
Cardiotoxicity Monitoring Protocol:
Anthracycline-based chemotherapy:
Baseline: Echo (LVEF + GLS) + troponin + NT-proBNP
During: Echo every 2 cycles (or after cumulative dose milestones)
After: Echo at 3, 6, 12 months post-completion, then annually
GLS threshold: >15% relative decline from baseline = subclinical cardiotoxicity
LVEF threshold: Decline >10% to below 50% = CTRCD
HER2-targeted therapy (trastuzumab):
Echo every 3 months during therapy
If LVEF 40-49%: Continue with closer monitoring + cardioprotection
If LVEF < 40%: Hold therapy, refer cardio-oncology
Immune checkpoint inhibitors:
Baseline ECG + troponin
Troponin every cycle for first 4 cycles
If troponin rise: Urgent CMR to rule out myocarditis
ICI myocarditis mortality: 25-50% -- requires immediate therapy hold + high-dose steroids
8. Cross-Modal Integration¶
8.1 Cardiovascular Genomics Triggers¶
The Cardiology Intelligence Agent implements cross-modal triggers that automatically query the shared genomic_evidence collection (3.5M variants) when imaging or clinical findings suggest a genetic etiology:
| Trigger Condition | Gene Panel Queried | Clinical Rationale |
|---|---|---|
| Non-ischemic DCM, age < 50 | TTN, LMNA, RBM20, MYH7, MYBPC3, TNNT2, SCN5A, BAG3, PLN, FLNC, DSP | 25-40% of DCM has genetic basis; LMNA carriers need ICD regardless of LVEF |
| Unexplained LVH, septal hypertrophy | MYH7, MYBPC3, TNNT2, TNNI3, TPM1, ACTC1, MYL2, MYL3, GLA (Fabry) | HCM is most common inherited cardiac disease (1:500) |
| Unexplained VT or cardiac arrest | SCN5A, KCNQ1, KCNH2, RYR2, CASQ2, PKP2, DSP, DSG2, DSC2 | Channelopathy or ARVC screen; cascade screening for families |
| Premature CAD (< 55M / < 65F) or LDL > 190 | LDLR, PCSK9, APOB, LDLRAP1, LIPA | FH prevalence 1:250; early detection enables prevention |
| Aortic root dilation > 4.5 cm | FBN1, TGFBR1, TGFBR2, SMAD3, ACTA2, MYH11, PRKG1, LOX | Heritable thoracic aortic disease; surgical thresholds differ by genotype |
| Cardiac amyloid (LGE + elevated T1) | TTR (transthyretin), AL amyloid genes | TTR amyloidosis is treatable with tafamidis; gene-specific therapy |
| Bicuspid aortic valve + coarctation | NOTCH1, GATA4/5/6, SMAD6 | Congenital heart disease genetics |
8.2 Imaging ↔ Genomics ↔ Drug Discovery Pipeline¶
Cardiac Imaging Finding (Echo, CT, MRI)
|
v
[Cross-Modal Trigger] -- Clinical criteria met?
| |
YES NO
| |
v v
[Query genomic_evidence] [Standard RAG response]
(3.5M variant vectors)
|
v
[Variant Annotation]
ClinVar pathogenicity, AlphaMissense score
|
v
[HCLS AI Factory Stage 2: RAG Target ID]
Identify druggable targets from genetic findings
|
v
[HCLS AI Factory Stage 3: Drug Discovery]
BioNeMo molecule generation, DiffDock binding
|
v
[Clinical Output]
FHIR R4 DiagnosticReport with genomic enrichment
8.3 Integration with Other Agents¶
| Integration | Direction | Data Flow |
|---|---|---|
| Imaging Agent | Bidirectional | Shares cardiac CT/MRI workflow results; receives DICOM routing for cardiac studies |
| Precision Biomarker Agent | Inbound | Receives cardiac biomarker reference ranges and age-stratified norms |
| Precision Oncology Agent | Inbound | Receives cancer therapy regimens for cardio-oncology surveillance |
| CAR-T Intelligence Agent | Inbound | Receives CRS-related cardiac toxicity data |
| Genomics Pipeline | Read-only | Queries genomic_evidence for cardiovascular gene variants |
| Drug Discovery Pipeline | Outbound | Sends confirmed genetic targets for compound screening |
9. NIM Integration Strategy¶
9.1 Shared NIM Services¶
The Cardiology Agent will reuse the four NIM services already deployed by the Imaging Intelligence Agent:
| NIM | Port | Cardiology Application |
|---|---|---|
| VISTA-3D | 8530 | Cardiac chamber segmentation, coronary artery segmentation, pericardial fat quantification |
| MAISI | 8531 | Synthetic cardiac CT generation for training, rare pathology simulation |
| VILA-M3 | 8532 | Cardiac image interpretation, echo measurement validation, ECG pattern recognition |
| Llama-3 8B | 8520 | Clinical reasoning fallback when Claude API unavailable |
9.2 VISTA-3D Cardiac Applications¶
VISTA-3D's 132 anatomical classes include cardiac structures suitable for:
- Chamber volumetrics: LV, RV, LA, RA segmentation for volume and EF calculation
- Myocardial segmentation: LV myocardium for mass calculation and wall motion assessment
- Pericardial analysis: Pericardial effusion detection and quantification
- Great vessel assessment: Aortic root and ascending aorta measurement
- Coronary segmentation: Combined with vessel centerline analysis for stenosis grading
9.3 Fallback Logic¶
Request → NIM available? → Yes → Real NIM inference
→ No → Cloud NIM? → Yes → NVIDIA Cloud API
→ No → Mock enabled? → Yes → Synthetic result
→ No → Error
10. Knowledge Graph Design¶
10.1 Graph Structure¶
The cardiology knowledge graph will contain structured entries across six dictionaries:
| Dictionary | Entries | Content |
|---|---|---|
| Cardiovascular Conditions | ~35 | ICD-10 codes, diagnostic criteria, severity classification, imaging characteristics, guideline-based management |
| Cardiac Biomarkers | ~25 | Reference ranges, clinical cutoffs, kinetics, diagnostic/prognostic value |
| Drug Classes | ~30 | Mechanism, target dose, titration, contraindications, landmark trials |
| Risk Calculators | ~8 | Input variables, formula, risk categories, guideline recommendations per category |
| Imaging Protocols | ~20 | Modality, protocol parameters, normal values, abnormal patterns |
| Cardiovascular Genes | ~50 | Gene symbol, chromosome, associated conditions, inheritance, prevalence, clinical actionability |
10.2 Example Knowledge Graph Entries¶
Condition: Hypertrophic Cardiomyopathy (HCM)
{
"name": "Hypertrophic Cardiomyopathy",
"icd10": "I42.1",
"aliases": ["HCM", "HOCM", "IHSS", "hypertrophic obstructive cardiomyopathy"],
"prevalence": "1:500 (most common inherited cardiac disease)",
"inheritance": "Autosomal dominant, variable penetrance",
"diagnostic_criteria": {
"wall_thickness": "≥ 15 mm in any segment (or ≥ 13 mm with family history)",
"lvot_gradient": "≥ 30 mmHg at rest or provocation = obstructive",
"mri_lge": "Patchy mid-wall, RV insertion points, extensive LGE = worse prognosis"
},
"risk_stratification": {
"model": "ACC/AHA 2024 HCM Guidelines SCD Risk Calculator",
"high_risk_features": [
"Family history of SCD from HCM",
"Unexplained syncope",
"Massive LVH (≥ 30 mm)",
"NSVT on Holter",
"Abnormal BP response to exercise",
"Extensive LGE (≥ 15% LV mass)",
"LVEF < 50%",
"Apical aneurysm"
]
},
"genes": ["MYH7", "MYBPC3", "TNNT2", "TNNI3", "TPM1", "ACTC1", "MYL2", "MYL3"],
"treatment": {
"medical": "Beta-blockers (first-line), verapamil, disopyramide, mavacamten",
"interventional": "Septal myectomy (surgical) or alcohol septal ablation",
"device": "ICD if high SCD risk"
},
"imaging_modalities": ["echo", "cardiac_mri", "cardiac_ct"],
"cross_modal_trigger": True
}
Biomarker: NT-proBNP
{
"name": "NT-proBNP",
"full_name": "N-terminal pro-B-type natriuretic peptide",
"loinc_code": "33762-6",
"reference_ranges": {
"age_lt_50": {"normal": "< 300 pg/mL", "grey_zone": "300-450", "elevated": "> 450"},
"age_50_75": {"normal": "< 300 pg/mL", "grey_zone": "300-900", "elevated": "> 900"},
"age_gt_75": {"normal": "< 300 pg/mL", "grey_zone": "300-1800", "elevated": "> 1800"}
},
"clinical_use": "Heart failure diagnosis and prognosis, GDMT titration monitoring",
"kinetics": "Half-life ~120 min, less affected by obesity than BNP, not cleared by neprilysin",
"confounders": ["Renal dysfunction (↑)", "Obesity (↓)", "AF (↑)", "Pulmonary disease (↑)"],
"guideline_thresholds": {
"rule_out_hf": "< 300 pg/mL (NPV > 98%)",
"hospitalized_hf": "Admission > 1000 associated with higher mortality",
"gdmt_success": "> 30% reduction on optimal therapy"
}
}
11. Query Expansion and Retrieval Strategy¶
11.1 Cardiology-Specific Query Expansion Maps¶
Fifteen domain-specific expansion maps will map cardiovascular terminology to related terms:
| Map | Keywords → Terms | Example |
|---|---|---|
| Coronary Disease | 25 → 180 | "heart attack" → myocardial infarction, MI, STEMI, NSTEMI, ACS, acute coronary syndrome, troponin elevation |
| Heart Failure | 20 → 150 | "weak heart" → heart failure, HFrEF, HFpEF, reduced ejection fraction, systolic dysfunction, LVEF |
| Arrhythmia | 20 → 140 | "irregular heartbeat" → atrial fibrillation, AF, AFib, arrhythmia, flutter, SVT, palpitations |
| Valvular | 15 → 100 | "heart valve" → aortic stenosis, AS, mitral regurgitation, MR, TAVR, valve replacement |
| Lipids | 15 → 100 | "cholesterol" → LDL-C, HDL-C, triglycerides, statin, PCSK9, hyperlipidemia, dyslipidemia |
| Imaging Echo | 15 → 90 | "heart ultrasound" → echocardiogram, echo, TTE, TEE, LVEF, GLS, diastolic function |
| Imaging CT | 10 → 70 | "heart scan" → cardiac CT, CTA, calcium score, CAD-RADS, coronary CTA |
| Imaging MRI | 10 → 70 | "cardiac MRI" → CMR, LGE, T1 mapping, ECV, tissue characterization, parametric mapping |
| Electrophysiology | 15 → 100 | "ECG" → electrocardiogram, 12-lead, rhythm strip, QTc, ST segment, LBBB |
| Heart Failure Drugs | 15 → 90 | "entresto" → sacubitril/valsartan, ARNI, neprilysin inhibitor, PARADIGM-HF |
| Structural | 10 → 60 | "TAVR" → transcatheter aortic valve replacement, TAVI, Edwards SAPIEN, Medtronic CoreValve |
| Prevention | 10 → 70 | "heart risk" → ASCVD, cardiovascular risk, PCE, risk calculator, CAC score |
| Cardio-Oncology | 10 → 60 | "chemo heart" → cardiotoxicity, CTRCD, anthracycline, trastuzumab, GLS monitoring |
| Hemodynamics | 10 → 60 | "heart pressures" → right heart cath, Swan-Ganz, PCWP, wedge pressure, PVR, cardiac output |
| Devices | 10 → 60 | "pacemaker" → ICD, CRT, CRT-D, cardiac resynchronization, LVAD, defibrillator |
Total: 200 keywords → ~1,400 expanded terms
11.2 Comparative Analysis Detection¶
The RAG engine auto-detects comparative queries using the same pattern established in the CAR-T and Imaging agents:
Trigger patterns: "vs", "versus", "compared to", "compare", "difference between", "better than"
Cardiology-specific comparisons:
| Comparison | Clinical Relevance |
|---|---|
| "TAVR vs SAVR" | Aortic stenosis intervention strategy by risk category |
| "Amiodarone vs Sotalol" | AF rate/rhythm control |
| "DOAC vs Warfarin" | Anticoagulation strategy for AF |
| "PCI vs CABG" | Revascularization strategy for multivessel CAD |
| "CT vs MRI for cardiomyopathy" | Imaging modality selection |
| "Metoprolol vs Carvedilol" | Beta-blocker selection in HF |
| "Entresto vs Enalapril" | RAAS inhibition strategy in HFrEF |
| "Ablation vs Drugs for AF" | Rhythm control strategy |
12. API and UI Design¶
12.1 FastAPI Endpoints (Port 8526)¶
| Method | Path | Purpose |
|---|---|---|
| GET | /health |
Service health, collection stats, NIM status |
| GET | /collections |
List all collections with vector counts |
| POST | /query |
Full RAG query with evidence synthesis |
| POST | /search |
Evidence-only search (no LLM) |
| POST | /api/ask |
Chat-style question answering |
| POST | /find-related |
Cross-collection entity linking |
| GET | /workflows |
List available clinical workflows |
| POST | /workflow/{name}/run |
Execute a clinical workflow |
| GET | /demo-cases |
List pre-loaded demo cases |
| POST | /demo-cases/{id}/run |
Run a demo case |
| POST | /risk/calculate |
Calculate validated risk score |
| POST | /gdmt/optimize |
GDMT optimization recommendation |
| POST | /protocol/recommend |
Imaging protocol recommendation |
| POST | /reports/generate |
Generate report (markdown, JSON, PDF, FHIR) |
| GET | /knowledge/stats |
Knowledge graph statistics |
| GET | /metrics |
Prometheus-compatible metrics |
| GET | /nim/status |
NIM service availability |
12.2 Streamlit UI (Port 8536)¶
Ten-tab interface following the Imaging Agent's 9-tab pattern:
| Tab | Purpose |
|---|---|
| Evidence Explorer | Multi-collection RAG Q&A with evidence citations, pre-filled cardiology example queries, Plotly donut chart for collection contribution |
| Workflow Runner | 11 clinical workflows with pre-loaded demo cases, annotated cardiac images, 6-step pipeline animation, download buttons |
| Cardiac Imaging | Echo, CT, MRI, nuclear imaging gallery with AI annotations, before/after toggle, 3D volume viewer |
| Risk Calculators | Interactive ASCVD, HEART, CHA₂DS₂-VASc, HAS-BLED, MAGGIC, EuroSCORE II calculators with guideline recommendations |
| GDMT Optimizer | Heart failure medication optimizer: input LVEF, labs, vitals → get pillar-by-pillar titration plan |
| Device & AI Ecosystem | 80+ FDA-cleared cardiac AI devices, searchable by modality and clinical task |
| Protocol Advisor | Patient-specific cardiac imaging protocol recommendations |
| Reports & Export | Markdown, JSON, NVIDIA-branded PDF, FHIR R4 DiagnosticReport export |
| Patient 360 | Cross-modal cardiac dashboard: imaging + genomics + biomarkers + risk scores with interactive Plotly network graph |
| Guidelines & Trials | ACC/AHA guideline browser, landmark trial summaries, evidence level filtering |
12.3 Sidebar¶
| Section | Controls |
|---|---|
| Guided Tour | 10-step demo flow with dismiss button |
| NIM Services | 2x2 status grid (VISTA-3D, MAISI, VILA-M3, LLM) |
| Collection Stats | Metric widgets for each of 12 collections |
| Filters | CV domain dropdown, modality dropdown, year range slider |
| Collections to Search | Individual collection toggle checkboxes |
| Demo Mode | Load demo patient button |
12.4 Demo Cases¶
| ID | Title | Workflow | Key Features |
|---|---|---|---|
| DEMO-001 | Acute Chest Pain: STEMI Evaluation | coronary_artery_disease | ECG ST-elevation, troponin kinetics, cath lab activation |
| DEMO-002 | New Heart Failure: GDMT Initiation | heart_failure_gdmt | LVEF 25%, NYHA III, 4-pillar GDMT plan |
| DEMO-003 | Severe Aortic Stenosis: TAVR Evaluation | valvular_disease | AVA 0.8 cm², mean gradient 48 mmHg, STS risk |
| DEMO-004 | New-Onset AF: Stroke Prevention | arrhythmia_management | CHA₂DS₂-VASc 4, DOAC recommendation, rate control |
| DEMO-005 | Anthracycline Cardiotoxicity Surveillance | cardio_oncology_surveillance | Pre-chemo baseline, GLS monitoring protocol |
13. Clinical Decision Support Engines¶
13.1 Validated Risk Calculators¶
The agent will implement six validated cardiovascular risk calculators:
| Calculator | Use Case | Inputs | Output |
|---|---|---|---|
| ASCVD (PCE) | 10-year atherosclerotic CVD risk | Age, sex, race, TC, HDL, SBP, DM, smoking, HTN Rx | Risk %, statin recommendation |
| HEART Score | Chest pain risk stratification | History, ECG, age, risk factors, troponin | Score 0-10, risk category |
| CHA₂DS₂-VASc | AF stroke risk | CHF, HTN, age, DM, stroke, vascular, sex | Score 0-9, anticoagulation recommendation |
| HAS-BLED | AF bleeding risk on anticoagulation | HTN, renal/liver, stroke, bleeding, labile INR, elderly, drugs/alcohol | Score 0-9, risk category |
| MAGGIC | HF prognosis | Age, sex, LVEF, NYHA, SBP, BMI, creatinine, DM, COPD, HF duration, smoking, meds | 1- and 3-year mortality |
| EuroSCORE II | Cardiac surgery risk | Age, sex, renal function, extracardiac arteriopathy, poor mobility, prior cardiac surgery, COPD, active endocarditis, critical preop state, DM on insulin, NYHA, CCS angina, LVEF, recent MI, PA pressure, urgency, weight of procedure, thoracic aorta surgery | Operative mortality % |
13.2 GDMT Optimization Engine¶
The GDMT optimizer implements the ACC/AHA 2022 Heart Failure Guidelines algorithm:
class GDMTOptimizer:
"""Heart failure guideline-directed medical therapy optimizer.
Implements the 4-pillar HFrEF GDMT framework with:
- Current medication assessment
- Target dose calculation
- Titration schedule recommendation
- Contraindication checking
- Drug interaction screening
- Lab monitoring requirements (K+, Cr, eGFR)
"""
def optimize(
self,
lvef: float,
nyha_class: int,
current_meds: List[Medication],
labs: LabValues,
vitals: Vitals,
comorbidities: List[str],
) -> GDMTRecommendation:
...
14. Reporting and Interoperability¶
14.1 Export Formats¶
| Format | Use Case | Standards |
|---|---|---|
| Markdown | Clinical narrative, consultation notes | -- |
| JSON | Programmatic consumption, dashboards | -- |
| NVIDIA-themed clinical documentation | ReportLab | |
| FHIR R4 | EHR integration, interoperability | SNOMED CT, LOINC, DICOM |
14.2 FHIR R4 Cardiovascular Coding¶
| Element | Code System | Example Codes |
|---|---|---|
| Findings | SNOMED CT | 22298006 (MI), 84114007 (HF), 49436004 (AF), 60234000 (AS) |
| Observations | LOINC | 10230-1 (LVEF), 33762-6 (NT-proBNP), 2093-3 (Total cholesterol), 18262-6 (LDL) |
| Procedures | SNOMED CT | 232717009 (CABG), 26212005 (PCI), 448076004 (TAVR) |
| Medications | RxNorm | Sacubitril/valsartan, carvedilol, dapagliflozin, apixaban |
| Imaging Studies | DICOM | US (echo), CT, MR, NM (nuclear) |
| Risk Scores | Custom extension | ASCVD %, CHA₂DS₂-VASc, HEART, MAGGIC |
14.3 Structured Report Sections¶
Cardiovascular reports will follow ACC/AHA reporting standards:
- Clinical Context -- Indication, relevant history, medications
- Findings -- Organized by organ system (LV, RV, valves, great vessels, coronary)
- Measurements -- Quantitative data with reference ranges and severity grading
- Impression -- Synthesized clinical interpretation
- Recommendations -- Guideline-based next steps with evidence levels
- Genomic Enrichment -- Cross-modal genetic findings (if triggered)
- Risk Scores -- Calculated risk stratification with category
- Citations -- Evidence sources with relevance scores
15. Product Requirements Document¶
15.1 Product Vision¶
Vision Statement: Enable any cardiologist, anywhere, to access integrated cardiovascular intelligence combining imaging AI, genomic analysis, guideline-based decision support, and evidence synthesis -- on a single $3,999 device.
Target Users: Community cardiologists, academic cardiology fellows, heart failure programs, structural heart teams, cardio-oncology clinics, preventive cardiology practices, clinical trial sites.
15.2 User Stories¶
Epic 1: Evidence-Based Clinical Queries¶
| ID | User Story | Priority | Acceptance Criteria |
|---|---|---|---|
| US-001 | As a cardiologist, I want to ask clinical questions and receive evidence-grounded answers with citations, so that I can make informed decisions. | P0 | Query returns answer with ≥3 citations from ≥2 collections; response time < 30 sec |
| US-002 | As a cardiologist, I want comparative analysis ("TAVR vs SAVR") with structured tables, so that I can compare treatment options. | P0 | Comparative query auto-detected; side-by-side evidence display; structured comparison table |
| US-003 | As a fellow, I want pre-filled example queries for common scenarios, so that I can learn the system quickly. | P1 | ≥4 clickable example queries; each returns relevant results |
| US-004 | As a researcher, I want to filter by CV domain, modality, and year, so that I can narrow my evidence search. | P1 | Sidebar filters applied to all queries; results reflect applied filters |
Epic 2: Clinical Workflows¶
| ID | User Story | Priority | Acceptance Criteria |
|---|---|---|---|
| US-005 | As a cardiologist, I want to run CAD assessment with calcium score and CTA data, so that I get CAD-RADS classification and management recommendations. | P0 | Workflow returns CAD-RADS 0-5, per-vessel stenosis, plaque characterization, guideline recommendation |
| US-006 | As an HF specialist, I want GDMT optimization for a patient with HFrEF, so that I get a 4-pillar titration plan. | P0 | All 4 GDMT pillars assessed; current vs target dose displayed; titration schedule provided |
| US-007 | As a structural heart team member, I want valve severity grading from echo measurements, so that I can assess intervention criteria. | P0 | Severity grading matches ASE criteria; SAVR vs TAVR recommendation based on STS risk |
| US-008 | As an EP physician, I want arrhythmia classification with CHA₂DS₂-VASc, so that I can manage AF stroke risk. | P0 | CHA₂DS₂-VASc calculated correctly; anticoagulation recommendation per guidelines |
| US-009 | As a cardio-oncologist, I want surveillance protocol generation, so that I know when to order echo and biomarkers. | P1 | Protocol specific to cancer therapy; GLS and LVEF thresholds defined; cardioprotection recommendations |
Epic 3: Risk Calculators¶
| ID | User Story | Priority | Acceptance Criteria |
|---|---|---|---|
| US-010 | As a preventive cardiologist, I want to calculate 10-year ASCVD risk, so that I can determine statin eligibility. | P0 | Correct PCE calculation; risk category assignment; statin intensity recommendation |
| US-011 | As an ED physician, I want HEART score for chest pain, so that I can risk-stratify patients. | P0 | Score 0-10 calculated; risk category (low/mod/high); disposition recommendation |
| US-012 | As a cardiologist, I want interactive risk calculator forms, so that I can enter patient data directly in the UI. | P1 | Form inputs with validation; real-time score update; guideline recommendation display |
Epic 4: Cross-Modal Integration¶
| ID | User Story | Priority | Acceptance Criteria |
|---|---|---|---|
| US-013 | As a cardiomyopathy specialist, I want genetic triggers from cardiac imaging, so that DCM patients automatically get gene panel recommendations. | P0 | Non-ischemic DCM + age < 50 triggers gene panel; genomic hits displayed in Patient 360 |
| US-014 | As a cardiologist, I want a Patient 360 view combining imaging, genomics, biomarkers, and risk scores, so that I see the complete picture. | P1 | Interactive Plotly network graph; cross-modal connections displayed; drill-down to evidence |
Epic 5: Reporting and Export¶
| ID | User Story | Priority | Acceptance Criteria |
|---|---|---|---|
| US-015 | As a cardiologist, I want to export clinical reports as PDF, so that I can include them in patient records. | P0 | NVIDIA-branded PDF with clinical question, analysis, evidence, risk scores; download button |
| US-016 | As a health IT engineer, I want FHIR R4 DiagnosticReport export, so that findings integrate with our EHR. | P1 | Valid FHIR R4 Bundle; SNOMED CT + LOINC coded; passes FHIR validator |
| US-017 | As a cardiologist, I want 4 export formats (Markdown, JSON, PDF, FHIR), so that I can choose the right format for my use case. | P1 | All 4 formats functional; consistent content across formats |
Epic 6: Demo and Presentation¶
| ID | User Story | Priority | Acceptance Criteria |
|---|---|---|---|
| US-018 | As a demo presenter, I want 5 pre-loaded demo cases, so that I can run the demo without entering data. | P0 | 5 demo cases selectable from dropdown; each runs in < 30 seconds; realistic clinical output |
| US-019 | As a new user, I want a sidebar guided tour, so that I understand the 10-tab interface quickly. | P1 | Expandable tour with numbered steps; dismiss button; persists across session |
| US-020 | As a demo presenter, I want a cardiac imaging gallery, so that I can show impressive AI-annotated images. | P1 | Echo, CT, MRI images with AI overlays; before/after toggle; 3D slice viewer |
15.3 Non-Functional Requirements¶
| Requirement | Target | Rationale |
|---|---|---|
| RAG query latency | < 30 seconds end-to-end | Acceptable for clinical consultation workflow |
| Risk calculator latency | < 5 seconds | Near-instantaneous for interactive use |
| Workflow execution (mock) | < 10 seconds | Responsive demo experience |
| Availability | 99.5% uptime | Clinical support tool (not life-critical) |
| Memory footprint | < 16 GB (agent only) | Coexist with other agents on 128 GB DGX Spark |
| Seed data completeness | 1,500+ records across 12 collections | Sufficient for meaningful RAG retrieval |
| Unit test coverage | > 80% | Reliable development cycle |
| FHIR R4 compliance | Passes HL7 FHIR Validator | Interoperability requirement |
15.4 Prioritization Matrix¶
| Phase | Features | Timeline |
|---|---|---|
| Phase 1 (MVP) | RAG engine (12 collections), Evidence Explorer, 3 workflows (CAD, HF, arrhythmia), 3 risk calculators (ASCVD, CHA₂DS₂-VASc, HEART), 3 demo cases, PDF export | 4-6 weeks |
| Phase 2 (Complete) | All 11 workflows, all 6 calculators, GDMT optimizer, FHIR R4 export, cardiac imaging gallery, Patient 360, all 5 demo cases | 4-6 weeks |
| Phase 3 (Polish) | Guided tour, pipeline animation, cross-modal triggers, network graph, Guidelines & Trials tab, benchmark validation | 2-3 weeks |
16. Data Acquisition Strategy¶
16.1 Automated Ingest Pipelines¶
| Source | Collection(s) | Method | Update Cadence |
|---|---|---|---|
| PubMed (NCBI E-utilities) | cardio_literature | MeSH-filtered abstract retrieval | Weekly |
| ClinicalTrials.gov (V2 API) | cardio_trials | Cardiovascular condition filter | Weekly |
| ACC/AHA Guidelines PDFs | cardio_guidelines | Manual curation + embedding | Per guideline update |
| ASE/SCCT/SCMR references | cardio_imaging | Manual curation | Per guideline update |
16.2 Curated Seed Data¶
| Collection | Records | Curation Source |
|---|---|---|
| cardio_imaging | 200 | ASE, SCCT, SCMR, ASNC guidelines and reference texts |
| cardio_electrophysiology | 150 | ACC/AHA/HRS guidelines, EP textbooks |
| cardio_heart_failure | 150 | ACC/AHA 2022 HF Guidelines, landmark trial data |
| cardio_valvular | 120 | ACC/AHA 2020 VHD Guidelines |
| cardio_prevention | 150 | ACC/AHA 2019 Prevention, 2018 Cholesterol Guidelines |
| cardio_interventional | 100 | SCAI guidelines, procedural references |
| cardio_oncology | 100 | ESC 2022 Cardio-Oncology Guidelines |
| cardio_devices | 80 | FDA AI/ML database, manufacturer data |
| cardio_hemodynamics | 80 | Catheterization references and guidelines |
16.3 PubMed Search Strategy¶
MeSH Terms:
"Cardiovascular Diseases"[MeSH] OR "Heart Diseases"[MeSH] OR
"Coronary Artery Disease"[MeSH] OR "Heart Failure"[MeSH] OR
"Atrial Fibrillation"[MeSH] OR "Cardiomyopathies"[MeSH]
AND ("Artificial Intelligence"[MeSH] OR "Machine Learning"[MeSH] OR
"Deep Learning"[MeSH] OR "Neural Networks"[MeSH])
Filters: Published 2018-2026, English, Humans
Expected: 3,000-5,000 abstracts
17. Validation and Testing Strategy¶
17.1 Unit Tests¶
| Test Category | Target Count | Coverage |
|---|---|---|
| Collection schemas | 36 | 12 collections × 3 tests each |
| Risk calculators | 48 | 6 calculators × 8 test cases each |
| GDMT optimizer | 30 | Multiple HF phenotypes and medication combos |
| Workflow logic | 40 | 11 workflows × 5 test cases each |
| RAG engine | 20 | Query expansion, scoring, synthesis |
| Knowledge graph | 15 | Entity lookup, alias resolution |
| API endpoints | 30 | All REST endpoints |
| FHIR R4 export | 15 | Schema validation, coding accuracy |
| Cross-modal triggers | 12 | All trigger conditions |
| Total | ~250 |
17.2 Clinical Validation¶
| Validation Type | Method | Target |
|---|---|---|
| Risk calculator accuracy | Compare against published validation cohorts | < 1% deviation from reference implementations |
| GDMT recommendations | Review by board-certified HF cardiologist | 95%+ guideline concordance |
| Severity grading | Compare against ASE/ACC criteria | 100% match for standard inputs |
| Guideline citation accuracy | Verify against source guidelines | 100% accurate citations |
| FHIR R4 compliance | HL7 FHIR Validator | Zero validation errors |
17.3 End-to-End Validation Checks¶
| Check | Criteria |
|---|---|
| Health endpoint | Returns status=healthy, all 12 collections with non-zero counts |
| RAG query | Returns answer with ≥3 citations in < 30 seconds |
| Risk calculator | All 6 calculators return correct results for known inputs |
| Workflow execution | All 11 workflows complete in mock mode |
| FHIR export | Valid R4 Bundle passes FHIR validator |
| PDF export | Generates downloadable PDF with NVIDIA branding |
| Demo cases | All 5 demo cases execute successfully |
| Cross-modal trigger | Genomic query fires for qualifying conditions |
| Comparative analysis | "TAVR vs SAVR" produces structured comparison |
18. Regulatory Considerations¶
18.1 Intended Use Classification¶
The Cardiology Intelligence Agent is a clinical decision support (CDS) tool intended for use by licensed healthcare professionals. It is designed to:
- Retrieve and synthesize published cardiovascular evidence
- Calculate validated clinical risk scores
- Generate guideline-based recommendations
- Assist with (not replace) clinical decision-making
18.2 FDA CDS Exemption Criteria (21st Century Cures Act)¶
Under the 21st Century Cures Act, software functions meeting all four criteria are exempt from FDA device regulation:
| Criterion | Assessment |
|---|---|
| Not intended to acquire, process, or analyze a medical image or signal | Met -- RAG and risk calculation; does not process raw medical images or signals |
| Intended for displaying, analyzing, or printing medical information | Met -- Displays evidence and calculations |
| Intended for use by healthcare professional | Met -- Designed for cardiologists |
| Healthcare professional does not primarily rely on the software | Met -- Provides recommendations for review, not autonomous decisions |
Assessment: The core RAG and risk calculator functions likely qualify for CDS exemption. Clinical workflows involving NIM-based image analysis would require separate regulatory consideration.
18.3 Disclaimers¶
All outputs will include standard disclaimers:
This tool is for clinical decision support only and does not replace professional medical judgment. All findings require verification by a qualified cardiologist. Risk calculations are estimates based on population-level data and may not apply to individual patients. Not FDA-cleared for autonomous clinical decision-making. For research and educational purposes only.
19. DGX Compute Progression¶
| Phase | Hardware | Price | Scope |
|---|---|---|---|
| Phase 1 -- Proof Build | DGX Spark | $3,999 | All 11 workflows (mock/cloud NIM), 12 collections, 6 risk calculators |
| Phase 2 -- Departmental | 1-2x DGX B200 | $500K-$1M | Full NIM stack, live echo/CT/MRI processing, PACS integration |
| Phase 3 -- Multi-Site | 4-8x DGX B200 | $2M-$4M | NVIDIA FLARE federated learning across sites, population analytics |
| Phase 4 -- AI Factory | DGX SuperPOD | $7M-$60M+ | Thousands concurrent studies, real-time ICU monitoring, national registries |
20. Implementation Roadmap¶
Phase 1: Foundation (Weeks 1-6)¶
| Week | Deliverable |
|---|---|
| 1-2 | Repository scaffolding, Pydantic models, settings, Docker Compose, 12 collection schemas |
| 3-4 | Seed data curation (1,530 records), ingest pipelines (PubMed, ClinicalTrials.gov), embedding generation |
| 5-6 | RAG engine (parallel search, weighted scoring, query expansion, Claude synthesis), Evidence Explorer tab |
Phase 2: Clinical Intelligence (Weeks 7-12)¶
| Week | Deliverable |
|---|---|
| 7-8 | 3 priority workflows (CAD, HF, arrhythmia), risk calculators (ASCVD, CHA₂DS₂-VASc, HEART), demo cases |
| 9-10 | Remaining 5 workflows (valve, CMR, stress, prevention, cardio-onc), remaining calculators (HAS-BLED, MAGGIC, EuroSCORE II) |
| 11-12 | GDMT optimizer, cross-modal triggers, FHIR R4 export, PDF reports |
Phase 3: UI and Polish (Weeks 13-16)¶
| Week | Deliverable |
|---|---|
| 13-14 | 10-tab Streamlit UI, cardiac imaging gallery, Patient 360 network graph, Guidelines & Trials browser |
| 15-16 | Sidebar guided tour, pipeline animation, pre-filled examples, 5 demo cases finalized, documentation |
Phase 4: Integration and Validation (Weeks 17-18)¶
| Week | Deliverable |
|---|---|
| 17 | Integration with Imaging Agent (shared cardiac workflows), cross-agent triggers, Docker Compose integration |
| 18 | Clinical validation, end-to-end testing (250+ unit tests), documentation publication (demo guide, design doc, project bible) |
21. Risk Analysis¶
| Risk | Probability | Impact | Mitigation |
|---|---|---|---|
| Risk calculator implementation errors | Medium | High | Validate against published reference implementations; extensive unit testing |
| GDMT logic complexity | Medium | Medium | Start with HFrEF only (best-defined); expand to HFmrEF/HFpEF later |
| Insufficient seed data for niche collections | Medium | Medium | Focus on high-impact collections first; iteratively expand |
| NIM cardiac segmentation accuracy | Low | Medium | Mock mode fallback; VISTA-3D already supports cardiac structures |
| Guideline updates during development | Low | Low | Modular guideline collection; easy to update individual recommendations |
| Memory pressure with 12 collections + existing agents | Low | Medium | BGE-small (384-dim) is compact; Milvus handles multi-collection efficiently |
| Claude API rate limits during demo | Low | High | Llama-3 NIM fallback; response caching for demo scenarios |
22. Competitive Landscape¶
22.1 Positioning¶
The Cardiology Intelligence Agent occupies a unique position in the cardiovascular AI landscape:
Multi-Modal Integration
↑
|
[Cardiology Agent]
|
Cloud-only ←------+------→ On-device
|
[HeartFlow, Cleerly]
|
↓
Single-Modality
No existing product combines:
- Multi-modal cardiac imaging AI (echo + CT + MRI + nuclear + ECG)
- Genomic integration with cross-modal triggers
- RAG-grounded evidence synthesis with citations
- Validated risk calculators (ASCVD, HEART, CHA₂DS₂-VASc)
- GDMT optimization engine
- On-device deployment ($3,999)
- Open-source (Apache 2.0)
22.2 Defensibility¶
| Advantage | Defensibility |
|---|---|
| Multi-collection RAG architecture | High -- 11 agents prove the pattern across all major clinical domains |
| Cross-modal genomic triggers | High -- unique to HCLS AI Factory; requires integrated platform |
| On-device inference | High -- DGX Spark + NIM stack is NVIDIA-exclusive |
| Open source | Medium -- community contribution, institutional customization |
| Guideline-aligned CDS | Low -- guidelines are public; competitors can implement |
| Clinical validation | Medium -- requires domain expertise and clinical partnerships |
23. Discussion¶
23.1 Why Cardiology Is the Right Next Agent¶
Of all medical specialties, cardiology presents the strongest case for an integrated intelligence agent:
-
Data richness: No other specialty routinely integrates as many data modalities (imaging, EP, hemodynamics, biomarkers, genomics, risk scores) into a single clinical encounter.
-
Quantitative precision: Cardiovascular medicine is measurement-driven. LVEF, calcium scores, valve gradients, QTc intervals, and risk percentages are all inherently structured data -- ideal for RAG retrieval and clinical decision support.
-
Guideline density: The ACC/AHA and ESC produce more clinical practice guidelines than any other medical specialty. These guidelines are highly structured (Class I-III, LOE A-C) and directly mappable to clinical decision support rules.
-
Clinical workflow fit: Cardiologists already use structured reporting, risk calculators, and decision algorithms. The Cardiology Intelligence Agent enhances rather than disrupts existing workflow.
-
Market size: Cardiovascular AI is the largest segment of the medical AI market ($2.8B projected by 2028). Every health system has cardiologists. The addressable market is essentially universal.
-
Existing foundation: The HCLS AI Factory already has cardiac CT workflows (DEMO-003 in the Imaging Agent), coronary segmentation via VISTA-3D, and cardiovascular genomics in the shared variant collection. The Cardiology Agent builds on existing infrastructure rather than starting from scratch.
23.2 Limitations¶
-
Mock mode inference: Phase 1 uses clinically realistic mock results rather than live model inference. This is appropriate for proof-of-concept and demo but requires GPU deployment for clinical validation.
-
Guideline currency: Clinical guidelines are updated periodically. The agent requires manual curation to incorporate focused updates, although the modular collection architecture makes updates straightforward.
-
Risk calculator applicability: Validated risk calculators (PCE, HEART) were developed in specific populations and may not generalize equally across all demographic groups. The agent should display appropriate caveats.
-
Not a replacement for clinical judgment: The agent assists with evidence synthesis and calculation but cannot replace the nuanced clinical judgment of an experienced cardiologist, particularly in complex or atypical presentations.
23.3 Future Directions¶
-
Real-time ICU integration: Continuous hemodynamic monitoring with AI-driven trend analysis and early warning for cardiac decompensation.
-
Wearable data integration: Apple Watch ECG, continuous glucose monitoring, and remote blood pressure data feeding into longitudinal risk assessment.
-
Population health analytics: Cohort-level cardiovascular outcomes analysis across institutions using federated learning (NVIDIA FLARE).
-
Clinical trial matching: Automated screening of patients for cardiovascular clinical trial eligibility based on imaging, biomarker, and genomic criteria.
-
Cardiac digital twins: Patient-specific computational models combining imaging anatomy, electrophysiology, and hemodynamics for treatment simulation.
24. Conclusion¶
The Cardiology Intelligence Agent represents the natural next extension of the HCLS AI Factory platform into the highest-impact medical specialty. By leveraging proven multi-collection RAG patterns, shared infrastructure, and established NIM integration -- while adding cardiology-specific clinical workflows, validated risk calculators, GDMT optimization, and cross-modal genomic triggers -- the agent will deliver integrated cardiovascular intelligence that currently requires multi-million-dollar institutional investments.
The 12-collection Milvus architecture, 8 reference clinical workflows, 6 validated risk calculators, and GDMT optimization engine provide a comprehensive decision support platform that covers the full spectrum of cardiovascular practice: from preventive risk stratification through acute coronary syndromes to advanced heart failure management and cardio-oncology surveillance.
Deploying on a single NVIDIA DGX Spark ($3,999) ensures that this intelligence is accessible not just to elite academic centers but to community cardiologists, rural health systems, and resource-limited institutions worldwide. Combined with Apache 2.0 licensing and the HCLS AI Factory's proven open-source approach, the Cardiology Intelligence Agent will democratize cardiovascular AI in a way that no existing commercial product achieves.
25. References¶
- Virani SS, et al. Heart Disease and Stroke Statistics -- 2025 Update. Circulation. 2025;151:e1-e375.
- Heidenreich PA, et al. 2022 AHA/ACC/HFSA Guideline for the Management of Heart Failure. Circulation. 2022;145:e895-e1032.
- Writing Committee, et al. 2021 ACC/AHA/SCAI Guideline for Coronary Artery Revascularization. J Am Coll Cardiol. 2022;79:e21-e129.
- Otto CM, et al. 2020 ACC/AHA Guideline for the Management of Patients with Valvular Heart Disease. Circulation. 2021;143:e72-e227.
- January CT, et al. 2019 AHA/ACC/HRS Focused Update of the 2014 Guideline for Management of Atrial Fibrillation. Circulation. 2019;140:e125-e151.
- Arnett DK, et al. 2019 ACC/AHA Guideline on the Primary Prevention of Cardiovascular Disease. Circulation. 2019;140:e596-e646.
- Grundy SM, et al. 2018 AHA/ACC Cholesterol Clinical Practice Guideline. Circulation. 2019;139:e1082-e1143.
- Lyon AR, et al. 2022 ESC Guidelines on Cardio-Oncology. Eur Heart J. 2022;43:4229-4361.
- Vahanian A, et al. 2021 ESC/EACTS Guidelines for the Management of Valvular Heart Disease. Eur Heart J. 2022;43:561-632.
- Hindricks G, et al. 2020 ESC Guidelines for the Diagnosis and Management of Atrial Fibrillation. Eur Heart J. 2021;42:373-498.
- Ommen SR, et al. 2024 AHA/ACC Guideline for the Diagnosis and Management of Hypertrophic Cardiomyopathy. Circulation. 2024;149:e1239-e1311.
- Cury RC, et al. CAD-RADS 2.0 -- 2022 Coronary Artery Disease Reporting and Data System. Radiology. 2022;305:209-221.
- Lang RM, et al. Recommendations for Cardiac Chamber Quantification by Echocardiography in Adults: An Update from the ASE. J Am Soc Echocardiogr. 2015;28:1-39.
- Kramer CM, et al. Standardized Cardiovascular Magnetic Resonance Imaging (CMR) Protocols: 2020 Update. J Cardiovasc Magn Reson. 2020;22:17.
- Goff DC, et al. 2013 ACC/AHA Guideline on the Assessment of Cardiovascular Risk. Circulation. 2014;129:S49-S73.
- Lip GY, et al. Refining Clinical Risk Stratification for Predicting Stroke and Thromboembolism in Atrial Fibrillation Using a Novel Risk Factor-Based Approach: The Euro Heart Survey. Chest. 2010;137:263-272.
- Six AJ, et al. The HEART Score for the Assessment of Patients with Chest Pain in the Emergency Department. Neth Heart J. 2008;16:191-196.
- Pocock SJ, et al. Predicting Survival in Heart Failure: A Risk Score Based on 39,372 Patients from 30 Studies (MAGGIC). Eur J Heart Fail. 2013;15:1082-1094.
- Nashef SA, et al. EuroSCORE II. Eur J Cardiothorac Surg. 2012;41:734-744.
- World Health Organization. Cardiovascular Diseases (CVDs) Fact Sheet. WHO, 2024.
- FDA. Artificial Intelligence and Machine Learning (AI/ML)-Enabled Medical Devices. FDA Database, 2025.
- McMurray JJ, et al. Angiotensin-Neprilysin Inhibition versus Enalapril in Heart Failure (PARADIGM-HF). N Engl J Med. 2014;371:993-1004.
- McMurray JJ, et al. Dapagliflozin in Patients with Heart Failure and Reduced Ejection Fraction (DAPA-HF). N Engl J Med. 2019;381:1995-2008.
- Packer M, et al. Cardiovascular and Renal Outcomes with Empagliflozin in Heart Failure (EMPEROR-Reduced). N Engl J Med. 2020;383:1413-1424.
- Solomon SD, et al. Dapagliflozin in Heart Failure with Mildly Reduced or Preserved Ejection Fraction (DELIVER). N Engl J Med. 2022;387:1089-1098.
- Maron DJ, et al. Initial Invasive or Conservative Strategy for Stable Coronary Disease (ISCHEMIA). N Engl J Med. 2020;382:1395-1407.
HCLS AI Factory -- Cardiology Intelligence Agent Research Paper and PRD Apache 2.0 License | March 2026 Author: Adam Jones