Neurology Intelligence Agent -- Project Bible¶

Version: 1.0.0 Date: 2026-03-22 Author: Adam Jones

1. Project Identity¶

Name: Neurology Intelligence Agent Codename: NeuroAgent Repository: ai_agent_adds/neurology_intelligence_agent/ Parent Project: HCLS AI Factory License: Apache 2.0 (Open Source)

Mission Statement¶

Deliver an AI-powered clinical decision support system that unifies fragmented neurological evidence -- spanning cerebrovascular, neurodegenerative, epilepsy, movement disorders, multiple sclerosis, headache, neuromuscular, and neuro-oncology domains -- into a single RAG-driven intelligence platform. Clinicians receive guideline-grounded, evidence-cited recommendations in under five seconds.

Why This Matters¶

3 billion people worldwide affected by neurological conditions (WHO 2024)
Neurological data is fragmented across siloed subspecialties with inconsistent terminology
Clinical decision-making requires synthesizing imaging, electrophysiology, genomics, and guideline evidence simultaneously
Time-critical conditions (acute stroke, status epilepticus) demand sub-minute decision support

2. Architecture Summary¶

Three-Layer Stack¶

Presentation     Streamlit UI (8529) + FastAPI REST (8528)
Intelligence     RAG Engine + Workflow Engine + Scale Calculators + Query Expansion
Data             Milvus 2.4 (14 collections, BGE-small 384-dim, IVF_FLAT/COSINE)

Core Design Decisions¶

Decision	Rationale
14 domain-specific collections (not 1 monolithic)	Workflow-specific weight boosting yields higher-relevance results
BGE-small-en-v1.5 (384-dim)	Best size-accuracy tradeoff for medical text on DGX Spark
IVF_FLAT with nlist=128	Sub-100ms search latency at 855K estimated records
Pydantic BaseSettings with `NEURO_` prefix	Type-safe config, env-driven, no secrets in code
ThreadPoolExecutor for parallel search	14 collections searched concurrently; reduces latency from 14x to ~1x
Claude Sonnet for synthesis	Strongest medical reasoning among available LLMs
Hoehn-Yahr + MDS-UPDRS + MoCA + NIHSS + ...	10 validated instruments prevent ad-hoc scoring

3. Directory Structure¶

neurology_intelligence_agent/
  api/
    __init__.py
    main.py                       # FastAPI application with lifespan
    routes/
      __init__.py
      neuro_clinical.py           # 15 clinical endpoints
      reports.py                  # Report generation
      events.py                   # SSE streaming
  app/
    __init__.py
    neuro_ui.py                   # Streamlit chat interface
  config/
    __init__.py
    settings.py                   # Pydantic BaseSettings (50+ params)
  data/
    cache/                        # Conversation persistence
    reference/                    # Reference data files
  docs/
    NEUROLOGY_INTELLIGENCE_AGENT_RESEARCH_PAPER.md
    PRODUCTION_READINESS_REPORT.md
    PROJECT_BIBLE.md              # This document
    ARCHITECTURE_GUIDE.md
    WHITE_PAPER.md
    DEPLOYMENT_GUIDE.md
    DEMO_GUIDE.md
    LEARNING_GUIDE_FOUNDATIONS.md
    LEARNING_GUIDE_ADVANCED.md
    INDEX.md
  scripts/
    setup_collections.py          # Create Milvus schemas
    seed_knowledge.py             # Populate knowledge base
    run_ingest.py                 # Data ingestion pipeline
    generate_docx.py              # DOCX report generation
  src/
    __init__.py
    agent.py                      # Agent orchestrator
    clinical_scales.py            # 10 scale calculators
    clinical_workflows.py         # 8 clinical workflows
    collections.py                # 14 collection schemas
    cross_modal.py                # Cross-agent triggers
    export.py                     # Report formats
    knowledge.py                  # Domain knowledge base
    metrics.py                    # Prometheus metrics
    models.py                     # Enums and Pydantic models
    query_expansion.py            # 251+ aliases, 16 synonym maps
    rag_engine.py                 # Multi-collection RAG
    scheduler.py                  # Ingest scheduler
    utils/
      __init__.py
    ingest/
      __init__.py
      base.py                     # Base ingest pipeline
      pubmed_neuro_parser.py      # PubMed parser
      neuroimaging_parser.py      # Imaging protocol parser
      eeg_parser.py               # EEG pattern parser
  tests/
    __init__.py
    conftest.py                   # Shared fixtures
    test_agent.py                 # 5 tests
    test_api.py                   # 8 tests
    test_clinical_scales.py       # 35 tests
    test_clinical_workflows.py    # 11 tests
    test_collections.py           # 15 tests
    test_integration.py           # 16 tests
    test_knowledge.py             # 30 tests
    test_models.py                # 55 tests
    test_query_expansion.py       # 5 tests
    test_rag_engine.py            # 4 tests
    test_settings.py              # 18 tests
    test_workflow_execution.py    # 7 tests
  docker-compose.yml              # Standalone deployment
  Dockerfile                      # Container image
  README.md                       # Quick-start guide
  requirements.txt                # Python dependencies

4. Data Model¶

Enums (18)¶

Enum	Values	Purpose
`NeuroWorkflowType`	9 values	Workflow routing
`NeuroDomain`	10 values	Domain classification
`StrokeType`	4 values	Stroke classification
`DementiaSubtype`	9 values	Dementia differential
`ATNStage`	8 values	Alzheimer's biomarker staging
`SeizureType`	9 values	ILAE 2017 seizure types
`EpilepsySyndrome`	10 values	Epilepsy syndrome identification
`MSPhenotype`	4 values	MS clinical phenotypes
`DMTCategory`	3 values	MS DMT efficacy tiers
`TumorGrade`	4 values	WHO 2021 CNS grades
`TumorMolecularMarker`	11 values	CNS tumor molecular markers
`ParkinsonsSubtype`	3 values	PD motor subtypes
`HeadacheType`	10 values	ICHD-3 headache types
`NMJPattern`	8 values	EMG/NCS pattern classification
`SeverityLevel`	5 values	Clinical severity
`EvidenceLevel`	4 values	AAN evidence classification
`GuidelineClass`	5 values	Recommendation classification
`ClinicalScaleType`	10 values	Scale calculator dispatch

Pydantic Models (12)¶

NeuroQuery, NeuroSearchResult, StrokeAssessment, DementiaAssessment, SeizureClassification, MSAssessment, TumorAssessment, HeadacheClassification, NeuromuscularAssessment, ScaleResult, WorkflowResult, NeuroResponse

5. Key Technical Specifications¶

Specification	Value
Embedding model	BAAI/bge-small-en-v1.5
Embedding dimension	384
Vector index type	IVF_FLAT
Similarity metric	COSINE
IVF nlist	128
Score threshold	0.4
Max conversation context	3 turns
Conversation TTL	24 hours
Citation high threshold	0.75
Citation medium threshold	0.60
LLM model	claude-sonnet-4-6
LLM max tokens	2048
LLM temperature	0.7

6. Dependency Map¶

Python Dependencies¶

Package	Purpose
`fastapi`	REST API framework
`uvicorn`	ASGI server
`pydantic` + `pydantic-settings`	Data validation and configuration
`pymilvus`	Milvus vector database client
`sentence-transformers`	BGE-small embedding model
`anthropic`	Claude LLM client
`streamlit`	Chat UI
`loguru`	Structured logging
`python-docx`	DOCX report generation

Infrastructure Dependencies¶

Service	Version	Purpose
Milvus	2.4	Vector database
etcd	3.5.5	Milvus metadata
MinIO	2023.03	Milvus object storage
Docker	24+	Container runtime
Python	3.10+	Runtime

7. Naming Conventions¶

Element	Convention	Example
Collection names	`neuro_` prefix, snake_case	`neuro_cerebrovascular`
Environment variables	`NEURO_` prefix, UPPER_SNAKE	`NEURO_MILVUS_HOST`
Enum values	snake_case strings	`acute_stroke`
API paths	lowercase, kebab-compatible	`/v1/neuro/stroke/triage`
Test files	`test_` prefix	`test_clinical_scales.py`
Scale classes	PascalCase + Calculator suffix	`NIHSSCalculator`
Workflow classes	PascalCase + Workflow suffix	`AcuteStrokeWorkflow`

8. Guideline Bodies Referenced¶

Organization	Abbreviation	Domains
American Academy of Neurology	AAN	All neurology domains
European Academy of Neurology	EAN	Cross-domain European guidelines
American Heart/Stroke Association	AHA/ASA	Cerebrovascular disease
International League Against Epilepsy	ILAE	Epilepsy classification and treatment
International Headache Society	IHS	ICHD-3 headache classification
Movement Disorder Society	MDS	PD, dystonia, tremor
National Comprehensive Cancer Network	NCCN	CNS tumors
World Health Organization	WHO	CNS tumor classification (2021)
NIA-AA	NIA-AA	ATN framework for Alzheimer's
American Clinical Neurophysiology Society	ACNS	EEG terminology

9. Quality Gates¶

Gate	Criterion	Status
Unit tests pass	209/209 tests green	Pass
Scale calculator accuracy	All 10 calculators match clinical definitions	Pass
Collection weight sums	Each workflow sums to ~1.0 (tolerance 0.02)	Pass
Model serialization	All 12 Pydantic models serialize/deserialize cleanly	Pass
API contract	All endpoints return documented response schemas	Pass
Configuration validation	Settings.validate() returns no critical issues	Pass
Docker build	`docker compose build` succeeds	Pass
Knowledge integrity	Drug count >= 42, gene count >= 38	Pass

10. Roadmap¶

v1.0 (Current -- March 2026)¶

14 collections, 8 workflows, 10 scales
Standalone and integrated Docker deployment
209 tests across 12 modules
Full documentation set

v1.1 (Planned -- Q2 2026)¶

EHR integration via FHIR R4 DiagnosticReport
Real-time EEG pattern classification
DWI/FLAIR MRI feature extraction
Structured medication interaction checking

v1.2 (Planned -- Q3 2026)¶

Multi-institutional knowledge federation
Longitudinal patient tracking
Automated guideline update ingestion
Clinical note summarization

11. Contact and Ownership¶

Role	Contact
Project Lead / Developer	Adam Jones
Platform	HCLS AI Factory
Repository	`ai_agent_adds/neurology_intelligence_agent/`
Documentation	`docs/` directory
Issues	GitHub Issues on hcls-ai-factory

Neurology Intelligence Agent -- Project Bible v1.0.0 HCLS AI Factory / GTC Europe 2026