epam/statgpt-backend
StatGPT Backend
What's novel
StatGPT Backend
Code Analysis
11 files read · 4 roundsA production-ready Python application that enables natural language queries over official statistics databases using SDMX standards with hybrid search combining vector, lexical, and LLM-based relevance scoring.
Strengths
Exceptional architectural patterns with clean separation between admin, app, and common layers. Implements sophisticated hybrid search architecture combining multiple retrieval strategies. Production-grade error handling, async-first design, enterprise security (OIDC), and comprehensive testing with real SDMX data.
Weaknesses
Some files are quite large which could benefit from further modularization. Hybrid search logic is intricate and would benefit from more targeted integration tests. A few magic numbers in configuration thresholds should be made configurable.
Score Breakdown
Signal breakdown
Innovation
Craft
Traction
Scope
Evidence
Commits
98
Contributors
10
Files
524
Active weeks
21
Repository
Language
Python
Stars
23
Forks
1
License
MIT