Files
BeyondCX_Insights/docs/TODO.md
sujucu70 75e7b9da3d feat: Add Streamlit dashboard with Blueprint compliance (v2.1.0)
Dashboard Features:
- 8 navigation sections: Overview, Outcomes, Poor CX, FCR, Churn, Agent, Call Explorer, Export
- Beyond Brand Identity styling (colors #6D84E3, Outfit font)
- RCA Sankey diagram (Driver → Outcome → Churn Risk flow)
- Correlation heatmaps (driver co-occurrence, driver-outcome)
- Outcome Deep Dive (root causes, correlation, duration analysis)
- Export functionality (Excel, HTML, JSON)

Blueprint Compliance:
- FCR: 4 categories (Primera Llamada/Rellamada × Sin/Con Riesgo de Fuga)
- Churn: Binary view (Sin Riesgo de Fuga / En Riesgo de Fuga)
- Agent: Talento Para Replicar / Oportunidades de Mejora
- Fixed FCR rate calculation (only FIRST_CALL counts as success)

Technical:
- Streamlit + Plotly for interactive visualizations
- Light theme configuration (.streamlit/config.toml)
- Fixed Plotly colorbar titlefont deprecation

Documentation:
- Updated PROJECT_CONTEXT.md, TODO.md, CHANGELOG.md
- Added 4 new technical decisions (TD-014 to TD-017)
- Created TROUBLESHOOTING.md with 10 common issues

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-19 16:27:30 +01:00

5.3 KiB
Raw Blame History

TODO.md

Lista priorizada de tareas pendientes


Checkpoints Completados

CP1: Project Setup & Contracts

  • Crear estructura de carpetas
  • Inicializar repo Git
  • Crear requirements.txt
  • Crear .env.example
  • Crear README.md
  • Crear config/rca_taxonomy.yaml
  • Crear config/settings.yaml
  • Crear schemas Pydantic

CP2: Transcription Module

  • Implementar Transcriber interface
  • Implementar AssemblyAITranscriber
  • Implementar modelos (Transcript, SpeakerTurn)
  • Tests unitarios
  • Notebook 01_transcription_validation.ipynb

CP3: RCA Schemas & Data Contracts

  • Implementar CallAnalysis
  • Implementar RCALabel, EvidenceSpan
  • Implementar Event
  • Separar observed vs inferred
  • Crear data/examples/

CP4: Feature & Event Extraction

  • Implementar event_detector.py
  • Implementar turn_metrics.py
  • Tests unitarios

CP5: Inference Engine

  • Crear prompt MAP único
  • Implementar LLMClient con JSON strict
  • Implementar BatchInference con resume
  • Tests de evidence obligatorio
  • Notebook 02_inference_validation.ipynb

CP6: Transcript Compression

  • Implementar CompressedTranscript
  • Validar reducción >60% tokens
  • Integrar en inference
  • Notebook 03_compression_validation.ipynb

CP7: Aggregation & RCA Trees

  • Implementar statistics.py
  • Definir severity_score con reglas explícitas
  • Implementar RCATreeBuilder
  • Notebook 04_aggregation_validation.ipynb

CP8: End-to-End Pipeline

  • Implementar CXInsightsPipeline
  • Implementar manifests por stage
  • Implementar resume
  • Implementar exports (JSON, Excel, PDF)
  • CLI principal
  • Notebook 05_full_pipeline_test.ipynb

CP-GAPS: v2.0 Blueprint Alignment (2026-01-19)

  • Gap Analysis vs BeyondCX Blueprints (4 docs Word)
  • Update rca_taxonomy.yaml with new driver categories
    • churn_risk drivers
    • fcr_failure drivers
    • agent_skills (positive + improvement_needed)
  • Update call_analysis.py models with new fields
    • FCRStatus enum
    • ChurnRisk enum
    • AgentClassification enum
    • DriverOrigin enum
    • AgentSkillIndicator model
    • Enhanced RCALabel with origin, corrective_action, replicable_practice
    • Updated CallAnalysis with new fields
  • Create prompt v2.0 (config/prompts/call_analysis/v2.0/)
    • system.txt
    • user.txt
    • schema.json
  • Update versions.yaml to active v2.0
  • Update prompt_manager.py with TaxonomyTexts
  • Update analyzer.py to parse new fields
  • Update aggregation models and statistics for v2.0
  • Update tests for v2.0 compatibility

CP-DASH: Streamlit Dashboard (2026-01-19)

  • Create dashboard structure (app.py, config.py, data_loader.py, components.py)
  • Implement Beyond Brand Identity styling
    • Colors: Black #000000, Blue #6D84E3, Grey #B1B1B0
    • Light theme configuration (.streamlit/config.toml)
    • Custom CSS with Outfit font
  • Implement 8 dashboard sections
    • Overview (KPIs, outcomes, drivers, FCR, churn)
    • Outcomes Analysis
    • Poor CX Analysis
    • FCR Analysis
    • Churn Risk Analysis
    • Agent Performance
    • Call Explorer
    • Export Insights
  • Advanced visualizations
    • RCA Sankey Diagram (Driver → Outcome → Churn Risk)
    • Correlation Heatmaps (co-occurrence, driver-outcome)
    • Outcome Deep Dive (root causes, correlation, duration)
  • Export functionality
    • Excel multi-sheet workbook
    • HTML executive summary report
    • JSON raw data export
  • Blueprint terminology compliance
    • FCR: 4 categorías (Primera Llamada/Rellamada × Sin/Con Riesgo)
    • Churn: Sin Riesgo de Fuga / En Riesgo de Fuga
    • Agent: Talento Para Replicar / Oportunidades de Mejora

Alta prioridad (Pendiente)

  • Run real benchmark with v2.0 - Ejecutar pipeline con 50-100 llamadas reales
  • Measure actual costs - Documentar costes reales STT + LLM
  • Validate v2.0 RCA accuracy - Manual review de 20 llamadas con nuevos campos
  • Documentation - Completar stubs en docs/
  • Test v2.0 with real transcripts - Validado con batch test-07 (30 llamadas)
  • Update exports for v2.0 - Dashboard incluye todos los campos nuevos
  • Dashboard Streamlit - Implementado con Beyond branding

Media prioridad (CP9 - Optional)

  • Caching por hash de transcript
  • Batch size benchmarks (encontrar óptimo)
  • Comparar STT providers (Whisper, Google)
  • Comparar LLM providers (Claude vs GPT-4o)
  • DuckDB para analytics de grandes batches

Baja prioridad (Fase 2)

  • Dashboard Streamlit (completado 2026-01-19)
  • Docker containerization
  • CI/CD pipeline
  • API REST (FastAPI)
  • Multi-idioma support
  • Real-time processing
  • Integración BeyondDiagnosticPrototipo
  • Campaign tracking (Blueprint KPI 2)
  • Customer value analysis (Blueprint Pilar 4)
  • Sales cycle optimization analysis

Backlog (Ideas)

  • Automatic prompt tuning based on validation results
  • A/B testing de prompts
  • Confidence calibration
  • Active learning loop
  • Cost anomaly detection

Última actualización: 2026-01-19 (v2.1 Dashboard + Blueprint Compliance completed)