EC Tech Docs
Set Up the Infrastructure
Initializing search
GitHub
Home
Getting started
Pipeline
Contribute
Infrastructure
Releases
References
EC Tech Docs
GitHub
Home
Getting started
Getting started
First steps
First steps
Matrix Pipeline
Tech Stack
Repository Structure
Installation
Local setup
Environments overview
Running the Matrix Pipeline
First full run
First full run
Data Access
Set Up Docker for Large Data Processing
Full Pipeline Run
First cluster run
First cluster run
Cluster Set-up
Cluster Specific Config
Full Pipeline Run on Cluster
Deep dive
Deep dive
GCP
GCP
GCP Set-up
GCP Environments
Kedro
Kedro
Kedro extensions
Kedro Cloud run on the Cluster
Kedro Environments
Kedro Environments
Base Environment
Test Environment
Sample environment guide
Cloud Environment
Jupyter
Jupyter
Kedro Pipeline in Jupyter Notebooks
Plugging into Cloud Environment Locally
Debugging
Debugging
Debugging our pipeline code
Walkthroughs
Walkthroughs
Running an experiment using previous data outputs
Developing a custom model in the pipeline
Fabricator: Declarative Synthetic Data Generation
Modify data version
New Data Source
Testing a change in the release pipeline
Data Catalog
Pipeline
Pipeline
Pipeline steps
Pipeline steps
Data release
Data release
Ingestion
Integration
Features
Features
Filtering
Embeddings
Modelling and matrix generation
Modelling and matrix generation
Modelling
Matrix generation
Matrix transformation
Evaluation
Cross run
Cross run
Run comparison pipeline
Other
Other
Preprocessing
Inference
Data
Data
Data API
Diseases List
EC Drug List
Ground Truth Lists
Knowledge Graph
Data science
Data science
Deep dive: Evaluation suite
Model Selection and Cross-Validation Techniques
Data engineering
Data engineering
Using Kedro to process datasets in batches asynchronously
Caching Approaches for API Based Enrichments
Feedback loop
Feedback loop
Flagging Review Pairs
Debugging
Debugging
Memory debugging for Embeddings node
Contribute
Contribute
Onboard to Matrix Platform
Get access to data
Contribution Standards
Documentation
Enhanced Changelog Generation Agent Instructions
Infrastructure
Infrastructure
Kubernetes Cluster
Public Data Zone
Observability Stack
LiteLLM Documentation
LiteLLM User Guide
LiteLLM Admin Guide
Adding a New Provider to LiteLLM
GCP Foundations
Identity-Aware Proxy (IAP) Architecture
Set Up the Infrastructure
Terraform modules
Terraform modules
Artifact Registry Module
Runbooks
Runbooks
Create a Release
Fix Github Actions worker
Running an experiment from a branch on the Every Cure Platform
Adding a new service via HTTPS
Running Argo Workflows locally [ in progress ]
Release Article Template Generation
Creating an OAuth Client
Restoring a backup
Secret Leak on GitHub Runbook
Architecture decision records
Architecture decision records
Automated Release Workflow
ADR: CI Optimization with GitHub Actions Self-Hosted Runners
ADR: Cross-Project Orchard Data Access Implementation
deploying LiteLLM
Process for Open Sourcing MATRIX repository
Improve testing through sampling
Main-Only Infrastructure Deployment Strategy
ADR: Make modelling/tuning CPU‑first; remove GPU dependency in the pipeline
Platform Refactor and Standardization.
Data Storage Setup for Open Sourcing
Secure private datasets
Switching pipeline runs to spot instances
GCP CloudBuild
GCP Billing Labels for Cost Management
GKE Safe Eviction Configuration
Spot Instance Implementation for Matrix Pipeline Infrastructure
Releases
Releases
Public Data Releases
Release History
Attribution
First-level Knowledge Sources
KG Primary Knowledge Sources
Archive
Archive
2026
2025
2024
References
References
Common Errors
Glossary
Set Up the Infrastructure
TODO: Fill this section