EC Tech Docs

Set Up the Infrastructure

Initializing search

GitHub

Home
Getting started
Pipeline
Contribute
Infrastructure
Releases
References

EC Tech Docs

GitHub

Home
Getting started
Getting started
- First steps
  First steps
- First full run
  First full run
- First cluster run
  First cluster run
- Deep dive
  Deep dive
  - GCP
    GCP
    
    GCP Set-up
    
    GCP Environments
  - Kedro
    Kedro
    
    Kedro extensions
    
    Kedro Cloud run on the Cluster
    
    Kedro Environments
    Kedro Environments
    
    Base Environment
    
    Test Environment
    
    Sample environment guide
    
    Cloud Environment
  - Jupyter
    Jupyter
    
    Kedro Pipeline in Jupyter Notebooks
    
    Plugging into Cloud Environment Locally
  - Debugging
    Debugging
    
    Debugging our pipeline code
  - Walkthroughs
    Walkthroughs
    
    Running an experiment using previous data outputs
    
    Developing a custom model in the pipeline
    
    Fabricator: Declarative Synthetic Data Generation
    
    Modify data version
    
    New Data Source
    
    Testing a change in the release pipeline
    
    Data Catalog
Pipeline
Pipeline
- Pipeline steps
  Pipeline steps
  - Data release
    Data release
    
    Ingestion
    
    Integration
  - Features
    Features
    
    Filtering
    
    Embeddings
  - Modelling and matrix generation
    Modelling and matrix generation
    
    Modelling
    
    Matrix generation
    
    Matrix transformation
    
    Evaluation
  - Cross run
    Cross run
    
    Run comparison pipeline
  - Other
    Other
    
    Preprocessing
    
    Inference
- Data
  Data
- Data science
  Data science
  - Deep dive: Evaluation suite
  - Model Selection and Cross-Validation Techniques
- Data engineering
  Data engineering
  - Using Kedro to process datasets in batches asynchronously
  - Caching Approaches for API Based Enrichments
- Feedback loop
  Feedback loop
  - Flagging Review Pairs
- Debugging
  Debugging
  - Memory debugging for Embeddings node
Contribute
Contribute
Infrastructure
Infrastructure
Releases
Releases
- Public Data Releases
- Release History
- Attribution
- First-level Knowledge Sources
- KG Primary Knowledge Sources
- Archive
  Archive
  - 2026
  - 2025
  - 2024
References
References
- Common Errors
- Glossary

Set Up the Infrastructure

TODO: Fill this section

July 23, 2026 July 23, 2026 GitHub

Made with Material for MkDocs