Skip to content
This repository has been archived by the owner on Aug 2, 2021. It is now read-only.

Stability area meta + timeline #1035

Open
21 of 51 tasks
zelig opened this issue Nov 30, 2018 · 2 comments
Open
21 of 51 tasks

Stability area meta + timeline #1035

zelig opened this issue Nov 30, 2018 · 2 comments

Comments

@zelig
Copy link
Member

zelig commented Nov 30, 2018

Tooling for network tests

User-experience benchmarks and metrics

Need more work to define

  • failure rate of accessing existing content long term
  • availability delay for different file sizes
    • earliest node can retrieve
    • all nodes can retrieve

Network testing

  • GlobalStore - shall we complete this, and add it as part of the Swarm app/deployment charts?
  • Chunk Explorer - shall we implement this tool - basically the ability to inspect chunks on given nodes?
  • Implement a tool to check if there is a path from one node to another based on syncing streams (server/client)?

Simulation framework

Mocking tools

k8s

  • Logging for k8s containers (maybe ELK / EFK - https://akomljen.com/get-kubernetes-logs-with-efk-stack-in-5-minutes/ ?)~
  • Container logs rotation PR#58~
  • Ingress for services~
    • for developers to individual Swarm pods~
    • for developers to Logging and Monitoring and Kubernetes Dashboard~
    • for users to Swarm, load balanced across all Swarm pods~
    • for monitoring, logging, and other auxiliary services~
  • SSL for all ingress services~
  • Authentication/Authorization for developers (AWS IAM access/secret keys, with no access to AWS services or to AWS console) to~
    • k8s / kubectl - it would be good if devs have access to this, so that they can port-forward and docker exec towards specific swarm containers.~
    • Monitoring~
    • Logging~
  • Auto-scaling of staging cluster
  • Documenting the setup
    • Terraform for the initial AWS EKS setup~
      • Variables, such as different launch groups, and instance types, and why we need them~
    • Kubernetes Dashboard~
    • Prometheus Monitoring
    • Logging
    • Geth application and how to deploy a new version
    • Swarm application and how to deploy a new version~
  • AWS Usage/Costs dashboard~
  • Monitoring of PV EBS drives - the dashboards that come with the current Monitoring stack don't have this.~
  • Hardening for:
    • Logging setup.
    • Metrics setup.
    • All Roles and Service Accounts created by external helm applications we use.
    • OS patching?
  • Upgrade strategy for k8s
@petyaracz
Copy link

upload speed for different file sizes

what's the diff between hash response and wait for local storage ?

@zelig
Copy link
Member Author

zelig commented Jan 8, 2019

@petyaracz disk storage is asyncronous, it does not finish strictly by the time the chunker (and the API) returns the swarm hash.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

3 participants