Skip to main content
Article·blog.healthchecks.io
infrastructuredevopsautomationdata-pipelinesentrepreneurshiphealthchecks.io

Healthchecks.io now uses self-hosted object storage

Healthchecks.io transitioned to self-hosted object storage for enhanced control, cost efficiency, and performance. AI practitioners can adopt this strategy to optimize their data infrastructure, improving data locality, privacy, and reducing egress costs for large datasets.

intermediate1 hour5 steps
The play
  1. Assess Current Data Storage Strategy
    Review your existing cloud data storage solutions (e.g., data lakes, model checkpoints, feature stores). Identify areas with high egress costs, data locality constraints, or performance bottlenecks that impact large AI/ML datasets.
  2. Understand Self-Hosting Benefits for AI/ML
    Research how self-hosted object storage solutions (e.g., MinIO, Ceph) can provide greater operational control, potential cost savings (especially for egress), improved data access latency, and enhanced data privacy for sensitive AI/ML workflows.
  3. Evaluate Self-Hosted Object Storage Solutions
    Explore leading self-hosted object storage platforms. Compare their features, scalability, deployment complexity, and the level of community or enterprise support available. Consider your team's existing infrastructure expertise and resources.
  4. Plan Operational Requirements and Expertise
    Outline the full scope of operational overhead for self-hosting, including hardware procurement, software deployment, ongoing maintenance, robust backup strategies, and disaster recovery. Estimate the internal expertise required or resources needed for training.
  5. Pilot a Self-Hosted Object Storage Instance
    Set up a small-scale pilot project. Deploy a chosen self-hosted object storage solution (e.g., MinIO on a VM or Kubernetes cluster) for a non-critical dataset. Test its feasibility, performance, and integration with your existing AI/ML pipelines.
Starter code
docker run \
  -p 9000:9000 \
  -p 9001:9001 \
  --name minio \
  -e "MINIO_ROOT_USER=minioadmin" \
  -e "MINIO_ROOT_PASSWORD=minioadmin" \
  quay.io/minio/minio server /data --console-address ":9001"
Source
Healthchecks.io now uses self-hosted object storage — Action Pack