SeaweedFS is a distributed storage system for blobs, objects, files, and data lake, to store and serve billions of files fast! Blob store has O(1) disk seek, local tiering, cloud tiering. Filer supports cross-cluster active-active replication, Kubernetes, POSIX, S3 API, encryption, Erasure Coding for warm storage, FUSE mount, Hadoop, WebDAV.

Last update: Jan 5, 2023

Comments: 17

SeaweedFS

Sponsor SeaweedFS via Patreon

SeaweedFS is an independent Apache-licensed open source project with its ongoing development made possible entirely thanks to the support of these awesome backers. If you'd like to grow SeaweedFS even stronger, please consider joining our sponsors on Patreon.

Your support will be really appreciated by me and other supporters!

Quick Start with single binary

Download the latest binary from https://github.com/chrislusf/seaweedfs/releases and unzip a single binary file weed or weed.exe
Run weed server -dir=/some/data/dir -s3 to start one master, one volume server, one filer, and one S3 gateway.

Also, to increase capacity, just add more volume servers by running weed volume -dir="/some/data/dir2" -mserver="<master_host>:9333" -port=8081 locally, or on a different machine, or on thousands of machines. That is it!

Quick Start for S3 API on Docker

docker run -p 8333:8333 chrislusf/seaweedfs server -s3

Introduction

SeaweedFS is a simple and highly scalable distributed file system. There are two objectives:

to store billions of files!
to serve the files fast!

SeaweedFS started as an Object Store to handle small files efficiently. Instead of managing all file metadata in a central master, the central master only manages volumes on volume servers, and these volume servers manage files and their metadata. This relieves concurrency pressure from the central master and spreads file metadata into volume servers, allowing faster file access (O(1), usually just one disk read operation).

There is only 40 bytes of disk storage overhead for each file's metadata. It is so simple with O(1) disk reads that you are welcome to challenge the performance with your actual use cases.

SeaweedFS started by implementing Facebook's Haystack design paper. Also, SeaweedFS implements erasure coding with ideas from f4: Facebook’s Warm BLOB Storage System, and has a lot of similarities with Facebook’s Tectonic Filesystem

On top of the object store, optional Filer can support directories and POSIX attributes. Filer is a separate linearly-scalable stateless server with customizable metadata stores, e.g., MySql, Postgres, Redis, Cassandra, HBase, Mongodb, Elastic Search, LevelDB, RocksDB, Sqlite, MemSql, TiDB, Etcd, CockroachDB, etc.

For any distributed key value stores, the large values can be offloaded to SeaweedFS. With the fast access speed and linearly scalable capacity, SeaweedFS can work as a distributed Key-Large-Value store.

SeaweedFS can transparently integrate with the cloud. With hot data on local cluster, and warm data on the cloud with O(1) access time, SeaweedFS can achieve both fast local access time and elastic cloud storage capacity. What's more, the cloud storage access API cost is minimized. Faster and Cheaper than direct cloud storage!

System	File Metadata	File Content Read	POSIX	REST API	Optimized for large number of small files
SeaweedFS	lookup volume id, cacheable	O(1) disk seek		Yes	Yes
SeaweedFS Filer	Linearly Scalable, Customizable	O(1) disk seek	FUSE	Yes	Yes
GlusterFS	hashing		FUSE, NFS
Ceph	hashing + rules		FUSE	Yes
MooseFS	in memory		FUSE		No
MinIO	separate meta file for each file			Yes	No

SeaweedFS	comparable to Ceph	advantage
Master	MDS	simpler
Volume	OSD	optimized for small files
Filer	Ceph FS	linearly scalable, Customizable, O(1) or O(logN)

SeaweedFS

Sponsor SeaweedFS via Patreon

Gold Sponsors

Table of Contents

Quick Start with single binary

Quick Start for S3 API on Docker

Introduction

Additional Features

Filer Features

Kubernetes

Example: Using Seaweed Object Store

Start Master Server

Start Volume Servers

Write File

Save File Id

Read File

Rack-Aware and Data Center-Aware Replication

Allocate File Key on Specific Data Center

Other Features

Architecture

Master Server and Volume Server

Write and Read files

Storage Size

Saving memory

Tiered Storage to the cloud

Compared to Other File Systems

Compared to HDFS

Compared to GlusterFS, Ceph

Compared to GlusterFS

Compared to MooseFS

Compared to Ceph

Compared to MinIO

Dev Plan

Installation Guide

Disk Related Topics

Hard Drive Performance

Solid State Disk

Benchmark

License

Stargazers over time

Owner

Chris Lu

Comments

Lots of "volume_server_handlers_read.go:75] request /661,33289a3ecd5edd with unmaching cookie seen: 1053646557 expected: 1674074305 " error log in PROD env

Possible performance problems on some platforms

Slow performance in replication mode 010 while executing volume.fix.replication

Lots of "volume_server_handlers.go:75] read error: /5,1001e1b02c1b01" errors in our production server log files

Deploy structure

Corrupted Files in Cluster

Hot, warm, cold storage [feature]

brain split happened when network interrupts between dc

[bug:filer] Continues to stick not to the leader raft.Server: Not current leader (critical)

Filer hangs or restarts on deleting large buckets

[Emergency] Cluster failed after upgrading weedfs to version 2.62

迁移问题

Add useRaftHashicorp flag in helm chart

What problem are we solving?

How are we solving the problem?

How is the PR tested?

Checks

mount: can not ensure ordered file handle lock and unlock by orderedMutex

Add S3 ACL support （already passed tests）

What problem are we solving?

How is the PR tested?

Checks

more solid weed mount

What problem are we solving?

How are we solving the problem?

How is the PR tested?

Checks

It is not possible to request a file via master if the volume is in a read-only state

Can not upgrade from 2.xx to 3.xx

Related tags

Golang implementation of distributed mutex on Azure lease blobs

Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or distributedly.

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Distributed reliable key-value store for the most critical data of a distributed system

Distributed disk storage database based on Raft and Redis protocol.

Lockgate is a cross-platform locking library for Go with distributed locks using Kubernetes or lockgate HTTP lock server as well as the OS file locks support.

Distributed-Services - Distributed Systems with Golang to consequently build a fully-fletched distributed service