flesh out README

2019-03-26 23:27:19 +01:00
parent 407bfbfb7e
commit 27172a170c
1 changed files with 50 additions and 4 deletions
--- a/README.md
+++ b/README.md
@ -3,12 +3,16 @@
 Mesh overlay network manager, using [wireguard](https://www.wireguard.com/).
 **⚠ WARNING**: since mesh membership is controlled by a mesh-wide pre-shared key, this effectively downgrades some of the
-security benefits from wireguard. See "security considerations" below for more info.
+security benefits from wireguard. See [security considerations](#security-considerations) below for more info.
 ## Quickstart
 Before starting, make sure [wireguard](https://www.wireguard.com/) is installed on all nodes.
 The following ports must be accessible between all nodes (see [configuration options](#configuration-options) to change these):
 - 51820 UDP
 - 7946 UDP and TCP
 Install `wesher` on all nodes with:
 ```
 $ go get github.com/costela/wesher
@ -34,7 +38,37 @@ the nodes already joined to the mesh cluster.
 *Note*: `wireguard`, and therefore `wesher`, need root access.
-## Overview
+## Features
 The `wesher` tool builds a cluster and manages the configuration of wireguard on each node to create peer-to-peer
 connections between all nodes, thus forming a full mesh VPN.
 This approach may not scale for hundreds of nodes (benchmarks accepted 😉), but is sufficiently performant to join
 several nodes across multiple cloud providers, or simply to secure inter-node comunication in a single public-cloud.
 ### Automatic Key management
 The wireguard private keys are created on startup for each node and the respective public keys are then broadcast
 across the cluster. 
 The control-plane cluster communication is secured with a pre-shared AES-256 bit key. This key can be be automatically
 created during startup of the first node in a cluster, or it can be provided (see [configuration](#configuration-options)).
 The cluster key must then be sent to other nodes via a out-of-band secure channel (e.g. ssh, cloud-init, etc).
 Once set, the cluster key is saved locally and reused on the next startup.
 ### Automatic IP address management
 The overlay IP address of each node is selected out of a private network (`10.0.0.0/8` by default) and is consistently
 hashed based on the hostname, meaning a host will always receive the same overlay IP address (see [limitations](#overlay-ip-collisions)
 of this approach below). The hostname is also used by the underlying cluster management (using [memberlist](https://github.com/hashicorp/memberlist))
 to identify nodes and must therefore be unique in the cluster.
 To ease intra-node communication, `wesher` also adds entries to `/etc/hosts` for each other node. See [configuration](#configuration-options)
 below for how to disable this behavior.
 ### Restoring state
 If a node in the cluster is restarted, it will attempt to re-join the last-known nodes using the same cluster key.
 This means a restart requires no manual intervention.
 ## Configuration options
@ -54,7 +88,7 @@ All options can be passed either as command-line flags or environment variables:
 ## Security considerations
-The decision of whom to allow in the mesh is made by [memberlist](github.com/hashicorp/memberlist) and is secured by a
+The decision of whom to allow in the mesh is made by [memberlist](https://github.com/hashicorp/memberlist) and is secured by a
 cluster-wide pre-shared key.
 Compromise of this key will allow an attacker to:
 - access services exposed on the overlay network
@ -62,7 +96,7 @@ Compromise of this key will allow an attacker to:
 It will not, however, allow the attacker access to decrypt the traffic between other nodes.
 This pre-shared key is currently static, set up during cluster bootstrapping, but will - in a future version - be
-rotated.
+rotated for improved security.
 ## Current known limitations
@ -72,3 +106,15 @@ Since the assignment of IPs on the overlay network is currently decided by the i
 naive hashing of the hostname, there can be no guarantee two hosts will not generate the same overlay IPs.
 This limitation may be worked around in a future version.
 ### Split-brain
 Once a cluster is joined, there is currently no way to distinguish a failed node from an intentionally removed one.
 This is partially by design: growing and shrinking your cluster dynamically (e.g. via autoscaling) should be as easy
 as possible.
 However, this does mean longer connection loss between any two parts of the cluster (e.g. across a WAN link between
 different cloud providers) can lead to a split-brain scenario where each side thinks the other side is simply "gone".
 There is currently no clean solution for this problem, but one could work around it by designating edge nodes which
 periodically restart `wesher` with the `--joinaddrs` option pointing to the other side.
 Future versions might include the notion of a "static" node to more cleanly avoid this.