nebula

Commit Graph

Author	SHA1	Message	Date
Wade Simmons	befce3f990	fix crash with `-test` (#602 ) When running in `-test` mode, `tun` is set to nil. So we should move the defer into the `!configTest` if block. panic: runtime error: invalid memory address or nil pointer dereference [signal SIGSEGV: segmentation violation code=0x1 addr=0x28 pc=0x54855c] goroutine 1 [running]: github.com/slackhq/nebula.Main.func3(0x4000135e80, {0x0, 0x0}) github.com/slackhq/nebula/main.go:176 +0x2c github.com/slackhq/nebula.Main(0x400022e060, 0x1, {0x76faa0, 0x5}, 0x4000230000, 0x0) github.com/slackhq/nebula/main.go:316 +0x2414 main.main() github.com/slackhq/nebula/cmd/nebula/main.go:54 +0x540	2021-12-06 14:06:16 -05:00
Nate Brown	48c47f5841	Warn if no lighthouses were configured on a non lighthouse node (#587 )	2021-11-30 10:31:33 -06:00
Nate Brown	467e605d5e	Push route handling into overlay, a few more nits fixed (#581 )	2021-11-12 11:19:28 -06:00
Nate Brown	e07524a654	Move all of tun into overlay (#577 )	2021-11-11 16:37:29 -06:00
Nate Brown	88ce0edf76	Start the overlay package with the old Inside interface (#576 )	2021-11-10 21:52:26 -06:00
Nate Brown	4453964e34	Move util to test, contextual errors to util (#575 )	2021-11-10 21:47:38 -06:00
Nate Brown	bcabcfdaca	Rework some things into packages (#489 )	2021-11-03 20:54:04 -05:00
brad-defined	6ae8ba26f7	Add a context object in nebula.Main to clean up on error (#550 )	2021-11-02 13:14:26 -05:00
Donatas Abraitis	32e2619323	Teardown tunnel automatically if peer's certificate expired (#370 )	2021-10-20 13:23:33 -05:00
Wade Simmons	ea2c186a77	remote_allow_ranges: allow inside CIDR specific remote_allow_lists (#540 ) This allows you to configure remote allow lists specific to different subnets of the inside CIDR. Example: remote_allow_ranges: 10.42.42.0/24: 192.168.0.0/16: true This would only allow hosts with a VPN IP in the 10.42.42.0/24 range to have private IPs (and thus don't connect over public IPs). The PR also refactors AllowList into RemoteAllowList and LocalAllowList to make it clearer which methods are allowed on which allow list.	2021-10-19 10:54:30 -04:00
brad-defined	7859140711	Only set serveDns if the host is also configured to be a lighthouse. (#433 )	2021-04-16 13:33:56 -05:00
brad-defined	17106f83a0	Ensure the Nebula device exists before attempting to bind to the Nebula IP (#375 )	2021-04-16 10:34:28 -05:00
Nathan Brown	710df6a876	Refactor remotes and handshaking to give every address a fair shot (#437 )	2021-04-14 13:50:09 -05:00
Nathan Brown	1499be3e40	Fix name resolution for host names in config (#431 )	2021-04-01 21:48:41 -05:00
Nathan Brown	64d8e5aa96	More LH cleanup (#429 )	2021-04-01 10:23:31 -05:00
Nathan Brown	883e09a392	Don't use a global ca pool (#426 )	2021-03-29 12:10:19 -05:00
Wade Simmons	a71541fb0b	export build version as a prometheus label (#405 ) This is how Prometheus recommends you do it, and how they do it themselves in their client. This makes it easy to see which versions you have deployed in your fleet, and query over it too.	2021-03-26 14:16:35 -04:00
Nathan Brown	3ea7e1b75f	Don't use a global logger (#423 )	2021-03-26 09:46:30 -05:00
Nathan Brown	7073d204a8	IPv6 support for outside (udp) (#369 )	2021-03-18 20:37:24 -05:00
Ryan Huber	73a5ed90b2	Do not allow someone to run a nebula lighthouse with an ephemeral port (#399 ) * Do not allow someone to run a nebula lighthouse with an ephemeral port * derp - we discover the port so we have to check the config setting * No context needed for this error * gofmt yourself * Revert "gofmt yourself" This reverts commit c01423498e3792f7acd69d7e691dce1edad81bcb. * Revert "No context needed for this error" This reverts commit 6792af6846d1200c564a4ad601a637535dd56c5b. * snip snap snip snap	2021-03-08 12:42:06 -08:00
Nathan Brown	b6234abfb3	Add a way to trigger punch backs via lighthouse (#394 )	2021-03-01 19:06:01 -06:00
Wade Simmons	2a4beb41b9	Routine-local conntrack cache (#391 ) Previously, every packet we see gets a lock on the conntrack table and updates it. When running with multiple routines, this can cause heavy lock contention and limit our ability for the threads to run independently. This change caches reads from the conntrack table for a very short period of time to reduce this lock contention. This cache will currently default to disabled unless you are running with multiple routines, in which case the default cache delay will be 1 second. This means that entries in the conntrack table may be up to 1 second out of date and remain in a routine local cache for up to 1 second longer than the global table. Instead of calling time.Now() for every packet, this cache system relies on a tick thread that updates the current cache "version" each tick. Every packet we check if the cache version is out of date, and reset the cache if so.	2021-03-01 19:52:17 -05:00
Wade Simmons	a0583ebdca	tun_disabled: reply to ICMP Echo Request (#342 ) This change allows a server running with `tun.disabled: true` (usually a lighthouse) to still reply to ICMP EchoRequest packets. This allows you to "ping" the lighthouse Nebula IP as a quick check to make sure the tunnel is up, even when running with tun.disabled. This is still gated by allowing `icmp` packets in the inbound firewall rules.	2021-03-01 11:09:41 -05:00
Wade Simmons	27d9a67dda	Proper multiqueue support for tun devices (#382 ) This change is for Linux only. Previously, when running with multiple tun.routines, we would only have one file descriptor. This change instead sets IFF_MULTI_QUEUE and opens a file descriptor for each routine. This allows us to process with multiple threads while preventing out of order packet reception issues. To attempt to distribute the flows across the queues, we try to write to the tun/UDP queue that corresponds with the one we read from. So if we read a packet from tun queue "2", we will write the outgoing encrypted packet to UDP queue "2". Because of the nature of how multi queue works with flows, a given host tunnel will be sticky to a given routine (so if you try to performance benchmark by only using one tunnel between two hosts, you are only going to be using a max of one thread for each direction). Because this system works much better when we can correlate flows between the tun and udp routines, we are deprecating the undocumented "tun.routines" and "listen.routines" parameters and introducing a new "routines" parameter that sets the value for both. If you use the old undocumented parameters, the max of the values will be used and a warning logged. Co-authored-by: Nate Brown <nbrown.us@gmail.com>	2021-02-25 15:01:14 -05:00
Nathan Brown	68e3e84fdc	More like a library (#279 )	2020-09-18 09:20:09 -05:00
forfuncsake	9b8b3c478b	Support startup without a tun device (#269 ) This commit adds support for Nebula to be started without creating a tun device. A node started in this mode still has a full "control plane", but no effective "data plane". Its use is suited to a lighthouse that has no need to partake in the mesh VPN. Consequently, creation of the tun device is the only reason nebula neesd to be started with elevated privileged, so this example lighthouse can also be run as a non-root user.	2020-08-10 09:15:55 -04:00
Wade Simmons	4756c9613d	trigger handshakes when lighthouse reply arrives (#246 ) Currently, we wait until the next timer tick to act on the lighthouse's reply to our HostQuery. This means we can easily add hundreds of milliseconds of unnecessary delay to the handshake. To fix this, we can introduce a channel to trigger an outbound handshake without waiting for the next timer tick. A few samples of cold ping time between two hosts that require a lighthouse lookup: before (v1.2.0): time=156 ms time=252 ms time=12.6 ms time=301 ms time=352 ms time=49.4 ms time=150 ms time=13.5 ms time=8.24 ms time=161 ms time=355 ms after: time=3.53 ms time=3.14 ms time=3.08 ms time=3.92 ms time=7.78 ms time=3.59 ms time=3.07 ms time=3.22 ms time=3.12 ms time=3.08 ms time=8.04 ms I recommend reviewing this PR by looking at each commit individually, as some refactoring was required that makes the diff a bit confusing when combined together.	2020-07-22 10:35:10 -04:00
Wade Simmons	aba42f9fa6	enforce the use of goimports (#248 ) * enforce the use of goimports Instead of enforcing `gofmt`, enforce `goimports`, which also asserts a separate section for non-builtin packages. * run `goimports` everywhere * exclude generated .pb.go files	2020-06-30 18:53:30 -04:00
Nathan Brown	41578ca971	Be more like a library to support mobile (#247 )	2020-06-30 13:48:58 -05:00
Wade Simmons	b37a91cfbc	add meta packet statistics (#230 ) This change add more metrics around "meta" (non "message" type packets). For lighthouse packets, we also record statistics around the specific lighthouse meta type. We don't keep statistics for the "message" type so that we don't slow down the fast path (and you can just look at metrics on the tun interface to find that information).	2020-06-26 13:45:48 -04:00
Wade Simmons	4f6313ebd3	fix config name for {remote,local}_allow_list (#219 ) This config option should be snake_case, not camelCase.	2020-04-08 16:20:12 -04:00
Wade Simmons	0a474e757b	Add lighthouse.{remoteAllowList,localAllowList} (#217 ) These settings make it possible to blacklist / whitelist IP addresses that are used for remote connections. `lighthouse.remoteAllowList` filters which remote IPs are allow when fetching from the lighthouse (or, if you are the lighthouse, which IPs you store and forward to querying hosts). By default, any remote IPs are allowed. You can provide CIDRs here with `true` to allow and `false` to deny. The most specific CIDR rule applies to each remote. If all rules are "allow", the default will be "deny", and vice-versa. If both "allow" and "deny" rules are present, then you MUST set a rule for "0.0.0.0/0" as the default. lighthouse: remoteAllowList: # Example to block IPs from this subnet from being used for remote IPs. "172.16.0.0/12": false # A more complicated example, allow public IPs but only private IPs from a specific subnet "0.0.0.0/0": true "10.0.0.0/8": false "10.42.42.0/24": true `lighthouse.localAllowList` has the same logic as above, but it applies to the local addresses we advertise to the lighthouse. Additionally, you can specify an `interfaces` map of regular expressions to match against interface names. The regexp must match the entire name. All interface rules must be either true or false (and the default rule will be the inverse). CIDR rules are matched after interface name rules. Default is all local IP addresses. lighthouse: localAllowList: # Example to blacklist docker interfaces. interfaces: 'docker.*': false # Example to only advertise IPs in this subnet to the lighthouse. "10.0.0.0/8": true	2020-04-08 15:36:43 -04:00
Wade Simmons	7cdbb14a18	Better config test (#177 ) * Better config test Previously, when using the config test option `-test`, we quit fairly earlier in the process and would not catch a variety of additional parsing errors (such as lighthouse IP addresses, local_range, the new check to make sure static hosts are in the certificate's subnet, etc). * run config test as part of smoke test * don't need privileges for configtest Co-authored-by: Nathan Brown <nate@slack-corp.com>	2020-04-06 11:35:32 -07:00
Felix Yan	9e2ff7df57	Correct typos in noise.go (#205 )	2020-03-30 11:23:55 -07:00
Ryan Huber	1297090af3	add configurable punching delay because of race-condition-y conntracks (#210 ) * add configurable punching delay because of race-condition-y conntracks * add changelog * fix tests * only do one punch per query * Coalesce punchy config * It is not is not set * Add tests Co-authored-by: Nate Brown <nbrown.us@gmail.com>	2020-03-27 11:26:39 -07:00
Wade Simmons	179a369130	add configuration options for HandshakeManager (#179 ) This change exposes the current constants we have defined for the handshake manager as configuration options. This will allow us to test and tweak with different intervals and wait rotations. # Handshake Manger Settings handshakes: # Total time to try a handshake = sequence of `try_interval * retries` # With 100ms interval and 20 retries it is 23.5 seconds try_interval: 100ms retries: 20 # wait_rotation is the number of handshake attempts to do before starting to try non-local IP addresses wait_rotation: 5	2020-02-21 16:25:11 -05:00
Wade Simmons	2d24ef7166	validate lighthouses and static hosts are in our subnet (#170 ) Validate all lighthouse.hosts and static_host_map VPN IPs are in the subnet defined in our cert. Exit with a fatal error if they are not in our subnet, as this is an invalid configuration (we will not have the proper routes set up to communicate with these hosts). This error case could occur for the following invalid example: nebula-cert sign -name "lighthouse" -ip "10.0.1.1/24" nebula-cert sign -name "host" -ip "10.0.2.1/24" config.yaml: static_host_map: "10.0.1.1": ["lighthouse.local:4242"] lighthouse: hosts: - "10.0.1.1" We will now return a fatal error for this config, since `10.0.1.1` is not in the host cert's subnet of `10.0.2.1/24`	2020-01-20 15:52:55 -05:00
Ryan Huber	9981510554	new mtu setting and const for default	2019-12-12 18:01:46 +00:00
Ryan Huber	f03d895ebf	don't steal error	2019-12-12 17:31:22 +00:00
Ryan Huber	9333a8e3b7	subnet support	2019-12-12 16:34:17 +00:00
Robin B	a086d60edc	Allow configuration of dns listener host/port (#74 ) * Allow configuration of dns listener host/port * Make DNS listen host/port configuration HUP-able	2019-12-11 17:42:55 -08:00
Nate Brown	9bd8cd2c11	Rebase on master, improve other fatal error messages	2019-12-11 11:08:39 -08:00
Nate Brown	1640a9bc77	Fail with a better error message if lh a hosts is unparsable	2019-12-09 16:53:56 -08:00
Alan Lam	61d9f241b9	Adds am_lighthouse warning msg (#43 ) * add warning message when am_lighthouse is enabled; update config templating	2019-11-24 09:32:08 -08:00
Ryan Huber	08915315ff	add tests and improve error	2019-11-23 23:55:23 +00:00
Ryan Huber	83d2550b2d	add an error (non fatal) when a lighthouse host has no static entry	2019-11-23 21:46:45 +00:00
Ryan Huber	6a460ba38b	remove old hmac function. superceded by ix_psk0	2019-11-23 16:50:36 +00:00
Nate Brown	3b1826740e	Improve tun/udp init error messages	2019-11-22 16:18:33 -08:00
Slack Security Team	f22b4b584d	Public Release	2019-11-19 17:00:20 +00:00

49 Commits