[ANNOUNCE] haproxy-1.8.0

Hi all,

After one year of intense development and almost one month of debugging,
polishing, and cross-review work trying to prevent our respective coworkers
from winning the first bug award, I'm pleased to announce that haproxy 1.8.0
is now officially released!

Since -rc4, a few last user-visible changes were brought :
  - by default the master worker exits if any of its processes dies. This
    is done so that when certain processes are dedicated to certain tasks,
    we're not left with some features not working anymore. Imagine having
    7 SSL offloaders chaining to 1 HTTP frontend, and the last one dying,
    you don't want to keep the 7 useless frontends. By quitting, we give
    a chance to a service manager to detect the problem and alert/restart
    the service. The behaviour is configurable though.

  - we were not happy with "thread-map" vs "cpu-map", making these difficult
    to configure. Now "thread-map" was removed and the feature was merged
    into "cpu-map" which also supports process ranges and cpu ranges for
    easier configuration.

  - haproxy can now be built with native systemd support using USE_SYSTEMD=1
    and starting it with -Ws (systemd-aware master-worker mode).

  - HTTP/2 will not schedule a graceful connection shutdown anymore when
    seeing a "Connection: close" header in a response. Instead a new HTTP
    action "reject" has been implemented to work like its TCP counter-part.

  - the HTTP/2 gateway code now properly reassembles split Cookie headers,
    as mandated by the specification. Not doing it was causing some issues
    with certain application servers, and absolutely needed to be addressed
    before claiming that it works.

And here is a high level overview of the new features contributed to 1.8
(warning, the list is huge) :

  - JSON stats (Simon Horman) : the stats socket's "show stat" and "show info"
    output can now be emitted in a structured JSON format which is more
    convenient than CSV for some modern data processing frameworks.

  - server templates (Frédéric Lécaille) : servers can be pre-provisionned
    in backends using a simple directive ("server-template"). It is then
    possible to configure them at runtime over the CLI or DNS, making it
    trivial to add/remove servers at run time without restarting. As a side
    effect of implementing this, all "server" keywords are now supported on
    the "default-server" line and it's possible to disable any of them using
    "no-<keyword>". All settings changed at runtime are present in the state
    file so that upon reload no information is lost.

  - dynamic cookies (Olivier Houchard) : a dynamic cookie can be generated
    on the fly based on the transport address of a newly added server. This
    is important to be able to use server templates in stateful environments.

  - per-certificate "bind" configuration (Emmanuel Hocdet) : all the SSL
    specific settings of the "bind" line may now be set per-certificate in
    the crtlist file. A common example involves requiring a client cert for
    certain domains only and not for others, all of them running on the same
    address:port.

  - pipelined and asynchronous SPOE (Christopher Faulet) : it's an important
    improvement to the Stream Processing Offload Engine that allows requests
    to be streamed over existing connections without having to wait for a
    previous response. It significantly increases the message rate and reduces
    the need for parallel connections. Two example WAFs were introduced as
    contributions to make use of this improvement (mod_security and
    mod_defender).

  - seamless reloads (Olivier Houchard) : in order to work around some issues
    faced on Linux causing a few RST to be emitted for incoming connections
    during a reload operations despite SO_REUSEPORT being used, it is now
    possible for the new haproxy process to connect to the previous one and
    to retrieve existing listening sockets so that they are never closed. Now
    no connection breakage will be observed during a reload operation anymore.

  - PCRE2 support (David Carlier) : this new version of PCRE seems to be
    making its way in some distros, so now we are compatible with it.

  - hard-stop-after (Cyril Bonté) : this new global setting forces old
    processes to quit after a delay consecutive to a soft reload operation.
    This is mostly used to avoid an accumulation of old processes in some
    environments where idle connections are kept with large timeouts.

  - support for OpenSSL asynchronous crypto engines (Grant Zhang) : this
    allows haproxy to defer the expensive crypto operations to external
    hardware engines. Not only can it significantly improve the performance,
    but it can also reduce the latency impact of slow crypto operations on
    all other operations since haproxy switches to other tasks while the
    engine is busy. This was successfully tested with Intel's QAT and with
    a home-made software engine. This requires OpenSSL 1.1.x.

  - replacement of the systemd-wrapper with a new master-worker model
    (William Lallemand) : this new model allows a master process to stay in
    the foreground on top of the multiple worker processes. This process
    knows the list of worker processes, can watch them to detect failures,
    can broadcast some signals it receives, and has access to the file
    system to reload if needed (yes, it even supports seamless upgardes to
    newer versions since it reloads using an execve() call). While initially
    designed as a replacement for the systemd-wrapper, it also proves useful
    in other environments and during development.

  - DNS autonomous resolver (Baptiste Assmann) : the DNS resolution used to
    be triggered by health checks. While easy and convenient, it was a bit
    limited and didn't allow to manage servers via the DNS, but only to detect
    address changes. With this change the DNS resolvers are now totally
    autonomous and can distribute the addresses they've received to multiple
    servers at once, and if multiple A records are present in a response, the
    advertised addresses will be optimally distributed to all the servers
    relying on the same record.

  - DNS SRV records (Olivier Houchard) : in order to go a bit further with
    DNS resolution, SRV records were implemented. The address, port and weight
    attributes will be applied to servers. New servers are automatically added
    provided there are enough available templates, and servers which disappear
    are automatically removed from the farm. By combining server templates and
    SRV records, it is now trivial to perform service discovery.

  - configurable severity output on the CLI : external tools connecting to
    haproxy's CLI had to know a lot of details about the output of certain
    actions since these messages were initially aimed at humans, and it was
    not envisionned that the socket would become a runtime API. This change
    offers an option to emit the severity level on each action's output so
    that external APIs can classify the output between success, information,
    warnings, errors etc.

  - TLS 1.3 with support for Early-Data (AKA 0-RTT) on both sides (Olivier
    Houchard) : TLS 1.3 introduces the notion of "Early-Data", which are
    data emitted during the handshake. This feature reduces the TLS handshake
    time by one round trip. When compiled with a TLS-1.3 compatible TLS
    library (OpenSSL 1.1.1-dev for now), haproxy can receive such requests,
    process them safely, and even respond before the handshake completes.
    Furthermore, when the client opts for this, it is also possible to pass
    the request to the server following the same principle. This way it is
    technically possible to fully process a client request in a single round
    trip.

  - multi-thread support (Christopher Faulet, Emeric Brun) : no more need
    to choose between having multiple independant processes performing
    their own checks or cascading two layers of processes to scale SSL.
    With multi-threading we get the best of both : a unified process state
    and multi-core scalability. Eventhough this first implementation focuses
    on stability over performance, it still scales fairly well, being almost
    linear on asymmetric crypto, which is where there's the most demand.
    This feature is enabled by default on platforms where it could be tested,
    ie Linux >= 2.6.28, Solaris, FreeBSD, OpenBSD >= 5.7. It is considered
    EXPERIMENTAL, which means that if you face a problem with it, you may
    be asked to disable it for the time it takes to solve the problem. It is
    also possible that certain fixes to come will have some side effects.

  - HTTP/2 (Willy Tarreau) : HTTP/2 is automatically detected and processed
    in HTTP frontends negociating the "h2" protocol name based on the ALPN
    or NPN TLS extensions. At the moment the HTTP/2 frames are converted to
    HTTP/1.1 requests before processing, so they will always appear as 1.1
    in the logs (and in server logs). No HTTP/2 is supported for now on the
    backend, though this is scheduled for the next steps. HTTP/2 support is
    still considered EXPERIMENTAL, so just like for multi-threading, in case
    of problem you may end up having to disable it for the time it takes to
    solve the issue.

  - small objects cache (William Lallemand) : we've been talking about this
    so-called "favicon cache" for many years now, so I'm pretty sure it will
    be welcome. To give a bit of context, we've often been criticized for
    not caching trivial responses from the servers, especially some slow
    application servers occasionally returning a small object (favicon.ico,
    main.css etc). While the obvious response is that installing a cache
    there is the best idea, it is sometimes perceived as overkill for just
    a few files. So what we've done here was to fill exactly that hole :
    have a *safe*, maintenance-free, small objects cache. In practice, if
    there is any doubt about a response's cachability, it will not cache.
    Same if the response contains a Vary header or is larger than a buffer.
    However this can bring huge benefits for situations where there's no
    argument against trivial caching. The intent is to keep it as simple and
    fast as possible so that it can always be faster than retrieving the same
    object from the next layer (possibly a full-featured cache). Note that I
    purposely asked William *not* to implement the purge on the CLI so that
    it remains maintenance-free and we don't see it abused where it should
    not be installed.

This version brings a total of 1208 commits authored by 54 persons. That's
almost the double of the number of commits of 1.7 (706) for a bit less people
(62 by then), though most of them are the same.

A few known limitations still apply to this release, but they are minor
enough to allow us to release and fix them later :
  - master-worker + daemon (-W -D) fails strangely on FreeBSD, and the
    workaround is even stranger. Since the master-worker was meant to replace
    systemd-wrapper, it's not needed on this platform so we'll take care of
    analysing the issue in depth. In the mean time, don't use -W on FreeBSD
    (nor on OpenBSD given that the issue involved the kqueue poller).

  - the CLI's "show sess" command is known for not being 100% thread-safe,
    so it's better to avoid using it if more than one thread is enabled. Note
    that it will not corrupt your system, it will most often work, but may
    either report occasional garbage or immediately crash. If it completes
    the dump you're safe. We'll work on it as well.

  - both the cache and HTTP compression use filters. It is not trivial to
    safely use them both, we still need to sort this out and either
    automatically deal with each corner case or document recommendations for
    safe use. For now, please do not enable compression with the cache (choose
    only one of them). Note that neither is enabled by default so if you don't
    know, you're safe.

  - device detection engines currently don't support multi-threading (but it's
    safe to build with it, there is a runtime check).

The outstanding amount of new features above proves that the new development
model we've adopted last year works much better than what we had in the past.
However I also noticed that it added a lot more pressure on a few person's
shoulders whose help has been invaluable in screening each and every report
so that the developers could stay focused on their tasks. And for this reason,
among the 466 persons who participated to discussions over the last year and
those animating the Discourse forums, I'd like to address special thanks to
the following ones who together responded to the vast majority of the threads
on the list, saving many of us from having to leave our code :
  - Aleksandar Lazic (aka Aleks)
  - Cyril Bonté
  - Daniel Schneller
  - Emmanuel Hocdet (aka Manu)
  - Igor Cicimov
  - Jarno Huuskonen
  - Pavlos Parissis
  - Thierry Fournier
  - Vincent Bernat

and a very special one for Lukas Tribus who in addition to providing a lot
of high quality answers on the mailing list has been tirelessly responding
to almost every question on Discourse, which is truly amazing (I'm starting
to suspect that there are several persons using the same name)! I'm totally
aware that saying "thank you" is not enough and that we'll definitely have
to see how to make your life easier as well guys, so that we can continue
to scale without adding you more burden!

I also noticed that the average quality of problem reports has significantly
increased over time, in part thanks to some long-time participants well used
to the process like Conrad Hoffmann, Pieter Baauw (aka PiBa-NL), Patrick
Hemmer, Dmitry Sivachenko or Jarno Huuskonen, and it's really great because
there's nothing more annoying than having to respond to a problem by always
starting to ask for the same information. So please keep up the good work
guys!

In my opinion we haven't emitted enough versions to make it easy for more
people to test, just like we haven't emitted enough stable releases, due to
all the people involved in the process being busy on their development. This
is something we'll have to address. I'll send a proposal of release schedule
for 1.9 some time later.

Now 1.9 opens with 1.9-dev0 so that we can go break things as usual :-)

Please find the usual URLs below :
   Site index       : http://www.haproxy.org/
   Discourse        : http://discourse.haproxy.org/
   Sources          : http://www.haproxy.org/download/1.8/src/
   Git repository   : http://git.haproxy.org/git/haproxy-1.8.git/
   Git Web browsing : http://git.haproxy.org/?p=haproxy-1.8.git
   Changelog        : http://www.haproxy.org/download/1.8/src/CHANGELOG
   Cyril's HTML doc : http://cbonte.github.io/haproxy-dconv/

By the way, I'm well known for messing up with each and every release, leaving
broken links here and there. So if something appears wrong, please report it!

Willy
---
Complete changelog from -rc5 to -final:
Christopher Faulet (15):
      MINOR: sample: Add "thread" sample fetch
      BUG/MINOR: Use crt_base instead of ca_base when crt is parsed on a server 
line
      BUG/MINOR: listener: Allow multiple "process" options on "bind" lines
      MINOR: config: Support a range to specify processes in "cpu-map" parameter
      MINOR: config: Slightly change how parse_process_number works
      MINOR: config: Export parse_process_number and use it wherever it's 
applicable
      MINOR: standard: Add my_ffsl function to get the position of the bit set 
to one
      MINOR: config: Add auto-increment feature for cpu-map
      MINOR: config: Support partial ranges in cpu-map directive
      MINOR:: config: Remove thread-map directive
      MINOR: config: Add the threads support in cpu-map directive
      MINOR: config: Add threads support for "process" option on "bind" lines
      MEDIUM: listener: Bind listeners on a thread subset if specified
      CLEANUP: debug: Use DPRINTF instead of fprintf into #ifdef 
DEBUG_FULL/#endif
      CLEANUP: log: Rename Alert/Warning in ha_alert/ha_warning

Emeric Brun (1):
      DOC: add initial peers protovol v2.0 documentation.

Emmanuel Hocdet (1):
      MINOR: ssl: Handle early data with BoringSSL

Eric Salama (2):
      CONTRIB: spoa_example: allow to compile outside HAProxy.
      CONTRIB: spoa_example: remove SPOE enums that are useless for clients

Lukas Tribus (2):
      BUG/MINOR: systemd: ignore daemon mode
      DOC: explain HTTP2 timeout behavior

Olivier Houchard (5):
      BUG/MINOR: ssl: Always start the handshake if we can't send early data.
      MINOR: ssl: Don't disable early data handling if we could not write.
      MINOR: ssl: Handle reading early data after writing better.
      MINOR: mux: Make sure every string is woken up after the handshake.
      MINOR/CLEANUP: proxy: rename "proxy" to "proxies_list"

Tim Duesterhus (1):
      MEDIUM: mworker: Add systemd `Type=notify` support

William Lallemand (17):
      BUG/MEDIUM: cache: free callback to remove from tree
      CLEANUP: cache: remove unused struct
      MEDIUM: cache: enable the HTTP analysers
      CLEANUP: cache: remove wrong comment
      CLEANUP: cache: reorder includes
      MEDIUM: shctx: use unsigned int for len and block_count
      MEDIUM: cache: "show cache" on the cli
      BUG/MEDIUM: cache: use key=0 as a condition for freeing
      BUG/MEDIUM: cache: refcount forbids to free the objects
      BUG/MEDIUM: cache fix cli_kws structure
      MEDIUM: cache: store sha1 for hashing the cache key
      BUG/MEDIUM: cache: free ressources in chn_end_analyze
      MINOR: cache: move the refcount decrease in the applet release
      MINOR: cache: replace a fprint() by an abort()
      MEDIUM: cache: max-age configuration keyword
      DOC: cache: configuration and management
      MAJOR: mworker: exits the master on failure

Willy Tarreau (49):
      BUG/MEDIUM: stream: don't automatically forward connect nor close
      BUG/MAJOR: stream: ensure analysers are always called upon close
      BUG/MINOR: stream-int: don't try to read again when CF_READ_DONTWAIT is 
set
      MINOR: threads/atomic: rename local variables in macros to avoid conflicts
      MINOR: threads/plock: rename local variables in macros to avoid conflicts
      MINOR: threads/atomic: implement pl_mb() in asm on x86
      MINOR: threads/atomic: implement pl_bts() on non-x86
      MINOR: threads/build: atomic: replace the few inlines with macros
      BUILD: threads/plock: fix a build issue on Clang without optimization
      BUILD: ebtree: don't redefine types u32/s32 in scope-aware trees
      BUILD: compiler: add a new type modifier __maybe_unused
      BUILD: h2: mark some inlined functions "unused"
      BUILD: server: check->desc always exists
      BUG/MEDIUM: h2: properly report connection errors in headers and data 
handlers
      MEDIUM: h2: add a function to emit an HTTP/1 request from a headers list
      MEDIUM: h2: change hpack_decode_headers() to only provide a list of 
headers
      BUG/MEDIUM: h2: always reassemble the Cookie request header field
      CONTRIB: spoa_example: remove bref, wordlist, cond_wordlist
      CONTRIB: spoa_example: remove last dependencies on type "sample"
      BUG/MEDIUM: deinit: correctly deinitialize the proxy and global listener 
tasks
      MINOR: pools: prepare functions to override malloc/free in pools
      MINOR: pools: implement DEBUG_UAF to detect use after free
      BUG/MEDIUM: threads/time: fix time drift correction
      BUG/MEDIUM: threads/time: maintain a common time reference between all 
threads
      BUG/MINOR: stream: fix tv_request calculation for applets
      BUG/MAJOR: h2: always remove a stream from the send list before freeing it
      BUG/MAJOR: threads/task: dequeue expired tasks under the WQ lock
      MINOR: http: implement the "http-request reject" rule
      MINOR: h2: send RST_STREAM before GOAWAY on reject
      MEDIUM: h2: don't gracefully close the connection anymore on Connection: 
close
      MINOR: h2: make use of client-fin timeout after GOAWAY
      MEDIUM: config: ensure that tune.bufsize is at least 16384 when using 
HTTP/2
      BUG/MEDIUM: stream: always release the stream-interface on abort
      CLEANUP: pools: rename all pool functions and pointers to remove this "2"
      DOC: update the roadmap file with the latest changes merged in 1.8
      DOC: fix mangled version in peers protocol documentation
      DOC: mention William as maintainer of the cache and master-worker
      DOC: add Christopher and Emeric as maintainers of the threads
      BUG/MINOR: threads: don't drop "extern" on the lock in include files
      MINOR: task: keep a pointer to the currently running task
      MINOR: task: align the rq and wq locks
      MINOR: fd: cache-align fdtab and fdcache locks
      MINOR: buffers: cache-align buffer_wq_lock
      CLEANUP: server: reorder some fields in struct server to save 40 bytes
      CLEANUP: proxy: slightly reorder the struct proxy to reduce holes
      CLEANUP: checks: remove 16 bytes of holes in struct check
      CLEANUP: cache: more efficiently pack the struct cache
      CLEANUP: fd: place the lock at the beginning of struct fdtab
      CLEANUP: pools: align pools on a cache line
      BUG/MAJOR: threads/queue: avoid recursive locking in 
pendconn_get_next_strm()
      BUILD: Makefile: reorder object files by size
      [RELEASE] Released version 1.8.0
---