Submitted:
25 October 2025
Posted:
28 October 2025
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
2. System Architecture
2.1. Concept Overview
- NGINX patched module with socket descriptor duplication.
- Acceptor module that transfers active connections via UNIX-domain sockets.
- Service Modules that perform request handling as independent processes.
2.2. Advantages over Conventional L7LB
- Local port exhaustion mitigation by offloading slow terminals.
- Dynamic and configurable L7 load balancing without restarts.
- No health-check overhead.
- Graceful restart support via socket FD passing.
- Reduced number of required application service processes.
3. Implementation
3.1. NGINX Modification
3.2. Acceptor Module
3.3. Example Service Module
4. Performance Evaluation
4.1. Environment
| Name | Description |
|---|---|
| nginx worker process | 4 |
| Benchmark tool | ApacheBench(ab) |
| ab -n 10000 -c 4 -k "http://127.0.0.1:8080/" | http keep alive |
| ab -n 10000 -c 4 "http://127.0.0.1:8080/" | no keep alive |
4.2. Results
5. Discussion
5.1. io-uring
5.2. Deterministic QPS and Spike Resistance
5.3. Definition and Analysis of Deterministic QPS
5.3.1. Definition
5.3.2. Mechanism
- The application never receives more simultaneous requests than it can process.
- The proxy never accumulates excessive pending ACCEPT/LISTEN queues, even under burst traffic.
- QPS becomes deterministic and directly measurable as a function of the application’s concurrency configuration.
5.3.3. Analysis
5.4. Port Exhaustion Mitigation
5.4.1. Definition
5.4.2. Problem Context
- Slow or idle clients occupy proxy ports for extended periods.
- Application-side latency indirectly causes TIME_WAIT and CLOSE_WAIT accumulation in the proxy.
- Eventually, ephemeral-port exhaustion limits connection acceptance capacity and induces cascading failures.
5.4.3. Proposed Mechanism
5.4.4. Analysis
- Stable port consumption: proportional to declared backend concurrency rather than client volume.
- Reduced TIME_WAIT/CLOSE_WAIT pressure: preventing socket-table inflation during long uptimes.
- Improved multi-service coexistence: isolation between independent service modules avoids cross-service port contention.
5.4.5. Considerations
5.5. Session Fairness and Multi-Core Affinity
6. Comparison to Conventional Architectures
6.1. Comparison with Katran
6.2. Comparison with TCP Splicing
| Feature | Traditional L7LB | Proposed Layer7 Proxy |
|---|---|---|
| TCP Direction | Client -> Server | Server <-> Acceptor |
| Port Exhaustion | Frequent | Eliminated |
| Health Checks | Required | Not required |
| Session Fairness | Limited | Deterministic |
| Restart Overhead | High | Graceful |
| QPS Determinism | Low | High |
6.3. Relation to L4 Load Balancers (e.g., Maglev)

7. Conclusions







Author Contributions
Funding
Conflicts of Interest
Use of Artificial Intelligence
References
- Neal, Cardwell; et al. : Fast ZC Rx Data Plane using io_uring, Netdev 0x17 (2023). https://netdevconf.info/0x17/ docs/netdev-0x17-paper24-talk-paper.
- A. Agarwal, A. A. Agarwal, A. Nikolaev, D. Rybkin, et al., Katran: A Scalable Layer-4 Load Balancer using XDP and eBPF, Meta (Facebook) Engineering Blog, 18. Available at: https://engineering.fb.com/2018/05/22/open-source/katran-a-scalable-network-load-balancer/. Open source implementation: https://github. 20 May.
- D. Ely, S. M. D. Ely, S. M. Bellovin, and M. Richardson, “TCP Splice: Asymmetric TCP Connection Splicing for Application Layer Gateways,” Proceedings of the 3rd USENIX Symposium on Internet Technologies and Systems (USITS’99), USENIX Association, 1999.
- D. E. Eisenbud, C. D. E. Eisenbud, C. Yi, C. Contavalli, C. Smith, R. Kononov, E. Mann-Hielscher, A. Cilingiroglu, B. Cheyney, W. Shang, and J. D. Hosein, Maglev: A Fast and Reliable Software Network Load Balancer, USENIX NSDI, 2016.






Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).