NGINX Architecture – An Insight (Part 1)

NGINX Architecture – An Insight (Part 1)

API and Microservices | Oct 09, 2014

3 MIN READ

In a traditional web server architecture each client connection is handled as a separate process or thread, and as the popularity of a website grows, and the number of concurrent connections increases—the web server slows down, delaying responses to the users. From the technical standpoint, spawning a separate process/thread requires switching CPU to a new task, and creating a new runtime context—which consumes additional memory and CPU time, and negatively impacts performance.
NGINX was developed with the thought of achieving 10x more performance and the optimized use of server resources—while being able to scale and support dynamic growth of a website. As a result NGINX became one the most well-known modular, event-driven, asynchronous, single-threaded web server and web proxy.
In NGINX users’ connections are processed in highly efficient runloops inside a limited number of single-threaded processes—called worker(s). Each worker can handle thousands of concurrent connections and requests per second.
Event-driven is basically about an approach to handle various tasks as “events”. Incoming connection is an event, disk read is an event and so on. The idea is to not waste server resources unless there’s an “event” to handle. Modern operating system can notify the web server about initiation or completion of a task, which in turn enables NGINX workers to use proper resources in a proper way. Server resources can be allocated and released dynamically, on-demand—resulting in optimized usage of network, memory and CPU.
Asynchronous means the runloop doesn’t get stuck on particular events—it sets condition for “alarms” from the operating system about particular “events” and continues to monitor the “event queue” for “alarms”. Only when there’s an alarm about an event, the runloop triggers actions (e.g. read/write from the network interface). In turn, specific actions always try to utilize non-blocking interfaces to the OS so that the worker doesn’t stop on handling a particular event. This way NGINX workers can use available shared resources concurrently in the most efficient manner.
Single-threaded means that many user connections can be handled by a single worker process which in turn helps to avoid excessive context switching—and leads to more efficient usage of memory and CPU.
Modular architecture helps developers to extend the set of the web server features without heavily modifying the NGINX core.

Worker Process
NGINX does not create a new process or thread for every connection. Worker process accepts the new requests from a shared listen queue and executes a highly efficient runloop across them—to process thousands of connections per worker. Worker gets notifications about events from the mechanisms in the OS kernel.
When NGINX is started, an initial set of listening sockets is created, workers then start to accept, read from and write to sockets when processing HTTP requests and responses.
As NGINX does not fork a process or thread per connection, the memory usage is very conservative and extremely efficient in most of the cases—it’s basically a true on-demand handling of memory. NGINX also conserves CPU cycles as there’s no ongoing create-destroy pattern for processes or threads.
In a nutshell what NGINX does can be described as orchestration of the underlying OS and hardware resources to server web clients—by checking the state of the network and storage events, initializing new connections, adding them to the runloop, and processing asynchronously until completion, at which point the connection is deallocated and removed from the runloop. Consequently NGINX helps to achieve moderate-to-low CPU usage under even most extreme workloads.
NGINX spawns several worker(s)—it’s typically a worker per CPU core—which in turn helps to scale across multiple CPUs. This approach helps the OS to schedule tasks across NGINX workers more evenly.
General recommendations for worker configuration might be as following:
• For the CPU-intensive workload—the number of NGINX worker(s) should be equal to number of CPU cores.
• For I/O-intensive workload—the number of worker(s) might be about two times the number of cores.
Thus NGINX is able to do more in less resources (e.g memory and CPU).

Table of Contents

– Sandeep Khuperkar I CTO and Director, Ashnik

Fine tuning Postgres to achieve 5,000 Queries per second!

Jan 15, 2021 | 5 MIN READ

Everything you need to know about Connection Pooling in Postgres

Jan 16, 2019 | 5 MIN READ

NGINX: An open source platform for high-performance web architectures

Apr 10, 2015 | 3 MIN READ

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Talking Open Source Podcast: Demystifying AI For Enterprise - Part 1 Watch Now!

Revolutionize Your CX with
Unified Observability

CloudOps Automation tool for Infrastructure monitoring and deployment.

Indonesia’s top digital credit service provider leverages Ashnik’s PostgreSQL expertise and services

Revolutionize Your CX with Unified Observability

Automate and monitor your PostgreSQL with ease.

The CloudOps Automation Tool for easy Infrastructure deployment and monitoring

Maximize Potential of Your Data with Streaming Data Pipeline Architecture

End-to-End Traceability and Unified Observability for the Modern Infrastructure

Watch: How to auto-scale in deployments using Kubernetes(K8s): A Technical Demo

NGINX Architecture – An Insight (Part 1)

– Sandeep Khuperkar I CTO and Director, Ashnik

Read More

Fine tuning Postgres to achieve 5,000 Queries per second!

Everything you need to know about Connection Pooling in Postgres

NGINX: An open source platform for high-performance web architectures

Products