Backend Development 54 min read

Mastering High-Concurrency C++ Servers: Multithreaded Network Programming Explained

This comprehensive guide explores C++ multithreaded network programming for high‑concurrency servers, covering threads, processes, socket basics, TCP/UDP protocols, thread pools, synchronization primitives, lock‑free structures, I/O multiplexing techniques, and a practical high‑performance chat server implementation.

Deepin Linux

Jul 5, 2025

Mastering High-Concurrency C++ Servers: Multithreaded Network Programming Explained

In the digital era, high concurrency is the norm for internet services such as e‑commerce flash sales, social media trends, and massive online games, demanding powerful concurrent processing capabilities. C++ provides superior performance, efficient execution, and precise control over system resources, making it a key choice for building high‑concurrency servers.

Part1 C++ Multithreaded Programming

1.1 What Are Threads and Processes?

A thread is the smallest unit of execution in an operating system, often called a lightweight process. It shares the process’s resources while maintaining its own execution context (program counter, registers, stack). A process is an independent unit of resource allocation with its own address space, code, and data. Multiple threads can exist within a single process, enabling parallel execution on multi‑core CPUs.

In Linux, the fork() function creates a new process by duplicating the calling process. The parent receives the child’s PID, while the child receives 0, allowing the program to distinguish execution paths.

#include <stdio.h>
#include <unistd.h>

int main() {
    pid_t pid;
    pid = fork();
    if (pid < 0) {
        perror("fork failed");
        return 1;
    } else if (pid == 0) {
        printf("I am the child process, my PID is %d, my parent's PID is %d
", getpid(), getppid());
    } else {
        printf("I am the parent process, my PID is %d, my child's PID is %d
", getpid(), pid);
    }
    return 0;
}

1.2 First Experience with the C++ Thread Library

Since C++11, the <thread> header provides portable thread creation. A thread is started by constructing a std::thread with a callable object.

Simple thread example:

#include <iostream>
#include <thread>

void printHello() {
    std::cout << "Hello from thread!" << std::endl;
}

int main() {
    std::thread t(printHello);
    std::cout << "Hello from main!" << std::endl;
    t.join();
    return 0;
}

A lambda can also be used to create a thread, making the code more concise.

#include <iostream>
#include <thread>

int main() {
    std::thread t([](){ std::cout << "Hello from thread created by lambda!" << std::endl; });
    std::cout << "Hello from main!" << std::endl;
    t.join();
    return 0;
}

Part2 Network Programming Basics

2.1 Detailed Socket Programming

A socket is an abstract data structure that provides a communication endpoint between applications. TCP sockets (SOCK_STREAM) guarantee ordered, reliable delivery, while UDP sockets (SOCK_DGRAM) offer fast, connection‑less transmission without delivery guarantees.

TCP server and client workflow:

Initialize socket and obtain file descriptor.

Server calls bind() to associate an IP address and port.

Server calls listen() to start listening.

Server calls accept() to wait for a client connection.

Client calls connect() to request a connection.

Data exchange via write() / read() (or send() / recv()).

When the client disconnects, the server receives EOF and eventually closes the socket.

Server code example (Linux):

#include <iostream>
#include <sys/socket.h>
#include <netinet/in.h>
#include <unistd.h>
#include <cstring>

#define PORT 8080

int main() {
    int server_fd, new_socket;
    struct sockaddr_in address;
    int opt = 1;
    int addrlen = sizeof(address);
    char buffer[1024] = {0};

    if ((server_fd = socket(AF_INET, SOCK_STREAM, 0)) == 0) {
        perror("socket failed");
        exit(EXIT_FAILURE);
    }
    if (setsockopt(server_fd, SOL_SOCKET, SO_REUSEADDR | SO_REUSEPORT, &opt, sizeof(opt))) {
        perror("setsockopt");
        exit(EXIT_FAILURE);
    }
    address.sin_family = AF_INET;
    address.sin_addr.s_addr = INADDR_ANY;
    address.sin_port = htons(PORT);
    if (bind(server_fd, (struct sockaddr *)&address, sizeof(address)) < 0) {
        perror("bind failed");
        exit(EXIT_FAILURE);
    }
    if (listen(server_fd, 3) < 0) {
        perror("listen");
        exit(EXIT_FAILURE);
    }
    if ((new_socket = accept(server_fd, (struct sockaddr *)&address, (socklen_t *)&addrlen)) < 0) {
        perror("accept");
        exit(EXIT_FAILURE);
    }
    int valread = read(new_socket, buffer, 1024);
    std::cout << "Received: " << buffer << std::endl;
    const char *response = "Hello from server";
    send(new_socket, response, strlen(response), 0);
    std::cout << "Response sent" << std::endl;
    close(new_socket);
    close(server_fd);
    return 0;
}

Client code example (Linux):

#include <iostream>
#include <sys/socket.h>
#include <arpa/inet.h>
#include <unistd.h>
#include <cstring>

#define PORT 8080

int main() {
    int sock = 0;
    struct sockaddr_in serv_addr;
    char buffer[1024] = {0};
    if ((sock = socket(AF_INET, SOCK_STREAM, 0)) < 0) {
        std::cerr << "Socket creation error" << std::endl;
        return -1;
    }
    serv_addr.sin_family = AF_INET;
    serv_addr.sin_port = htons(PORT);
    if (inet_pton(AF_INET, "127.0.0.1", &serv_addr.sin_addr) <= 0) {
        std::cerr << "Invalid address/ Address not supported" << std::endl;
        return -1;
    }
    if (connect(sock, (struct sockaddr *)&serv_addr, sizeof(serv_addr)) < 0) {
        std::cerr << "Connection Failed" << std::endl;
        return -1;
    }
    const char *message = "Hello from client";
    send(sock, message, strlen(message), 0);
    std::cout << "Message sent" << std::endl;
    int valread = read(sock, buffer, 1024);
    std::cout << "Received: " << buffer << std::endl;
    close(sock);
    return 0;
}

2.2 Network Protocols and Data Transmission

TCP/IP is the backbone of reliable data transfer. TCP establishes a connection via a three‑way handshake (SYN, SYN‑ACK, ACK) to synchronize sequence numbers and ensure ordered delivery. IP handles routing and addressing.

UDP provides connection‑less, low‑latency transmission suitable for real‑time applications such as video streaming or online gaming, where occasional packet loss is acceptable.

The three‑way handshake ensures both sides can send and receive data reliably:

Client sends SYN with initial sequence number.

Server replies with SYN‑ACK, acknowledging the client’s sequence and providing its own.

Client sends ACK, completing the connection.

During transmission, data passes through application, transport, network, and data‑link layers, where headers are added. Issues such as packet sticking (粘包) and splitting (拆包) can occur, especially with TCP streams. Common solutions include fixed‑length framing, delimiters, or prefixing each packet with its length.

Part3 C++ Multithreaded Network Programming Practice

3.1 Multithreaded Server Architecture Design

A common high‑concurrency architecture is the master‑worker model. The master thread accepts new connections and dispatches them to a pool of worker threads, which handle I/O and business logic. This separation improves scalability and keeps the acceptor responsive.

Thread pool implementation example:

#include <iostream>
#include <thread>
#include <vector>
#include <queue>
#include <mutex>
#include <condition_variable>
#include <functional>
#include <sys/socket.h>
#include <arpa/inet.h>
#include <unistd.h>

class ThreadPool {
public:
    ThreadPool(size_t numThreads) {
        for (size_t i = 0; i < numThreads; ++i) {
            threads.emplace_back([this] {
                while (true) {
                    std::function<void()> task;
                    {
                        std::unique_lock<std::mutex> lock(this->queueMutex);
                        this->condition.wait(lock, [this] { return this->stop || !this->tasks.empty(); });
                        if (this->stop && this->tasks.empty())
                            return;
                        task = std::move(this->tasks.front());
                        this->tasks.pop();
                    }
                    task();
                }
            });
        }
    }
    ~ThreadPool() {
        {
            std::unique_lock<std::mutex> lock(queueMutex);
            stop = true;
        }
        condition.notify_all();
        for (std::thread &thread : threads) {
            thread.join();
        }
    }
    template<class F, class... Args>
    void enqueue(F &&f, Args &&... args) {
        {
            std::unique_lock<std::mutex> lock(queueMutex);
            tasks.emplace(std::bind(std::forward<F>(f), std::forward<Args>(args)...));
        }
        condition.notify_one();
    }
private:
    std::vector<std::thread> threads;
    std::queue<std::function<void()>> tasks;
    std::mutex queueMutex;
    std::condition_variable condition;
    bool stop = false;
};

void handleClient(int clientSocket) {
    char buffer[1024] = {0};
    int valread = read(clientSocket, buffer, 1024);
    std::cout << "Received: " << buffer << std::endl;
    const char *response = "Hello from server";
    send(clientSocket, response, strlen(response), 0);
    std::cout << "Response sent" << std::endl;
    close(clientSocket);
}

int main() {
    int serverSocket = socket(AF_INET, SOCK_STREAM, 0);
    if (serverSocket == 0) {
        perror("socket failed");
        exit(EXIT_FAILURE);
    }
    struct sockaddr_in address;
    address.sin_family = AF_INET;
    address.sin_addr.s_addr = INADDR_ANY;
    address.sin_port = htons(8080);
    if (bind(serverSocket, (struct sockaddr *)&address, sizeof(address)) < 0) {
        perror("bind failed");
        exit(EXIT_FAILURE);
    }
    if (listen(serverSocket, 3) < 0) {
        perror("listen");
        exit(EXIT_FAILURE);
    }
    ThreadPool pool(4);
    while (true) {
        struct sockaddr_in clientAddress;
        socklen_t clientAddressLen = sizeof(clientAddress);
        int clientSocket = accept(serverSocket, (struct sockaddr *)&clientAddress, &clientAddressLen);
        if (clientSocket < 0) {
            perror("accept");
            continue;
        }
        pool.enqueue([clientSocket] { handleClient(clientSocket); });
    }
    close(serverSocket);
    return 0;
}

This design enables a server to handle many simultaneous connections efficiently.

3.2 Thread Synchronization and Mutual Exclusion

When multiple threads share resources, synchronization is essential to avoid data races. C++ provides std::mutex for exclusive locking and std::condition_variable for coordinated waiting.

Mutex example:

#include <iostream>
#include <mutex>
#include <thread>

std::mutex mtx;
int sharedResource = 0;

void increment() {
    for (int i = 0; i < 1000; ++i) {
        mtx.lock();
        ++sharedResource;
        mtx.unlock();
    }
}

int main() {
    std::thread t1(increment);
    std::thread t2(increment);
    t1.join();
    t2.join();
    std::cout << "Final value of sharedResource: " << sharedResource << std::endl;
    return 0;
}

Condition variable example (producer‑consumer):

#include <iostream>
#include <mutex>
#include <condition_variable>
#include <queue>
#include <thread>

std::mutex mtx;
std::condition_variable cv;
std::queue<int> dataQueue;

void producer() {
    for (int i = 0; i < 10; ++i) {
        std::unique_lock<std::mutex> lock(mtx);
        dataQueue.push(i);
        std::cout << "Produced: " << i << std::endl;
        lock.unlock();
        cv.notify_one();
        std::this_thread::sleep_for(std::chrono::milliseconds(100));
    }
}

void consumer() {
    while (true) {
        std::unique_lock<std::mutex> lock(mtx);
        cv.wait(lock, [] { return !dataQueue.empty(); });
        int data = dataQueue.front();
        dataQueue.pop();
        std::cout << "Consumed: " << data << std::endl;
        lock.unlock();
        if (data == 9) break;
    }
}

int main() {
    std::thread t1(producer);
    std::thread t2(consumer);
    t1.join();
    t2.join();
    return 0;
}

3.3 Concurrency Control Strategies

Beyond basic locks, lock‑free data structures and read‑write locks improve performance in read‑heavy scenarios.

Lock‑free list example (simplified):

#include <atomic>

template<typename T>
struct Node {
    T data;
    std::atomic<Node<T>*> next;
    Node(const T& value) : data(value), next(nullptr) {}
};

template<typename T>
class LockFreeList {
public:
    LockFreeList() : head(nullptr) {}
    bool insert(const T& value) {
        Node<T>* newNode = new Node<T>(value);
        Node<T>* prev = nullptr;
        Node<T>* curr = head.load();
        while (curr != nullptr && curr->data < value) {
            prev = curr;
            curr = curr->next.load();
        }
        if (curr != nullptr && curr->data == value) {
            delete newNode;
            return false;
        }
        newNode->next.store(curr);
        if (prev == nullptr) {
            while (!head.compare_exchange_weak(curr, newNode)) {
                if (curr != nullptr && curr->data < value) {
                    prev = curr;
                    curr = curr->next.load();
                } else {
                    break;
                }
            }
        } else {
            while (!prev->next.compare_exchange_weak(curr, newNode)) {
                if (curr != nullptr && curr->data < value) {
                    prev = curr;
                    curr = curr->next.load();
                } else {
                    break;
                }
            }
        }
        return true;
    }
private:
    std::atomic<Node<T>*> head;
};

Read‑write lock example using std::shared_mutex:

#include <iostream>
#include <shared_mutex>
#include <thread>

std::shared_mutex rwMutex;
int sharedData = 0;

void readData() {
    rwMutex.lock_shared();
    std::cout << "Read data: " << sharedData << std::endl;
    rwMutex.unlock_shared();
}

void writeData(int value) {
    rwMutex.lock();
    sharedData = value;
    std::cout << "Write data: " << value << std::endl;
    rwMutex.unlock();
}

int main() {
    std::thread t1(readData);
    std::thread t2(readData);
    std::thread t3(writeData, 42);
    std::thread t4(readData);
    t1.join(); t2.join(); t3.join(); t4.join();
    return 0;
}

3.4 I/O Multiplexing Techniques

I/O multiplexing allows a single thread to monitor many file descriptors. Three common mechanisms on Linux are select, poll, and epoll.

① select

select()

monitors sets of descriptors for readability, writability, or exceptions. It copies descriptor sets between user and kernel space, limiting scalability to ~1024 descriptors.

#include <sys/select.h>
#include <sys/time.h>
#include <sys/types.h>
#include <unistd.h>

int select(int nfds, fd_set *readfds, fd_set *writefds, fd_set *exceptfds, struct timeval *timeout);

Example usage (TCP echo server):

#include <iostream>
#include <sys/select.h>
#include <sys/socket.h>
#include <arpa/inet.h>
#include <unistd.h>
#include <cstring>

#define PORT 8080
#define MAX_CLIENTS 10

int main() {
    int server_socket = socket(AF_INET, SOCK_STREAM, 0);
    if (server_socket == -1) { perror("socket creation failed"); return 1; }
    struct sockaddr_in server_addr;
    server_addr.sin_family = AF_INET;
    server_addr.sin_port = htons(PORT);
    server_addr.sin_addr.s_addr = INADDR_ANY;
    if (bind(server_socket, (struct sockaddr *)&server_addr, sizeof(server_addr)) == -1) { perror("bind failed"); close(server_socket); return 1; }
    if (listen(server_socket, 3) == -1) { perror("listen failed"); close(server_socket); return 1; }
    fd_set read_fds;
    FD_ZERO(&read_fds);
    FD_SET(server_socket, &read_fds);
    int max_fd = server_socket;
    while (true) {
        fd_set temp_fds = read_fds;
        int activity = select(max_fd + 1, &temp_fds, NULL, NULL, NULL);
        if (activity == -1) { perror("select error"); break; }
        if (FD_ISSET(server_socket, &temp_fds)) {
            int client_socket = accept(server_socket, NULL, NULL);
            if (client_socket != -1) {
                FD_SET(client_socket, &read_fds);
                if (client_socket > max_fd) max_fd = client_socket;
            }
        }
        for (int i = 0; i <= max_fd; ++i) {
            if (FD_ISSET(i, &temp_fds) && i != server_socket) {
                char buffer[1024] = {0};
                int valread = read(i, buffer, sizeof(buffer));
                if (valread <= 0) { close(i); FD_CLR(i, &read_fds); }
                else { std::cout << "Received: " << buffer << std::endl; }
            }
        }
    }
    close(server_socket);
    return 0;
}

② poll

poll()

uses an array of struct pollfd without the descriptor‑set size limit of select, but still copies the entire array on each call.

#include <poll.h>

int poll(struct pollfd *fds, nfds_t nfds, int timeout);

③ epoll

epoll

(Linux 2.6+) provides edge‑triggered or level‑triggered notifications with a single kernel‑space event list, offering O(1) scalability.

#include <sys/epoll.h>

int epoll_create(int size); // size is ignored after 2.6.8
int epoll_ctl(int epfd, int op, int fd, struct epoll_event *event);
int epoll_wait(int epfd, struct epoll_event *events, int maxevents, int timeout);

Choosing between select, poll, and epoll depends on portability and the number of descriptors; epoll is preferred for high‑performance Linux servers.

Part4 Practical Case: High‑Concurrency Chat Server Implementation

4.1 Functional Requirements Analysis

The chat server must support user registration/login, group chat (broadcast to all members), private chat (one‑to‑one), and real‑time message push (e.g., via WebSocket). Usernames must be unique and passwords stored securely (e.g., hashed with SHA‑256).

4.2 Code Implementation and Analysis

Server code (uses the thread pool from Part3, an unordered_map to store online users, and a mutex for safety):

#include <iostream>
#include <thread>
#include <vector>
#include <queue>
#include <mutex>
#include <condition_variable>
#include <functional>
#include <sys/socket.h>
#include <arpa/inet.h>
#include <unistd.h>
#include <unordered_map>
#include <string>

class ThreadPool { /* same as previous implementation */ };

struct User { std::string username; int socketFd; };
std::unordered_map<int, User> users; // key: socket fd
std::mutex usersMutex;

void handleClient(int clientSocket) {
    char buffer[1024] = {0};
    int valread = read(clientSocket, buffer, 1024);
    if (valread < 0) { perror("read failed"); close(clientSocket); return; }
    std::string message(buffer, valread);
    size_t pos = message.find('|');
    if (pos == std::string::npos) { std::cerr << "Invalid message format" << std::endl; close(clientSocket); return; }
    std::string command = message.substr(0, pos);
    std::string data = message.substr(pos + 1);
    if (command == "login") {
        {
            std::lock_guard<std::mutex> lock(usersMutex);
            users[clientSocket] = {data, clientSocket};
            std::cout << "User " << data << " logged in" << std::endl;
        }
        const char* response = "Login successful";
        send(clientSocket, response, strlen(response), 0);
    } else if (command == "send") {
        pos = data.find('|');
        if (pos == std::string::npos) { std::cerr << "Invalid send message format" << std::endl; close(clientSocket); return; }
        std::string targetUser = data.substr(0, pos);
        std::string msgContent = data.substr(pos + 1);
        {
            std::lock_guard<std::mutex> lock(usersMutex);
            for (auto &pair : users) {
                if (pair.second.username == targetUser) {
                    std::string fullMessage = users[clientSocket].username + " says: " + msgContent;
                    send(pair.second.socketFd, fullMessage.c_str(), fullMessage.size(), 0);
                    break;
                }
            }
        }
    } else {
        std::cerr << "Unknown command: " << command << std::endl; close(clientSocket); return;
    }
    while (true) {
        valread = read(clientSocket, buffer, 1024);
        if (valread <= 0) break; // error or disconnect
        // Additional message handling (e.g., group chat) would go here
    }
    {
        std::lock_guard<std::mutex> lock(usersMutex);
        users.erase(clientSocket);
    }
    close(clientSocket);
}

int main() {
    int serverSocket = socket(AF_INET, SOCK_STREAM, 0);
    if (serverSocket == 0) { perror("socket failed"); exit(EXIT_FAILURE); }
    struct sockaddr_in address;
    address.sin_family = AF_INET;
    address.sin_addr.s_addr = INADDR_ANY;
    address.sin_port = htons(8080);
    if (bind(serverSocket, (struct sockaddr *)&address, sizeof(address)) < 0) { perror("bind failed"); exit(EXIT_FAILURE); }
    if (listen(serverSocket, 3) < 0) { perror("listen"); exit(EXIT_FAILURE); }
    ThreadPool pool(4);
    while (true) {
        struct sockaddr_in clientAddress;
        socklen_t clientAddressLen = sizeof(clientAddress);
        int clientSocket = accept(serverSocket, (struct sockaddr *)&clientAddress, &clientAddressLen);
        if (clientSocket < 0) { perror("accept"); continue; }
        pool.enqueue([clientSocket] { handleClient(clientSocket); });
    }
    close(serverSocket);
    return 0;
}

Client code (connects, logs in, and sends messages):

#include <iostream>
#include <sys/socket.h>
#include <arpa/inet.h>
#include <unistd.h>
#include <cstring>
#include <thread>

void receiveMessages(int socketFd) {
    char buffer[1024] = {0};
    while (true) {
        int valread = read(socketFd, buffer, 1024);
        if (valread <= 0) { perror("read failed"); break; }
        std::cout << "Received: " << std::string(buffer, valread) << std::endl;
    }
}

int main() {
    int sock = 0;
    struct sockaddr_in serv_addr;
    if ((sock = socket(AF_INET, SOCK_STREAM, 0)) < 0) { std::cerr << "Socket creation error" << std::endl; return -1; }
    serv_addr.sin_family = AF_INET;
    serv_addr.sin_port = htons(8080);
    if (inet_pton(AF_INET, "127.0.0.1", &serv_addr.sin_addr) <= 0) { std::cerr << "Invalid address" << std::endl; return -1; }
    if (connect(sock, (struct sockaddr *)&serv_addr, sizeof(serv_addr)) < 0) { std::cerr << "Connection Failed" << std::endl; return -1; }
    std::thread recvThread(receiveMessages, sock);
    char buffer[1024] = {0};
    while (true) {
        std::cout << "Enter message: ";
        std::cin.getline(buffer, 1024);
        if (send(sock, buffer, strlen(buffer), 0) < 0) { std::cerr << "Send failed" << std::endl; break; }
        if (strcmp(buffer, "exit") == 0) break;
    }
    recvThread.join();
    close(sock);
    return 0;
}

4.3 Performance Optimization and Testing

To achieve stable high performance, the server can reduce lock contention by replacing the std::unordered_map with a lock‑free hash table, and switch from blocking I/O to epoll for event‑driven handling. Load testing with tools such as Webbench shows that after these optimizations, response times remain low even with 100 concurrent users, and packet loss is eliminated.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

high concurrency multithreading I/O Multiplexing Network programming Socket C++

Written by

Deepin Linux

Research areas: Windows & Linux platforms, C/C++ backend development, embedded systems and Linux kernel, etc.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.