Fundamentals 39 min read

Mastering Linux Process Synchronization: Prevent Race Conditions with Mutexes, Semaphores, and More

This comprehensive guide explains why race conditions occur in Linux processes, explores the underlying concepts of critical sections and synchronization, and provides practical examples of atomic operations, mutexes, semaphores, condition variables, read‑write locks, and spinlocks to ensure safe concurrent programming.

Deepin Linux

Feb 7, 2026

Mastering Linux Process Synchronization: Prevent Race Conditions with Mutexes, Semaphores, and More

Linux Process Synchronization Basics

In Linux, a process is the basic unit of resource allocation and independent execution. While multiple processes can run concurrently to improve system efficiency, unsynchronized access to shared resources (global variables, file descriptors, hardware devices) can cause race conditions, leading to crashes or incorrect results.

What Is Process Synchronization?

Process synchronization coordinates the execution order of multiple processes so that shared resources are accessed safely, eliminating race conditions at the source.

Understanding Race Conditions

When several processes manipulate the same resource without proper ordering, the non‑deterministic scheduling of the kernel can produce inconsistent states. A classic example is two processes incrementing a shared counter:

P1 reads count = 10.

P2 reads count = 10 before P1 writes back.

P1 increments to 11 and stores the result in its private cache.

P2 also increments its copy to 11 and writes back.

P1 finally writes 11 to shared memory.

The final value is 11 instead of the expected 12, illustrating data inconsistency that can affect financial systems, databases, or web servers.

Core Techniques to Avoid Race Conditions

1. Atomic Operations

Atomic operations are indivisible instructions that cannot be interrupted. In user space, C/C++ provides std::atomic and Linux kernel offers atomic_inc. Example:

#include <stdio.h>
#include <pthread.h>
int shared_variable = 0;
void* increment(void* arg) {
    for (int i = 0; i < 10000; ++i) {
        __sync_fetch_and_add(&shared_variable, 1); // atomic increment
    }
    return NULL;
}
int main() {
    pthread_t t1, t2;
    pthread_create(&t1, NULL, increment, NULL);
    pthread_create(&t2, NULL, increment, NULL);
    pthread_join(t1, NULL);
    pthread_join(t2, NULL);
    printf("Final value: %d
", shared_variable);
    return 0;
}

2. Mutex (Mutual Exclusion Lock)

A mutex protects a critical section so that only one thread can execute it at a time.

#include <stdio.h>
#include <pthread.h>
pthread_mutex_t mutex = PTHREAD_MUTEX_INITIALIZER;
int shared_variable = 0;
void* thread_func(void* arg) {
    for (int i = 0; i < 1000; ++i) {
        pthread_mutex_lock(&mutex);
        ++shared_variable;
        pthread_mutex_unlock(&mutex);
    }
    return NULL;
}
int main() {
    pthread_t t1, t2;
    pthread_create(&t1, NULL, thread_func, NULL);
    pthread_create(&t2, NULL, thread_func, NULL);
    pthread_join(t1, NULL);
    pthread_join(t2, NULL);
    printf("Final value: %d
", shared_variable);
    return 0;
}

3. Semaphore

Semaphores control access to a pool of resources. A binary semaphore works like a mutex, while a counting semaphore can represent multiple identical resources.

#include <stdio.h>
#include <semaphore.h>
#include <pthread.h>
sem_t sem;
int shared_variable = 0;
void* thread_func(void* arg) {
    for (int i = 0; i < 1000; ++i) {
        sem_wait(&sem);
        ++shared_variable;
        sem_post(&sem);
    }
    return NULL;
}
int main() {
    pthread_t t1, t2;
    sem_init(&sem, 0, 1);
    pthread_create(&t1, NULL, thread_func, NULL);
    pthread_create(&t2, NULL, thread_func, NULL);
    pthread_join(t1, NULL);
    pthread_join(t2, NULL);
    printf("Final value: %d
", shared_variable);
    sem_destroy(&sem);
    return 0;
}

4. Condition Variable

Condition variables are used together with a mutex to block a thread until a specific condition becomes true, e.g., the classic producer‑consumer problem.

#include <stdio.h>
#include <pthread.h>
#define BUFFER_SIZE 5
int buffer[BUFFER_SIZE];
int in = 0, out = 0;
pthread_mutex_t mutex = PTHREAD_MUTEX_INITIALIZER;
pthread_cond_t not_empty = PTHREAD_COND_INITIALIZER;
pthread_cond_t not_full = PTHREAD_COND_INITIALIZER;
void* producer(void* arg) {
    while (1) {
        int item = rand();
        pthread_mutex_lock(&mutex);
        while ((in + 1) % BUFFER_SIZE == out) {
            pthread_cond_wait(&not_full, &mutex);
        }
        buffer[in] = item;
        in = (in + 1) % BUFFER_SIZE;
        pthread_cond_signal(&not_empty);
        pthread_mutex_unlock(&mutex);
    }
    return NULL;
}
void* consumer(void* arg) {
    while (1) {
        pthread_mutex_lock(&mutex);
        while (in == out) {
            pthread_cond_wait(&not_empty, &mutex);
        }
        int item = buffer[out];
        out = (out + 1) % BUFFER_SIZE;
        pthread_cond_signal(&not_full);
        pthread_mutex_unlock(&mutex);
        printf("Consumed %d
", item);
    }
    return NULL;
}
int main() {
    pthread_t prod, cons;
    pthread_create(&prod, NULL, producer, NULL);
    pthread_create(&cons, NULL, consumer, NULL);
    pthread_join(prod, NULL);
    pthread_join(cons, NULL);
    return 0;
}

5. Read‑Write Lock

Read‑write locks allow many concurrent readers but exclusive writers, ideal for read‑heavy workloads such as database queries.

#include <stdio.h>
#include <pthread.h>
pthread_rwlock_t rwlock = PTHREAD_RWLOCK_INITIALIZER;
int shared_data = 0;
void* reader(void* arg) {
    pthread_rwlock_rdlock(&rwlock);
    printf("Read %d
", shared_data);
    pthread_rwlock_unlock(&rwlock);
    return NULL;
}
void* writer(void* arg) {
    pthread_rwlock_wrlock(&rwlock);
    ++shared_data;
    printf("Wrote %d
", shared_data);
    pthread_rwlock_unlock(&rwlock);
    return NULL;
}
int main() {
    pthread_t r1, r2, w;
    pthread_create(&r1, NULL, reader, NULL);
    pthread_create(&r2, NULL, reader, NULL);
    pthread_create(&w, NULL, writer, NULL);
    pthread_join(r1, NULL);
    pthread_join(r2, NULL);
    pthread_join(w, NULL);
    return 0;
}

6. Spinlock

Spinlocks are useful when the lock is held for a very short time; the thread repeatedly checks the lock instead of sleeping.

#include <stdio.h>
#include <pthread.h>
pthread_spinlock_t spin;
int shared = 0;
void* thread_func(void* arg) {
    for (int i = 0; i < 1000; ++i) {
        pthread_spin_lock(&spin);
        ++shared;
        pthread_spin_unlock(&spin);
    }
    return NULL;
}
int main() {
    pthread_t t1, t2;
    pthread_spin_init(&spin, PTHREAD_PROCESS_PRIVATE);
    pthread_create(&t1, NULL, thread_func, NULL);
    pthread_create(&t2, NULL, thread_func, NULL);
    pthread_join(t1, NULL);
    pthread_join(t2, NULL);
    printf("Final value: %d
", shared);
    pthread_spin_destroy(&spin);
    return 0;
}

Kernel‑Level Implementation Details

Atomic Operations in the Kernel

On x86, the LOCK prefix combined with instructions like XADD or CMPXCHG guarantees atomicity across CPUs. The Linux kernel wraps these in functions such as atomic_inc() and atomic_add().

mov eax, value          ; load increment value
mov ecx, [memory_addr]   ; load current value
lock xadd [ecx], eax    ; atomic add and store

Memory Barriers

Memory barriers ( mfence, sfence, lfence) prevent the CPU or compiler from reordering memory operations, ensuring that writes become visible to other processors before subsequent reads.

Spinlock vs. Mutex in the Kernel

Spinlocks use busy‑waiting and are implemented with ldrex/strex on ARM or lock on x86. Mutexes put the waiting thread to sleep, using wait queues, which reduces CPU waste but incurs context‑switch overhead.

Practical Considerations

Deadlock Prevention

Common strategies include allocating all needed resources at once, imposing a global ordering on lock acquisition, avoiding nested locks, using timed tryLock() calls, and employing higher‑level constructs like ReentrantLock where available.

Performance Optimisation

Minimise lock hold time, choose the appropriate synchronization primitive (read‑write lock for read‑heavy workloads, spinlock for short critical sections), redesign algorithms to reduce contention, employ lock‑free data structures, use thread pools wisely, and consider asynchronous I/O to avoid blocking threads.

By understanding the underlying causes of race conditions and selecting the right synchronization mechanism, developers can write robust, high‑performance concurrent programs on Linux.

C++condition variable mutex Semaphore race condition POSIX process synchronization

Written by

Deepin Linux

Research areas: Windows & Linux platforms, C/C++ backend development, embedded systems and Linux kernel, etc.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.