Virtual memory

Virtual memory is a memory-management technique that gives each program the illusion of having a large, contiguous private address space, while the OS and hardware translate those virtual addresses into physical addresses in actual RAM (or disk storage when RAM is full).

The mechanism does three things at once:

Extends apparent memory beyond physical RAM by using disk space (swapping/paging).
Isolates programs from each other — one program’s virtual address $0x1000$ maps to different physical memory than another’s.
Decouples programs from physical layout — a program doesn’t need to know what physical address it lives at.

The basic idea

Programs use virtual addresses. The hardware memory management unit (MMU) translates each virtual address into a physical address before sending it to the actual memory.

Program: ldw r2, 0x1000     →   MMU translates 0x1000 → 0x4823000   →   Memory access at 0x4823000

The translation is done via a Page table, a data structure (in main memory) that maps virtual pages to physical frames. A page is a fixed-size chunk (typically 4 KB); a frame is a chunk of physical memory the same size.

To translate virtual address $V$ :

Take the high-order bits of $V$ to identify the virtual page number.
Look that up in the Page table to get the physical frame number.
Concatenate frame number with the low-order “offset” bits of $V$ to get the physical address.

For a 32-bit address space the page table needs $2^{20}$ entries; for 64-bit, naive layout is impossibly large, so multi-level (hierarchical) and inverted page tables are used. See Page table for entry format, multi-level layouts, inverted/hashed variants, and the page-walk procedure.

What about memory not in RAM

If a program references a virtual page that isn’t currently in physical memory:

The page table entry’s “present” bit is 0.
The MMU raises a page fault exception.
The OS handler runs: finds the page on disk (in the swap file), evicts some other page from RAM if needed, reads the requested page into the freed frame, updates the page table, returns from the exception.
The program retries the access; this time the page is present, the access succeeds.

The whole sequence is invisible to the program — it just notices a slight pause. From the program’s perspective, all of memory is always available; it never has to manage what’s “in RAM” vs “on disk.”

This is what makes “virtual memory larger than physical RAM” possible — pages are paged in and out on demand.

Three big benefits

More memory than physically present. A 32-bit machine with 1 GB of RAM can run programs that use 4 GB of address space, with the OS swapping pages to disk as needed.
Isolation between programs. Each program’s virtual addresses are mapped through its own page table. Program A’s virtual $0x1000$ is independent of Program B’s virtual $0x1000$ . They can’t accidentally read or write each other’s memory.
Memory protection. Page table entries include read/write/execute permission bits. Trying to write to a read-only page or execute a non-executable page raises an exception.

Costs

Translation overhead. Every memory access has to go through the page table. Without help, this would mean two memory accesses per “memory access” (one to read the page table, one to read the actual data).
Page fault penalty. A miss that requires a disk read costs millions of CPU cycles.
Page table memory. The table itself takes up RAM.

The translation overhead is mitigated by the TLB, a small fast cache of recent translations. Most accesses hit the TLB and translate in zero extra time.

Modern OSes use virtual memory universally — every process runs with its own virtual address space, isolated from others, with the OS dynamically paging in and out as needed. It’s invisible to most programs but indispensable.

Idriss Rami — Notes

Explorer

Virtual memory

The basic idea

What about memory not in RAM

Three big benefits

Costs

Graph View

Table of Contents

Backlinks