What is a "thread" (really)?

T

12

369

I have been trying to find a good definition, and get an understanding, of what a thread really is.

It seems that I must be missing something obvious, but every time I read about what a thread is, it's almost a circular definition, a la "a thread is a thread of execution" or " a way to divide into running tasks". Uh uh. Huh?

It seems from what I have read that a thread is not really something concrete, like a process is. It is in fact just a concept. From what I understand of the way this works, a processor executes some commands for a program (which has been termed a thread of execution), then when it needs to switch to processing for some other program for a bit, it stores the state of the program it's currently executing for somewhere (Thread Local Storage) and then starts executing the other program's instructions. And back and forth. Such that, a thread is really just a concept for "one of the paths of execution" of a program that is currently running.

Unlike a process, which really is something - it is a conglomeration of resources, etc.

As an example of a definition that didn't really help me much . . .

From Wikipedia:

"A thread in computer science is short for a thread of execution. Threads are a way for a program to divide (termed "split") itself into two or more simultaneously (or pseudo-simultaneously) running tasks. Threads and processes differ from one operating system to another but, in general, a thread is contained inside a process and different threads in the same process share same resources while different processes in the same multitasking operating system do not."

So am I right? Wrong? What is a thread really?

Edit: Apparently a thread is also given its own call stack, so that is somewhat of a concrete thing.

Toritorie answered 5/3, 2011 at 5:16 Comment(8)

"Process" is no less of an abstract term. – Heida 5/3, 2011 at 5:19

Is thread local storage just the call stack for the thread? – Stableboy 6/3, 2015 at 6:16

The answers below are... abstract. In simpler terms (and glossing over some details): once upon a time, a computer program could only do one thing at once. So it did A, then after that B, then C, then... . In modern systems, this isn't ideal; for example you want to keep browsing the web while downloading a file. So programs now have one or more 'threads'. Each 'thread' can only do one thing at once, but different threads can do things simultaneously. Thread 1 can do A, then B, then C; thread 2 can do X, then Y, then Z. B can't start until A has finished, but A and X can happen at once. – Seasonal 27/9, 2018 at 23:27

@Seasonal that is great but how is that different from a process? – Felid 13/2, 2020 at 16:4

@Felid the basic difference between a thread and process (and really the most important difference) is that two or more threads can share the same spaces in memory, i.e use the same resources, whereas two processes must exist in different memory spaces. Does that make sense? – Cinthiacintron 27/2, 2020 at 8:17

@IkechukwuAnude as threads shares resources of process (which creates these threads) and each threads has it's own stack, does that mean the stack allocated to process is divided among all the different threads? If it's true, does that mean if we increase number of threads, size of stack per thread will decrease? – Lucillalucille 11/10, 2022 at 16:47

@MegaLegend found an answer yet? – Dan 10/12, 2023 at 19:36

@Dan As far as I have understood. Thread is really just an abstract idea. Say you have three functions that you want to execute concurrently. Then you will create three threads and load one function in each of the thread. You can consider thread as a way for OS to store information about how much of the loaded function (on that thread)has been executed. Since each function needs its own call stack, thus one is allocated to each thread (new stack is allocated to each thread and parent process stack is not divided among them). Some resources like code block are shared among each threads. – Lucillalucille 12/12, 2023 at 16:15

S

214

A thread is an independent set of values for the processor registers (for a single core). Since this includes the Instruction Pointer (aka Program Counter), it controls what executes in what order. It also includes the Stack Pointer, which had better point to a unique area of memory for each thread or else they will interfere with each other.

Threads are the software unit affected by control flow (function call, loop, goto), because those instructions operate on the Instruction Pointer, and that belongs to a particular thread. Threads are often scheduled according to some prioritization scheme (although it's possible to design a system with one thread per processor core, in which case every thread is always running and no scheduling is needed).

In fact the value of the Instruction Pointer and the instruction stored at that location is sufficient to determine a new value for the Instruction Pointer. For most instructions, this simply advances the IP by the size of the instruction, but control flow instructions change the IP in other, predictable ways. The sequence of values the IP takes on forms a path of execution weaving through the program code, giving rise to the name "thread".

Soliz answered 5/3, 2011 at 5:23 Comment(21)

+1. A thread isn't anything more "concrete" than a set of register values. – Aswan 5/3, 2011 at 5:24

What "set of values"? What are they? How do they define a thread? – Toritorie 5/3, 2011 at 5:25

@Richard: The exact list of CPU registers depends on the architecture, but instruction pointer and stack pointer are pretty much universal. They define a thread insofar as when this thread (set of register values) is loaded in the processor core, the thread is running. The processor is fetching instructions demanded by the thread and updating the thread registers. When a context switch is needed, the processor saves this set of register values into memory and loads a set belonging to a different thread, typically as part of the interrupt servicing logic. – Soliz 5/3, 2011 at 5:31

@BenVoigt For processors with multiple cores, is a thread not an "independent set of values for the processor registers"? – Stableboy 6/3, 2015 at 6:18

@committedandroider: No, a thread has state for only one core. To have multiple cores active, you need multiple threads. – Soliz 6/3, 2015 at 15:12

Hi thx @BenVoigt. A few clarifications that noobs like me may stumble over: what is meant by "processor registers"? What is meant by "instruction pointer" and "stack pointer"? – Zootechnics 13/4, 2016 at 12:30

Google-fu: Instruction Pointer -> is a register that holds the memory address of the instruction to be executed next. The CPU is hard-wired to read the instruction pointer and execute the instruction at that particular address. Processor Register -> The registers are the places where the values that the CPU is actually working on are located (e.g. variable values). Stack Pointer -> Is a small register that stores the address of the last program request in a stack – Eisenach 9/4, 2019 at 2:53

@LeviFuller During parallel execution of threads and processes, which instruction does the instruction pointer to? – Jeffery 1/2, 2020 at 20:31

@ajaysinghnegi: A single processor core doesn't do parallel execution, it does timeslicing. On a multicore system (parallel processing), you can't use the definite article "the" to talk about registers like the instruction pointer, because there are many copies, one in each CPU core. (Note that all multisocket systems are also multicore even if you don't have "multicore processor" units installed in the sockets, because the one core per socket still results in multiple cores in the system). – Soliz 17/2, 2020 at 16:1

With HyperThreading, "core" is often used to mean a pair of closely-coupled cores which share compute resources, but each of these two still has its own set of registers and control logic... for the purposes of this answer "processor core" means each hardware threading unit, regardless of whether it has shared or dedicated compute associated. – Soliz 17/2, 2020 at 16:1

A "set of values for the processor registers" is not a thread. That's a thread context. It's part of a thread, but no more so than all the variables that are exclusively used by the thread (including its entire stack) or, the kernel variables that describe the thread. A computer scientist would say that a "thread of execution" is a particular sequence of operations—one that is "threaded" through your program's code. I tell noobs that a thread is like an agent that carries out your instructions and, that you can have more than one such agent working for you at the same time. – Cellophane 21/9, 2020 at 20:42

@SolomonSlow: A thread context is what makes a thread a thread. All the other associated things are supplementary, not definitional. – Soliz 22/9, 2020 at 15:3

@BenVoigt, Yup, just like how the wheels are what make a car a car. A car isn't any use if it's got no wheels. All that other stuff—frame, suspension, drive train, steering—that's all just bells and whistles, but you ain't going anywhere if you've got no wheels. – Cellophane 22/9, 2020 at 16:14

@SolomonSlow: The kernel object associated with the thread, that enforces access control, allows you to find the thread by its ID, etc is more like power steering. You can have a car without power steering and it's still a car. But man doesn't that hydraulic assist make life convenient. Likewise the kernel structure contains thread-management data (aka metadata). It's closely coupled to the thread. But metadata isn't the thing, it's data about the thing. – Soliz 22/9, 2020 at 16:44

@BenVoigt Finally, which one is correct? Thread is a set ov values for the CPU registers or Thread is a sequence of instructions? Which one comes first? And which one should be deduced from the other? I'm confused :( – Utilitarianism 19/2, 2021 at 0:52

@SolomonSlow Finally, which one is correct? Thread is a set ov values for the CPU registers or Thread is a sequence of instructions? Which one comes first? And which one should be deduced from the other? I'm confused :( – Utilitarianism 19/2, 2021 at 0:53

@MohammadMehdiSarfejoo Why should only one answer be "correct?" Why should one be "derived from the other?" A computer scientist might have a different way of thinking about what "thread" means from the way a kernel developer thinks about it. – Cellophane 19/2, 2021 at 3:13

@SolomonSlow so You mean they both have same meaning and both of them are in the same direction although they are different ways of thinking about "thread". Did I understand this correctly? – Utilitarianism 19/2, 2021 at 14:27

@MohammadMehdiSarfejoo, At a higher level, they both are talking about the same thing, but they emphasize different aspects of it. A computer scientist is more interested in reasoning about sequences of instructions, in how they are interleaved, and in how they can interact with each other. A kernel developer is more interested in how the system keeps track of which sequence of instructions it is executing at any given moment and, in how it switches from one to another. – Cellophane 19/2, 2021 at 15:33

@MohammadMehdiSarfejoo: "which one should be deduced from the other?" is exactly what the last paragraph of my answer explains – Soliz 19/2, 2021 at 16:10

@BenVoigt is it like the context is what makes a thread that thread? – Wina 19/3, 2021 at 16:44

G

339

A thread is an execution context, which is all the information a CPU needs to execute a stream of instructions.

Suppose you're reading a book, and you want to take a break right now, but you want to be able to come back and resume reading from the exact point where you stopped. One way to achieve that is by jotting down the page number, line number, and word number. So your execution context for reading a book is these 3 numbers.

If you have a roommate, and she's using the same technique, she can take the book while you're not using it, and resume reading from where she stopped. Then you can take it back, and resume it from where you were.

Threads work in the same way. A CPU is giving you the illusion that it's doing multiple computations at the same time. It does that by spending a bit of time on each computation. It can do that because it has an execution context for each computation. Just like you can share a book with your friend, many tasks can share a CPU.

On a more technical level, an execution context (therefore a thread) consists of the values of the CPU's registers.

Last: threads are different from processes. A thread is a context of execution, while a process is a bunch of resources associated with a computation. A process can have one or many threads.

Clarification: the resources associated with a process include memory pages (all the threads in a process have the same view of the memory), file descriptors (e.g., open sockets), and security credentials (e.g., the ID of the user who started the process).

Gettysburg answered 5/3, 2011 at 5:29 Comment(2)

A better analogy would equate person with CPU (both do something), and equate book with address-space (both just exist). That way, bookmarks in different books are like threads in different processes. A single book with more than one bookmark would be the analog of a multi-threaded process, which is what people usually mean when they say "threads." It works for a single processor machine, but it breaks down somewhat when you talk about multi-processing. Nobody cares which CPU executes function f(), but it does matter which person reads chapter 11. – Cellophane 25/6, 2014 at 19:47

@pwnall, thanks a lot for digesting difficult concepts for others like me! Is multithreading involved in multiprocessing ( or running a process in parallel on many CPUs, in case I am using the wrong term)? – Broider 20/4, 2019 at 16:41