Should I use #include in headers?
Asked Answered
D

9

85

Is it necessary to #include some file, if inside a header (*.h), types defined in this file are used?

For instance, if I use GLib and wish to use the gchar basic type in a structure defined in my header, is it necessary to do a #include <glib.h>, knowing that I already have it in my *.c file?

If yes do I also have to put it between the #ifndef and #define or after the #define?

Doig answered 26/11, 2009 at 15:55 Comment(1)
See also: #256777Greaser
G
131

NASA's Goddard Space Flight Center (GSFC) rules for headers in C state that it must be possible to include a header in a source file as the only header, and that code using the facilities provided by that header will then compile.

This means that the header must be self-contained, idempotent and minimal:

  • self-contained — all necessary types are defined by including relevant headers if need be.
  • idempotent — compilations don't break even if it is included multiple times.
  • minimal — it doesn't define anything that is not needed by code that uses the header to access the facilities defined by the header.

The benefit of this rule is that if someone needs to use the header, they do not have to struggle to work out which other headers must also be included — they know that the header provides everything necessary.

The possible downside is that some headers might be included many times; that is why the multiple inclusion header guards are crucial (and why compilers try to avoid reincluding headers whenever possible).

Implementation

This rule means that if the header uses a type — such as 'FILE *' or 'size_t' — then it must ensure that the appropriate other header (<stdio.h> or <stddef.h> for example) should be included. A corollary, often forgotten, is that the header should not include any other header that is not needed by the user of the package in order to use the package. The header should be minimal, in other words.

Further, the GSFC rules provide a simple technique to ensure that this is what happens:

  • In the source file that defines the functionality, the header must be the first header listed.

Hence, suppose we have a Magic Sort.

magicsort.h

#ifndef MAGICSORT_H_INCLUDED
#define MAGICSORT_H_INCLUDED

#include <stddef.h>

typedef int (*Comparator)(const void *, const void *);
extern void magicsort(void *array, size_t number, size_t size, Comparator cmp);

#endif /* MAGICSORT_H_INCLUDED */

magicsort.c

#include <magicsort.h>

void magicsort(void *array, size_t number, size_t size, Comparator cmp)
{
    ...body of sort...
}

Note that the header must include some standard header that defines size_t; the smallest standard header that does so is <stddef.h>, though several others also do so (<stdio.h>, <stdlib.h>, <string.h>, possibly a few others).

Also, as mentioned before, if the implementation file needs some other headers, so be it, and it is entirely normal for some extra headers to be necessary. But the implementation file ('magicsort.c') should include them itself, and not rely on its header to include them. The header should only include what users of the software need; not what the implementers need.

Configuration headers

If your code uses a configuration header (GNU Autoconf and the generated 'config.h', for example), you may need to use this in 'magicsort.c':

#ifdef HAVE_CONFIG_H
#include "config.h"
#endif /* HAVE_CONFIG_H */

#include "magicsort.h"

...

This is the only time I know of that the module's private header is not the very first header in the implementation file. However, the conditional inclusion of 'config.h' should probably be in 'magicsort.h' itself.


Update 2011-05-01

The URL linked above is no longer functional (404). You can find the C++ standard (582-2003-004) at EverySpec.com; the C standard (582-2000-005) seems to be missing in action.

The guidelines from the C standard were:

§2.1 UNITS

(1) Code shall be structured as units, or as stand-alone header files.

(2) A unit shall consist of a single header file (.h) and one or more body (.c) files. Collectively the header and body files are referred to as the source files.

(3) A unit header file shall contain all pertinent information required by a client unit. A unit’s client needs to access only the header file in order to use the unit.

(4) The unit header file shall contain #include statements for all other headers required by the unit header. This lets clients use a unit by including a single header file.

(5) The unit body file shall contain an #include statement for the unit header, before all other #include statements. This lets the compiler verify that all required #include statements are in the header file.

(6) A body file shall contain only functions associated with one unit. One body file may not provide implementations for functions declared in different headers.

(7) All client units that use any part of a given unit U shall include the header file for unit U; this ensures that there is only one place where the entities in unit U are defined. Client units may call only the functions defined in the unit header; they may not call functions defined in the body but not declared in the header. Client units may not access variables declared in the body but not in the header.

A component contains one or more units. For example, a math library is a component that contains multiple units such as vector, matrix, and quaternion.

Stand-alone header files do not have associated bodies; for example, a common types header does not declare functions, so it needs no body.

Some reasons for having multiple body files for a unit:

  • Part of the body code is hardware or operating system dependent, but the rest is common.
  • The files are too large.
  • The unit is a common utility package, and some projects will only use a few of the functions. Putting each function in a separate file allows the linker to exclude the ones not used from the final image.

§2.1.1 Header include rationale

This standard requires a unit’s header to contain #include statements for all other headers required by the unit header. Placing #include for the unit header first in the unit body allows the compiler to verify that the header contains all required #include statements.

An alternate design, not permitted by this standard, allows no #include statements in headers; all #includes are done in the body files. Unit header files then must contain #ifdef statements that check that the required headers are included in the proper order.

One advantage of the alternate design is that the #include list in the body file is exactly the dependency list needed in a makefile, and this list is checked by the compiler. With the standard design, a tool must be used to generate the dependency list. However, all of the branch recommended development environments provide such a tool.

A major disadvantage of the alternate design is that if a unit’s required header list changes, each file that uses that unit must be edited to update the #include statement list. Also, the required header list for a compiler library unit may be different on different targets.

Another disadvantage of the alternate design is that compiler library header files, and other third party files, must be modified to add the required #ifdef statements.

A different common practice is to include all system header files before any project header files, in body files. This standard does not follow this practice, because some project header files may depend on system header files, either because they use the definitions in the system header, or because they want to override a system definition. Such project header files should contain #include statements for the system headers; if the body includes them first, the compiler does not check this.

GSFC Standard available via Internet Archive 2012-12-10

Information courtesy Eric S. Bullington:

The referenced NASA C coding standard can be accessed and downloaded via the Internet archive:

http://web.archive.org/web/20090412090730/http://software.gsfc.nasa.gov/assetsbytype.cfm?TypeAsset=Standard

Sequencing

The question also asks:

If yes, do I also have to put it (the #include lines) between the #ifndef and #define or after the #define.

The answer shows the correct mechanism — the nested includes, etc, should be after the #define (and the #define should be the second non-comment line in the header) — but it doesn't explain why that's correct.

Consider what happens if you place the #include between the #ifndef and #define. Suppose the other header itself includes various headers, perhaps even #include "magicsort.h" indirectly. If the second inclusion of magicsort.h occurs before #define MAGICSORT_H_INCLUDED, then the header will be included a second time before the types it defines are defined. So, in C89 and C99, any typedef type name will be erroneously redefined (C2011 allows them to be redefined to the same type), and you will get the overhead of processing the file multiple times, defeating the purpose of the header guard in the first place. This is also why the #define is the second line and is not written just before the #endif. The formula given is reliable:

#ifndef HEADERGUARDMACRO
#define HEADERGUARDMACRO

...original content of header — other #include lines, etc...

#endif /* HEADERGUARDMACRO */
Greaser answered 26/11, 2009 at 16:43 Comment(7)
Thanks, that answers my question. Just a detail: what the difference between #define MAGICSORT_H_INCLUDED and #define MAGICSORT_H_INCLUDED <b>1</b>Doig
There's no operational difference. Using #define X with no value specified means that X evaluates to an empty string except in a conditional context like #if X where it evaluates to 0; obviously, #define X 1 always expands to 1. As long as you always test with #ifdef X (or #if defined(X) or #elif defined(X)) and not with #if X, then you are fine with either.Greaser
In the original ANSI C standard it also stated that a conforming header file must stand alone(include everything it needs). In my C files I include its own header first, this helps to detect these non stand-alone headers (but only helps). It is better to catch them now than later as the effort is less.Arleyne
@richard: the situation is slightly more nuanced than that. Each standard C header may only provide the types, macros and function declarations cited in its section of the standard (plus any names reserved to the implementation, of course). However, some items (NULL, size_t, for example) are defined in a number of standard C headers. Therefore, the implementation must ensure that they are only defined once, regardless of the order the standard headers are included in the translation unit. How the implementation does that is its problem. Note: C99 allows <inttypes.h> to include <stdint.h>.Greaser
Including headers in headers can lead to a quite cryptic problems, when you happen to have circular dependencies around your headers. This problem is not apparent first time. The only symptom is that the compiler insists that something is undeclared despite the fact you included corresponding header file. It's quite difficult to trace down if you have to work on a 20 years old codebase, with a huge header dependency hell and #ifdef mess.Millerite
@Calmarius: Having spent a lot of time working on a 30-year old code base with many thousands of files and huge header dependency mess and #ifdef HELL, I can confidently say I'd far rather have self-contained headers than not. The situation you describe won't happen often; old code bases like that are typically stable. And the thought of having to edit several thousand files to add a new header to each one because there is a new header needed somewhere doesn't bear thinking about.Greaser
"NASA's Goddard Space Flight Center (GSFC) rules for headers in C..." I believe this here qualifies as proof that... we are all rocket scientists...Rubi
B
26

A good practice is to only put #includes in an include file if the include file needs them. If the definitions in a given include file are only used in the .c file then include it only in the .c file.

In your case, i would include it in the include file between the #ifdef/#endif.

This will minimize dependencies so that files that don't need a given include won't have to be recompiled if the include file changes.

Bear answered 26/11, 2009 at 16:10 Comment(6)
Bad advice. It will produce ugly C program (with #ifdef/#endif).Giverin
I think you misunderstood my answer. It is common practice to use #ifdef/#endif in .h files to avoid multiple includes of a .h file.Bear
That's what i read on the gnu site. But what's the difference between #include placed before #define (and after the #ifndef) and after the #define?Doig
No difference to the preprocessor as the statement will always get executed if the file has not been included yet. However you should not put anything between the #ifndef and the #define as it is a common pattern other programmers will be used to seeing.Owens
Another convention (though less common) is: #ifndef MY_HEADER // header contents #define MY_HEADER #endif This removes the need to follow #endif with e.g. // MY_HEADER for reference, as is common with the other conventionPastis
@JivanPal it's clever, though I think it would be preferable to stick to the most common conventionCherub
J
1

During compilation preprocessor just replaces #include directive by specified file content. To prevent endless loop it should use

#ifndef SOMEIDENTIFIER
#define SOMEIDENTIFIER
....header file body........
#endif

If some header was included into another header which was included to your file than it is not necessary to explicitly include it again, because it will be included into the file recursively

Jehias answered 26/11, 2009 at 16:11 Comment(0)
R
0

Usually, library developers protect their includes from multiple including with the #ifndef /#define / #endif "trick" so you don't have to do it.

Of course, you should check... but anyways the compiler will tell you at some point ;-) It is anyhow a good practice to check for multiple inclusions since it slows down the compilation cycle.

Romansh answered 26/11, 2009 at 15:57 Comment(0)
H
0

Yes it is necessary or the compiler will complain when it tries to compile code that it is not "aware" of. Think of #include's are a hint/nudge/elbow to the compiler to tell it to pick up the declarations, structures etc in order for a successful compile. The #ifdef/#endif header trick as pointed out by jldupont, is to speed up compilation of code.

It is used in instances where you have a C++ compiler and compiling plain C code as shown here Here is an example of the trick:

#ifndef __MY_HEADER_H__
#define __MY_HEADER_H__

#ifdef __cplusplus
extern "C" {
#endif


/* C code here such as structures, declarations etc. */

#ifdef __cplusplus
}
#endif

#endif /* __MY_HEADER_H__ */

Now, if this was included multiple times, the compiler will only include it once since the symbol __MY_HEADER_H__ is defined once, which speeds up compilation times. Notice the symbol cplusplus in the above example, that is the normal standard way of coping with C++ compiling if you have a C code lying around.

I have included the above to show this (despite not really relevant to the poster's original question). Hope this helps, Best regards, Tom.

PS: Sorry for letting anyone downvote this as I thought it would be useful tidbit for newcomers to C/C++. Leave a comment/criticisms etc as they are most welcome.

Hippocrates answered 26/11, 2009 at 16:13 Comment(5)
The ANSI C compiler doesn't complain about usage within headers of types defined elsewhere.Doig
@Victor: True, but if you only have a C++ compiler, this will enable the C++ compiler to compile plain old C fashioned code...hope this clarifies my point. Thanks for your comment anyway. :)Hippocrates
Using double-underscore in your code is a bad idea; all names using double underscore reserved for the implementation in C++, and names starting with double underscore are reserved for the implementation in C.Greaser
__cplusplus is a macro defined by the implementation if it is a c++ compiler, although you are correct for the include guard defineGrider
Note that you should not, in general, create function, variable, tag or macro names that start with an underscore. Part of C11 §7.1.3 Reserved identifiers says: — All identifiers that begin with an underscore and either an uppercase letter or another underscore are always reserved for any use.All identifiers that begin with an underscore are always reserved for use as identifiers with file scope in both the ordinary and tag name spaces. See also What does double underscore (__const) mean in C?Greaser
O
0

You need to include the header from your header, and there's no need to include it in the .c. Includes should go after the #define so they are not unnecessarily included multiple times. For example:

/* myHeader.h */
#ifndef MY_HEADER_H
#define MY_HEADER_H

#include <glib.h>

struct S
{
    gchar c;
};

#endif /* MY_HEADER_H */

and

/* myCode.c */
#include "myHeader.h"

void myFunction()
{
    struct S s;
    /* really exciting code goes here */
}
Owens answered 26/11, 2009 at 16:23 Comment(3)
so if i #include the glib.h in the header i don't need to do it in the .c file? I'm using GLib types and functions in the .c too!Doig
Yes - because the .c includes the .h which includes glib.h so effectively including glib.h in the .cOwens
As long as that header is then included in the .c file, yes - all include does is insert the contents of the file at compile time. So they get copied from glib.h to myheader.h which then copies them into mycode.cNadaba
P
-1

I use the following construct to be sure, that the needed include file is included before this include. I include all header files in source files only.

#ifndef INCLUDE_FILE_H
 #error "#include INCLUDE.h" must appear in source files before "#include THISFILE.h"
#endif
Pedrick answered 26/11, 2009 at 17:4 Comment(2)
Why not just type #include "INCLUDE.h"Arleyne
Instead of doing this, make sure that every header file has include guards. Then you can just include it without worrying about if some other file includes the same header.Kamkama
C
-1

What I normally do is make a single include file that includes all necessary dependencies in the right order. So I might have:

#ifndef _PROJECT_H_
#define _PROJECT_H_
#include <someDependency.h>
#include "my_project_types.h"
#include "my_project_implementation_prototypes.h"
#endif

All in project.h. Now project.h can be included anywhere with no order requirements but I still have the luxury of having my dependencies, types, and API function prototypes in different headers.

Crossquestion answered 26/11, 2009 at 18:11 Comment(1)
Note that you should not, in general, create function, variable, tag or macro names that start with an underscore. Part of C11 §7.1.3 Reserved identifiers says: — All identifiers that begin with an underscore and either an uppercase letter or another underscore are always reserved for any use.All identifiers that begin with an underscore are always reserved for use as identifiers with file scope in both the ordinary and tag name spaces. See also What does double underscore (__const) mean in C?Greaser
G
-2

Just include all external headers in one common header file in your project, e.g. global.h and include it in all your c files:

It can look like this:

#ifndef GLOBAL_GUARD
#define GLOBAL_GUARD

#include <glib.h>
/*...*/
typedef int  YOUR_INT_TYPE;
typedef char YOUR_CHAR_TYPE;
/*...*/
#endif

This file uses include guard to avoid multiple inclusions, illegal multiple definitions, etc.

Giverin answered 26/11, 2009 at 16:4 Comment(5)
I would rather name it something like <projectname>.h or something, but make sure to note that if your including a lot of files, it can really slow down the total compilation time.Physicality
I think however, that global.h is ok. I have seen such convention in many serious projects.Giverin
This is OK if by external headers you mean outside the current project and ones that won't change often. If it is used to include all headers outside the current module/library it will cause a lot of unnecessary re-compilation when one header is changed that is not required for all source files.Owens
Very good idea, but you tell me to include it in .c or .h <u>or</u> files which doesn't really answer my question. I put it once in the .c file. It doesn't seem necessary for the compiler to do it in the .h too.Doig
This is widely recognized as directly harmful practice, since it creates a tight coupling between every unrelated file in your whole project. Don't do this, it is poor and dangerous design.Kamkama

© 2022 - 2024 — McMap. All rights reserved.