How to extract the source filename without path and suffix at compile time?
Asked Answered
P

7

36

Using both gcc with -std=c11 and g++ with -std=c++14.

E.g. for a file named src/dir/Hello.cxx it should expand to something like e.g.:

const char basename[] = "Hello";

or

const char basename[] = getStaticBasename(__FILE__);

as where getStaticBasename() is a macro (for C sources) or constexpr function (for C++ sources) which results to "Hello".

I have to avoid splitting the string from __FILE__ at runtime, because the path and suffix must not be compiled into the executable in any way.

The solution must be without dependencies to huge libraries such as boost.

As I have no makefiles, solutions like this cannot be used in my case.

Did one have a solution for that?

Edit 2015-07-02:

  • I have no influence on how the compiler and linker was invoked (sometimes via makefile, sometimes from command line, or some IDE (Eclipse CDT managed make, Crossworks, Xcode et cetera. So the solution needs to be in code only.
  • My use case is to provide some kind of "generic region identifier" for a small footprint logging solution. The application code (which uses my logger) should only #include <Joe/Logger.h> and within the later calls to e.g. LOG_DEBUG(...) I'll implicitely take use of the automatically generated "generic region identifier".
  • My current solution is that the application code have to declare a JOE_LOG_FILE_REGION(Hello); (after #include <Joe/Logger.h>) before it could place LOG_DEBUG(...) in its code.
Periphrasis answered 25/6, 2015 at 12:14 Comment(10)
The C++ one is fairly easy. I can't think of how it would be done in C.Meade
I think it's best to split this question into two, as c++ can look very differently due to possibility of using constexprs.Roister
Question is not duplicated because you want to do it at compile time but one answer there is what you need: __FILE__ macro shows full pathRebutter
@DarkFalcon Why don't you add your C++ solution as an answer, or at least give some hints? Yes it won't answer the C part, but it's still better than nothing.Cystocarp
@JoachimPileborg: I'm afraid I lack the time (and maybe the smarts) to put that together, but here is an implementation by someone else which provides the harder bits (the substr): github.com/dscharrer/void/blob/master/c%2B%2B/…Meade
The makefile solution can be adapted to the command line. Just have some script to build your thing, or even use some script instead of g++Cheops
I need a solution that is independent on how the compiler is invoked.Periphrasis
Yep, I think I'll split it into two questions for C and C++Periphrasis
Maybe duplicate of https://mcmap.net/q/121606/-getting-base-name-of-the-source-file-at-compile-time/2506522Standup
Since C++20 there is std::source_location.Select
P
39

1. gcc builtin function can get the file name of a full path at compile time.

#define __FILENAME__ (__builtin_strrchr(__FILE__, '/') ? __builtin_strrchr(__FILE__, '/') + 1 : __FILE__)

or

#define __FILENAME__ (strrchr(__FILE__, '/') ? strrchr(__FILE__, '/') + 1 : __FILE__)

2. c++11 constexpr also can do this at compile time.

c++11 constexpr function can only use a return-statement.

example:

#include <stdio.h>

constexpr const char* str_end(const char *str) {
    return *str ? str_end(str + 1) : str;
}

constexpr bool str_slant(const char *str) {
    return *str == '/' ? true : (*str ? str_slant(str + 1) : false);
}

constexpr const char* r_slant(const char* str) {
    return *str == '/' ? (str + 1) : r_slant(str - 1);
}
constexpr const char* file_name(const char* str) {
    return str_slant(str) ? r_slant(str_end(str)) : str;
}

int main() {
    constexpr const char *const_file = file_name(__FILE__);
    puts(const_file);
    return 0;
}

source file name is foo/foo1/foo2/foo3/foo4.cpp

use g++ -o foo.exe foo/foo1/foo2/foo3/foo4.cpp -std=c++11 --save-temps to compile this file.

you can see this.

.file   "foo4.cpp"
        .section        .rodata
.LC0:
        .string "foo/foo1/foo2/foo3/foo4.cpp"
        .text
        .globl  main
        .type   main, @function
main:
.LFB4:
        .cfi_startproc
        pushq   %rbp
        .cfi_def_cfa_offset 16
        .cfi_offset 6, -16
        movq    %rsp, %rbp
        .cfi_def_cfa_register 6
        subq    $16, %rsp
        movq    $.LC0+19, -8(%rbp) 
        movl    $.LC0+19, %edi
        call    puts
        movl    $0, %eax
        leave
        .cfi_def_cfa 7, 8
        ret
        .cfi_endproc
.LFE4:
        .size   main, .-main
        .ident  "GCC: (Ubuntu 4.8.4-2ubuntu1~14.04.3) 4.8.4"
        .section        .note.GNU-stack,"",@progbits

movl $.LC0+19, %edi .LC0 + 19 is the address of file name string without path and suffix

3. c++14 constexpr function can do this in a simple way

#include <iostream>

constexpr const char* file_name(const char* path) {
    const char* file = path;
    while (*path) {
        if (*path++ == '/') {
            file = path;
        }
    }
    return file;
}

int main() {
    constexpr const char* file = file_name(__FILE__);
    std::cout << file << std::endl;
    return 0;
}

c++14 constexpr function can use loop and local variable.

the file_name function will replace with a address of const char * at compiler time. ~

Pugilist answered 7/7, 2016 at 4:5 Comment(7)
At least with gcc 6.3.0, the compiler does not allow __builtin_strrchr to be used in the initializer of a static variable, because "initializer element is not constant".Fendley
I've tried your example with plain strrchr and strstr and in both cases optimization took place, so it seems it is not necessary to use __builtin_strrchr. Compiler: "GCC: (Debian 6.3.0-18+deb9u1) 6.3.0 20170516".Asgard
cute, they're extending the preprocessor but at the same time they don't give it full eval power of switching to php, so it can't delete files/google things/fetch/send network requests from preprocessorBikaner
i tried to use your #define __FILENAME__ with constexpr const char* basename=__FILENAME__ but my g++ 4.8.5 complains about a function not being constexpr. so it seems that this solution depends on having compiler optimizations enabled.Chou
I prefer your c++ only solution because it is always compile time (and you don't have to worry about optimizer being turned off and then your #define is runtime).Chou
I wonder if compile times will be affected in projects that use the latter metafunction heavily.Mcsweeney
These solutions leave the suffix so they does not satisfy the question. Or am I missing something?Danell
D
12

extract the base filename at compile time with no preprocessor tricks and no external scripts? c++14? no problem sir.

#include <iostream>
#include <string>

using namespace std;

namespace detail {
    constexpr bool is_path_sep(char c) {
        return c == '/' || c == '\\';
    }

    constexpr const char* strip_path(const char* path)
    {
        auto lastname = path;
        for (auto p = path ; *p ; ++p) {
            if (is_path_sep(*p) && *(p+1)) lastname = p+1;
        }
        return lastname;
    }

    struct basename_impl
    {
        constexpr basename_impl(const char* begin, const char* end)
        : _begin(begin), _end(end)
        {}

        void write(std::ostream& os) const {
            os.write(_begin, _end - _begin);
        }

        std::string as_string() const {
            return std::string(_begin, _end);
        }

        const char* const _begin;
        const char* const _end;
    };

    inline std::ostream& operator<<(std::ostream& os, const basename_impl& bi) {
        bi.write(os);
        return os;
    }

    inline std::string to_string(const basename_impl& bi) {
        return bi.as_string();
    }

    constexpr const char* last_dot_of(const char* p) {
        const char* last_dot = nullptr;
        for ( ; *p ; ++p) {
            if (*p == '.')
                last_dot = p;
        }
        return last_dot ? last_dot : p;
    }
}

// the filename with extension but no path
constexpr auto filename = detail::strip_path(__FILE__);
constexpr auto basename = detail::basename_impl(filename, detail::last_dot_of(filename));

auto main() -> int
{
    cout << filename << endl;
    cout << basename << endl;

    cout << to_string(basename) << endl;

    return 0;
}
Detwiler answered 1/7, 2015 at 21:35 Comment(7)
plast points to the last path separator (or beginning of string if there is none) so you want to return one past the separator to return the filename. However, you also don't want to return an empty string if the path happens to end with a slash (it's a corner case I know, but still...)Detwiler
cute, this constexpr thing really wants to become it's own programming language like PHP.Bikaner
@Dmitry re "constexpr .. it's own programming language": myself and many articles I've read do consider constexpr it's own language. in addition there is the c++ templates... they are another language. preprocessor is another language.Chou
RichardHodges your implementation is about 2 or 3 times more lines of code than @pexeer's solution and uses two for loops, uses struct, uses std::string, uses operator overloading, uses std::ostream. what features/added-benefit does your implementation do compared to @pexeer's?Chou
@TrevorBoydSmith as far as I can see, I'm offering a little more utility in terms of providing ostream and to_stream overloads, and handling both windows and unix paths. However, peexer's solution was written a year after mine so he may have had more time to digest the question.Detwiler
@RichardHodges: More than that, yours is the only C++ solution that actually removes the suffix.Mcmahan
std::string_view in C++17 can greatly reduce the line of code (basename_impl).Suspicious
M
9

If you run gcc from the folder where the source file is located, you will get a different __FILE__ than if you pass an absolute path (i.e. handed to gcc through an IDE).

  • gcc test.c -otest.exe gives me __FILE__ as test.c.
  • gcc c:\tmp\test.c -otest.exe gives me __FILE__ as c:\tmp\test.c.

Perhaps calling gcc from the path where the source is located is sufficient as work-around?


EDIT

Here is a "dirty" but safe hack which removes the file extension in compile-time. Not really something I'd recommend, but it was fun to write :) So take it for what it is worth. It only works in C.

#include <stdio.h>

#define EXT_LENGTH (sizeof(".c") - 1) // -1 null term

typedef union
{
  char filename_no_nul [sizeof(__FILE__)-EXT_LENGTH-1]; // -1 null term
  char filename_nul    [sizeof(__FILE__)-EXT_LENGTH];
} remove_ext_t;

int main (void)
{
  const remove_ext_t file = { __FILE__ };

  puts(file.filename_nul);

  return 0;
}

The union allocates one member which is large enough to hold the full path minus extension and null terminator. And it allocates one member which is large enough to hold the full path minus extension, though with a null terminator.

The member which is too small to hold the full __FILE__ is initialized with as much of __FILE__ as can fit. This is ok in C but not allowed in C++. If __FILE__ contains test.c, the union member will now be initialized to contain test with no null terminator.

There will however still be trailing zeroes after that string, because this hack abuses the fact that the other union member has been initialized according to the rules of "aggregate/union" initialization. This rule forces any remaining items in the "aggregate" to be initialized as if they had static storage duration, i.e to zero. Which happens to be the value of the null terminator.

Madian answered 25/6, 2015 at 12:55 Comment(4)
This is really smart, what can I say? ;).Marishamariska
Yes, quite hackish, and I know some who would shoot me with a rope using such that. However, you could pack that into a header using 'static` and include in every file.Agora
There is text in the standard saying “If there are fewer initializers in a brace-enclosed list than there are elements or members of an aggregate, or fewer characters in a string literal used to initialize an array of known size than there are elements in the array, the remainder of the aggregate shall be initialized implicitly the same as objects that have static storage duration.” But this cannot literally be true, or union { int a, b; } = { 3 } would leave 0 in the union. It must be interpreted in light of…Flagelliform
… “When no designations are present, subobjects of the current object are initialized in order according to the type of the current object: array elements in increasing subscript order, structure members in declaration order, and the first named member of a union.”Flagelliform
M
5

It turns out to be very simple, you just need the #line preprocessor directive, example

#line 0 "Hello"

at the top of the file, this as is, if all you want is to hide the file name completely then

#line 0 ""

would work.

If you don't want to use Makefiles, you can use this

file=cfile;
content=$(sed -e "1s/^/#line 0 \"$file\"\n/" example/${file}.c);
echo $content | gcc -xc -O3 -o ${file} -

The -xc gcc flag above means (from gcc's documentation):

-x language:

Specify explicitly the language for the following input files (rather than letting the compiler choose a default based on the file name suffix). This option applies to all following input files until the next -x option. Possible values for language are:

          c  c-header  cpp-output
          c++  c++-header  c++-cpp-output
          objective-c  objective-c-header  objective-c-cpp-output
          objective-c++ objective-c++-header objective-c++-cpp-output
          assembler  assembler-with-cpp
          ada
          f77  f77-cpp-input f95  f95-cpp-input
          go
          java

If you don't have any sort of script that helps you building the source then there is no way to do it I think.

Also, you can see from the above quote of the gcc documentation, that you can save the files without any extension at all, and then combine @Lundin's original solution with this and use

gcc -xc -o file filename_without_extension

in this case __FILE__ would expand to "filename_without_extension", and you would achieve what you want, although you need to compile the file in the same directory where it lives, because otherwise it will contain the path to the file.

Marishamariska answered 25/6, 2015 at 12:35 Comment(10)
How do you put the string "hello" in the source file (i.e. in the #line directive)? The file name can be anything, not just hello.cpp.Giddy
That becomes a nightmare when renaming files.Agora
@Olaf which is why using a Makefile is the best solution.Marishamariska
This doesn't look very helpful. If you have to place the string of interest into the source, anyhow, you can just as well use it directly to initialize the variable.Psychopath
That would hardly help as such inside the code. (but would allow to pass a macro)Agora
@Olaf you are right, I don't think there is a solution that meets all the requirements. The good thing about this solution is that it doesn't require changing the __FILE__ macro if it was already used.Marishamariska
@JensGustedt You can of course downvote if you like.Marishamariska
If have required similar, but I pass it from SCons by -DFILE_BASE=.... The name is processed in Python/SCons that way.Agora
Why do you use scons? it's so complicated to use python tools, it's version 3 or 2 and all that cr*p... But yes, you are right, that would be a perfect solution, if the OP wanted to use build scripts of any kind. I don't like scons because my linux distro has a package manager rolled by my self, so if I need some new package I must compile it from source, which is a nightmare when the build tools are scons or waf.Marishamariska
What I meant is that you could just do gcc -O3 -o ${file} -DFILE_BASE="\"${file}\"" "${file}.c" directly without passing your source through sed and complicated stuff like that.Psychopath
V
3

The most voted solution does not rely answer the OP, because the complete file path is stored in the binary and only a pointer to the last part of the path is computed (from the last '/' character) and used.

See assembly output of the proposed solution in @pexeer answer:

.LC0:
        .string "/app/example.cpp"
main:
        push    rax
        mov     esi, OFFSET FLAT:.LC0+5
        mov     edi, OFFSET FLAT:_ZSt4cout
        call    std::basic_ostream<char, std::char_traits<char> >& std::operator<< <std::char_traits<char> >(std::basic_ostream<char, std::char_traits<char> >&, char const*)
        mov     rdi, rax
        call    std::basic_ostream<char, std::char_traits<char> >& std::endl<char, std::char_traits<char> >(std::basic_ostream<char, std::char_traits<char> >&)
        xor     eax, eax
        pop     rdx
        ret
_GLOBAL__sub_I_main:
        push    rax
        mov     edi, OFFSET FLAT:_ZStL8__ioinit
        call    std::ios_base::Init::Init() [complete object constructor]
        mov     edx, OFFSET FLAT:__dso_handle
        mov     esi, OFFSET FLAT:_ZStL8__ioinit
        pop     rcx
        mov     edi, OFFSET FLAT:_ZNSt8ios_base4InitD1Ev
        jmp     __cxa_atexit

To avoid storing the complete file path, you'll need something like this:

#include <iostream>
#include <utility>

constexpr const char* file_name(const char* path) {
    const char* file = path;
    while (*path) {
        if (*path++ == '/') {
            file = path;
        }
    }
    return file;
}

constexpr size_t file_length(const char * path) {
    size_t i = 0;
    const char * file = file_name(path);
    while (*file) { i ++; file++; }
    return i;
}

template<std::size_t... I>
const char * print_impl(std::index_sequence<I...>) {
    static const char file[file_length(__FILE__)+1] = { file_name(__FILE__)[I]...};
    return file;
}

inline const char* print_file() {
    return print_impl(std::make_index_sequence<file_length(__FILE__) + 1>());
}


int main() {
    std::cout<<print_file()<<std::endl;
    return 0;
}

and you'll get this assembly output (where the complete file path isn't stored):

main:
        push    rax
        mov     esi, OFFSET FLAT:print_impl<0ul, 1ul, 2ul, 3ul, 4ul, 5ul, 6ul, 7ul, 8ul, 9ul, 10ul, 11ul>(std::integer_sequence<unsigned long, 0ul, 1ul, 2ul, 3ul, 4ul, 5ul, 6ul, 7ul, 8ul, 9ul, 10ul, 11ul>)::file
        mov     edi, OFFSET FLAT:_ZSt4cout
        call    std::basic_ostream<char, std::char_traits<char> >& std::operator<< <std::char_traits<char> >(std::basic_ostream<char, std::char_traits<char> >&, char const*)
        mov     rdi, rax
        call    std::basic_ostream<char, std::char_traits<char> >& std::endl<char, std::char_traits<char> >(std::basic_ostream<char, std::char_traits<char> >&)
        xor     eax, eax
        pop     rdx
        ret
_GLOBAL__sub_I_main:
        push    rax
        mov     edi, OFFSET FLAT:_ZStL8__ioinit
        call    std::ios_base::Init::Init() [complete object constructor]
        mov     edx, OFFSET FLAT:__dso_handle
        mov     esi, OFFSET FLAT:_ZStL8__ioinit
        pop     rcx
        mov     edi, OFFSET FLAT:_ZNSt8ios_base4InitD1Ev
        jmp     __cxa_atexit
print_impl<0ul, 1ul, 2ul, 3ul, 4ul, 5ul, 6ul, 7ul, 8ul, 9ul, 10ul, 11ul>(std::integer_sequence<unsigned long, 0ul, 1ul, 2ul, 3ul, 4ul, 5ul, 6ul, 7ul, 8ul, 9ul, 10ul, 11ul>)::file:
        .string "example.cpp"

Example here

The basic idea here is to construct a statically initialized char array containing only the string you want (and not a pointer to a static char array containing the full file path). Deducing the file length is trivial but required since we can't call strlen in a constexpr function.

Then the trick is to use a integer sequence as indices in the file's pointed array (like natural declaration: const char f[] = {"str"[0], "str"[1], ...}). The integer sequence can be used in variadic template instantiation, so it must be called in such context.

GCC leaks the print_impl function as a symbol (so it's likely larger that the file's full path), but it can be stripped later on at linker step (or with strip --strip-all /path/to/binary)

Vasya answered 25/3, 2022 at 13:54 Comment(1)
this is the wayTrek
C
3

Unfortunately it seems everyone is busy removing the unwanted part of the path via various magical ways in code (--> most of them not working).

In my opinion the correct way is to tell the compiler to change/remove the path from the macro avoiding all need for tinkering. With gcc the parameter is called fmacro-prefix-map. You would use it like this:

-fmacro-prefix-map=/path/to/source/=

to change "/path/to/source/main.cpp" to just "main.cpp"

By the way: this also works for std::source_location and of course the full path (unaltered) is not stored in the resulting binary.

Coverup answered 30/6, 2023 at 12:43 Comment(1)
A RTFM answer that's very appropriate. The answers above will/can extract the basename without the extension (since they process the string) though.Vasya
F
0

I simplified @Lundin's solution to use an anonymous union:

#include <stdio.h>

#define BASE_FILE(ext) (((union { char const f[sizeof(__FILE__) - (sizeof(ext))]; }){__FILE__}).f)
#define BASE_C_FILE BASE_FILE(".c")

int main()
{
    puts(BASE_C_FILE);
    return 0;
}

If the extension is not .c, you can just call BASE_FILE("<extension>").

The main problem with this solution is that BASE_FILE(ext) expands to a char const[] instead of a string literal like __FILE__.

Also it causes a warning such as:

warning: initializer-string for array of ‘char’ is too long

depending on your compiler.

I also removed the non-null terminated string, as it's basically a segfault generator.

Flavin answered 8/10, 2023 at 11:13 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.