Benefits of compiling po files to mo

About

Asked 12/12, 2018 at 12:37 Answered 17/12, 2018 at 16:24

What's the benefit and primary reason for compiling GNU gettext .po (Portable Object) files to .mo (Machine Object) ?

I saw many programs reading/parsing .po directly.

I'm not using wordpress but on their docs it says:

https://codex.wordpress.org/I18n_for_WordPress_Developers

PO files are compiled to binary MO files, which give faster access to the strings at run-time

Is faster access true? PO can be read only once and cached in some hash table, the same probably goes for MO

Hawking answered 12/12, 2018 at 12:37 Comment(0)

There are several reasons:

You should always compile PO files with msgfmt --check which performs several important checks on the PO file, not only a syntax check. For example, if you you are using printf format strings, it will check that %-expansions in the translation match the original string. Failure to do so, may result in crashs at runtime. There are a lot more checks, depending on the (programming) language.
Reading a binary MO file is usually faster and simpler than parsing a textual PO file.
PO files often have translation entries that should not be used for production, for example fuzzy or obsolete entries.
Many PO parsers are buggy or incomplete.
It is part of the gettext API. Translations are expected to be located by default under /usr/share/locale/LOCALE/LC_MESSAGES/TEXTDOMAIN.mo and they are expected to be in MO format, not in PO format. That does, of course, not apply to the countless libraries that just implement a subset of the gettext API.

Jar answered 17/12, 2018 at 16:24 Comment(6)

Many programs load translation only once on startup. I don't think performance is that important unless loading it every time. – Hawking 17/12, 2018 at 16:53

Compiling C sources on the fly before executing binaries would also just contribute to their startup time, and still it is a customary thing to do. – Jar 20/12, 2018 at 14:1

See also reason #5 that I have just added. – Jar 20/12, 2018 at 14:2

@Hawking I think you are only looking at desktop applications or long-running daemons/services. But for CLI programs, the startup performance is important because they are often invoked from shell scripts, maybe even in loops. – Jar 8/7, 2020 at 19:54

"Translations are expected to be located under /usr/share/locale/LOCALE/LC_MESSAGES/TEXTDOMAIN.mo" - this seems very odd - why would applications not just keep their translations inside their own folder structure? What else would want to read them? – Greeson 12/8 at 14:34

@RobGrant, on Unix systems, applications do no have their own directory structure. The executable images are in one directory (for example /usr/bin) configuration in /etc, application data in /usr/share and so on. Besides, you can change the path to the translation files with the bindtextdomain() library function. – Jar 14/8 at 5:44

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags