1. Changelog
1.1. Revision 11 - August 21st, 2023
-
Monday, February 6th, 2023 (see § 2.1 February 2023 Kona-Hybrid C++ meeting): WG21 (C++ Committee) votes in favor of
-
doing whatever C does with respect to keeping or removing
/if_empty
/prefix
, since there is an open National Body comment to do just that; and,suffix -
keeping the approved
-style of named constants as proposed by the AFNOR French National Body comment.__STDC_EMBED_ *
-
-
Monday, February 13th, 2023 (see § 2.2 January-February 2023 Virtual C meeting): WG14 (C Committee) reaffirms their direction in the working draft.
-
CWG Review!
-
Do not have both a prose explanation for the feature test macro, and then the actual specification with the editor’s note.
-
Remove redundant prose explanations for elements already shown in the grammar.
-
An
directive ➡ A#embed
directive.#embed -
directive ➡# embed
directive (no space between hash sign and#embed
directive name in wording).embed -
Rationale for needing at least one macro expansion per
directive (see § 7.1.1 Macro Expansion and the limit parameter).#embed -
Rationale for keeping the use of
as an operator a constraint violation for thedefined
directive (see § 7.1.2 defined and problematic undefined behavior).#embed -
Ensure that
parameters are expanded (and only their insides)limit (...)
-
1.2. Revision 10 - January 15th, 2023
-
Happy New Year 🥳🎊!
-
There were votes during the C++ Meeting to:
-
Make the previously-optional
for empty files and__has_embed
/if_empty
/prefix
parameters part of the core proposal.suffix -
Move the proposal to Core Working Group (CWG).
-
1.3. Revision 9 - October 15th, 2022
-
The C edition of the
paper was accepted into C23.#embed -
It was accepted with all of the optional components (
,if_empty
).__has_embed -
These will be kept optional in this paper, but this information may inform the C++ Standards Committee of its decisions.
-
1.4. Revision 8 - July 15th, 2022
-
Remove
spelling from C++ proposal (not the C proposal): C++ does not have a rule about__
being considered identical to__nodiscard__
in spelling, as explained in § 4.2.1.4 Why is `__param_name__` part of the grammar?.nodiscard -
Document discussion and reaction to discussion from June 2022 meeting in § 2.4 June 2022 Virtual C++ meeting.
-
Add an example for
withif_empty
.limit ( 0 ) -
Add some explanations and showings for
versus__has_embed
/suffix
/prefix
-style of parameters in § 6.1 `__has_embed(…) == 2`, `suffix`/`prefix`/`if_empty` parameters, both, or neither?.if_empty
1.5. Revision 7 - June 23rd, 2022
-
Add letter of support sent to the C Committee.
-
Add brief history.
-
Wording improvements:
-
Italicize definitions only once, not repeatedly, unless they are Words of Power.
-
Remove redundant/poorly copied text for embed parameters.
-
Move "Notes" into "Recommended Practice" sections.
-
Use textual descriptions for some of the mathematics.
-
-
Rename
tois_empty
, per C++ Standards Committee request.if_empty -
Make
/prefix
/suffix
optional, so as to be polled for inclusion in the next revision.if_empty -
Remove all mentions of "attribute", and produce solely parameter types.
-
Document discussion and reaction to discussion from June 2022 meeting in § 2.4 June 2022 Virtual C++ meeting.
1.6. Revision 6 - May 12th, 2022
-
Minor typo and grammar fixes in the wording.
1.7. Revision 5 - April 12th, 2022
-
Additional syntax changes based on feedback from Joseph Myers, Hubert Tong, Jens Maurer, other implementers, and users.
-
Minor wording tweaks and typo clean up.
-
An implementation available in Godbolt (since last revision as well and noted below).
-
The paper’s source code has been refactored:
-
Separated WG21 paper from WG14 paper.
-
Core paper together (rationale, reasoning), included in both C and C++ papers since rationale is identical.
-
-
Changed
to match feedback from last standards meeting, nominally that an empty resource returns__has_embed
instead of2
(but both decay to a truthy value during preprocessor conditional inclusion expressions). Modified the wording and the prose in § 4.4 `__has_embed` to match.1 -
The wording for the limit parameter (§ 7.2.6 Add a new sub-clause §15.4.2 under Resource Inclusion for Embed parameters [cpp.res.param]) adjusted to perform macro expansion, at least once. Exact wording may need help.
1.8. Revision 4 - June 15th, 2021
-
Vastly improve C++ wording after June 3rd, 2021 discussion.
-
Change syntax after comments from implementers of scanners / dependency trackers, and comments from implementers that were supported by users.
-
Add support for "named parameter" implementation extensions.
1.9. Revision 3 - April 15th, 2021
-
Added post C meeting fixes to prepare for hopeful success next meeting.
-
Added 2 more examples to C and C++ wording.
-
Vastly improved wording and reduced ambiguities in syntax and semantics.
-
Fixed various wording issues.
1.10. Revision 2 - October 25th, 2020
-
Added post C++ meeting notes and discussion.
-
Removed type or bit specifications from the
directive.#embed -
Moved "Type Flexibility" section and related notes to the Appendix as they are now unpursued.
1.11. Revision 1 - April 10th, 2020
-
Added post C meeting notes and discussion.
-
Added discussion of potential endianness.
-
Improved wording section at the end to be more detailed in handling preprocessor (which does not understand types).
1.12. Revision 0 - January 5th, 2020
-
Initial release.
2. Polls & Votes
The votes for the C++ Committee are as follows:
-
SF: Strongly in Favor
-
F: In Favor
-
N: Neutral
-
A: Against
-
SA: Strongly Against
2.1. February 2023 Kona-Hybrid C++ meeting
P1967 should include the macro’ed textual versions of has-embed-expressions results, as proposed (and as accepted by WG14).
SF | F | N | A | SA |
---|---|---|---|---|
20 | 12 | 2 | 1 | 0 |
Result: Consensus
P1967 should remove embed-standard-parameter
,
, and/or
, if WG14 also chooses to do so, for the purposes of C compatibility of this feature.
SF | F | N | A | SA |
---|---|---|---|---|
7 | 17 | 6 | 0 | 3 |
Result: Consensus
Forward P1967R10 (as modified by the above polls/WG14) to Core Working Group.
Result: Unanimous Consent
2.2. January-February 2023 Virtual C meeting
Week 1: January 23rd, 2023 - January 27th, 2023
AFNOR French National Body comment N3067, FR-130 was accepted, giving named constants in place of
,
, and
for the
returns with
,
,
.
Week 2: February 13th, 2023 - January 17th, 2023
AFNOR French National Body comment N3067, FR-130 was rejected, keeping the status quo of
as presented in this paper’s previous revisions and as accepted in its WG14 paper.
2.3. November 2022 Kona-Hybrid C++ meeting
Forward P1967R9, with both "optional" sections included to CWG for inclusion in C++26. This is as WG14 accepted.
SF | F | N | A | SA |
---|---|---|---|---|
7 | 5 | 2 | 2 | 2 |
Forward P1967R9, including section §7.3.6 (
with return value 2), but not §7.3.7 (
/
/
) to CWG for inclusion in C++26. This diverges from what was accepted by WG14.
SF | F | N | A | SA |
---|---|---|---|---|
6 | 6 | 4 | 3 | 2 |
The first poll had stronger consensus, so it was taken as the option to CWG.
2.4. June 2022 Virtual C++ meeting
"EWG encourages P1967 to define the form of vendor extensions as parameters to
?"
SF | F | N | A | SA |
---|---|---|---|---|
4 | 4 | 3 | 1 | 0 |
This was the result of consensus. The extensive discussion also made it clear that we must make sure that unrecognized embed parameters, due to them changing how an initializer may be formed, must be considered ill-formed. Users may get around this by using
. To dispel the notion that they may be optional, frontmatter wording was added to § 7.2.3 Add to the control-line production in §15.1 Preamble [cpp.pre] a new grammar production, as well as a supporting embed-parameter-seq production to make it clear the expectations.
Part of the discussion during this meeting was also whether or not the case for emptiness was useful. We moved the empty-based parameters to OPTIONAL pieces of wording, and expect to forward each of these on independent votes asides from the base proposal. This captures the sentiment of folks who may not have spoken up a lot during the meeting but nevertheless felt uneasy: we can simply go with whatever the poll says next meeting.
We took the feedback to rename
to
, since it is a better name for a "do-something-if-predicate-is-true" style attribute.
2.5. July 2021 Virtual C++ meeting
No votes were taken at this meeting, since it was mostly directional and about the changing of the syntax to better fit tools and scanners. In particular, it was more or less unanimously encouraged to:
-
re-do the syntax to be
instead of#embed header-name additional-tokens...
;#embed limit-parameter header-name -
the
should be reshaped into a parameter specification, giving both standard parameters (such as making a namedadditional - tokens
argument) and implementation-defined ones (such aslimit ( integer - constant )
);clang :: element_type ( short ) -
and, the wording should include some recommendation or specification for "as if by fread" to make wording easier.
All of these recommendations were incorporated below.
2.6. September 2020 Virtual C++ EWG Meeting
"We want
(no type name, no other specification) as a feature."
SF | F | N | A | SA |
---|---|---|---|---|
2 | 16 | 3 | 0 | 1 |
This vote gained the most consensus in the Committee. While there were some individuals who wanted to be able to specify a type, there was stronger interest in not specifying a type at all and always producing a list of integer literals suitable to be used anywhere an
was valid.
"We want to explore allowing an optional sequence of tokens to specify a type to
."
SF | F | N | A | SA |
---|---|---|---|---|
1 | 9 | 4 | 4 | 3 |
Further need was also expressed for
of different types of variables, so we would rather focus that ability into a sister feature,
. There was also an expression to augment
to handle arrays of data, which would be a follow-on proposal. There was a great amount of interest in the
direction, which means a paper should be written to follow up on it.
2.7. April 2020 Virtual C Meeting
"We want to have a proper preprocessor
over a
-based directive."
This had UNANIMOUS CONSENT to pursue a proper preprocessor directive and NOT use the
syntax. It is noted that the author deems this to be the best decision!
The following poll was later superseded in the C and C++ Committees.
"We want to specify embed as using
rather than
." (2-way poll.)
Y | N | A |
---|---|---|
10 | 2 | 3 |
-
Y: 10 bits-per-element (Ye)
-
N: 2 type-based (Nay)
-
A: 4 Abstain (Abstain)
This poll will be a bit harder to accommodate properly. Using a
that produces a numeric constant means that the max-length specifier is now ambiguous. The syntax of the directive may need to change to accommodate further exploration.
3. Introduction
For well over 40 years, people have been trying to plant data into executables for varying reasons. Whether it is to provide a base image with which to flash hardware in a hard reset, icons that get packaged with an application, or scripts that are intrinsically tied to the program at compilation time, there has always been a strong need to couple and ship binary data with an application.
Neither C nor C++ makes this easy for users to do, resulting in many individuals reaching for utilities such as `xxd`, writing python scripts, or engaging in highly platform-specific linker calls to set up `extern` variables pointing at their data. Each of these approaches come with benefits and drawbacks. For example, while working with the linker directly allows injection of very large amounts of data (5 MB and upwards), it does not allow accessing that data at any other point except runtime. Conversely, doing all of these things portably across systems and additionally maintaining the dependencies of all these resources and files in build systems both like and unlike `make` is a tedious task.
Thusly, we propose a new preprocessor directive whose sole purpose is to be `#include`, but for binary data: `#embed`.
3.1. Motivation
The reason this needs a new language feature is simple: current source-level encodings of "producing binary" to the compiler are incredibly inefficient both ergonomically and mechanically. Creating a brace-delimited list of numbers in C comes with baggage in the form of how numbers and lists are formatted. C’s preprocessor and the forcing of tokenization also forces an unavoidable cost to lexer and parser handling of values.
Therefore, using arrays with specific initialized values of any significant size becomes borderline impossible. One would think this old problem would be work-around-able in a succinct manner. Given how old this desire is (that comp.std.c thread is not even the oldest recorded feature request), proper solutions would have arisen. Unfortunately, that could not be farther from the truth. Even the compilers themselves suffer build time and memory usage degradation, as contributors to the LLVM compiler ran the gamut of the biggest problems that motivate this proposal in a matter of a week or two earlier this very year. Luke is not alone in his frustrations: developers all over suffer from the inability to include binary in their program quickly and perform exceptional gymnastics to get around the compiler’s inability to handle these cases.
C developer progress is impeded regarding the inability to handle this use case, and it leaves both old and new programmers wanting.
Finally, Microsoft has an ABI problem with its maximum string literal size that cannot be solved using string literals or anything treated like string literals, as the LLVM thread and the thread from Claire Xen make clear. It has also frustrated both C an C++ programmers alike, despite their best efforts. It was so frustrating that even extended-C-and-C++-compilers, like Circle, solve this problem with custom directives.
3.2. But How Expensive Is This?
Many different options as opposed to this proposal were seriously evaluated. Implementations were attempted in at least 2 production-use compilers, and more in private. To give an idea of usage and size, here are results for various compilers on a machine with the following specification:
-
Intel Core i7 @ 2.60 GHz
-
24.0 GB RAM
-
Debian Sid or Windows 10
-
Method: Execute command hundreds of times, stare extremely hard at `htop`/Task Manager
While `time` and `Measure-Command` work well for getting accurate timing information and can be run several times in a loop to produce a good average value, tracking memory consumption without intrusive efforts was much harder and thusly relied on OS reporting with fixed-interval probes. Memory usage is therefore approximate and may not represent the actual maximum of consumed memory. All of these are using the latest compiler built from source if available, or the latest technology preview if available. Optimizations at `-O2` (GCC & Clang style)/`/O2 /Ob2` (MSVC style) or equivalent were employed to generate the final executable.
3.2.1. Speed
Strategy | 40 kilobytes | 400 kilobytes | 4 megabytes | 40 megabytes |
---|---|---|---|---|
`#embed` GCC | 0.236 s | 0.231 s | 0.300 s | 1.069 s |
`xxd`-generated GCC | 0.406 s | 2.135 s | 23.567 s | 225.290 s |
`xxd`-generated Clang | 0.366 s | 1.063 s | 8.309 s | 83.250 s |
`xxd`-generated MSVC | 0.552 s | 3.806 s | 52.397 s | Out of Memory |
3.2.2. Memory Size
Strategy | 40 kilobytes | 400 kilobytes | 4 megabytes | 40 megabytes |
---|---|---|---|---|
`#embed` GCC | 17.26 MB | 17.96 MB | 53.42 MB | 341.72 MB |
`xxd`-generated GCC | 24.85 MB | 134.34 MB | 1,347.00 MB | 12,622.00 MB |
`xxd`-generated Clang | 41.83 MB | 103.76 MB | 718.00 MB | 7,116.00 MB |
`xxd`-generated MSVC | ~48.60 MB | ~477.30 MB | ~5,280.00 MB | Out of Memory |
3.2.3. Analysis
The numbers here are not reassuring that compiler developers can reduce the memory and compilation time burdens with regard to large initializer lists. Furthermore, privately owned compilers and other static analysis tools perform almost exponentially worse here, taking vastly more memory and thrashing CPUs to 100% for several minutes (to sometimes several hours if e.g. the Swap is engaged due to lack of main memory). Every compiler must always consume a certain amount of memory in a relationship directly linear to the number of tokens produced. After that, it is largely implementation-dependent what happens to the data.
The GNU Compiler Collection (GCC) uses a tree representation and has many places where it spawns extra "garbage", as its called in the various bug reports and work items from implementers. There has been a 16+ year effort on the part of GCC to reduce its memory usage and speed up initializers (C Bug Report and C++ Bug Report). Significant improvements have been made and there is plenty of room for GCC to improve here with respect to compiler and memory size. Somewhat unfortunately, one of the current changes in flight for GCC is the removal of all location information beyond the 256th initializer of large arrays in order to save on space. This technique is not viable for static analysis compilers that promise to recreate source code exactly as was written, and therefore discarding location or token information for large initializers is not a viable cross-implementation strategy.
LLVM’s Clang, on the other hand, is much more optimized. They maintain a much better scaling and ratio but still suffer the pain of their token overhead and Abstract Syntax Tree representation, though to a much lesser degree than GCC. A bug report was filed but talk from two prominent LLVM/Clang developers made it clear that optimizing things any further would require an extremely large refactor of parser internals with a lot of added functionality, with potentially dubious gains. As part of this proposal, the implementation provided does attempt to do some of these optimizations, and follows some of the work done in this post to try and prove memory and file size savings. (The savings in trying to optimize parsing large array literals were "around 10%", compared to the order-of-magnitude gains from `#embed` and similar techniques).
Microsoft Visual C (MSVC) scales the worst of all the compilers, even when given the benefit of being on its native operating system. Both Clang and GCC outperform MSVC on Windows 10 or WINE as of the time of writing.
Linker tricks on all platforms perform better with time (though slower than `#embed` implementation), but force the data to be optimizer-opaque (even on the most aggressive "Link Time Optimization" or "Whole Program Optimization" modes compilers had). Linker tricks are also exceptionally non-portable: whether it is the `incbin` assembly command supported by certain compilers, specific invocations of `rc.exe`/`objcopy` or others, non-portability plagues their usefulness in writing Cross-Platform C (see Appendix for listing of techniques). This makes C decidedly unlike the "portable assembler" advertised by its proponents (and my Professors and co-workers).
3.3. Support
To say that `#embed` enjoys broad C Community support is an understatement. In all the years we have written proposals for C and C++, this is the only one where someone physically mailed us a letter - from a different country - directly to the Standards Body to try and make a case for the feature directly, rather than what was already in the paper: