Arthur O’Dwyer

Why can’t I specialize `std::hash` inside my own namespace?

2024-07-15T00:01:00+00:00

This question comes up a lot on the cpplang Slack. Suppose I have a class named my::Book, and I want to put it into a std::unordered_set. Then I need to write a std::hash specialization for it. So I write:

namespace my {
  struct Book { ~~~~ }; // A
  struct Library { ~~~~ };
} // namespace my

template<>
struct std::hash { // D
  size_t operator()(const my::Book& b) const {
    return b.hash();
  }
};

See “Don’t reopen namespace std” (2021-10-27).

But that’s a lot of extra my::-qualification, and (even worse) requires that I remember from line A all the way down to line D that I need to specialize hash for my type. I’d vastly prefer to provide the implementation of hash right next to Book itself, like this:

namespace my {
  struct Book { ~~~~ }; // A

  template<>
  struct std::hash { // D
    size_t operator()(const Book& b) const {
      return b.hash();
    }
  };

  struct Library { ~~~~ };
} // namespace my

Sadly, C++ doesn’t let us do this. There have been at least two WG21 proposals to allow this, but both were abandoned:

N3730 and N3867 “Specializations and namespaces” (Mike Spertus, 2014)
P0665 “Allow Class Template Specializations in Associated Namespaces” (Tristan Brindle, 2018)

Analogous case for member templates

The status quo is that std::hash can be specialized only “in any scope in which the corresponding primary template may be defined” ([temp.expl.spec]/3, [temp.spec.partial.general]/6). This wording originated in response to CWG727 “In-class explicit specializations” (2008) — see also CWG1755 “Out-of-class partial specializations of member templates” and N4090 — where the problem they were all thinking about was the problem of member templates. We certainly don’t want to permit e.g.

struct A {
  template struct Hash;
};
struct B {
  template<> struct A::Hash {}; // error
};

Nor do we want to permit:

struct A {
  template struct Hash;
  struct B {
    template<> struct Hash {}; // error
  };
};

Incidentally, GCC doesn’t support explicit specializations in member scope at all; that’s GCC bug #85282.

So, one might ask, why should it work any differently when A and B are namespaces rather than classes?

On the other hand, clearly it does work differently for namespaces versus classes. Consider these two perfectly parallel snippets:

struct A {
  struct B {
    template struct Hash;
  };
  template<> struct B::Hash {}; // error
};

namespace A {
  namespace B {
    template struct Hash;
  }
  template<> struct B::Hash {}; // OK
}

The former is ill-formed (which seems to me like an excellent idea). The latter is OK; in fact, it’s exactly how we specialize std::hash today.

So there’s nothing terribly inconsistent with our wanting to treat these two snippets differently also:

struct A {
  template struct Hash;
};
struct B {
  template<> struct A::Hash {}; // error; we'd like to keep it an error
};

namespace A {
  template struct Hash;
}
namespace B {
  template<> struct A::Hash {}; // error, but we'd like to make it OK
}

The problem is name lookup

The real problem is name lookup. Consider (Godbolt):

namespace A {
  int f() { return 1; }
  template struct Hash;
}
int f() { return 2; }
template<> struct A::Hash {
  int g() { return f(); } // C
};

The call on line C finds A::f, not ::f, because there, although that line is lexically inside the global namespace, it is also inside a specialization of A::Hash and thus logically inside namespace A. The same happens if we replace the class templates with function templates (Godbolt).

Now, what happens if we make this code legal:

namespace A {
  int f() { return 1; }
  template struct Hash;
}
namespace B {
  int f() { return 2; }
  template<> struct A::Hash {
    int g() { return f(); } // C
  };
}

Does line C call A::f (because we’re logically inside a specialization of A::Hash), or B::f (because we’re lexically inside namespace B)? Obviously, by the above logic, it must call A::f. But consider how awkward this would be in practice:

namespace my {
  struct Book { ~~~~ };

  template<>
  struct std::hash { // D
    size_t operator()(const my::Book& b) const { // E
      ~~~~
    }
  };
}

On line D we can say Book; but on line E we must say my::Book, because at that point we’re logically inside namespace std and a lookup for Book inside namespace std wouldn’t find anything. Worse, if its fully qualified name is something like mycompany::my::detail::Book, we’ll have to spell out that whole thing!

A potentially dangerous pitfall

If your type in namespace my shares its name with something in std, this gotcha might really hurt. For example:

namespace my {
  template
  struct vector { ~~~~ };

  template
  struct std::hash> {                  // D
    size_t operator()(const vector& v) const; // E
  };
}

By the name-lookup logic above, line D means my::vector but line E means std::vector. This could lead to very confusing error messages later in the program — or worse, runtime misbehavior, if my::vector is implicitly convertible to std::vector!

Note a similar pitfall with variable initializers (Godbolt):
namespace A { extern int g; }
namespace A { int f() { return 1; } }

int f() { return 2; }
int A::g = f();
initializes A::g to 1, not 2. I think any working programmer would be shocked by that result; but it never comes up in practice.

An almost-workaround

Here’s almost (but not quite) a clever workaround for the above deficiency. We’re already delegating the body of std::hash::operator() to Book::hash(), so that its code appears in the correct logical scope (my rather than std). Could we just delegate a little more?

namespace my {
  struct Book {
    struct Hash {
      size_t operator()(const Book& b) const { ~~~~ } // E
    };
  };
  template<> struct std::hash // D
    : my::Book::Hash {};            // F
}

Now we can use the unqualified name Book on both lines D and E. But it turns out that the base-clause on line F is also logically within namespace std, not namespace my ([basic.scope.class]/1), so on line F we still need to spell out my::Book with full qualification. Factoring out Book::Hash hasn’t gained us much!

Conclusion

I’d like to see C++ gain the ability to define specializations of std::hash lexically within namespace my. Despite the downside of name lookup’s requiring fully qualified names (and the pitfall above when you forget full qualification in a critical place), the ergonomic benefits would still be enormous.

Still, it would certainly be much easier to sell the feature if it didn’t have that pitfall. Do you have an idea that would solve the name-lookup pitfall — without breaking any of the other examples in this post? If you do, please, contact me via the email link below!

A `priority_tag`-like pattern for type-based metaprogramming

2024-07-07T00:01:00+00:00

Via Enrico Mauro, an interesting variation on the priority_tag trick. Compare the following example to the HasSomeKindOfSwap example in “priority_tag for ad-hoc tag dispatch” (2021-07-09).

template
struct HasSomeKindOfValueType
  : HasSomeKindOfValueType {};

template
struct HasSomeKindOfValueType>
  : std::true_type {};

template
struct HasSomeKindOfValueType>
  : std::true_type {};

template
struct HasSomeKindOfValueType
  : std::false_type {};

static_assert(HasSomeKindOfValueType>::value);
static_assert(HasSomeKindOfValueType>::value);
static_assert(!HasSomeKindOfValueType::value);

This snippet cleanly expresses the logic: “If T::value_type exists, return true; otherwise if T::element_type exists, return true; otherwise return false.” It combines two template-metaprogramming tricks we’ve seen before:

The “recursive template,” where X<2> derives from (or calls) X<1>, which derives from (or calls) X<0>.
The “Enable parameter,” where you hang an extra class Enable = void parameter off the end of your trait precisely so that it can be used for SFINAE.

The Enable parameter isn’t technically needed in C++20; we have requires for that, now. The following C++20 expresses the same logic as the C++17 above.

template
struct HasSomeKindOfValueType
  : HasSomeKindOfValueType {};

template requires requires { typename T::value_type; }
struct HasSomeKindOfValueType
  : std::true_type {};

template requires requires { typename T::element_type; }
struct HasSomeKindOfValueType
  : std::true_type {};

template
struct HasSomeKindOfValueType
  : std::false_type {};

`std::try_cast` and `(const&&)=delete`

2024-07-03T00:01:00+00:00

P2927 “Inspecting exception_ptr,” proposes a facility (formerly known as std::try_cast) with this signature:

template
const T* exception_ptr_cast(const exception_ptr&) noexcept;

You might use it in a slight modification of Nicolas Guillemot’s example from 2013 (Godbolt):

FooResult lippincott(std::exception_ptr eptr) {
  assert(eptr != nullptr);
  if (auto *ex = std::exception_ptr_cast(eptr)) {
    return FOO_ERROR1;
  } else if (auto *ex = std::exception_ptr_cast(eptr)) {
    return FOO_ERROR2;
  } else {
    if (auto *ex = std::exception_ptr_cast(eptr)) {
      log::warning("%s", ex->what());
    }
    return FOO_UNKNOWN;
  }
}

try {
  foo::DoThing();
} catch (...) {
  return lippincott(std::current_exception());
}

Notice the grotesque redundancy of the name exception_ptr_cast. In an ideal world you should name your functions primarily for what they do, not for what argument types they take.

Someone pointed out that it’s dangerous to write

if (auto *ex = std::exception_ptr_cast(std::current_exception())) {
  std::cout << ex->what() << "\n";
}

because std::current_exception is permitted to copy the in-flight exception to storage that lives only as long as the exception_ptr’s reference count remains above zero. MSVC actually does something along these lines; see “MSVC can’t handle move-only exception types” (2019-05-11). So if you write the line above, it will compile everywhere, work fine on Itanium-ABI platforms, but cause a use-after-free on Windows.

“Okay, why don’t we just make it ill-formed to call exception_ptr_cast with an rvalue exception_ptr? Just =delete the overload that preferentially binds to rvalues. Then you have to store the thing in a variable, so you know the resulting exception* can’t dangle. Like this:”

template const T* exception_ptr_cast(const exception_ptr&) noexcept;
template void exception_ptr_cast(const exception_ptr&&) = delete;

This, as all readers of my blog know, is wrong! Value category is not lifetime. We certainly want to continue being able to write (Godbolt):

if (Result r = DoThing(); r.has_error())
  if (auto *ex = std::exception_ptr_cast(r.error()))
    std::cout << ex->what() << "\n";

…even when r.error() returns by value, so that r.error() is a prvalue expression. And vice versa, even if we were to =delete the rvalue overload as shown above, that wouldn’t stop someone from writing (Godbolt):

auto *ex = std::exception_ptr_cast(DoThing().error());
  // dangles when r.error() returns by reference

When is deleting `const&&` overloads desirable?

Short answer, I believe it’s never desirable. Because value category is not lifetime.

However, even if you believe that value category is lifetime, there’s still a major difference between the places that the STL uses (const&&)=delete today, and the scenario with exception_ptr_cast sketched above. The eleven places where (const&&)=delete appears today are:

template
  const T* addressof(const T&&) = delete;
template
  void as_const(const T&&) = delete;
template void ref(const T&&) = delete;
template void cref(const T&&) = delete;

and in ref_view, and in reference_wrapper’s single-argument constructor (for parity with std::cref), and lastly, in the five constructors that construct regex_iterator or regex_token_iterator from const regex&&.

In all of these cases, the function being deleted is a function that would return you a pointer or reference to the argument itself, which we know is syntactically an rvalue expression and thus is being explicitly signaled by the programmer as “I’m done with this object” (regardless of how long it’ll actually live, relative to the useful lifetime of the returned pointer or reference).

In the exception_ptr_cast case, the function returns you a pointer, not to the argument object, but to the thing pointed to by the argument object: not eptr but *eptr. Semantically, this is the same operation as shared_ptr::operator*. Consider the potential for dangling here:

int& r = *std::make_shared(42);
std::cout << r;
  // r is dangling!
int *p = std::make_shared(42).get();
std::cout << *p;
  // p is dangling!

Yet we don’t mark shared_ptr::operator*() const&& or shared_ptr::get() const&& as deleted; because we realize that it’s actually quite useful to be able to pass around shared_ptrs by value. Even if value category were lifetime, the lifetime of a single shared_ptr is only weakly correlated with the lifetime of the object it points to.

Just for fun, I =delete‘d the rvalue overloads of get, *, and -> for both smart pointers and recompiled Clang to see what broke. The surprising answer: A lot of things!

This use of CI.getPCHContainerOperations(), exactly like the r.error() example above
This use of Context.takeReplaceableUses()
This use of CreateInfoOutputFile, which returns a unique_ptr that is dereferenced, used, and discarded
These uses of TMBuilder.create, ditto
Almost every use of createArgument, ditto (honestly a lot of these look like null dereferences waiting to happen)
This questionable use of NotNull
This redundant std::move probably indicates a bug of some kind
Here, an actual (though very minor) bug: get should have been release

In fact, LLVM contains at least one instance of the following pattern:

std::make_unique(F, DT, AC)->verifyPredicateInfo();

Why not simply PredicateInfo(F, DT, AC).verifyPredicateInfo()? Maybe a PredicateInfo is extremely large, such that the latter would blow their stack. I don’t think that’s actually the case, but it’s a plausible reason one might heap-allocate a temporary object like this.

In our exception_ptr_cast scenario, that would be like:

std::cout << std::exception_ptr_cast(std::current_exception())->what();
  // no risk of dangling here

In conclusion: Deleting rvalue overloads is a bad way to deal with dangling bugs, because value category is not lifetime. But even where the STL already uses =delete, it always uses it to address dangling references to the argument itself, never to anything more abstractly “managed” or reference-counted by the argument. We want to preserve the ability to pass and return exception_ptrs by value, and so we mustn’t make prvalue exception_ptrs behave any differently from lvalue exception_ptrs.

How my papers did at St Louis

2024-06-30T00:01:00+00:00

This past week I was at the WG21 meeting in St Louis, Missouri. My lovely wife and I spent a fair bit of time exploring the city in vacation mode: City Museum (fantastic); the penguin house at the St Louis Zoo (fantastic); cocktails, carpaccio, and hot chicken at Blood & Sand (fantastic). But in between, there were some movements of papers.

P1144R11 `is_trivially_relocatable`

No movement on P1144; EWGI continues not to schedule it for discussion and not to forward it to EWG either. But there was a big Friday discussion of the competing direction P2786, informed by the recent papers P3233 “Issues with P2786” (Giuseppe D’Angelo) and P3236 “Please reject P2786 and adopt P1144” (several authors). The result of this discussion was a strong vote (21–15–3–6–5) to “un-forward” P2786 and bring it back to EWG for further discussion and revision. My utmost thanks to all the authors involved, especially the twelve signatories of P3236 (two of whom managed to attend the meeting) and especially Giuseppe D’Angelo!

Recall the three main points of difference between P1144 and P2786:

1. Does the is_trivially_relocatable trait have “holistic”/“umbrella” semantics, tying directly into the Rule of Five and relating an object’s value semantics to its object representation, à la is_trivially_copyable? Or does it talk specifically about the operation “move-construct and destroy,” i.e., can the operation never be used without messing with object lifetimes; and does the trait refer specifically to that operation, à la is_trivially_copy_assignable?

Another way to put that is: There’s a class property P1 “can be move-destroyed trivially” and a class property P2 “can be move-destroyed trivially and swapped trivially.” No codebase ever has names for both P1 and P2; the majority of third-party codebases have a trait is_trivially_relocatable modeling P2, and don’t care about P1. A minority of codebases (see the survey; perhaps only OE-Lib) have a trait is_trivially_relocatable modeling P1, and don’t care about P2. P1144 proposes that the STL should use the existing name std::is_trivially_relocatable for P2, and not care about P1. P2786 proposed that the STL should reassign the name std::is_trivially_relocatable to P1 instead, and someday come up with a new name, such as is_trivially_swappable, for P2 (with the corresponding contextual keyword).

2. Is the warrant syntax an ordinary attribute, e.g. [[clang::trivially_relocatable]], appearing in the usual position for class attributes? Or is it a new contextual keyword, appearing in trailing position like final?

3. Does the warrant have “sharp-knife” semantics, permitting the programmer to warrant the trivial relocatability of a class that contains data members of unknown triviality, on pain of UB? Or does the warrant have “dull-knife” semantics, being a no-op or ill-formed when applied wrongly, thus requiring application of the warrant “virally downward” through all the class’s bases and data members? (Note that the former, P1144’s choice, formally classifies all “buggy” misuse of the warrant as UB. The latter, P2786’s choice, allows you to misuse the attribute to create behaviors a human would certainly call “bugs,” such as the segfault shown in P3233 §8.3, while still seeming to classify such bugs as “well-defined behavior.” I admit I have no good mental model of P2786’s decisions in this area.)

I expect that the St Louis discussion will result, at least, in a new revision of P2786 that brings P2786 into line with P1144 as regards point #1, and as regards polymorphic types. See “Polymorphic types aren’t trivially relocatable” (2023-06-24). I think point #3 will require EWG discussion. Point #2 will be the very last to fall. But notice that points #2 and #3 relate only to the warrant mechanism, not to the meaning of the trait itself.

Once P2786 formally agrees with P1144 on point #1, there will be no further reason for Clang maintainers to delay adopting #84621, which will give us our first mainstream compiler implementation of P1144/P2786 sans warrant syntax. The next step will be for Clang to implement a warrant syntax that works in today’s C++, which in fact is already in use by some libraries eagerly awaiting Clang’s implementation.

In short, St Louis was a very good meeting for trivial relocatability. My utmost thanks again to Giuseppe and the P3236 folks!

P2767R2 `flat_map`/`flat_set` omnibus

No movement; still in LWG and LEWG (depending on which part you’re looking at). But the first subsection’s big editorial refactoring was merged before St Louis, which is great.

P2848R0 `is_uniqued` (Enrico Mauro and myself)

Seen by the SG9 Ranges subgroup, and forwarded, with only a little questioning of the name.

P2952R0 `auto& operator=(X&&) = default` (Matthew Taylor and myself)

Seen by EWGI in my absence and forwarded to EWG for next time.

P2953R0 Forbid defaulting `operator=(X) &&`

No movement; still in EWGI.

P3016R0 Inconsistencies in `begin`/`end` for `valarray` and `initializer_list`

No movement. As I understand it, P3016 was made ready to forward to LWG in Tokyo, but due to a snafu wasn’t actually electronic-polled before St Louis; so it will likely be seen by LWG in Wrocław.

P3279 What “trivially fooable” should mean

This paper basically recaps the horrors of “Types that falsely advertise trivial copyability” (2024-05-15), and tries to find a way for the C++ Standard to express what all library vendors have assumed for decades it actually meant. But P3279R0 is a big mess still lacking a clear proposed wording; so it hasn’t yet been discussed by any group, and that’s fine. I’m hoping R1 will be clearer.

Other papers

Peter Sommerlad’s P2968 “Make std::ignore a first-class object” was voted into C++26 at this meeting! This small paper provides the Standard’s blessing to code like

std::ignore = f();

which, until now, was always technically UB.

Should you now go replace all your void casts, like (void)effectful(), with std::ignore = effectful()? Of course not. Please god don’t do that. But I’m very happy that the Standard now formally defines the meaning of std::ignore, blessing common practice and removing UB from the library.

Alan Talbot’s P0562 “Trailing Commas in Base-clauses and Ctor-initializers” proposes that, just like you can use trailing commas in array initializers and enum definitions, you should be able to use them in base-specifier-lists and member-initializer-lists too.

See:

“Why you should enforce dangling commas for multiline statements” (Nik Graf, April 2016)
“On the virtues of the trailing comma” (Raymond Chen, February 2024)

P0562 almost made it to plenary, but was delayed at the eleventh hour by what looks like a serious grammatical ambiguity. Godbolt:

struct X { X(int); };
struct Y : X {
  Y(short) : A1(0)  {}
  Y(float) : A2(0), {}
  
  >(0){}

  template using A1 = X;
  template using A2 = X;
  static constexpr int b1 = 2;
  template using b2 = int;
  static constexpr int c = 0;
};

That >(0){} cruft is actually still part of the Y(float) constructor definition! But how is the parser supposed to know it, if we permit the comma after (0) to be a meaningless dangling comma? The parser hasn’t seen the declaration of A2 yet. Today, apparently, this is handled by the assumption that a comma followed immediately by { never indicates the beginning of the constructor’s braced function body — that is, that trailing commas never appear.

Can this be fixed? I don’t know.

Does any similar ambiguity affect the base-specifier-list part of the proposal? I don’t know.

Lénárd Szolnoki’s P2413 “Remove unsafe conversions of unique_ptr” arrived in LEWG with an R1 that was much more complicated than its R0, and was asked to come back less complicated.

I implemented R0 in my Clang fork back in 2020. It simply removes the implicit conversion from default_delete (which calls delete on a Lion*) to default_delete (which calls delete on a Cat*) except in the case that Cat’s destructor is virtual. Because if calling delete on a Cat* that really (we assume) points to a Lion object, has undefined behavior except when Cat’s destructor is virtual. This obscure change has the effect of also preventing conversion from unique_ptr to unique_ptr (because the deleter type is no longer convertible). This has caught several real bugs in both LLVM/Clang and Chromium, as well as in my own employer’s codebase.

R1 extends R0 by also trying to catch situations like:

auto p = std::unique_ptr(new Lion);  // UB
p.reset(new Lion);  // UB

This does catch real bugs; Lénárd showed me LLVM #96980 as an example. But it requires invasive changes to the API of unique_ptr itself, which is scary for a language that prides itself on backward compatibility. In particular, R1 broke an idiom used at least nine times in Chromium:

template
struct raw_ptr {
  T *p_;
  operator T*() const { return p_; }
};

raw_ptr r;
auto p = std::unique_ptr(r); // R1 made this ill-formed
p.reset(r); // R1 made this ill-formed

So I hope we’ll see P2413 return with the more conservative direction in the future.

Gonzalo Brito Gadeschi’s P0843 inplace_vector was adopted for C++26! I still think Pablo Halpern’s proposal to add allocator support (for the purpose of “passing through” non-trivial allocators to the elements à la std::tuple, and for the benefit of fancy allocators like Boost.Interprocess) is a good idea, and I hope that he and/or I will bring a proposal to add such support before C++26 ships. Maybe it’ll be easier to sell as a small amendment on top of the draft IS, than as a relatively invasive modification of Gonzalo’s proposal.

Another important paper adopted for C++26 is P2300 “std::execution” (9 authors), which is something to do with coroutines and/or multithreading and/or Sender/Receiver (P2900). You can see some examples of P2300’s API in the paper’s Examples section. It was pointed out that P2300 is too arcane for any ordinary programmer to comprehend, and as far as I can tell, that’s true; but (charitably) that’s not necessarily a reason to keep it out of the Standard, just a reason to teach people not to use it. I don’t expect ordinary mortals to use std::linalg or std::rcu either.

On the other hand, std::linalg and std::rcu passed their plenaries by unanimous consent (see N4970 motion 19, N4957 motions 6+7); std::execution generated a lot of discussion and then passed with significant opposition (N4985 motion 12, 57–27–20). I think we’ll see more activity in this area. If nothing else, we’ll see some National Body comments.

How the STL uses `explicit`

2024-06-25T00:01:00+00:00

One of the papers on the docket for this week’s WG21 meeting in St Louis is P3116 “Policy for explicit” (Zach Laine, 2024). The idea of “policy,” in this context, is that LEWG wants to have something like a “style guide” for proposal-authors. If a proposal comes in with noexcept in the wrong places, or explicit, or [[nodiscard]], we want to be able to quickly tell the author how it ought to be, without a lot of the same discussion happening on every paper. Like a house style guide in newspaper-editing: if our newspaper uses the Oxford comma, and you bring in an article without it, then we can just point to the style guide, make the fix, and move on [see?], without a lot of repeated discussion of the pros and cons of the comma except insofar as you can argue that it belongs in this particular article for a really good reason.

So the idea of P3116 is basically to lay out the current “house style” of the C++ Standard document (specifically as it relates to the explicit keyword) — not to innovate or break new ground, but just to describe what the library clauses currently do, so that we can keep consistency with that house style going forward. (Is consistency a virtue? Usually. And specifically in C++, which is built on “static polymorphism” with templates, it’s supremely important that “like things look alike.” Every inconsistency is a potential compiler error deep inside someone’s template code; every surface defect will be taught as a pitfall.)

Regular readers of my blog will know my own guideline for the usage of explicit in industry codebases:

All your constructors should be explicit by default. Non-explicit constructors are for special cases.

See “Most C++ constructors should be explicit” (2023-04-08).

But what’s good in green-field industry code isn’t always possible in 40-year-old Standard Library specification! Yes, it’s bad, that e.g. the iterator-pair constructor of vector is non-explicit, so that this (Godbolt) is UB:

std::vector v = {"1", "2"}; // UB

But there are billions of lines of code relying for over a decade on awful constructs like

const char s[] = "hello";
std::vector v = {&s[0], &s[5]};  // OK since C++11, albeit poor style

thus IMHO it’s simply not possible for WG21 to wake up tomorrow and mark std::vector’s iterator-pair constructor explicit. It’d break the world.

So, if the STL doesn’t follow my guidelines for the use of explicit, what policies does it follow?

The STL’s `explicit` house style as I understand it

Again, this is a bad set of guidelines for industry code. If you came here looking for what you should do in your code, go read “Most C++ constructors should be explicit” (2023-04-08) instead. These guidelines are specifically for types aiming to get into the Standard Library.

These guidelines are a slight expansion and clarification of what’s in P3116R0 already (thanks to Zach for that!). At some point, I expect that LEWG will adopt a policy (based on a future revision of P3116); then this blog post will be outdated and the source of truth (as to the “real” style policy) will become WG21’s own SD-9 document.

Without further ado, the style!

0. Deviation from the following rules is permitted; but every deviation must be motivated and justified by a good rationale.
Rationale: Note that “I like it” or “my employer does it this way” are not good rationales.
1. In general, there should be no operator X conversion functions except for operator bool.
Rationale: Prefer named accessors. Examples of justified deviation include basic_string::operator basic_string_view and explicit chrono::year::operator int.
2. operator bool should always be explicit.
Rationale: An explicit conversion to bool suffices to make the type contextually convertible to bool, which suffices for e.g. if (p), (p && true), etc. For example, unique_ptr::operator bool and optional::operator bool are both explicit. An example of justified deviation is bitset::reference::operator bool, which is non-explicit in order to support the syntax bool b = ref. It may also justify deviation if the type T needs to satisfy boolean-testable, which requires implicit convertibility; but this is very rare.
3. The appropriate explicit-ness for a constructor depends primarily on its arity: the number of arguments it accepts. Below are separate rules for one-argument, zero-argument, and multi-argument constructors. If a single overload is of variable arity (e.g. because of defaulted function arguments or variadic parameter packs), such that the rules below do not all agree about the appropriate explicit-ness, then you must split up that constructor into multiple overloads so that you can apply the rules.
Rationale: An example of non-deviation is set(initializer_list, const C& = C(), const A& = A()), which conforms to rule 4 when called with one argument and rule 8 when called with two or three arguments; both rules agree that it should be non-explicit. Another example is filesystem::path(string_type&&, format = auto_format), which justifies its deviation from rule 4 and conforms to rule 8. Even when not strictly required by this rule, it can be beneficial to split a constructor into multiple overloads; for example, in order to mark a zero-argument signature noexcept, or to avoid unnecessary default-construction of temporaries. See P1163R0 (Nevin Liber, 2018), a proposal to bring the entire Library into conformance with this guideline.
4. Initializer-list constructors ([dcl.init.list]), copy constructors, and move constructors should be non-explicit. Otherwise, every single-argument constructor should be explicit.
Rationale: This prevents implicit conversions between unrelated types from different domains. An example of a justified deviation is string(const char*); another is optional(optional&&), which is conditionally explicit. An example of deliberate non-deviation is explicit string(const StringViewLike&), where the user might be surprised if an implicit conversion from string_view allocated memory.

5. If the type provides a non-explicit initializer-list constructor, then it must also provide a non-explicit default constructor.
Rationale: Initialization from {} uses the default constructor, never the initializer-list constructor.

6. Except as indicated in the next rule, every zero-argument constructor should be non-explicit.

7. If the type is a “tag type,” then its zero-argument constructor should be explicit.
Rationale: Often, a non-explicit constructor for a tag type would be ineffective. For example, in_place_index_t carries information in the name of the type which cannot be deduced from a bare {}. Elsewhere, treating a bare {} as a tag might lead to new ambiguities. [TODO FIXME: Here is an example of ambiguity; but where does this pattern ever occur in the real STL?]

8. Constructors of arity 2 or greater should be non-explicit.
Rationale: Existing generic code may expect to be able to construct objects from initializer-lists without explicitly naming their type. For example, pair(piecewise_construct_t, tuple, tuple) is non-explicit. For example, chrono::nonexistent_local_time’s two-argument constructor is non-explicit. An example of a justified deviation is explicit optional(in_place_t, Args&&...), where the user might expect that optional o = {in_place, 1, 2} would call T(in_place, 1, 2) but in fact the constructor selected is explicit optional(in_place_t, Args&&...) and so the code is ill-formed.

9. Deduction guides should never be explicit.
Rationale: The explicit keyword has no effect on a deduction guide. See LWG3451.

Deviations in the current Standard Library

This list may be incomplete; you can help by adding to it.

Deviations from rule 1 inside (which I assume are all justifiable) include:

explicit day::operator unsigned() const
explicit month::operator unsigned() const
explicit year::operator int() const
year_month_day::operator sys_days() const;
explicit year_month_day::operator local_days() const;
year_month_day_last::operator sys_days() const;
explicit year_month_day_last::operator local_days() const;
year_month_weekday::operator sys_days() const;
explicit year_month_weekday::operator local_days() const;
year_month_weekday_last::operator sys_days() const;
explicit year_month_weekday_last::operator local_days() const;
explicit hh_mm_ss::operator precision() const;
zoned_time::operator sys_time() const;
explicit zoned_time::operator local_time() const;

Justifiable deviations from rule 1 elsewhere:

weak_ordering::operator partial_ordering() const;
strong_ordering::operator partial_ordering() const;
strong_ordering::operator weak_ordering() const;
coroutine_handle::operator coroutine_handle() const;
basic_string::operator basic_string_view() const;
reference_wrapper::operator T&() const;
atomic::operator T() const;
atomic::operator T() const volatile;
atomic_ref::operator T() const;
filesystem::path::operator string_type() const;
ranges::in_fun_result::operator in_fun_result() const &;
ranges::in_fun_result::operator in_fun_result() &&;
  // ...
ranges::out_value_result::operator out_value_result() const &;
ranges::out_value_result::operator out_value_result() &&;
ranges::subrange::operator PairLike() const;

Justified deviations from rule 2:

vector::reference::operator bool() const;
bitset::reference::operator bool() const;

Every single-argument converting constructor ([class.conv.ctor]) justifiably deviates from rule 4. These include:

basic_string(const C*, const A& = A());
basic_string(nullptr_t) = delete;
coroutine_handle(nullptr_t);
allocator(const allocator&);
pmr::polymorphic_allocator(memory_resource*);
pmr::polymorphic_allocator(const polymorphic_allocator&);
scoped_allocator_adaptor(const scoped_allocator_adaptor&);
scoped_allocator_adaptor(scoped_allocator_adaptor&&);
shared_ptr(nullptr_t);
shared_ptr(const shared_ptr&);
shared_ptr(shared_ptr&&);
shared_ptr(const weak_ptr&);
shared_ptr(unique_ptr&&);
weak_ptr(nullptr_t);
weak_ptr(const weak_ptr&);
weak_ptr(weak_ptr&&);
weak_ptr(const shared_ptr&);
unique_ptr(nullptr_t);
unique_ptr(unique_ptr&&);
expected(const expected&);
expected(expected&&);
explicit(see below) expected(U&&);
optional(nullopt_t);
explicit(see below) optional(U&&);
explicit(see below) optional(const optional&);
explicit(see below) optional(optional&&);
variant(T&&);
function(nullptr_t);
function(F&&);
move_only_function(nullptr_t);
move_only_function(F&&);
copyable_function(nullptr_t);
copyable_function(F&&);
function_ref(F*);
function_ref(F&&);
bitset(unsigned long long);
reverse_iterator(const reverse_iterator&);
move_iterator(const move_iterator&);
move_sentinel(const move_sentinel&);
explicit(see below) complex(const complex&);
valarray(const slice_array&);
valarray(const gslice_array&);
valarray(const mask_array&);
valarray(const indirect_array&);
chrono::duration(const duration&);
chrono::time_point(const time_point&);
chrono::weekday(const sys_days&);
chrono::year_month_day(const year_month_day_last&);
chrono::year_month_day(const sys_days&);
chrono::year_month_weekday(const sys_days&);
chrono::zoned_time(const sys_time&);
chrono::zoned_time(const zoned_time&);
error_code(ErrorCodeEnum);
error_condition(ErrorConditionEnum);
atomic(T);
shared_future(future&&);
text_encoding(text_encoding::id);
ranges::subrange(R&&);
ranges::ref_view(T&&);
ranges::owning_view(R&&);

string_view and span also have converting constructors that justifiably deviate from rule 4:

basic_string_view(const C*);
basic_string_view(nullptr_t) = delete;
span(T(&)[N]);
span(array&);
span(const array&);
explicit(see below) span(R&&);
explicit(see below) span(const span&);

span for N != dynamic_extent even justifiably deviates from rule 4’s guidance that initializer-list constructors should be non-explicit! This is so that {1,2,3} will implicitly convert to span but not to span; see P2447.

explicit(see below) span(initializer_list);

scoped_allocator_adaptor deviates from rules 3+4, without obvious justification:

scoped_allocator_adaptor(B&&, const As&...);

Among exception-hierarchy types, system_error deviates from rule 4; ios_base::failure deviates from rules 3+8. Personally, I think it would be reasonable to add “Exception types should have all their constructors explicit,” making the latter two of these constructors conformant but a very small number of multi-argument constructors (e.g. chrono::nonexistent_local_time) deviant. Notice that most exception types have single-argument constructors which are (by rule 4) explicit.

system_error(error_code);
explicit failure(const string&, const error_code& = io_errc::stream);
explicit failure(const char*, const error_code& = io_errc::stream);

The following constructors deviate from rules 3+8. P1163 proposed to fix them (well, all those that existed when P1163 was written). Notably, the recently added basic_syncbuf, basic_osyncstream, etc., do conform to all the rules.

explicit basic_stringbuf(const basic_string&, ios_base::openmode = ios_base::in | ios_base::out);
explicit basic_stringbuf(basic_string&&, ios_base::openmode = ios_base::in | ios_base::out);
explicit basic_stringbuf(const basic_string&, ios_base::openmode = ios_base::in | ios_base::out);
explicit basic_stringbuf(const U&, ios_base::openmode = ios_base::in | ios_base::out);
explicit basic_istringstream(const basic_string&, ios_base::openmode = ios_base::in);
explicit basic_istringstream(basic_string&&, ios_base::openmode = ios_base::in);
explicit basic_istringstream(const basic_string&, ios_base::openmode = ios_base::in);
explicit basic_istringstream(const U&, ios_base::openmode = ios_base::in);
explicit basic_ostringstream(const basic_string&, ios_base::openmode = ios_base::out);
explicit basic_ostringstream(basic_string&&, ios_base::openmode = ios_base::out);
explicit basic_ostringstream(const basic_string&, ios_base::openmode = ios_base::out);
explicit basic_ostringstream(const U&, ios_base::openmode = ios_base::out);
explicit basic_stringstream(const basic_string&, ios_base::openmode = ios_base::out | ios_base::in);
explicit basic_stringstream(basic_string&&, ios_base::openmode = ios_base::out | ios_base::in);
explicit basic_stringstream(const basic_string&, ios_base::openmode = ios_base::out | ios_base::in);
explicit basic_stringstream(const U&, ios_base::openmode = ios_base::out | ios_base::in);
explicit basic_spanbuf(span, ios_base::openmode = ios_base::in | ios_base::out);
explicit basic_ispanstream(span, ios_base::openmode = ios_base::in);
explicit basic_ospanstream(span, ios_base::openmode = ios_base::out);
explicit basic_spanstream(span, ios_base::openmode = ios_base::out | ios_base::in);
explicit basic_ifstream(const char*, ios_base::openmode = ios_base::in);
explicit basic_ifstream(const filesystem::path::value_type*, ios_base::openmode = ios_base::in);
explicit basic_ifstream(const string&, ios_base::openmode = ios_base::in);
explicit basic_ifstream(const U&, ios_base::openmode = ios_base::in);
explicit basic_ofstream(const char*, ios_base::openmode = ios_base::out);
explicit basic_ofstream(const filesystem::path::value_type*, ios_base::openmode = ios_base::out);
explicit basic_ofstream(const string&, ios_base::openmode = ios_base::out);
explicit basic_ofstream(const U&, ios_base::openmode = ios_base::out);
explicit basic_fstream(const char*, ios_base::openmode = ios_base::in | ios_base::out);
explicit basic_fstream(const filesystem::path::value_type*, ios_base::openmode = ios_base::in | ios_base::out);
explicit basic_fstream(const string&, ios_base::openmode = ios_base::in | ios_base::out);
explicit basic_fstream(const U&, ios_base::openmode = ios_base::in | ios_base::out);

Many utility types have explicit multi-argument constructors taking tag types; these deviate from rules 3+8. I believe the justification here is “avoids visual ambiguity as to whether we’re initializing a T or an AlgebraicOf.” That is, does optional t = {in_place, 1, 2}; initialize the contained T with T(1, 2) or with T(in_place, 1, 2)?

explicit optional(in_place_t, Args&&...);
explicit optional(in_place_t, initializer_list, Args&&...);
explicit variant(in_place_type_t, Args&&...);
explicit variant(in_place_type_t, initializer_list, Args&&...);
explicit variant(in_place_index_t, Args&&...);
explicit variant(in_place_index_t, initializer_list, Args&&...);
explicit unexpected(in_place_t, Args&&...);
explicit unexpected(in_place_t, initializer_list, Args&&...);
explicit expected(in_place_t, Args&&...);
explicit expected(in_place_t, initializer_list, Args&&...);
explicit expected(unexpect_t, Args&&...);
explicit expected(unexpect_t, initializer_list, Args&&...);
explicit single_view(in_place_t, Args&&...);

Some type-erasure types also have explicit tagged constructors; but function_ref(nontype_t, U&&) is conformantly non-explicit. Unlike with the algebraic types, any t = {in_place_type, 1, 2}; has only one possible meaning, and so these deviations from rules 3+8 seem unjustified to me.

explicit any(in_place_type_t, Args&&...); explicit any(in_place_type_t, initializer_list, Args&&...); explicit move_only_function(in_place_type_t, Args&&...); explicit move_only_function(in_place_type_t, initializer_list, Args&&...); explicit copyable_function(in_place_type_t, Args&&...); explicit copyable_function(in_place_type_t, initializer_list, Args&&...);

Algebraic types justifiably deviate from rules 4, 6, and 8 depending on the explicit-ness of the corresponding constructors of their element types:

explicit(see below) pair(); explicit(see below) pair(const T&, const U&); explicit(see below) pair(A&&, B&&); explicit(see below) pair(pair&); explicit(see below) pair(const pair&); explicit(see below) pair(pair&&); explicit(see below) pair(const pair&&); explicit(see below) pair(PairLike&&); explicit(see below) tuple(); explicit(see below) tuple(const Ts&...); explicit(see below) tuple(Us&&...); explicit(see below) tuple(tuple&); explicit(see below) tuple(const tuple&); explicit(see below) tuple(tuple&&); explicit(see below) tuple(const tuple&&); explicit(see below) tuple(pair&); explicit(see below) tuple(const pair&); explicit(see below) tuple(pair&&); explicit(see below) tuple(const pair&&); explicit(see below) tuple(TupleLike&&);

On the other hand, the allocator_arg-extended constructors of tuple seem unjustifiedly complicated. They might benefit from being made unconditionally explicit, with the justification that they are never meant to be called directly, but only via allocator-aware push_back and the like, which invariably use direct-initialization. Alternatively, they could be made unconditionally non-explicit, thus conforming to rule 8.

explicit(see below) tuple(allocator_arg_t, const A&); explicit(see below) tuple(allocator_arg_t, const A&, const Ts&...); explicit(see below) tuple(allocator_arg_t, const A&, Us&&...); explicit(see below) tuple(allocator_arg_t, const A&, tuple&); explicit(see below) tuple(allocator_arg_t, const A&, const tuple&); explicit(see below) tuple(allocator_arg_t, const A&, tuple&&); explicit(see below) tuple(allocator_arg_t, const A&, const tuple&&); explicit(see below) tuple(allocator_arg_t, const A&, pair&); explicit(see below) tuple(allocator_arg_t, const A&, const pair&); explicit(see below) tuple(allocator_arg_t, const A&, pair&&); explicit(see below) tuple(allocator_arg_t, const A&, const pair&&); explicit(see below) tuple(allocator_arg_t, const A&, TupleLike&&);

The containers have these unjustified deviations from rules 3+8. Again, notice that these allocator-extended constructors aren’t expected to be called explicitly by the user, so perhaps that could be a justification; but if so, then we should mark many more of their constructors explicit too. P1163 proposed to fix these:

explicit deque(size_type, const A& = A()); explicit forward_list(size_type, const A& = A()); explicit list(size_type, const A& = A()); explicit vector(size_type, const A& = A()); explicit map(const C&, const A& = A()); explicit multimap(const C&, const A& = A()); explicit set(const C&, const A& = A()); explicit multiset(const C&, const A& = A());

Here are other unjustified deviations from rules 3+8. P1163 proposed to fix them:

explicit unordered_map(size_type, const H& = H(), const E& = E(), const A& = A()); explicit unordered_multimap(size_type, const H& = H(), const E& = E(), const A& = A()); explicit unordered_set(size_type, const H& = H(), const E& = E(), const A& = A()); explicit unordered_multiset(size_type, const H& = H(), const E& = E(), const A& = A()); explicit basic_istream::sentry(basic_istream&, bool = false); explicit basic_ostream::sentry(basic_ostream&, bool = false); explicit filesystem::file_status(file_type, perms = perms::unknown); explicit basic_regex(const C*, flag_type = regex_constants::ECMAScript); explicit basic_regex(const basic_string&, flag_type = regex_constants::ECMAScript); explicit bitset(const basic_string&, size_type = 0, size_type = npos, C = '0', C = '1'); explicit bitset(const basic_string_view&, size_type = 0, size_type = npos, C = '0', C = '1'); explicit bitset(const C*, size_type = 0, size_type = npos, C = '0', C = '1');

My own P2767 proposes to fix two recent unjustified deviations from rules 3+8:

explicit flat_set(const TC&, const C& = C()); explicit flat_multiset(const TC&, const C& = C());

locale::facet and its children deviate from rules 3+6 and/or 3+8. P1163 chose not to propose fixing these classes because none of them have a public destructor.

explicit facet(size_t = 0); explicit ctype(size_t = 0); explicit ctype_byname(const char*, size_t = 0); explicit ctype_byname(const string&, size_t = 0); explicit ctype(const mask* = nullptr, bool = false, size_t = 0); explicit codecvt(size_t = 0); explicit num_get(size_t = 0); explicit num_put(size_t = 0); explicit numpunct(size_t = 0); explicit collate(size_t = 0); explicit collate_byname(const char*, size_t = 0); explicit collate_byname(const string&, size_t = 0); explicit time_get(size_t = 0); explicit time_get_byname(const char*, size_t = 0); explicit time_get_byname(const string&, size_t = 0); explicit time_put(size_t = 0); explicit time_put_byname(const char*, size_t = 0); explicit time_put_byname(const string&, size_t = 0); explicit money_get(size_t = 0); explicit moneypunct(size_t = 0); explicit messages(size_t = 0);

string deviates from rule 8. This being a very promiscuous template, the deviation might be justified as “avoiding [visual] ambiguity,” I’m not sure.

explicit basic_string(const StringViewLike&, const A& = A());

span justifiably deviates from rule 8, again to prevent {a, a+3} from implicitly converting to span. Here, personally, I’d prefer even more deviation from rule 8 — I think these constructors should be unconditionally explicit — but they’re analogous to e.g. vector’s iterator-pair constructor.

explicit(see below) span(It, size_type); explicit(see below) span(It, Sentinel);

The mdspan library deviates a lot from these guidelines. extents is kind of like a tag type, but also kind of like optional or unique_ptr in how it implements converting constructors from one kind of extents to another. The converting constructors (justifiably?) deviate from rule 4:

explicit(see below) extents(const extents&); explicit(see below) extents(span); explicit(see below) extents(const array&); mapping(const E&); explicit(see below) mapping(const layout_left::mapping&); explicit(see below) mapping(const layout_right::mapping&); explicit(see below) mapping(const layout_stride::mapping&); explicit(see below) mapping(const LLPM&); // likewise for layout_right::mapping, layout_stride::mapping, // layout_left_padded::mapping, layout_right_padded::mapping explicit(see below) scaled_accessor(const scaled_accessor&); explicit(see below) conjugated_accessor(const conjugated_accessor&); explicit(see below) mdspan(const mdspan&);

Then there’s a variadic constructor deviating from rules 3+8; but I suspect that if it were non-explicit, it might be ambiguous with the converting constructors from span and/or array above:

explicit extents(Gs...);

Then there is one constructor template (justifiably?) deviating from rules 3+8:

explicit mdspan(data_handle_type, Os...);

And two (justifiably?) deviating from rule 8:

explicit(see below) mdspan(data_handle_type, span); explicit(see below) mdspan(data_handle_type, const array&);

out_ptr_t deviates from rule 8 with the justification (AFAIK) that you should never construct an out_ptr_t directly, but only via the factory function std::out_ptr(p). (This is similar to the above-mentioned possible justification for allocator-extended constructors.)

explicit out_ptr_t(S&, Args...); explicit inout_ptr_t(S&, Args...);

Some types in [thread] deviate from rule 6 and/or rule 8 in ways that seem unjustified, but also harmless:

explicit thread(F&&, Args&&...); explicit jthread(F&&, Args&&...); explicit scoped_lock(Ms&...); explicit scoped_lock(adopt_lock_t, Ms&...); explicit barrier(ptrdiff_t, F = F()); explicit stop_callback(const stop_token&, D&&); explicit stop_callback(stop_token&&, D&&);
Some distributions in [rand] unjustifiedly deviate from rule 8. It’s not 100% consistent; discrete_distribution, piecewise_constant_distribution, and piecewise_linear_distribution do not deviate. P1163 proposed to fix all of these:

explicit uniform_int_distribution(I, I = numeric_limits::max()); explicit uniform_real_distribution(R, R = 1.0); explicit binomial_distribution(I, double = 0.5); explicit gamma_distribution(R, R = 1.0); explicit weibull_distribution(R, R = 1.0); explicit extreme_value_distribution(R, R = 1.0); explicit normal_distribution(R, R = 1.0); explicit lognormal_distribution(R, R = 1.0); explicit cauchy_distribution(R, R = 1.0); explicit fisher_f_distribution(R, R = 1.0);

All Ranges view types deviate from rule 8, justified by P2711. (But none of them deviate from rule 6.)

explicit filter_view(V, F); explicit transform_view(V, F); explicit take_view(V, range_difference_t); explicit take_while_view(V, F); explicit drop_view(V, range_difference_t); explicit drop_while_view(V, F); explicit join_with_view(V, P); explicit join_with_view(R&&, range_value_t); explicit lazy_split_view(V, P); explicit lazy_split_view(R&&, range_value_t); explicit split_view(V, P); explicit split_view(R&&, range_value_t); explicit concat_view(Views...); explicit zip_view(Views...); explicit zip_transform_view(F, Views...); explicit adjacent_transform_view(V, F); explicit chunk_view(V, range_difference_t); explicit slide_view(V, range_difference_t); explicit chunk_by_view(V, F); explicit stride_view(V, range_difference_t); explicit cartesian_product_view(V1, Vs...);

The “tag types” in the Standard (which correctly abandon rule 6 for rule 7, and thus do not deviate from the rules) are:

explicit sorted_unique_t(); explicit sorted_equivalent_t(); explicit full_extent_t(); explicit allocator_arg_t(); explicit linalg::column_major_t(); explicit linalg::row_major_t(); explicit linalg::upper_triangle_t(); explicit linalg::lower_triangle_t(); explicit linalg::implicit_unit_diagonal_t(); explicit linalg::explicit_diagonal_t(); explicit from_range_t(); explicit destroying_delete_t(); explicit nothrow_t(); explicit nostopstate_t(); explicit defer_lock_t(); explicit try_to_lock_t(); explicit adopt_lock_t(); explicit piecewise_construct_t(); explicit in_place_t(); explicit in_place_type_t(); explicit in_place_index_t(); explicit nontype_t(); explicit unexpect_t(); explicit chrono::last_spec();

_Because Internet_ (2019)

2024-06-19T00:01:00+00:00

This past weekend I finished reading Gretchen McCulloch’s entertaining 2019 book on “the new rules of language,” Because Internet. Top takeaways:

Emoji (emojis?) are gestures: they can reinforce a verbal message (the stereotypical-old-person’s “going🚗 to buy bread🍞 do you need anything??”) or even convey certain kinds of nonverbal message by themselves (“💕💕”, “🎉”, “🙏” — the latter literally representing a bodily gesture). You couldn’t write Hamlet in emoji any more than you could write Hamlet using only gestures. But, as McCulloch astutely points out (p. 192), Shakespeare’s actual Hamlet was always intended to be accompanied by gestures! (See “Booth on Cassio” (2019-01-05).)

The notion of “gesture” also explains why emoji can be repeated for emphasis, and combined in a relatively order-free way: “🎊🎆🥂” and “🥂🎊🎆” mean exactly the same thing, whereas reordered verbal phrases like “old green knife” and “green old knife” really don’t feel quite synonymous.

McCulloch observes that we needed text-messaging in order for video-calling to be adopted. The historical norm for telephone calls (even after the invention of Caller ID) was to pick up no matter what; what Hopper 1992 calls “caller hegemony.”

Consider the following scene: You and your best loved one are having the most difficult argument you can remember. S/he has just escalated the argument by calling you a terrible name. You ready a stinging retort, but just then the phone rings. Do you answer it? The overwhelming majority of the hundreds of individuals to whom I have posed that question indicate that they would answer the telephone even on such an extreme occasion.

Twenty-five years later, McCulloch reports, the overwelt was in the opposite direction. And why? Well, if it’s really important they’ll text. Or, god forbid, leave a voicemail.

With such a norm, how could video calling ever catch on? A video call needs to be planned, staged — you can hop out of the shower or roll over in bed to pick up the phone, but you can’t do the same with a video call. Stanley Kubrick was showing video payphones in 2001: A Space Odyssey (1968), but it took until the 2010s for Skype and Zoom to go mainstream. The missing ingredient was the low-bandwidth, low-stakes, high-latency medium over which to set up the high-stakes video call: “Hey, got time for a Zoom tonight?”

McCulloch spends some pages talking about the linguistics of Internet animals, from the 2000s’ lolcats to the 2020s’ doge and snek. This reminded me of a meme that’s been in mine and my wife’s household lexicon for a while, that I might as well share here. It all started with this 2014 Tumblr comic by Grace Culloton:

For years I’d read this comic as:

PERSON: *preparing a sandwich*
no no
Not for puppies

DOG:
Okay, but like…
what if
was for puppies

And so this became a phrasal template for us.

Suppose we’re driving home, where there’s food; we probably shouldn’t stop at McDonalds on the way; but like
what if
did stop at McDonalds?

Now, the thing I noticed only several years later — if you noticed it right away, this part will be much less interesting to you — is that the dog doesn’t actually say

what if
was for puppies

at all! He says

what if it
was for puppies,

with the pronoun. It’s just that by the time I first encountered this comic I’d totally internalized the stereotypical way that dogs talk on the Internet, and part of that stereotype is ever-so-slightly-askew grammar. (Turns out, on the Internet can tell you’re a dog.)

Another relevant observation by McCulloch (p. 110): “For people whose linguistic norms are oriented towards the offline world, the most neutral way of separating one utterance from the next is with a dash or a string of dots. […] For people whose linguistic norms are oriented to the internet, the most neutral way of indicating an utterance is with a new line or message break.” (Pp. 209–217 discuss how the “chat” paradigm graduated from IRC and AIM to its modern ubiquity. Your phone could present text messages with an “email inbox” paradigm; but it doesn’t; it presents texts as a stream of newline-separated utterances instead. Texts, timelines, blog comments: everything is chat now.)

So when I think of how that dog’s utterance should be punctuated — well, it’s just like in the comic, right? There’s no dot-dot-dot, no comma; just new speech bubble, new panel, new line. The little word “it”, at the end of the line, easily and naturally gets lost. Consider this alternative dialogue option:

what if
it was for puppies

Less chance of losing the “it” in that version, isn’t there? I wonder if the vertical structure of internet text encourages the placement of important words at line-beginnings, and thus encourages the erosion of small and unimportant words particularly when they immediately precede a stressed word (and thus would be placed at line-endings).

Anyway, I recommend Because Internet. Go read it!

Who uses P2786 and P1144 for trivial relocation?

2024-06-15T00:01:00+00:00

On Friday, June 28th, there will be a discussion at the C++ Committee meeting about the state of “trivial relocation” in C++. (The meeting is physically in St Louis, but also streamed remotely via Zoom and open to visiting experts per WG21’s Meetings and Participation Policy.) If you’re an expert (read: a library maintainer or developer) with opinions or experience on “trivial relocation” — especially if you have implemented it in your own codebase — I encourage you to read the participation policy and try (virtually) to show up for the discussion!

There are two active proposals for trivial relocation in C++: my own P1144 (based on pre-existing practice in Folly, BSL, and Qt) and Bloomberg’s P2786 (a reaction to P1144). The latter — unfortunately in my view — has raced past P1144 in the committee and shows a real danger of being included in C++26.

Why “danger”? Because P2786 is not fit for any present-day library’s needs. See section 2.

And how “raced”? See section 3.

For a good (non-Arthur-written) summary of “what is trivial relocation anyway,” see Giuseppe D’Angelo’s ongoing series of blog posts for KDAB, collectively titled “Qt and Trivial Relocation”:

Part 1: “What is relocation?”

Part 2: “Relocation and erasure”

Part 3: “Trivial relocability for vector erasure, and types with write-through reference semantics”

Part 4: “On trivial relocation and move assignments”

What present-day libraries use P2786 and/or P1144

There are several aspects to “trivial relocation” as a proposed C++ feature:

The simple user-facing type-trait is_trivially_relocatable_v. P2786 and P1144 both propose this, but with significantly different meanings. P2786 treats “relocatable” as a verb analogous to “move-constructible,” whereas P1144 treats “trivially relocatable” as a holistic property analogous to “trivially copyable.” A P2786-friendly codebase would say that tuple is trivially relocatable (because it’s trivially move-constructible and trivially destructible), whereas a P1144-friendly codebase would say that tuple is not trivially relocatable (because it permits a sequence of value-semantic operations preserving the number of live objects of the type, which cannot be emulated by operating solely on the participating objects’ bit-patterns).

New library algorithms. P1144 proposes the gamut you’d expect by treating the verb relocate analogously to construct: uninitialized_relocate, uninitialized_relocate_n, uninitialized_relocate_backward, relocate_at, and a prvalue-producing relocate. P2786 proposes only std::trivially_relocate. These, again, are significantly different approaches.

Invisible-to-the-user library optimizations that are compatible with either P1144 or P2786; e.g. vector reallocation, type-erased function move-construction and move-assignment, inplace_vector move-construction.

Invisible-to-the-user library optimizations that are compatible only with P1144; e.g. vector insert and erase, swap, rotate, inplace_vector move-assignment.

“Warrant” syntax. P1144 proposes a [[trivially_relocatable]] attribute; P2786 proposes a contextual keyword trivially_relocatable. Most libraries can be expected not to use these, because their effect would be limited to compilers that support the attribute (in the former case) or to compilers speculatively supporting a merely proposed C++26 keyword (in the latter case).

For any codebase that concerns itself at all with “trivial relocation,” we can look at how it deals with each of these four dimensions: Does it define a type-trait matching P1144’s holistic semantics or P2786’s single-operation semantics (or neither)? Does it provide some or all of P1144’s proposed library algorithms, or P2786’s single algorithm (or neither)? Does it implement optimizations compatible only with P1144, optimizations compatible with both proposals, or no optimizations at all?

Further, it’s only fair to look at four other relevant properties of a library:

Has it ever taken relevant input from Arthur O’Dwyer; from Mungo and Alisdair; both; or neither? (And if so: Was the input in the form of code-review feedback, or actual code?)

Does the library rely on P1144’s proposed feature-test macros __cpp_impl_trivially_relocatable and/or __cpp_lib_trivially_relocatable, i.e., can you compile it with Arthur’s fork of Clang/libc++ and get the optimizations today? Alternatively, does it rely on P2786’s proposed feature-test macro __cpp_trivial_relocatability?

Did any of the library’s maintainers sign P3236R1 “Please reject P2786 and adopt P1144” (May 2024)?

Did any of the library’s maintainers comment (pro or con) on Clang #84621, which asks Clang to implement P1144 semantics for __is_trivially_relocatable(T)?

I’ll try to update this list of libraries as I find out about new ones.

Abseil

Google Abseil implements absl::is_trivially_relocatable with P1144 semantics: “its object representation doesn’t depend on its address, and also none of its special member functions do anything strange.” Abseil goes out of its way to avoid using Clang’s __is_trivially_relocatable builtin, because of Clang #69394, but also because:

// Clang on all platforms fails to detect that a type with a user-provided // move-assignment operator is not trivially relocatable. So in fact we // opt out of Clang altogether, for now. // // TODO(b/325479096): Remove the opt-out once Clang's behavior is fixed.

Abseil does use the builtin if P1144’s feature-test macro __cpp_impl_trivially_relocatable is defined. This means that we can use Godbolt to see Abseil getting better codegen on a compiler that supports P1144: this Godbolt shows inlined_vector::erase generating a loop over the assignment operator with Clang trunk, but a straight-line memmmove with P1144 Clang.

Abseil’s Swiss tables (e.g. absl::flat_hash_set) optimize rehashing (compatible with both P2786 and P1144); this Godbolt shows the benefit. absl::InlinedVector optimizes erase and swap (compatible only with P1144), but not insert or reserve. The erase and swap optimizations were implemented by Arthur O’Dwyer (#1625, #1618, #1632).

One Abseil maintainer thumbs-upped Clang #84621, but did not leave a comment.

AMC

Amadeus AMC implements amc::is_trivially_relocatable with P1144 semantics. It implements the P1144 library algorithms uninitialized_relocate_n and relocate_at (but not the other three, and not P2786’s trivially_relocate).

amc::SmallVector and amc::Vector both optimize reserve (compatible with both P2786 and P1144), as well as swap (delegating some of the work to std::swap_ranges so as to remain compatible with both P2786 and P1144). They both optimize insert and erase (compatible only with P1144).

AMC’s sole contributor, Stéphane Janel, signed P3236R1. He also commented in favor on Clang #84621.

binutils-gdb

GNU binutils-gdb uses the adjective IsRelocatable, but uses it specifically to mean the same thing as is_trivially_copyable, and only in order to =delete memcpy and memmove for non-trivially-copyable types. I don’t know why they didn’t name their trait IsTriviallyCopyable. binutils-gdb isn’t evidence for any particular semantics.

Blender

Blender implements P1144’s uninitialized_relocate_n, but without any optimization for, or attempt to identify, trivially relocatable types.

Blender has never taken commits or code review from Arthur, Mungo, or Alisdair.

Two Blender maintainers signed P3236R1.

BSL

Bloomberg BSL’s implementation is very old — older than any standards proposal in this area.

BSL implements bslmf::IsBitwiseMoveable with P1144 semantics, i.e. as a synonym for the holistic property is_trivially_copyable, albeit with a (non-P1144-compatible, non-P2786-compatible) special case that assumes all one-byte types must be empty and therefore trivial. Notably, BSL does not set IsBitwiseMoveable for trivially move-constructible, trivially destructible types (such as tuple).

BSL provides the generic algorithms bslma::ConstructionUtil::destructiveMove(destp, alloc, srcp) (analogous but certainly not identical to P1144’s generic relocate_at(srcp, destp)) and ArrayPrimitives::destructiveMove(d_first, first, last, alloc) (ditto, P1144’s generic uninitialized_relocate(first, last, d_first)).

bsl::vector optimizes reserve (compatible with both P2786 and P1144) by delegating to the generic ArrayPrimitives::destructiveMove. It optimizes insert and erase (compatible only with P1144) likewise.

bsl::deque optimizes insert and erase (compatible only with P1144).

bsl::vector::insert(pos, first, last), when first is an input iterator (so that last - first cannot be computed), will append elements to the end of the vector and then delegate to the generic ArrayPrimitives::rotate, which optimizes (compatible only with P1144). BSL does not provide its own implementations of bsl::rotate, bsl::swap_ranges, etc.; those algorithms are just using‘ed from namespace std.

BSL has taken commits from Alisdair Meredith and Mungo Gill, but if any of them were in this particular area, it’s not immediately obvious. It’s never taken commits from Arthur O’Dwyer.

fast_io

The cppfastio/fast_io library implements fast_io::freestanding::is_trivially_relocatable with P1144 semantics.

In eleven places, it tests __has_cpp_attribute(clang::trivially_relocatable) and uses the attribute to warrant types as trivially relocatable if the attribute is supported. This attribute is supported only in Arthur’s P1144 reference implementation. It appears to me that none of these eleven instances strongly depend on P1144’s “sharp-knife” semantics; I think they are equally compatible with P2786’s “dull-knife” semantics.

fast_io::containers::vector optimizes reallocation (compatible with both P2786 and P1144) and erase (compatible only with P1144). However, as of this writing, erase is certainly buggy for most T, regardless of the relocation optimization; so this library is fundamentally not an example of usage experience.

fast_io has taken code review from Arthur O’Dwyer.

Folly

Facebook Folly’s implementation is very old — older than any standards proposal in this area.

Folly implements folly::IsRelocatable with P1144 semantics. As of #2216 (June 2024), Folly will use P1144’s std::is_trivially_relocatable when the feature-test macro __cpp_lib_trivially_relocatable is set. (This code was contributed by Arthur O’Dwyer.)

It does not implement either proposal’s library algorithms.

folly::fbvector optimizes reserve (compatible with both P2786 and P1144) and also insert and erase (compatible only with P1144).

folly::small_vector optimizes move-construction (compatible with both P2786 and P1144; contributed in #1934 by Arthur O’Dwyer) and move-assignment (compatible only with P1144; contributed separately by Giuseppe Ottaviano). small_vector does not yet optimize insert or erase.

Folly maintainer Giuseppe Ottaviano signed P3236R1.

HPX

Stellar HPX asked for an implementation of P1144 and/or P2786 as part of Google Summer of Code 2023. P1144 was implemented; Arthur was asked to code-review, and did so. The feature was shipped in HPX 1.10 (May 2024). IIUC, it’s all in the namespace hpx::experimental — although a lot of the documentation currently implies it’s in namespace hpx?

HPX 1.10 implements is_trivially_relocatable_v with P1144 semantics. It implements the P1144 library algorithms uninitialized_relocate{,_n,_backward} and relocate_at; it comments on, but doesn’t implement P1144 relocate; and it doesn’t implement P2786 trivially_relocate.

It does not implement any library optimizations.

It does not use anyone’s feature-test macros; it uses a config macro HPX_HAVE_P1144_RELOCATE_AT instead. If that macro is set, it will fall back to P1144’s std::is_trivially_relocatable, std::uninitialized_relocate, etc.

Three HPX contributors signed P3236R1. One (the primary implementor of HPX’s P1144 implementation) commented in favor on Clang #84621.

Iros

Cole Trammer’s Iros defines di::concepts::TriviallyRelocatable as a synonym for is_trivially_copyable.

The trait is used in erasure_cast in a way that indicates the important part for Iros is is_trivially_copy_constructible, i.e. nothing to do with relocation. This is incompatible with both P1144 and P2786.

Iros provides uninitialized_relocate(first, last, dfirst, dlast) and uninitialized_relocate_backwards(first, last, dfirst, dlast) in the di::container namespace. These take more arguments than the P1144 versions, and are not optimized.

libc++

libc++ introduced std::__libcpp_is_trivially_relocatable in February 2024. Unusually for this survey, libc++ uses Clang’s builtin __is_trivially_relocatable even in the absence of any feature-test macro. This means that it suffers from Clang #69394; but it’s also evidence that libc++ is satisfied for now to look only at “move construct + destroy” (compatible with P2786). For example, __libcpp_is_trivially_relocatable> is true (compatible only with P2786).

libc++ optimizes vector reallocation (compatible with both P2786 and P1144), and nothing else.

libc++ doesn’t implement any library API from either P1144 or P2786 (not even for internal use).

libc++ has taken commits and code review from Arthur, but not in this area.

libstdc++

libstdc++ introduced std::__is_bitwise_relocatable in October 2018; author Marc Glisse originally called it std::__is_trivially_relocatable but renamed it in February 2019 at Arthur’s suggestion (thus keeping the name __is_trivially_relocatable available for the core-language builtin, which was added to Clang by Devin Jeanpierre in February 2022.

std::__is_bitwise_relocatable is true for trivial types and for deque specifically, because libstdc++’s deque has a throwing move-constructor (therefore before 2018 it was getting the vector pessimization).

libstdc++ optimizes vector reallocation (compatible with both P2786 and P1144), and nothing else.

libstdc++ defines a generic algorithm __relocate_a(first, last, d_first, alloc) which is roughly analogous to P1144’s uninitialized_relocate(first, last, d_first). It also defines a helper __relocate_object_a(dest, src, alloc) but only for non-trivially-relocatable types (and notice that this parameter order is the opposite of P1144’s std::relocate_at).

libstdc++ has never taken commits from Arthur, Mungo, or Alisdair.

OE-Lib

Ole Erik Peistorpet’s OE-Lib defines oel::is_trivially_relocatable as “trivially move-constructible and trivially destructible” (compatible with P2786 but not with P1144). Likewise, oel::is_trivially_relocatable is true (compatible with P2786 but not with P1144).

oel::dynarray (a vector-alike type) optimizes insert and erase (compatible with P1144 but not with P2786) — but with an explicit code comment that “the allocator model is not quite standard: destroy is never used, construct […] is not called when relocating elements.” So while it’s not legal for std::vector::erase to destroy-and-reconstruct arbitrary (P2786-relocatable-but-not-P1144-relocatable) elements instead using their non-trivial assignment operator, oel::dynarray::erase might be perfectly within its contract to destroy-and-reconstruct arbitrary elements.

oel::dynarray also optimizes the novel operation da.unordered_erase(it); i.e., destroy *it, trivially relocate da.back() into the hole, and decrement da.size(). Again this wouldn’t be legal for std::vector under P2786, but might be expected behavior for oel::dynarray.

ParlayLib

Carnegie Mellon ParlayLib has supported relocation with P1144 semantics since October 2020; originally the code was based on a draft of P1144R0. In November 2023 the project’s primary maintainer, Daniel Liam Anderson, rewrote the code to match P1144R10; this update was code-reviewed by Arthur O’Dwyer (#67, #1) and merged in February 2024.

ParlayLib implements parlay::is_trivially_relocatable per P1144. If the P1144 feature-test macro __cpp_lib_trivially_relocatable is defined, then it will fall back to std::is_trivially_relocatable. Unusually for this survey, ParlayLib will use Clang’s builtin __is_trivially_relocatable even in the absence of any feature-test macro.

ParlayLib supports the P1144 library algorithms uninitialized_relocate{,_n} and relocate_at. It doesn’t implement P2786 trivially_relocate.

ParlayLib uses the [[trivially_relocatable]] attribute if the feature-test macro __cpp_impl_trivially_relocatable is set. This allows it to warrant to the compiler that parlay::sequence (a kind of multithreading-aware vector) is trivially relocatable.

ParlayLib supports several parallel sort_copy-style algorithms, each of which takes a template policy parameter controlling the way in which data is copied from the input range to the output range. One such policy is uninitialized_relocate_tag, which relocates elements from the input range to the output range. This is used to implement an in-place sort as:

auto Tmp = uninitialized_sequence(In.size()); auto a = count_sort(In, make_slice(Tmp), make_slice(Keys), num_buckets); parlay::uninitialized_relocate(Tmp.begin(), Tmp.end(), In.begin());

ParlayLib’s primary maintainer signed P3236R1, and commented in favor on Clang #84621.

PocketPy

PocketPy in #208 (2024) implemented a small_vector that uses relocation in its move-constructor.

PocketPy implements pkpy::is_trivially_relocatable_v with P1144 semantics, essentially as an implementation detail of pkpy::small_vector. This was introduced in #208 (February 2024).

pkpy::small_vector::reserve uses realloc for trivially relocatable value types (compatible with both P2786 and P1144). pkpy::small_vector doesn’t support insert, erase, or swap at all.

PocketPy has never taken commits or code review from Arthur, Mungo, or Alisdair.

Qt

Qt’s implementation is very old — older than any standards proposal in this area.

It implements Q_IS_RELOCATABLE (née Q_IS_MOVABLE) according to the P1144 definition, and has implemented the P1144-alike library algorithm q_uninitialized_relocate since June 2020.

Like BSL, Qt lacks a public rotate algorithm, but internally it uses a q_rotate optimized for trivially relocatable types (compatible only with P1144).

QVector and QList optimize insert and erase (both compatible only with P1144, not P2786). Qt’s incompatibility with P2786 is one of the main takeaways of the blog series “Qt and Trivial Relocatability” linked above.

Qt has never taken commits or code-review from Arthur, Mungo, or Alisdair.

Qt contributor Giuseppe D’Angelo authored P3233R0 “Issues with P2786” in April 2024.

Two Qt maintainers signed P3236R1. They commented in favor and “weak ‘leans-to’”, respectively, on Clang #84621.

Skia

Google Skia defines sk_is_trivially_relocatable with P1144 semantics (is_trivially_copyable), but doesn’t seem to use it for any optimizations.

small_vectors

Artur Bać’s C++23 small_vectors library provides small_vectors::is_relocatable_v with P1144 semantics.

It optimizes only reallocation (compatible with both P2786 and P1144).

It implements (as an implementation detail) a P1144-alike detail::uninitialized_relocate_n(first, n, dest), with no optimization. Besides that algorithm, it also provides three novel algorithms:

uninitialized_relocate_with_copy_n(first, n, dest) which does copy-and-destroy instead of move-and-destroy

uninitialized_relocate_if_noexcept_n(first, n, dest) which does move-and-destroy if move is nothrow, otherwise copy-and-destroy

uninitialized_uneven_range_swap(first1, n1, first2, n2) which swaps the first min(n1, n2) elements and then relocates the rest from the end of the longer range to the end of the shorter range. This is the building block of a small-vector swap (compatible only with P1144); although in fact this is dead code — small_vectors currently provides no swap functionality.

small_vectors’ almost-sole contributor commented in favor on Clang #84621.

stdx::error

Charles Salvia’s stdx::error library defines stdx::is_trivially_relocatable with P1144 semantics; if the P1144 feature-test macro __cpp_lib_trivially_relocatable is defined, it will fall back to std::is_trivially_relocatable. Unusually for this survey, stdx::error will use Clang’s builtin __is_trivially_relocatable even in the absence of any feature-test macro.

stdx::error uses P1144’s [[trivially_relocatable]] attribute to mark error as trivially relocatable, if the feature-test macro __cpp_impl_trivially_relocatable is defined.

Its use of trivial relocation is compatible with both P2786 and P1144.

All of this code was contributed by Arthur O’Dwyer (#1, #2) during the November 2023 Kona meeting. stdx::error has never taken commits or code review from Mungo or Alisdair.

Subspace

Chromium Subspace defines concept sus::mem::TriviallyRelocatable with P1144 semantics: the internal documentation talks about “non-trivial move operations and destructors,” and the concept tests is_trivially_move_assignable.

Alone in this survey, Subspace tests __has_extension(trivially_relocatable) when deciding whether to trust Clang’s __is_trivially_relocatable builtin. This __has_extension flag is defined in Arthur’s P1144 reference implementation but not in Clang trunk, nor in Corentin Jabot’s P2786 reference implementation.

sus::collections::Vec optimizes reserve (compatible with both P2786 and P1144). Trivial relocation is also used as a building block in Vec::drain (compatible with both P2786 and P1144). Vec doesn’t support arbitrary insert or erase.

Subspace’s almost-sole-contributor Dana Jansens wrote a blog post describing Subspace’s use of trivial relocation in sus::mem::swap(T&, T&), which is compatible only with P1144, not P2786:

“Trivially Relocatable Types in C++/Subspace” (Dana Jansens, January 2023)

Dana also commented in favor on Clang #84621.

Thermadiag/seq

The seq library (“a collection of original C++14 STL-like containers and related tools”) implements seq::is_relocatable as a synonym for is_trivially_copyable (compatible with P1144), but with code comments suggesting that it is only used to replace move-construction and destruction (compatible with P2786 but not P1144).

However, seq::detail::CircularBuffer does optimize insert and erase (compatible only with P1144). It also provides the novel operation push_back_pop_front_relocatable(v), which is a three-element rotate — v goes into the back of the queue while the front of the queue goes into v — optimized into three memcpys. If we understand this as destroying and reconstructing v, it’s compatible with both P2786 and P1144; if we understand it as an assignment to v, it’s compatible only with P1144.

seq::any has a policy parameter (with good documentation) controlling whether it’s allowed to hold non-trivially-relocatable types in its SBO buffer. This is compatible with both P2786 and P1144.

seq::radix_map optimizes reallocation (compatible with both P2786 and P1144).

Thrust

NVIDIA Thrust implements thrust::is_trivially_relocatable with P1144 semantics and documentation that refers to P1144R0.

It uses the trait in async_copy_n to relocate data from one place to another (compatible with both P1144 and P2786); however, async_copy_n itself is then used as a building block of the in-place async_stable_sort_n analogous to ParlayLib’s code quoted above. Since this uses relocation to permute elements during their lifetime (“swap by relocating”), it is compatible with P1144 but not P2786.

Thrust has never taken commits or code-review from Arthur, Mungo, or Alisdair.

History of P1144 and P2786 in WG21

Links in this section tend to go to the committee minutes on the WG21 wiki, which is private and accessible to committee members only.

P1144 “std::is_trivially_relocatable” (Arthur O’Dwyer, 2018–present) remains stuck in EWGI. It has been discussed three times in the six years since 2018:

At the February 2019 Kona meeting, EWGI saw P1144R3 and gave feedback.

At the February 2020 Prague meeting, EWGI saw P1144R4 (in my absence) and voted to forward to EWG (1–3–4–1–0, votes ordered “Strongly in Favor” to “Strongly Against”). No changes were requested (since everyone seems to have assumed it was being forwarded). P1144R5 was published in March 2020, but was never scheduled again in WG21 for the next three years.

At the February 2023 Issaquah meeting, EWGI saw both P2786R0 and P1144R6. An author of P2786 asked that P2786 and P1144 be forwarded together to EWG, or not at all, so that EWG could have a design discussion seeing both designs at once. However, two separate forwarding polls were taken. P1144R6 (with one author present remotely) polled 0–7–4–3–1; P2786R0 (with two) polled 1–8–3–3–1. The former was judged “not consensus” and the latter was judged “consensus.”

At the November 2023 Kona meeting, trivial relocatability was discussed in a Friday informational-only session of EWG. No quorum was present and no minutes were taken.

P2786 “Trivial Relocatability for C++26” (Mungo Gill and Alisdair Meredith, 2023–present) has passed once through EWG and CWG, although it enters St Louis on LEWG’s plate and also (luckily, although I don’t quite understand how we got here procedurally) back on EWG’s plate for further discussion.

P2786R0 was first published in February 2023, seen by EWGI (as above), forwarded 1–8–3–3–1.

At the November 2023 Kona meeting, trivial relocatability was discussed in a Friday informational-only session of EWG. No quorum was present and no minutes were taken.

At the February 2024 Tokyo meeting, EWG saw P2786 alone (despite the author’s request to discuss it alongside P1144). It was forwarded to CWG (7–9–6–0–2), seen by CWG, and approved. It was then sent to LEWG for consideration by that subgroup.

On 2024-04-09, LEWG saw P2786R4 in a Zoom telecon. It was pointed out by several participants that P2786 didn’t match the semantics of any present-day library; even Bloomberg’s own BSL uses the P1144 model.

The past year (especially the months since the February 2024 Tokyo meeting) has seen several new papers in the space:

P2967R1 “Relocation has a library interface” (Alisdair Meredith and Mungo Gill, October 2023–present)

P3233R0 “Issues with P2786” (Giuseppe D’Angelo, April 2024)

P3236R1 “Please reject P2786 and adopt P1144” (several authors, April 2024–present)

P3239R0 “A relocating swap” (Alisdair Meredith, May 2024)

P3278R0 “Analysis of interaction between relocation, assignment, and swap” (Nina Ranns and Corentin Jabot, May 2024)

Tangent circles of integer radius

2024-06-10T00:01:00+00:00

Over on Puzzling StackExchange, Brandan Williams poses the following interesting question:

Find a strictly decreasing sequence of integers \(r_0, r_1, r_2, \dots, r_n\) such that you can place kissing circles of radii \(r_1, r_2, \dots, r_n\) around a central circle of radius \(r_0\). That is, circle \(r_0\) will be tangent to all \(n\) other circles, and circle \(r_i\) will be tangent to circle \(r_{i+1}\) for all \(1\leq i

If it weren’t for the “strictly decreasing” criterion, a couple of simple solutions would be (1 1 1 1 1 1 1) and (1 3 2 3 2).

A more complicated example (still failing the “decreasing” criterion) would be this smallest integer solution to the four coins problem, (6 23 46 69):

If I’m not mistaken that this is the smallest distinct-integer-valued solution to the four coins problem, then I’m surprised it’s not better known. The only place I find the sequence “6, 23, 46, 69” recorded online is in this set of math puzzles from Stanford in 2020; see the diagram in problem 10. The radius-6 circle is the inner Soddy circle of the other three; their outer Soddy circle circumscribes the outer circles and (if I’m not mistaken) would have radius 138.

StackExchange contributor “EdwardH” wrote up a really fantastic procedure to find solutions that do satisfy the “strictly decreasing” criterion. I turned it into Python code (here; the code that generated these pretty SVG images is also there) and let my laptop loose on it for a while; but after multiple days of mainly brute force, my best solution remained superastronomically large:

Tom Sirgedas discovered that it works better to look for solutions with an odd number of circles (i.e., an even number of circles around the central one). Here’s his record as of this writing. These radii are small enough to fit in a 32-bit int!

Now, the obvious question is: Is Tom’s the smallest solution to this puzzle (measured by the radius of the central circle)? If not, then what is the smallest solution? And how do you prove it? Help wanted!

UPDATE, 2024-06-24: Tom again found a vastly smaller solution — in fact, a pair of them! The central circles here have radius only 235872.

UPDATE, 2024-06-29: Tom did it again! Here, the central radius is only 222768.

UPDATE, 2024-07-04: Tom’s outdone himself this time! The radii here are 5600, 5488, 4950, 3850, 3750, 2800, 2645, 2380, 945, 880, 450, 448, 300, 175, and 112.

See also these two posts, for the musings that led up to the invention of this problem (albeit no further investigation of the problem itself):

“Convergent Chain Curling” (Brandan Williams, July 2022)

“Convergent Chain Curling Part 2” (Brandan Williams, April 2024)

_Topsys & Turvys_ (1893)

2024-05-23T00:01:00+00:00

This evening Scott Kim gave a webinar for Fundapromat, the Panamanian Foundation for the Promotion of Mathematics, on the topic of “Ambigrams.” In that webinar, Scott mentioned in passing P. S. Newell’s 1893 book Topsys & Turvys as a forerunner of the genre.

The only textual ambigram in Newell’s book appears on the last page, where the word “PUZZLE” inverts to form the words “THE END.”

The book is downloadable from the Library of Congress (here), but since it’s hard to appreciate the upside-down parts in a PDF viewer… and since I already had the code to rotate images in the browser… here’s Topsys & Turvys (1893), the HTML5 version! Hover or tap to rotate each image below.

See also:

“Scott Kim’s rotational ambigrams for the Celebration of Mind” (2020-10-18)

Types that falsely advertise P2786 trivial relocatability

2024-05-18T00:01:00+00:00

My previous blog post — “Types that falsely advertise trivial copyability” (2024-05-15) — refers to my as-yet-pretty-vague proposal P3279 “What ‘trivially fooable’ should mean”, where I basically propose that is_trivially_constructible should be true if and only if T(declval()) selects a constructor or conversion operator which is “known to be equivalent in its observable effects to a simple copy of the complete object representation.” This is intended to be similar to the existing wording for defaulted special members, e.g.

A copy/move constructor for class X is trivial if it is not user-provided and […]

the constructor selected to copy/move each direct base class subobject is trivial, and

for each non-static data member of X that is of class type (or array thereof), the constructor selected to copy/move that member is trivial.

and to the proposed wording from P2786R5:

A class C is a trivially relocatable class if […]

when an object of type C is direct-initialized from an xvalue of type C, overload resolution would select a constructor that is neither user-provided nor deleted

I knew Corentin Jabot had implemented P2786R5 in a fork of Clang, so I went looking to see how he’d handled that particular wording. It involves overload resolution, which would also be needed by my P3279’s proposed new wording for “trivially copyable,” so I figured I should go see how that kind of thing is done in Clang.

Instead, I found no overload resolution: just a casewise check for a move-constructor, or if not a move-constructor then a copy-constructor. That’s not what P2786 says to do!

A type mishandled by the P2786 reference implementation

Godbolt:

struct C { C(const C&) = default; template C(C&&); C& operator=(C&&); };

When an object of type C is direct-initialized from an xvalue of type C, overload resolution selects the constructor C::C(C&&), which is absolutely user-provided. So this type C shouldn’t be P2786-trivially-relocatable; but Corentin’s reference implementation wrongly considers it to be.

A type that falsely advertises P2786 trivial relocatability

Godbolt:

struct S { S(const volatile S&); template S(const S&); S(S&&) = default; }; static_assert(!__is_trivially_copyable(S)); static_assert(!__is_trivially_relocatable(S)); static_assert(__is_trivially_constructible(S, S&&)); #ifdef P2786 static_assert(__is_cpp_trivially_relocatable(S)); #endif struct T { S s; T(const T&) = default; }; static_assert(!__is_trivially_copyable(T)); static_assert(!__is_trivially_relocatable(T)); static_assert(!__is_trivially_constructible(T, T&&)); #ifdef P2786 static_assert(__is_cpp_trivially_relocatable(T)); #endif

This corresponds to my previous post’s Plum example. The core language has one idea of how to relocate T — i.e., call its non-trivial copy constructor followed by its destructor:

void simple_correct_relocate(T *s, T *d) { ::new(d) T(std::move(*s)); s->~T(); }

— and P1144’s std::relocate_at rightly follows the core language, since T is not in fact trivially relocatable. But P2786R5 treats is_trivially_relocatable as true (because overload resolution on a move-construction of T selects T’s defaulted copy constructor, and we don’t look any deeper to see what that copy constructor actually does with T’s S data member).

Corentin’s P2786 reference implementation follows P2786’s dictates and bypasses T’s non-trivial relocation operation:

void new_relocate(T *s, T *d) { static_assert(__is_cpp_trivially_relocatable(T)); std::trivially_relocate(s, s+1, d); } new_relocate(T*, T*): # @new_relocate(T*, T*) movzbl (%rdi), %eax movb %al, (%rsi) retq

My P1144 implementation rightly gives the same code as simple_correct_relocate:

void new_relocate(T *s, T *d) { static_assert(!__is_trivially_relocatable(T)); std::relocate_at(s, d); } new_relocate(T*, T*): # @new_relocate(T*, T*) movq %rdi, %rax movq %rsi, %rdi movq %rax, %rsi jmp S::S(S const&)@PLT # TAILCALL

A type that falsely advertises both P1144 and P2786 trivial relocatability

Godbolt:

struct S { S(const volatile S&) = delete; template S(const S&); S(S&&) = default; }; static_assert(__is_trivially_copyable(S)); static_assert(__is_trivially_relocatable(S)); static_assert(__is_trivially_constructible(S, S&&)); #ifdef P2786 static_assert(__is_cpp_trivially_relocatable(S)); #endif struct T { S s; T(const T&) = default; }; static_assert(!__is_trivially_copyable(T)); static_assert(!__is_trivially_constructible(T, T&&)); #ifdef P2786 static_assert(!__is_trivially_relocatable(T)); static_assert(__is_cpp_trivially_relocatable(T)); #else static_assert(__is_trivially_relocatable(T)); #endif

In a post-P3279 world, is_trivially_copyable would depend on the results of overload resolution instead of just looking at special members. So S would not be considered is_trivially_copyable, and therefore it wouldn’t be P1144-trivially-relocatable; and therefore neither would T.

Arthur O’Dwyer

Why can’t I specialize `std::hash` inside my own namespace?

Analogous case for member templates

The problem is name lookup

A potentially dangerous pitfall

An almost-workaround

Conclusion

A `priority_tag`-like pattern for type-based metaprogramming

`std::try_cast` and `(const&&)=delete`

When is deleting const&& overloads desirable?

How my papers did at St Louis

P1144R11 is_trivially_relocatable

P2767R2 flat_map/flat_set omnibus

P2848R0 is_uniqued (Enrico Mauro and myself)

P2952R0 auto& operator=(X&&) = default (Matthew Taylor and myself)

P2953R0 Forbid defaulting operator=(X) &&

P3016R0 Inconsistencies in begin/end for valarray and initializer_list

P3279 What “trivially fooable” should mean

Other papers

How the STL uses `explicit`

The STL’s explicit house style as I understand it

Deviations in the current Standard Library

_Because Internet_ (2019)

Who uses P2786 and P1144 for trivial relocation?

What present-day libraries use P2786 and/or P1144

History of P1144 and P2786 in WG21

Tangent circles of integer radius

_Topsys & Turvys_ (1893)

Types that falsely advertise P2786 trivial relocatability

A type mishandled by the P2786 reference implementation

A type that falsely advertises P2786 trivial relocatability

A type that falsely advertises both P1144 and P2786 trivial relocatability