Let's say I have these two overloads: <pre class="prettyprint"><code>void Log(const wchar_t* message) { // Do something } void Log(const std::wstring& message) { // Do something } </code></pre> Can I then in the first function add some compile-time verifiction that the passed argument is a string literal? EDIT: A clarification on why this would be good in my case; my current high-frequency logging uses only string literals and can hence be optimized a lot when there are non-heap allocation guarantees. The second overload doesn't exist today, but I might want to add it, but then I want to keep the first one for extreme scenarios. :)

You can't detect string literals directly but you can detect if the argument is an array of characters which is pretty close. However, you can't do it from the inside, you need to do it from the outside: <pre class="prettyprint"><code>template <std::size_t Size> void Log(wchar_t const (&message)[Size]) { // the message is probably a string literal Log(static_cast<wchar_t const*>(message); } </code></pre> The above function will take care of wide string literals and arrays of wide characters: <pre class="prettyprint"><code>Log(L"literal as demanded"); wchar_t non_literal[] = { "this is not a literal" }; Log(non_literal); // will still call the array version </code></pre> Note that the information about the string being a literal isn't as useful as one might hope for. I frequently think that the information could be used to avoid computing the string length but, unfortunately, string literals can still embed null characters which messes up static deduction of the string length.

So this grew out of Keith Thompson's answer... As far as I know, you can't restrict string literals to only normal functions, but you can do it to macro functions (through a trick). <pre class="prettyprint"><code>#include <iostream> #define LOG(arg) Log(L"" arg) void Log(const wchar_t *message) { std::wcout << "Log: " << message << "\n"; } int main() { const wchar_t *s = L"Not this message"; LOG(L"hello world"); // works LOG(s); // terrible looking compiler error } </code></pre> Basically, a compiler will convert <code>"abc" "def"</code> to look exactly like <code>"abcdef"</code>. And likewise, it will convert <code>"" "abc"</code> to <code>"abc"</code>. You can use this to your benefit in this case. <hr> I also saw this comment on the C++ Lounge, and that gave me another idea of how to do this, which gives a cleaner error message: <pre class="prettyprint"><code>#define LOG(arg) do { static_assert(true, arg); Log(arg); } while (false) </code></pre> Here, we use the fact that static_assert requires a string literal as it's second argument. The error that we get if we pass a variable instead is quite nice as well: <pre class="prettyprint"><code>foo.cc:12:9: error: expected string literal LOG(s); ^ foo.cc:3:43: note: expanded from macro 'LOG' #define LOG(arg) do { static_assert(true, arg); Log(arg); } while (false) </code></pre>

I believe the answer to your question is no -- but here's a way to do something similar. Define a macro, and use the <code>#</code> "stringification" operator to guarantee that only a string literal will be passed to the function (unless somebody bypasses the macro and calls the function directly). For example: <pre class="prettyprint"><code>#include <iostream> #define LOG(arg) Log(#arg) void Log(const char *message) { std::cout << "Log: " << message << "\n"; } int main() { const char *s = "Not this message"; LOG("hello world"); LOG(hello world); LOG(s); } </code></pre> The output is: <pre class="prettyprint"><code>Log: "hello world" Log: hello world Log: s </code></pre> The attempt to pass <code>s</code> to <code>LOG()</code> did not trigger a compile-time diagnostic, but it didn't pass that pointer to the <code>Log</code> function. There are at least two disadvantages to this approach. One is that it's easily bypassed; you may be able to avoid that by searching the source code for references to the actual function name. The other is that stringifying a string literal doesn't just give you the same string literal; the stringified version of <code>"hello, world"</code> is <code>"\"hello, world\""</code>. I suppose your <code>Log</code> function could strip out any <code>"</code> characters in the passed string. You may also want to handle backslash escapes; for example, <code>"\n"</code> (a 1-character string containing a newline) is stringified as <code>"\\n"</code> (a 2-character string containing a backslash and the letter <code>n</code>). But I think a better approach is not to rely on the compiler to diagnose calls with arguments other than string literals. Just use some other tool to scan the source code for calls to your <code>Log</code> function and report any calls where the first argument isn't a string literal. If you can enforce a particular layout for the calls (for example, the tokens <code>Log</code>, <code>(</code>, and a string literal on the same line), that shouldn't be too difficult.

Can a compilation error be forced if a string argument is not a string literal?

Tags:

c++

templates

Let's say I have these two overloads:

void Log(const wchar_t* message)
{
    // Do something
}

void Log(const std::wstring& message)
{
    // Do something
}

Can I then in the first function add some compile-time verifiction that the passed argument is a string literal?

EDIT: A clarification on why this would be good in my case; my current high-frequency logging uses only string literals and can hence be optimized a lot when there are non-heap allocation guarantees. The second overload doesn't exist today, but I might want to add it, but then I want to keep the first one for extreme scenarios. :)

331

asked Sep 01 '13 22:09

Johann Gerell

3 Answers

You can't detect string literals directly but you can detect if the argument is an array of characters which is pretty close. However, you can't do it from the inside, you need to do it from the outside:

template <std::size_t Size>
void Log(wchar_t const (&message)[Size]) {
    // the message is probably a string literal
    Log(static_cast<wchar_t const*>(message);
}

The above function will take care of wide string literals and arrays of wide characters:

Log(L"literal as demanded");
wchar_t non_literal[] = { "this is not a literal" };
Log(non_literal); // will still call the array version

Note that the information about the string being a literal isn't as useful as one might hope for. I frequently think that the information could be used to avoid computing the string length but, unfortunately, string literals can still embed null characters which messes up static deduction of the string length.

180

answered Oct 22 '22 14:10

Dietmar Kühl

So this grew out of Keith Thompson's answer... As far as I know, you can't restrict string literals to only normal functions, but you can do it to macro functions (through a trick).

#include <iostream>
#define LOG(arg) Log(L"" arg)

void Log(const wchar_t *message) {
    std::wcout << "Log: " << message << "\n";
}

int main() {
    const wchar_t *s = L"Not this message";
    LOG(L"hello world");  // works
    LOG(s);               // terrible looking compiler error
}

Basically, a compiler will convert "abc" "def" to look exactly like "abcdef". And likewise, it will convert "" "abc" to "abc". You can use this to your benefit in this case.

I also saw this comment on the C++ Lounge, and that gave me another idea of how to do this, which gives a cleaner error message:

#define LOG(arg) do { static_assert(true, arg); Log(arg); } while (false)

Here, we use the fact that static_assert requires a string literal as it's second argument. The error that we get if we pass a variable instead is quite nice as well:

foo.cc:12:9: error: expected string literal
    LOG(s);
        ^
foo.cc:3:43: note: expanded from macro 'LOG'
#define LOG(arg) do { static_assert(true, arg); Log(arg); } while (false)

answered Oct 22 '22 15:10

Bill Lynch

I believe the answer to your question is no -- but here's a way to do something similar.

Define a macro, and use the # "stringification" operator to guarantee that only a string literal will be passed to the function (unless somebody bypasses the macro and calls the function directly). For example:

#include <iostream>

#define LOG(arg) Log(#arg)

void Log(const char *message) {
    std::cout << "Log: " << message << "\n";
}

int main() {
    const char *s = "Not this message";
    LOG("hello world");
    LOG(hello world);
    LOG(s);
}

The output is:

Log: "hello world"
Log: hello world
Log: s

The attempt to pass s to LOG() did not trigger a compile-time diagnostic, but it didn't pass that pointer to the Log function.

There are at least two disadvantages to this approach.

One is that it's easily bypassed; you may be able to avoid that by searching the source code for references to the actual function name.

The other is that stringifying a string literal doesn't just give you the same string literal; the stringified version of "hello, world" is "\"hello, world\"". I suppose your Log function could strip out any " characters in the passed string. You may also want to handle backslash escapes; for example, "\n" (a 1-character string containing a newline) is stringified as "\\n" (a 2-character string containing a backslash and the letter n).

But I think a better approach is not to rely on the compiler to diagnose calls with arguments other than string literals. Just use some other tool to scan the source code for calls to your Log function and report any calls where the first argument isn't a string literal. If you can enforce a particular layout for the calls (for example, the tokens Log, (, and a string literal on the same line), that shouldn't be too difficult.

answered Oct 22 '22 13:10

Keith Thompson

Related questions
                            
                                Convert array to set in C++
                            
                                keyword "auto" C++ and "dynamic" C#
                            
                                Default size of std::vector / Programming books myth?
                            
                                What is the result of comparing a number with NaN?
                            
                                How to clear vector in C++ from memory [duplicate]
                            
                                Erasing() an element in a vector doesn't work
                            
                                Why doesn't C++ have a pointer to member function type?
                            
                                when should a member function be both const and volatile together?
                            
                                C++ Class Extension
                            
                                java.util.concurrent vs. Boost Threads library
                            
                                Dynamic memory allocation question
                            
                                Iterating over 2-dimensional STL vector c++
                            
                                How to use cv::imdecode, if the contents of an image file are in a char array?
                            
                                Understanding Iterators in the STL
                            
                                Win32 - Get Main Wnd Handle of application
                            
                                generic way to print out variable name in c++
                            
                                How to check whether a class has specified nested class definition or typedef in C++ 11?
                            
                                Q_ASSERT release build semantics
                            
                                c++ std::vector search for value
                            
                                How can i get the top n keys of std::map based on their values?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With