I am implementing a function-like procedural macro which takes a single string literal as an argument, but I don't know how to get the value of the string literal. If I print the variable, it shows a bunch of fields, which includes both the type and the value. They are clearly there, somewhere. How do I get them? <pre class="prettyprint"><code>extern crate proc_macro; use proc_macro::{TokenStream,TokenTree}; #[proc_macro] pub fn my_macro(input: TokenStream) -> TokenStream { let input: Vec<TokenTree> = input.into_iter().collect(); let literal = match &input.get(0) { Some(TokenTree::Literal(literal)) => literal, _ => panic!() }; // can't do anything with "literal" // println!("{:?}", literal.lit.symbol); says "unknown field" format!("{:?}", format!("{:?}", literal)).parse().unwrap() } </code></pre> <pre class="prettyprint"><code>#![feature(proc_macro_hygiene)] extern crate macros; fn main() { let value = macros::my_macro!("hahaha"); println!("it is {}", value); // prints "it is Literal { lit: Lit { kind: Str, symbol: "hahaha", suffix: None }, span: Span { lo: BytePos(100), hi: BytePos(108), ctxt: #0 } }" } </code></pre>

After running into the same problem countless times already, I finally wrote a library to help with this: <code>litrs</code> on crates.io. It compiles faster than <code>syn</code> and lets you inspect your literals. <pre class="prettyprint"><code>use std::convert::TryFrom; use litrs::StringLit; use proc_macro::TokenStream; use quote::quote; #[proc_macro] pub fn my_macro(input: TokenStream) -> TokenStream { let input = input.into_iter().collect::<Vec<_>>(); if input.len() != 1 { let msg = format!("expected exactly one input token, got {}", input.len()); return quote! { compile_error!(#msg) }.into(); } let string_lit = match StringLit::try_from(&input[0]) { // Error if the token is not a string literal Err(e) => return e.to_compile_error(), Ok(lit) => lit, }; // `StringLit::value` returns the actual string value represented by the // literal. Quotes are removed and escape sequences replaced with the // corresponding value. let v = string_lit.value(); // TODO: implement your logic here } </code></pre> See the documentation of <code>litrs</code> for more information. <hr> To obtain more information about a literal, <code>litrs</code> uses the <code>Display</code> impl of <code>Literal</code> to obtain a string representation (as it would be written in source code) and then parses that string. For example, if the string starts with <code>0x</code> one knows it has to be an integer literal, if it starts with <code>r#"</code> one knows it is a raw string literal. The crate <code>syn</code> does exactly the same. Of course, it seems a bit wasteful to write and run a second parser given that rustc already parsed the literal. Yes, that's unfortunate and having a better API in <code>proc_literal</code> would be preferable. But right now, I think <code>litrs</code> (or <code>syn</code> if you are using <code>syn</code> anyway) are the best solutions. <hr> (PS: I'm usually not a fan of promoting one's own libraries on Stack Overflow, but I am very familiar with the problem OP is having and I very much think <code>litrs</code> is the best tool for the job right now.)

If you're writing procedural macros, I'd recommend that you look into using the crates <code>syn</code> (for parsing) and <code>quote</code> (for code generation) instead of using <code>proc-macro</code> directly, since those are generally easier to deal with. In this case, you can use <code>syn::parse_macro_input</code> to parse a token stream into any syntatic element of Rust (such as literals, expressions, functions), and will also take care of error messages in case parsing fails. You can use <code>LitStr</code> to represent a string literal, if that's exactly what you need. The <code>.value()</code> function will give you a <code>String</code> with the contents of that literal. You can use <code>quote::quote</code> to generate the output of the macro, and use <code>#</code> to insert the contents of a variable into the generated code. <pre class="prettyprint lang-rust prettyprint-override"><code>use proc_macro::TokenStream; use syn::{parse_macro_input, LitStr}; use quote::quote; #[proc_macro] pub fn my_macro(input: TokenStream) -> TokenStream { // macro input must be `LitStr`, which is a string literal. // if not, a relevant error message will be generated. let input = parse_macro_input!(input as LitStr); // get value of the string literal. let str_value = input.value(); // do something with value... let str_value = str_value.to_uppercase(); // generate code, include `str_value` variable (automatically encodes // `String` as a string literal in the generated code) (quote!{ #str_value }).into() } </code></pre>

How do I get the value and type of a Literal in a procedural macro?

Tags:

rust

rust-proc-macros

I am implementing a function-like procedural macro which takes a single string literal as an argument, but I don't know how to get the value of the string literal.

If I print the variable, it shows a bunch of fields, which includes both the type and the value. They are clearly there, somewhere. How do I get them?

extern crate proc_macro;
use proc_macro::{TokenStream,TokenTree};

#[proc_macro]
pub fn my_macro(input: TokenStream) -> TokenStream {
    let input: Vec<TokenTree> = input.into_iter().collect();
    let literal = match &input.get(0) {
        Some(TokenTree::Literal(literal)) => literal,
        _ => panic!()
    };

    // can't do anything with "literal"
    // println!("{:?}", literal.lit.symbol); says "unknown field"

    format!("{:?}", format!("{:?}", literal)).parse().unwrap()
}

#![feature(proc_macro_hygiene)]
extern crate macros;

fn main() {
    let value = macros::my_macro!("hahaha");
    println!("it is {}", value);
    // prints "it is Literal { lit: Lit { kind: Str, symbol: "hahaha", suffix: None }, span: Span { lo: BytePos(100), hi: BytePos(108), ctxt: #0 } }"
}

617

asked Apr 12 '20 10:04

Pablo Tato Ramos

2 Answers

After running into the same problem countless times already, I finally wrote a library to help with this: litrs on crates.io. It compiles faster than syn and lets you inspect your literals.

use std::convert::TryFrom;
use litrs::StringLit;
use proc_macro::TokenStream;
use quote::quote;


#[proc_macro]
pub fn my_macro(input: TokenStream) -> TokenStream {
    let input = input.into_iter().collect::<Vec<_>>();
    if input.len() != 1 {
        let msg = format!("expected exactly one input token, got {}", input.len());
        return quote! { compile_error!(#msg) }.into();
    }

    let string_lit = match StringLit::try_from(&input[0]) {
        // Error if the token is not a string literal
        Err(e) => return e.to_compile_error(),
        Ok(lit) => lit,
    };

    // `StringLit::value` returns the actual string value represented by the
    // literal. Quotes are removed and escape sequences replaced with the
    // corresponding value.
    let v = string_lit.value();

    // TODO: implement your logic here
}

See the documentation of litrs for more information.

To obtain more information about a literal, litrs uses the Display impl of Literal to obtain a string representation (as it would be written in source code) and then parses that string. For example, if the string starts with 0x one knows it has to be an integer literal, if it starts with r#" one knows it is a raw string literal. The crate syn does exactly the same.

Of course, it seems a bit wasteful to write and run a second parser given that rustc already parsed the literal. Yes, that's unfortunate and having a better API in proc_literal would be preferable. But right now, I think litrs (or syn if you are using syn anyway) are the best solutions.

(PS: I'm usually not a fan of promoting one's own libraries on Stack Overflow, but I am very familiar with the problem OP is having and I very much think litrs is the best tool for the job right now.)

162

answered Oct 26 '22 14:10

Lukas Kalbertodt

If you're writing procedural macros, I'd recommend that you look into using the crates syn (for parsing) and quote (for code generation) instead of using proc-macro directly, since those are generally easier to deal with.

In this case, you can use syn::parse_macro_input to parse a token stream into any syntatic element of Rust (such as literals, expressions, functions), and will also take care of error messages in case parsing fails.

You can use LitStr to represent a string literal, if that's exactly what you need. The .value() function will give you a String with the contents of that literal.

You can use quote::quote to generate the output of the macro, and use # to insert the contents of a variable into the generated code.

use proc_macro::TokenStream;
use syn::{parse_macro_input, LitStr};
use quote::quote;

#[proc_macro]
pub fn my_macro(input: TokenStream) -> TokenStream {
    // macro input must be `LitStr`, which is a string literal.
    // if not, a relevant error message will be generated.
    let input = parse_macro_input!(input as LitStr);

    // get value of the string literal.
    let str_value = input.value();

    // do something with value...
    let str_value = str_value.to_uppercase();

    // generate code, include `str_value` variable (automatically encodes
    // `String` as a string literal in the generated code)
    (quote!{
        #str_value
    }).into()
}

answered Oct 26 '22 13:10

Frxstrem

Related questions
                            
                                Couldn't start client Rust Language Server
                            
                                What exactly is a Rust "toolchain"?
                            
                                Why can't None be cloned for a generic Option<T> when T doesn't implement Clone?
                            
                                What does `impl ... for` mean?
                            
                                Can I determine the zero value of generic types?
                            
                                How to check if function pointer passed from C is non-NULL
                            
                                How do I free a *char allocated via FFI in Rust?
                            
                                Return value from match to Err(e)
                            
                                Default generic parameter
                            
                                How do I get a substring between two patterns in Rust?
                            
                                How do I iterate over elements of a struct in Rust?
                            
                                Why do Rust's operators have the type Output variable? [duplicate]
                            
                                Why can't I mutably borrow a primitive from an enum?
                            
                                Why can I use Ok and Err directly without the Result:: prefix?
                            
                                Why does the compiler not complain that an iterator moved to a for loop is immutable?
                            
                                How do I multiply an integer and a floating value together and display the result as a floating value in Rust? [duplicate]
                            
                                Why does a &str not coerce to a &String when using Vec::contains?
                            
                                Is there a way to use existing structs as enum variants?
                            
                                Why is it common to use dynamic errors in rust, and not enums? Is it bad/not possible to use compile-time variants?
                            
                                Unable to stop my Docker container with Ctrl-C

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With