std.format

This package provides string formatting functionality using printf style format strings.

SubmoduleFunction NameDescription
packageformatConverts its arguments according to a format string into a string.
packagesformatConverts its arguments according to a format string into a buffer.
packageFormatExceptionSignals a problem while formatting.
$(D write)$(D formattedWrite)Converts its arguments according to a format string and writes the result to an output range.
$(D write)$(D formatValue)Formats a value of any type according to a format specifier and writes the result to an output range.
$(D read)$(D formattedRead)Reads an input range according to a format string and stores the read values into its arguments.
$(D read)$(D unformatValue)Reads a value from the given input range and converts it according to a format specifier.
$(D spec)$(D FormatSpec)A general handler for format strings.
$(D spec)$(D singleSpec)Helper function that returns a FormatSpec for a single format specifier.

Limitation: This package does not support localization, but adheres to the rounding mode of the floating point unit, if available.

Format Strings

The functions contained in this package use format strings. A format string describes the layout of another string for reading or writing purposes. A format string is composed of normal text interspersed with format specifiers. A format specifier starts with a percentage sign '%', optionally followed by one or more parameters and ends with a format indicator. A format indicator may be a simple format character or a compound indicator.

Format strings are composed according to the following grammar:

FormatString:
    FormatStringItem FormatString
FormatStringItem:
    Character
    FormatSpecifier
FormatSpecifier:
    '%' Parameters FormatIndicator

FormatIndicator: FormatCharacter CompoundIndicator FormatCharacter: see remark below CompoundIndicator: '(' FormatString '%)' '(' FormatString '%|' Delimiter '%)' Delimiter empty Character Delimiter

Parameters: Position Flags Width Precision Separator Position: empty Integer '$' Integer ':' Integer '$' Integer ':' '$' Flags: empty Flag Flags Flag: '-'|'+'|' '|'0'|'#'|'=' Width: OptionalPositionalInteger Precision: empty '.' OptionalPositionalInteger Separator: empty ',' OptionalInteger ',' OptionalInteger '?' OptionalInteger: empty Integer '*' OptionalPositionalInteger: OptionalInteger '*' Integer '$'

Character '%%' AnyCharacterExceptPercent Integer: NonZeroDigit Digits Digits: empty Digit Digits NonZeroDigit: '1'|'2'|'3'|'4'|'5'|'6'|'7'|'8'|'9' Digit: '0'|'1'|'2'|'3'|'4'|'5'|'6'|'7'|'8'|'9'

Note: FormatCharacter is unspecified. It can be any character that has no other purpose in this grammar, but it is recommended to assign (lower- and uppercase) letters.

Note: The Parameters of a CompoundIndicator are currently limited to a '-' flag.

$(SECTION4 Format Indicator)

The format indicator can either be a single character or an expression surrounded by %\() and %\. It specifies the basic manner in which a value will be formatted and is the minimum requirement to format a value.

The following characters can be used as format characters:

FormatCharacterSemantics
's'To be formatted in a human readable format. Can be used with all types.
'c'To be formatted as a character.
'd'To be formatted as a signed decimal integer.
'u'To be formatted as a decimal image of the underlying bit representation.
'b'To be formatted as a binary image of the underlying bit representation.
'o'To be formatted as an octal image of the underlying bit representation.
'x' / 'X'To be formatted as a hexadecimal image of the underlying bit representation.
'e' / 'E'To be formatted as a real number in decimal scientific notation.
'f' / 'F'To be formatted as a real number in decimal natural notation.
'g' / 'G'To be formatted as a real number in decimal short notation. Depending on the number, a scientific notation or a natural notation is used.
'a' / 'A'To be formatted as a real number in hexadecimal scientific notation.
'r'To be formatted as raw bytes. The output may not be printable and depends on endianness.

The compound indicator can be used to describe compound types like arrays or structs in more detail. A compound type is enclosed within '%\(') and '%\'. The enclosed sub-format string is applied to individual elements. The trailing portion of the sub-format string following the specifier for the element is interpreted as the delimiter, and is therefore omitted following the last element. The '%|' specifier may be used to explicitly indicate the start of the delimiter, so that the preceding portion of the string will be included following the last element.

The format string inside of the compound indicator should contain exactly one format specifier (two in case of associative arrays), which specifies the formatting mode of the elements of the compound type. This format specifier can be a compound indicator itself.

Note: Inside a compound indicator, strings and characters are escaped automatically. To avoid this behavior, use "%-$(LPAREN)" instead of "%$(LPAREN)".

$(SECTION4 Flags)

There are several flags that affect the outcome of the formatting.

FlagSemantics
'-'When the formatted result is shorter than the value given by the width parameter, the output is left justified. Without the '-' flag, the output remains right justified.

There are two exceptions where the '-' flag has a different meaning: (1) with 'r' it denotes to use little endian and (2) in case of a compound indicator it means that no special handling of the members is applied.

'='When the formatted result is shorter than the value given by the width parameter, the output is centered. If the central position is not possible it is moved slightly to the right. In this case, if '-' flag is present in addition to the '=' flag, it is moved slightly to the left.
'+' / ' 'Applies to numerical values. By default, positive numbers are not formatted to include the + sign. With one of these two flags present, positive numbers are preceded by a plus sign or a space. When both flags are present, a plus sign is used.

In case of 'r', a big endian format is used.

'0'Is applied to numerical values that are printed right justified. If the zero flag is present, the space left to the number is filled with zeros instead of spaces.
'#'Denotes that an alternative output must be used. This depends on the type to be formatted and the format character used. See the sections below for more information.

$(SECTION4 Width$(COMMA) Precision and Separator)

The width parameter specifies the minimum width of the result.

The meaning of precision depends on the format indicator. For integers it denotes the minimum number of digits printed, for real numbers it denotes the number of fractional digits and for strings and compound types it denotes the maximum number of elements that are included in the output.

A separator is used for formatting numbers. If it is specified, the output is divided into chunks of three digits, separated by a ','. The number of digits in a chunk can be given explicitly by providing a number or a '*' after the ','.

In all three cases the number of digits can be replaced by a '*'. In this scenario, the next argument is used as the number of digits. If the argument is a negative number, the precision and separator parameters are considered unspecified. For width, the absolute value is used and the '-' flag is set.

The separator can also be followed by a '?'. In that case, an additional argument is used to specify the symbol that should be used to separate the chunks.

$(SECTION4 Position)

By default, the arguments are processed in the provided order. With the position parameter it is possible to address arguments directly. It is also possible to denote a series of arguments with two numbers separated by ':', that are all processed in the same way. The second number can be omitted. In that case the series ends with the last argument.

It's also possible to use positional arguments for width, precision and separator by adding a number and a '$' after the '*'.

$(SECTION4 Types)

This section describes the result of combining types with format characters. It is organized in 2 subsections: a list of general information regarding the formatting of types in the presence of format characters and a table that contains details for every available combination of type and format character.

When formatting types, the following rules apply:

  • If the format character is upper case, the resulting string will be formatted using upper case letters.
  • The default precision for floating point numbers is 6 digits.
  • Rounding of floating point numbers adheres to the rounding mode of the floating point unit, if available.
  • The floating point values NaN and Infinity are formatted as nan and inf, possibly preceded by '+' or '-' sign.
  • Formatting reals is only supported for 64 bit reals and 80 bit reals. All other reals are cast to double before they are formatted. This will cause the result to be inf for very large numbers.
  • Characters and strings formatted with the 's' format character inside of compound types are surrounded by single and double quotes and unprintable characters are escaped. To avoid this, a '-' flag can be specified for the compound specifier (e.g. "%-$(LPAREN)%s%$(RPAREN)" instead of "%$(LPAREN)%s%$(RPAREN)" ).
  • Structs, unions, classes and interfaces are formatted by calling a toString method if available. See $(D module std.format.write) for more details.
  • Only part of these combinations can be used for reading. See $(D module std.format.read) for more detailed information.

This table contains descriptions for every possible combination of type and format character:

Formatted as...
's'null
's'false or true
'b', 'd', 'o', 'u', 'x', 'X'As the integrals 0 or 1 with the same format character.

Please note, that 'o' and 'x' with '#' flag might produce unexpected results due to special handling of the value 0.

'r'\0 or \1
's', 'd'A signed decimal number. The '#' flag is ignored.
'b', 'o', 'u', 'x', 'X'An unsigned binary, decimal, octal or hexadecimal number.

In case of 'o' and 'x', the '#' flag denotes that the number must be preceded by 0 and 0x, with the exception of the value 0, where this does not apply. For 'b' and 'u' the '#' flag has no effect.

'e', 'E', 'f', 'F', 'g', 'G', 'a', 'A'As a floating point value with the same specifier.

Default precision is large enough to add all digits of the integral value.

In case of ($B 'a') and 'A', the integral digit can be any hexadecimal digit.

'r'Characters taken directly from the binary representation.
'e', 'E'Scientific notation: Exactly one integral digit followed by a dot and fractional digits, followed by the exponent. The exponent is formatted as 'e' followed by a '+' or '-' sign, followed by at least two digits.

When there are no fractional digits and the '#' flag is not present, the dot is omitted.

'f', 'F'Natural notation: Integral digits followed by a dot and fractional digits.

When there are no fractional digits and the '#' flag is not present, the dot is omitted.

Please note: the difference between 'f' and 'F' is only visible for NaN and Infinity.

's', 'g', 'G'Short notation: If the absolute value is larger than 10 ^^ precision or smaller than 0.0001, the scientific notation is used. If not, the natural notation is applied.

In both cases precision denotes the count of all digits, including the integral digits. Trailing zeros (including a trailing dot) are removed.

If '#' flag is present, trailing zeros are not removed.

'a', 'A'Hexadecimal scientific notation: 0x followed by 1 (or 0 in case of value zero or denormalized number) followed by a dot, fractional digits in hexadecimal notation and an exponent. The exponent is build by p, followed by a sign and the exponent in decimal notation.

When there are no fractional digits and the '#' flag is not present, the dot is omitted.

'r'Characters taken directly from the binary representation.
's', 'c'As the character.

Inside of a compound indicator 's' is treated differently: The character is surrounded by single quotes and non printable characters are escaped. This can be avoided by preceding the compound indicator with a '-' flag (e.g. "%-$(LPAREN)%s%$(RPAREN)").

'b', 'd', 'o', 'u', 'x', 'X'As the integral that represents the character.
'r'Characters taken directly from the binary representation.
's'The sequence of characters that form the string.

Inside of a compound indicator the string is surrounded by double quotes and non printable characters are escaped. This can be avoided by preceding the compound indicator with a '-' flag (e.g. "%-$(LPAREN)%s%$(RPAREN)").

'r'The sequence of characters, each formatted with 'r'.
compoundAs an array of characters.
's'When the elements are characters, the array is formatted as a string. In all other cases the array is surrounded by square brackets and the elements are separated by a comma and a space. If the elements are strings, they are surrounded by double quotes and non printable characters are escaped.
'r'The sequence of the elements, each formatted with 'r'.
compoundThe sequence of the elements, each formatted according to the specifications given inside of the compound specifier.
's'As a sequence of the elements in unpredictable order. The output is surrounded by square brackets. The elements are separated by a comma and a space. The elements are formatted as key:value.
compoundAs a sequence of the elements in unpredictable order. Each element is formatted according to the specifications given inside of the compound specifier. The first specifier is used for formatting the key and the second specifier is used for formatting the value. The order can be changed with positional arguments. For example "%(%2$s (%1$s), %)" will write the value, followed by the key in parenthesis.
's'The name of the value. If the name is not available, the base value is used, preceeded by a cast.
All, but 's'Enums can be formatted with all format characters that can be used with the base value. In that case they are formatted like the base value.
's'When the elements of the range are characters, they are written like a string. In all other cases, the elements are enclosed by square brackets and separated by a comma and a space.
'r'The sequence of the elements, each formatted with 'r'.
compoundThe sequence of the elements, each formatted according to the specifications given inside of the compound specifier.
's'When the struct has neither an applicable toString nor is an input range, it is formatted as follows: StructType(field1, field2, ...).
's'When the class has neither an applicable toString nor is an input range, it is formatted as the fully qualified name of the class.
's'When the union has neither an applicable toString nor is an input range, it is formatted as its base name.
's'A null pointer is formatted as 'null'. All other pointers are formatted as hexadecimal numbers with the format character 'X'.
'x', 'X'Formatted as a hexadecimal number.
's'The array is surrounded by square brackets and the elements are separated by a comma and a space.
'r'The sequence of the elements, each formatted with 'r'.
compoundThe sequence of the elements, each formatted according to the specifications given inside of the compound specifier.
's', 'r', compoundAs the .stringof of this delegate treated as a string.

Please note: The implementation is currently buggy and its use is discouraged.

Modules

read
module std.format.read

This is a submodule of std.format.

spec
module std.format.spec

This is a submodule of std.format.

write
module std.format.write

This is a submodule of std.format.

Public Imports

std.format.read
public import std.format.read;
Undocumented in source.
std.format.spec
public import std.format.spec;
Undocumented in source.
std.format.write
public import std.format.write;
Undocumented in source.

Members

Classes

FormatException
class FormatException

Signals an issue encountered while formatting.

Functions

format
immutable(Char)[] format(Char[] fmt, Args args)
typeof(fmt) format(Args args)

Converts its arguments according to a format string into a string.

sformat
char[] sformat(char[] buf, const(Char)[] fmt, Args args)
char[] sformat(char[] buf, Args args)

Converts its arguments according to a format string into a buffer. The buffer has to be large enough to hold the formatted string.

Examples

Simple use:

// Easiest way is to use `%s` everywhere:
assert(format("I got %s %s for %s euros.", 30, "eggs", 5.27) == "I got 30 eggs for 5.27 euros.");

// Other format characters provide more control:
assert(format("I got %b %(%X%) for %f euros.", 30, "eggs", 5.27) == "I got 11110 65676773 for 5.270000 euros.");

Compound specifiers allow formatting arrays and other compound types:

/*
The trailing end of the sub-format string following the specifier for
each item is interpreted as the array delimiter, and is therefore
omitted following the last array item:
 */
    assert(format("My items are %(%s %).", [1,2,3]) == "My items are 1 2 3.");
    assert(format("My items are %(%s, %).", [1,2,3]) == "My items are 1, 2, 3.");

/*
The "%|" delimiter specifier may be used to indicate where the
delimiter begins, so that the portion of the format string prior to
it will be retained in the last array element:
 */
    assert(format("My items are %(-%s-%|, %).", [1,2,3]) == "My items are -1-, -2-, -3-.");

/*
These compound format specifiers may be nested in the case of a
nested array argument:
 */
    auto mat = [[1, 2, 3],
                [4, 5, 6],
                [7, 8, 9]];

    assert(format("%(%(%d %) - %)", mat), "1 2 3 - 4 5 6 - 7 8 9");
    assert(format("[%(%(%d %) - %)]", mat), "[1 2 3 - 4 5 6 - 7 8 9]");
    assert(format("[%([%(%d %)]%| - %)]", mat), "[1 2 3] - [4 5 6] - [7 8 9]");

/*
Strings and characters are escaped automatically inside compound
format specifiers. To avoid this behavior, use "%-(" instead of "%(":
 */
    assert(format("My friends are %s.", ["John", "Nancy"]) == `My friends are ["John", "Nancy"].`);
    assert(format("My friends are %(%s, %).", ["John", "Nancy"]) == `My friends are "John", "Nancy".`);
    assert(format("My friends are %-(%s, %).", ["John", "Nancy"]) == `My friends are John, Nancy.`);

Using parameters:

// Flags can be used to influence to outcome:
assert(format("%g != %+#g", 3.14, 3.14) == "3.14 != +3.14000");

// Width and precision help to arrange the formatted result:
assert(format(">%10.2f<", 1234.56789) == ">   1234.57<");

// Numbers can be grouped:
assert(format("%,4d", int.max) == "21,4748,3647");

// It's possible to specify the position of an argument:
assert(format("%3$s %1$s", 3, 17, 5) == "5 3");

Providing parameters as arguments:

// Width as argument
assert(format(">%*s<", 10, "abc") == ">       abc<");

// Precision as argument
assert(format(">%.*f<", 5, 123.2) == ">123.20000<");

// Grouping as argument
assert(format("%,*d", 1, int.max) == "2,1,4,7,4,8,3,6,4,7");

// Grouping separator as argument
assert(format("%,3?d", '_', int.max) == "2_147_483_647");

// All at once
assert(format("%*.*,*?d", 20, 15, 6, '/', int.max) == "   000/002147/483647");

Meta

Authors

Walter Bright, Andrei Alexandrescu, and Kenji Hara