unicode

A single entry point to lookup Unicode $(CODEPOINT) sets by name or alias of a block, script or general category.

It uses well defined standard rules of property name lookup. This includes fuzzy matching of names, so that 'White_Space', 'white-SpAce' and 'whitespace' are all considered equal and yield the same set of white space $(CHARACTERS).

Members

Properties

opDispatch
auto opDispatch [@property getter]

Performs the lookup of set of $(CODEPOINTS) with compile-time correctness checking. This short-cut version combines 3 searches: across blocks, scripts, and common binary properties.

Static functions

opCall
auto opCall(C[] name)

The same lookup across blocks, scripts, or binary properties, but performed at run-time. This version is provided for cases where name is not known beforehand; otherwise compile-time checked opDispatch is typically a better choice.

parseSet
CodepointSet parseSet(Range range, bool casefold)

Parse unicode codepoint set from given range using standard regex syntax '...'. The range is advanced skiping over regex set definition. casefold parameter determines if the set should be casefolded - that is include both lower and upper case versions for any letters in the set.

Structs

block
struct block

Narrows down the search for sets of $(CODEPOINTS) to all Unicode blocks.

hangulSyllableType
struct hangulSyllableType

Fetch a set of $(CODEPOINTS) that have the given hangul syllable type.

script
struct script

Narrows down the search for sets of $(CODEPOINTS) to all Unicode scripts.

Meta