Top Description Fields Constructors Methods
jdk.internal.icu.text

pack-priv Class FilteredNormalizer2

extends Normalizer2
Class Inheritance
Imports
java.io.IOException

Normalization filtered by a UnicodeSet. Normalizes portions of the text contained in the filter set and leaves portions not contained in the filter set unchanged. Filtering is done via UnicodeSet.span(..., UnicodeSet.SpanCondition.SIMPLE). Not-in-the-filter text is treated as "is normalized" and "quick check yes". This class implements all of (and only) the Normalizer2 API. An instance of this class is unmodifiable/immutable.
Author
Markus W. Scherer
ICU Status
Stable since ICU 4.4.

Field Summary

Modifier and TypeField and Description
private Normalizer2
private UnicodeSet

Constructor Summary

AccessConstructor and Description
public
FilteredNormalizer2(Normalizer2
wrapped Normalizer2 instance
n2
,
UnicodeSet
UnicodeSet which determines the characters to be normalized
filterSet
)

Constructs a filtered normalizer wrapping any Normalizer2 instance and a filter set.

Method Summary

Modifier and TypeMethod and Description
public StringBuilder
append(StringBuilder
string, should be normalized
first
,
CharSequence
string, should be normalized
second
)

Implements abstract jdk.internal.icu.text.Normalizer2.append.

Appends the second string to the first string (merging them at the boundary) and returns the first string.
public int
getCombiningClass(int
code point
c
)

Overrides jdk.internal.icu.text.Normalizer2.getCombiningClass.

Gets the combining class of c.
public String
getDecomposition(int
code point
c
)

Implements abstract jdk.internal.icu.text.Normalizer2.getDecomposition.

Gets the decomposition mapping of c.
public boolean
hasBoundaryBefore(int
character to test
c
)

Implements abstract jdk.internal.icu.text.Normalizer2.hasBoundaryBefore.

Tests if the character always has a normalization boundary before it, regardless of context.
public boolean
isNormalized(CharSequence
input string
s
)

Implements abstract jdk.internal.icu.text.Normalizer2.isNormalized.

Tests if the string is normalized.
public StringBuilder
normalize(CharSequence
source string
src
,
StringBuilder
destination string; its contents is replaced with normalized src
dest
)

Implements abstract jdk.internal.icu.text.Normalizer2.normalize.

Writes the normalized form of the source string to the destination string (replacing its contents) and returns the destination string.
public Appendable
normalize(CharSequence
source string
src
,
Appendable
destination Appendable; gets normalized src appended
dest
)

Implements abstract jdk.internal.icu.text.Normalizer2.normalize.

Writes the normalized form of the source string to the destination Appendable and returns the destination Appendable.
private Appendable
public StringBuilder
normalizeSecondAndAppend(StringBuilder
string, should be normalized
first
,
CharSequence
string, will be normalized
second
)

Implements abstract jdk.internal.icu.text.Normalizer2.normalizeSecondAndAppend.

Appends the normalized form of the second string to the first string (merging them at the boundary) and returns the first string.
private StringBuilder
normalizeSecondAndAppend(StringBuilder first, CharSequence second, boolean doNormalize)

public int
spanQuickCheckYes(CharSequence
input string
s
)

Implements abstract jdk.internal.icu.text.Normalizer2.spanQuickCheckYes.

Returns the end of the normalized substring of the input string.
Inherited from jdk.internal.icu.text.Normalizer2:
getNFCInstancegetNFDInstancegetNFKCInstancegetNFKDInstancenormalize

Field Detail

norm2back to summary
private Normalizer2 norm2
setback to summary
private UnicodeSet set

Constructor Detail

FilteredNormalizer2back to summary
public FilteredNormalizer2(Normalizer2 n2, UnicodeSet filterSet)

Constructs a filtered normalizer wrapping any Normalizer2 instance and a filter set. Both are aliased and must not be modified or deleted while this object is used. The filter set should be frozen; otherwise the performance will suffer greatly.

Parameters
n2:Normalizer2

wrapped Normalizer2 instance

filterSet:UnicodeSet

UnicodeSet which determines the characters to be normalized

ICU Status
Stable since ICU 4.4.

Method Detail

appendback to summary
public StringBuilder append(StringBuilder first, CharSequence second)

Implements abstract jdk.internal.icu.text.Normalizer2.append.

Doc from jdk.internal.icu.text.Normalizer2.append.

Appends the second string to the first string (merging them at the boundary) and returns the first string. The result is normalized if both the strings were normalized. The first and second strings must be different objects.

Parameters
first:StringBuilder

string, should be normalized

second:CharSequence

string, should be normalized

Returns:StringBuilder

first

Annotations
@Override
ICU Status
Stable since ICU 4.4.
getCombiningClassback to summary
public int getCombiningClass(int c)

Overrides jdk.internal.icu.text.Normalizer2.getCombiningClass.

Doc from jdk.internal.icu.text.Normalizer2.getCombiningClass.

Gets the combining class of c. The default implementation returns 0 but all standard implementations return the Unicode Canonical_Combining_Class value.

Parameters
c:int

code point

Returns:int

c's combining class

Annotations
@Override
ICU Status
Stable since ICU 49.
getDecompositionback to summary
public String getDecomposition(int c)

Implements abstract jdk.internal.icu.text.Normalizer2.getDecomposition.

Doc from jdk.internal.icu.text.Normalizer2.getDecomposition.

Gets the decomposition mapping of c. Roughly equivalent to normalizing the String form of c on a DECOMPOSE Normalizer2 instance, but much faster, and except that this function returns null if c does not have a decomposition mapping in this instance's data. This function is independent of the mode of the Normalizer2.

Parameters
c:int

code point

Returns:String

c's decomposition mapping, if any; otherwise null

Annotations
@Override
ICU Status
Stable since ICU 4.6.
hasBoundaryBeforeback to summary
public boolean hasBoundaryBefore(int c)

Implements abstract jdk.internal.icu.text.Normalizer2.hasBoundaryBefore.

Doc from jdk.internal.icu.text.Normalizer2.hasBoundaryBefore.

Tests if the character always has a normalization boundary before it, regardless of context. If true, then the character does not normalization-interact with preceding characters. In other words, a string containing this character can be normalized by processing portions before this character and starting from this character independently. This is used for iterative normalization. See the class documentation for details.

Parameters
c:int

character to test

Returns:boolean

true if c has a normalization boundary before it

Annotations
@Override
ICU Status
Stable since ICU 4.4.
isNormalizedback to summary
public boolean isNormalized(CharSequence s)

Implements abstract jdk.internal.icu.text.Normalizer2.isNormalized.

Doc from jdk.internal.icu.text.Normalizer2.isNormalized.

Tests if the string is normalized. Internally, in cases where the quickCheck() method would return "maybe" (which is only possible for the two COMPOSE modes) this method resolves to "yes" or "no" to provide a definitive result, at the cost of doing more work in those cases.

Parameters
s:CharSequence

input string

Returns:boolean

true if s is normalized

Annotations
@Override
ICU Status
Stable since ICU 4.4.
normalizeback to summary
public StringBuilder normalize(CharSequence src, StringBuilder dest)

Implements abstract jdk.internal.icu.text.Normalizer2.normalize.

Doc from jdk.internal.icu.text.Normalizer2.normalize.

Writes the normalized form of the source string to the destination string (replacing its contents) and returns the destination string. The source and destination strings must be different objects.

Parameters
src:CharSequence

source string

dest:StringBuilder

destination string; its contents is replaced with normalized src

Returns:StringBuilder

dest

Annotations
@Override
ICU Status
Stable since ICU 4.4.
normalizeback to summary
public Appendable normalize(CharSequence src, Appendable dest)

Implements abstract jdk.internal.icu.text.Normalizer2.normalize.

Doc from jdk.internal.icu.text.Normalizer2.normalize.

Writes the normalized form of the source string to the destination Appendable and returns the destination Appendable. The source and destination strings must be different objects.

Any java.io.IOException is wrapped into a com.ibm.icu.util.ICUUncheckedIOException.

Parameters
src:CharSequence

source string

dest:Appendable

destination Appendable; gets normalized src appended

Returns:Appendable

dest

Annotations
@Override
ICU Status
Stable since ICU 4.6.
normalizeback to summary
private Appendable normalize(CharSequence src, Appendable dest, UnicodeSet.SpanCondition spanCondition)
normalizeSecondAndAppendback to summary
public StringBuilder normalizeSecondAndAppend(StringBuilder first, CharSequence second)

Implements abstract jdk.internal.icu.text.Normalizer2.normalizeSecondAndAppend.

Doc from jdk.internal.icu.text.Normalizer2.normalizeSecondAndAppend.

Appends the normalized form of the second string to the first string (merging them at the boundary) and returns the first string. The result is normalized if the first string was normalized. The first and second strings must be different objects.

Parameters
first:StringBuilder

string, should be normalized

second:CharSequence

string, will be normalized

Returns:StringBuilder

first

Annotations
@Override
ICU Status
Stable since ICU 4.4.
normalizeSecondAndAppendback to summary
private StringBuilder normalizeSecondAndAppend(StringBuilder first, CharSequence second, boolean doNormalize)
spanQuickCheckYesback to summary
public int spanQuickCheckYes(CharSequence s)

Implements abstract jdk.internal.icu.text.Normalizer2.spanQuickCheckYes.

Doc from jdk.internal.icu.text.Normalizer2.spanQuickCheckYes.

Returns the end of the normalized substring of the input string. In other words, with end=spanQuickCheckYes(s); the substring s.subSequence(0, end) will pass the quick check with a "yes" result.

The returned end index is usually one or more characters before the "no" or "maybe" character: The end index is at a normalization boundary. (See the class documentation for more about normalization boundaries.)

When the goal is a normalized string and most input strings are expected to be normalized already, then call this method, and if it returns a prefix shorter than the input string, copy that prefix and use normalizeSecondAndAppend() for the remainder.

Parameters
s:CharSequence

input string

Returns:int

"yes" span end index

Annotations
@Override
ICU Status
Stable since ICU 4.4.