Package | Description |
---|---|
org.kew.rmf.transformers | |
org.kew.rmf.transformers.authors | |
org.kew.rmf.transformers.botany | |
org.kew.rmf.transformers.collations | |
org.kew.rmf.transformers.collations.wcs |
Modifier and Type | Class and Description |
---|---|
class |
A2BTransformer
A generic transformer that searches for all occurrences of a string
A2BTransformer.a and replaces each with a string A2BTransformer.b . |
class |
CapitalLettersExtractor
CapitalLettersExtractor returns only the capital letters in a string, other letters
are replaced by
RegexTransformer.replacement (default space). |
class |
CompositeTransformer
A CompositeTransformer can have other transformers and
executes them in order on the given string.
|
class |
DictionaryRegexTransformer
Uses a
Dictionary object of which it iterates over
the keys to use each as a regular expression; if the pattern matches, it
transforms the string accordingly returning the corresponding value of the
Dictionary. |
class |
DictionaryTransformer
Uses a
Dictionary object to lookup a string in its
keys and returns the value if the key is found. |
class |
LowerCaseTransformer
Converts the input to lower case using the defined locale, default is English.
|
class |
NormaliseDiacriticalMarksTransformer
Removes diacritical marks from letters, intended for dealing with OCR errors
and language-ignorant string comparisons.
|
class |
RegexExtractorTransformer
A generic transformer that extracts all occurrences of a pattern (regEx)
in a string
It takes two optional parameters, RegexExtractorTransformer.removeMultipleWhitespaces (default true) and
RegexExtractorTransformer.trimIt (default true) |
class |
RegexTransformer
A generic transformer that searches for all occurrences of a regular expression pattern (regEx)
RegexTransformer.pattern in a string and replaces each with a string RegexTransformer.replacement . |
class |
RemoveBracketedTextTransformer
Removes all text in brackets (round, square and curly) incl.
|
class |
RomanNumeralTransformer
This transformer splits a string into a series of words.
|
class |
SqueezeWhitespaceTransformer
A transformer to find multiple whitespace characters (space, tab etc) and
replace them with a single space.
|
class |
StringShrinkerTransformer
Crops/shrinks a string to a length of
StringShrinkerTransformer.length characters. |
class |
StripNonAlphabeticCharactersTransformer
This is a chain of three transformations:
replace diacritical characters with their Latin equivalent (
NormaliseDiacriticalMarksTransformer )
replace all non-alphabetic characters with StripNonAlphabeticCharactersTransformer.replacement (default: space)
replace multiple whitespace occurrences with one whitespace
Returns a trimmed result. |
class |
StripNonAlphanumericCharactersTransformer
This is a chain of three transformations:
replace diacritical characters with their Latin equivalent (
NormaliseDiacriticalMarksTransformer )
replace all non-alphanumeric characters with StripNonAlphanumericCharactersTransformer.replacement (default: space)
replace multiple whitespace occurrences with one whitespace
Returns a trimmed result. |
class |
StripNonAsciiAlphabeticCharactersTransformer
This transformer strips non ASCII alphabetic characters (A to Z, a to z), replacing
them with a space.
|
class |
StripNonAsciiAlphanumericCharactersTransformer
This transformer strips non ASCII alphanumeric characters (A to Z, a to z, 0 to 9), replacing
them with a space.
|
class |
StripNonNumericCharactersTransformer
This transformer strips non numeric characters, i.e.
|
class |
TitleCaseTransformer
Converts the input to lower case.
|
class |
UpperCaseTransformer
Converts the input to lower case.
|
class |
YearRangeExtractorTransformer
Extracts year ranges from arbitrary strings.
|
class |
ZeroToBlankTransformer
This transformer translates zeros to blanks
|
Modifier and Type | Method and Description |
---|---|
List<Transformer> |
CompositeTransformer.getTransformers() |
Modifier and Type | Method and Description |
---|---|
void |
CompositeTransformer.setTransformers(List<Transformer> transformers) |
Modifier and Type | Class and Description |
---|---|
class |
CleanedPubAuthors
Cleans ex (StripExAuthor) after in (StripInAuthor) after removing the
basionym (StripBasionymAuthor).
|
class |
DotFDotCleaner
Removes " f.", ".f." after any alphanumeric Characters (incl.
|
class |
ShrunkAuthors
This transformer tries to identify all authors (accepts publishing-,
basionym-, ex-, in-) of plant names in a string and returns a string where
each of their surnames are shrunk/cropped to a length
of
ShrunkAuthors.setShrinkTo(int) . |
class |
ShrunkPubAuthors
This transformer tries to identify all publication authors of plant names in
a string and returns a string where each of their surnames are shrunk/cropped
to a length of
ShrunkPubAuthors.setShrinkTo(int) . |
class |
StripBasionymAuthorTransformer
This transformer translates author strings in the form "(Author1) Author2" to "Author2"
|
class |
StripExAuthorTransformer
This transformer translates author strings in the form "Author1 ex Author2" to "Author2"
|
class |
StripInAuthorTransformer
This transformer translates author strings in the form "Author1 in Author2" to "Author1"
|
class |
StripPublishingAuthorTransformer
This transformer translates author strings in the form "(Author1) Author2" to "Author1"
|
class |
SurnameExtractor
This transformer tries to identify all surnames of plant name authors in a string
and deletes everything else.
|
Modifier and Type | Class and Description |
---|---|
class |
EpithetTransformer
This transformer canonicalises epithets.
|
class |
FakeHybridSignCleaner
Deletes X and x that seem to be meant as hybrid signs
X and x can be at the beginning of a string followed by a whitespace or
anywhere in the string if surrounded by white-spaces.
|
Modifier and Type | Class and Description |
---|---|
class |
CollationStructureTransformer |
Modifier and Type | Class and Description |
---|---|
class |
IssueExtractorTransformer |
class |
PageExtractorTransformer |
class |
SeriesExtractorTransformer |
class |
TabOrFigExtractorTransformer |
class |
VolExtractorTransformer |
class |
YearExtractorTransformer |
Copyright © 2012–2014 Royal Botanic Gardens, Kew. All rights reserved.