The following APIs are capable of handling Unicode objects and strings on input (we refer to them as strings in the descriptions) and return Unicode objects or integers as apporpriate.
They all return NULL or -1 in case an exception occurrs.
Concat two strings giving a new Unicode string.
Split a string giving a list of Unicode strings.
If sep is NULL, splitting will be done at all whitespace substrings. Otherwise, splits occur at the given separator.
At most maxsplit splits will be done. If negative, no limit is set.
Separators are not included in the resulting list.
Dito, but split at line breaks.
CRLF is considered to be one line break. Line breaks are not included in the resulting list.
Translate a string by applying a character mapping table to it and return the resulting Unicode object.
The mapping table must map Unicode ordinal integers to Unicode ordinal integers or None (causing deletion of the character).
Mapping tables must only provide the __getitem__ interface, e.g. dictionaries or sequences. Unmapped character ordinals (ones which cause a LookupError) are left untouched and are copied as-is.
errors has the usual meaning for codecs. It may be NULL which indicates to use the default error handling.
Join a sequence of strings using the given separator and return the resulting Unicode string.
Return 1 if substr matches str[start:end] at the given tail end (direction == -1 means to do a prefix match, direction == 1 a suffix match), 0 otherwise.
Return the first position of substr in str[start:end] using the given direction (direction == 1 means to do a forward search, direction == -1 a backward search), 0 otherwise.
Count the number of occurrences of substr in str[start:end]
Replace at most maxcount occurrences of substr in str with replstr and return the resulting Unicode object. maxcount == -1 means: replace all occurrences.
Compare two strings and return -1, 0, 1 for less than, equal, greater than resp.
format % args
. The args argument must be
a tuple.
Checks whether element is contained in container and returns 1/0 accordingly.
element has to coerce to an one element Unicode string. -1 is returned in case of an error.