char
Author: Heli Qi Affiliation: NAIST Date: 2022.07
CharTokenizer
Bases: Tokenizer
Tokenizer implementation that converts the input sentence string into a list of graphemes (characters).
Source code in speechain/tokenizer/char.py
text2tensor(text, no_sos=False, no_eos=False, return_tensor=True)
Parameters:
Name | Type | Description | Default |
---|---|---|---|
text
|
str
|
|
required |
no_sos
|
bool
|
|
False
|
no_eos
|
bool
|
|
False
|
return_tensor
|
bool
|
|
True
|
Returns: