o vZŽh¢ã@s<ddlmZmZdd„Zdd„Zdd„Zdd „Zd d„ZdS) é)ÚRegexBuilderÚsymbolscCóttjdd„djS)zÓKeep tone-modifying punctuation by matching following character. Assumes the `tone_marks` pre-processor was run for cases where there might not be any space after a tone-modifying punctuation mark. cSó d |¡S)Nz(?<={}).©Úformat©Úx©r úM/var/www/auris/lib/python3.10/site-packages/gtts/tokenizer/tokenizer_cases.pyÚó ztone_marks..©Zpattern_argsZpattern_func)rrÚ TONE_MARKSÚregexr r r rÚ tone_markss ÿþrcCr)aJPeriod and comma case. Match if not preceded by "." and only if followed by space. Won't cut in the middle/after dotted abbreviations; won't cut numbers. Note: Won't match if a dotted abbreviation ends a sentence. Note: Won't match the end of a sentence if not followed by a space. cSr)Nz(?.r)rrÚPERIOD_COMMArr r r rÚperiod_commas þýrcCr)zColon case. Match a colon ":" only if not preceded by a digit. Mainly to prevent a cut in the middle of time notations e.g. 10:01 cSr)Nz (?.r)rrÚCOLONrr r r rÚcolon#s ÿþrcCs@d ttjƒttjƒttjƒttjƒ¡}t|dd„djS)z‚Match other punctuation. Match other punctuation to split on; punctuation that naturally inserts a break in speech. ÚcSr©Nz{}rrr r rr<r z#other_punctuation..r) ÚjoinÚsetrÚALL_PUNCrrrrr©Zpuncr r rÚother_punctuation/sÿþýÿrcCstj}t|dd„djS)z[Match all punctuation. Use as only tokenizer case to mimic gTTS 1.x tokenization. cSrrrrr r rrEr z(legacy_all_punctuation..r)rrrrrr r rÚlegacy_all_punctuation?srN)Zgtts.tokenizerrrrrrrrr r r rÚs