API Reference¶

This document provides technical details about the classes and methods available in the Nepali Unicoder package.

`Converter` Class¶

The Converter class is a simplified wrapper around the Engine for easy usage.

`init(self, mode: str = "roman")`¶

Initializes a new Converter. - mode: Either "roman" (default) or "preeti".

`convert(self, text: str) -> str`¶

Translates the input text to Unicode Devanagari. - text: The input string (Romanized Nepali or Preeti characters).

`Engine` Class¶

The core conversion logic is implemented in the Engine class.

`init(self, trie: Optional[Trie] = None, tokenizer: Optional[Tokenizer] = None, mode: str = "roman")`¶

Initializes the conversion engine. - Loads the appropriate character mappings into a Trie based on the mode. - Loads post_rules for contextual transformations in Preeti mode.

`Tokenizer` Class¶

The Tokenizer splits the input text into meaningful chunks (Tokens).

`tokenize(self, text: str, use_blocks: bool = True) -> List[Token]`¶

Splits text into tokens based on types: ROMAN, BLOCK, LITERAL, and NUMBER. - use_blocks: If True (default), recognizes {...} as "as-is" blocks. In Preeti mode, this is typically set to False by the Engine.

API Reference¶

Converter Class¶

__init__(self, mode: str = "roman")¶

convert(self, text: str) -> str¶

Engine Class¶

__init__(self, trie: Optional[Trie] = None, tokenizer: Optional[Tokenizer] = None, mode: str = "roman")¶

Tokenizer Class¶

tokenize(self, text: str, use_blocks: bool = True) -> List[Token]¶