Skip to content

Commit

Permalink
Releasing V0.0.8.9 - Minor changes in romanizer rules
Browse files Browse the repository at this point in the history
  • Loading branch information
ranzaka committed Jun 23, 2024
1 parent 79ea208 commit 80d140f
Show file tree
Hide file tree
Showing 3 changed files with 757 additions and 3 deletions.
2 changes: 1 addition & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[project]
name = "sinlib"
version = "0.0.8.8"
version = "0.0.8.9"
description = "Sinhala NLP Toolkit"
authors = [
{ name = "Ransaka", email = "ransaka.ravihara@gmail.com" }
Expand Down
2 changes: 1 addition & 1 deletion src/sinlib/romanize.py
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,7 @@ def __romanize(self, text: str):
self.tokenizer.token_id_to_token_map[c] for c in encodings
]
romanized_sinhala = [
self.char_mapper.get(ch, ch if ch in NUBERS_AND_PUNKTS.union(" ") else None)
self.char_mapper.get(ch, ch if ch in NUBERS_AND_PUNKTS.union(" ") else '')
for ch in decoded_sinhala_chars
]
romanized_sinhala = "".join(romanized_sinhala)
Expand Down
Loading

0 comments on commit 80d140f

Please sign in to comment.