Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

「皁/皂」字唔跟OpenCC標準 #196

Closed
russell-liu opened this issue Nov 12, 2024 · 3 comments · Fixed by #197
Closed

「皁/皂」字唔跟OpenCC標準 #196

russell-liu opened this issue Nov 12, 2024 · 3 comments · Fixed by #197
Assignees
Labels
dict 錯字錯音、詞條修正問題

Comments

@russell-liu
Copy link
Contributor

russell-liu commented Nov 12, 2024

問題描述

根據OpenCC標準: https://github.com/BYVoid/OpenCC/blob/ver.1.1.9/data/scheme/st_multi.txt#L166 ,「皂」專指肥皂;但essay-cantonese.txt同jyut6ping3.(words|phrase).dict.yaml入面「皂」完全冇出現,全部都係用「皁」。

修改意見

有肥皂意思時將「皁」改成「皂」。

@russell-liu russell-liu added the dict 錯字錯音、詞條修正問題 label Nov 12, 2024
@laubonghaudoi
Copy link
Member

多謝報告,可唔可以幫手開埋個PR改?

@russell-liu
Copy link
Contributor Author

可以,不過有兩個問題:

  1. essay-cantonese.txt之中淨係得個「皁」字個行點改?係唔改、改成「皂」、定添加「皂」?如果添加,詞頻點處理?
  2. essay-cantonese.txt嘅行列用咩標準排列?即係話,部份「皁」改爲「皂」後,係咪要將「皂」字頭詞排晒喺「皁」字頭詞後面?

@laubonghaudoi
Copy link
Member

應該係乜都唔需要理,直接 find and replace 就可以嘅,順序嗰啲全部保持原樣就得。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dict 錯字錯音、詞條修正問題
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants