Skip to content

Korean TN for Money and Telephone#324

Merged
tbartley94 merged 12 commits intoNVIDIA:ko_tn_staging_v1from
bbae0312:komoney-clean
Oct 24, 2025
Merged

Korean TN for Money and Telephone#324
tbartley94 merged 12 commits intoNVIDIA:ko_tn_staging_v1from
bbae0312:komoney-clean

Conversation

@bbae0312
Copy link

@bbae0312 bbae0312 commented Sep 16, 2025

What does this PR do ?

Add Korean Money TN (tagger + verbalizer) and Telephone TN (tagger + verbalizer)
Support KRW and major currencies with prefix/suffix (₩/KRW, US$, HK$, €, EUR, ¥, JPY, CAD, NZD, CHF, AED, Dh/DH/Dhs.).
Support telephone numbers.

Before your PR is "Ready for review"

Pre checks:

  • Have you signed your commits? Use git commit -s to sign.
  • Do all unittests finish successfully before sending PR?
    1. pytest or (if your machine does not have GPU) pytest --cpu from the root folder (given you marked your test cases accordingly @pytest.mark.run_only_on('CPU')).
    2. Sparrowhawk tests bash tools/text_processing_deployment/export_grammars.sh --MODE=test ...
  • If you are adding a new feature: Have you added test cases for both pytest and Sparrowhawk here.
  • Have you added __init__.py for every folder and subfolder, including data folder which has .TSV files?
  • Have you followed codeQL results and removed unused variables and imports (report is at the bottom of the PR in github review box) ?
  • Have you added the correct license header Copyright (c) 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved. to all newly added Python files?
  • If you copied nemo_text_processing/text_normalization/en/graph_utils.py your header's second line should be Copyright 2015 and onwards Google, Inc.. See an example here.
  • Remove import guards (try import: ... except: ...) if not already done.
  • If you added a new language or a new feature please update the NeMo documentation (lives in different repo).
  • Have you added your language support to tools/text_processing_deployment/pynini_export.py.

PR Type:

  • New Feature
  • Bugfix
  • Documentation
  • Test

If you haven't finished some of the above items you can still open "Draft" PR.

@bbae0312 bbae0312 changed the title feat(ko/money): Korean Money TN only; add data & tests; wire tagger/v… Korean TN for Money Sep 16, 2025
@bbae0312 bbae0312 changed the title Korean TN for Money Korean TN for Money and Telephone Sep 25, 2025
Copy link
Member

@tbartley94 tbartley94 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please unify documentation and elaborate when necessary. Should be properly informative with no knowledge of codebase.

Copy link
Member

@tbartley94 tbartley94 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please consult other code docs for example documentation.

Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>
Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>
@github-actions
Copy link

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

@github-actions github-actions bot added the Stale label Oct 22, 2025
@tbartley94 tbartley94 merged commit fbfc92e into NVIDIA:ko_tn_staging_v1 Oct 24, 2025
2 checks passed
bbae0312 added a commit to bbae0312/NeMo-text-processing that referenced this pull request Feb 27, 2026
* feat(ko/money): Korean Money TN only; add data & tests; wire tagger/verbalizer

Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix(ko/money): polish tagger/verbalizer & expand tests

Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* ko: add Telephone TN (tagger+verbalizer) + wire + tests; include money/test updates

Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* ko: refactor money/telephone taggers & verbalizers

Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* ko/money: use NEMO_NOT_QUOTE, lowercase space helper, trim mid optimizes

Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* ko: update money/telephone taggers and telephone verbalizer

Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>

* ko: update telephone taggers

Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>

---------

Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Jinwoo Bae <bbae7050@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants