DING Chenchen | 丁 尘辰 | 丁 塵辰 | テイ ジンシン
Last updated: 2024/05/28
Affiliation & Contact
Education & Employment
- 2024-04 -- now :
Associate Professor, Multilingual Natural Language Processing Laboratory, Division of Information Science, NAIST
- 2022-04 -- now :
Senior Researcher (tenured), ATT, ASTREC, NICT
- 2018-04 -- 2022-03 :
Researcher (tenure-track), ATT, ASTREC, NICT
- 2015-04 -- 2018-03 :
Researcher (fixed-term), ATT, ASTREC, NICT
- 2015-03 :
Ph.D. in Engineering, University of Tsukuba
- 2014-04 -- 2014-09 :
Researcher (cooperative), Multi-Lingual Translation Laboratory, NICT
- 2012-07 -- 2013-03 :
Intern, Natural Language Computing Group, Microsoft Research Asia
- 2012-03 :
M.E., University of Tsukuba
Selected Refereed Publication
-
Journal
-
Yuqin Lin, Jianwu Dang, Longbiao Wang, Sheng Li, and Chenchen Ding.
Disordered Speech Recognition Considering Low Resources and Abnormal Articulation.
Speech Communication, Vol. 155, 103002, 2023.
-
Abhisek Chakrabarty, Raj Dabre, Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
Low-resource Multilingual Neural Translation Using Linguistic Feature-based Relevance Mechanisms.
ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 22 Issue 7, Article No. 191, 2023.
-
Xinglin Lyu, Junhui Li, Min Zhang, Chenchen Ding, Hideki Tanaka, and Masao Utiyama.
Refining History for Future-aware Neural Machine Translation.
IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol.31, pp 500 -- 512, 2023.
-
Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
Inputting Writing Systems with Medium Complexity: A Generalized Input Method Editor AKKHARA and Case Study on Myanmar Script.
Journal of Natural Language Processing, Vol. 29 Issue 4, pp 1254 -- 1271, 2022.
-
Hour Kaing, Chenchen Ding*, Masao Utiyama, Eiichiro Sumita, Sethserey Sam, Sopheap Seng, Katsuhito Sudoh, and Satoshi Nakamura.
Towards Tokenization and Part-of-Speech Tagging for Khmer: Data and Discussion.
ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 20 Issue 6, Article No. 104, 2021.
# * co-first and corresponding author
-
Chenchen Ding, Sann Su Su Yee, Win Pa Pa, Khin Mar Soe, Masao Utiyama, and Eiichiro Sumita.
A Burmese (Myanmar) Treebank: Guideline and Analysis.
ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 19 Issue 3, Article No. 40, 2020.
-
Chenchen Ding, Hnin Thu Zar Aye, Win Pa Pa, Khin Thandar Nwet, Khin Mar Soe, Masao Utiyama, and Eiichiro Sumita.
Towards Burmese (Myanmar) Morphological Analysis: Syllable-based Tokenization and Part-of-Speech Tagging.
ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 19 Issue 1, Article No. 5, 2019.
# The data in footnote 5 are old version. Please refer to the ALT page for the latest data.
-
Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
NOVA: A Feasible and Flexible Annotation System for Joint Tokenization and Part-of-Speech Tagging.
ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 18 Issue 2, Article No. 17, 2018.
# The data in footnote 10 are old version. Please refer to the ALT page for the latest data.
-
Chenchen Ding, Ye Kyaw Thu, Masao Utiyama, and Eiichiro Sumita.
Word Segmentation for Burmese (Myanmar).
ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 15 Issue 4, Article No. 22, 2016.
-
Chenchen Ding, Keisuke Sakanushi, Hirona Touji, and Mikio Yamamoto.
Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine Translation.
ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 15 Issue 3, Article No. 20, 2016.
# This work was mainly done in the doctoral course, though the acceptance and publication were far after my graduation.
-
Chenchen Ding and Mikio Yamamoto.
A Generative Dependency N-gram Language Model: Unsupervised Parameter Estimation and Application.
Journal of Natural Language Processing, Vol. 21, No. 5, pp. 981--1009, 2014.
# Some errors in the IJCNLP2013 paper are fixed.
-
Conference
-
Hour Kaing, Chenchen Ding, Hideki Tanaka, and Masao Utiyama.
Robust Neural Machine Translation for Abugidas by Glyph Perturbation.
In Proc. of EACL, Vol. 2, pp.311--318, 2024.
-
Jiannan Mao, Chenchen Ding, Hour Kaing, Hideki Tanaka, Masao Utiyama, and Tadahiro Matsumoto.
Improving Zero-Shot Dependency Parsing by Unsupervised Learning.
In Proc. of PACLIC, pp. 217--226, 2023.
-
Van-Hien Tran, Chenchen Ding, Hideki Tanaka, and Masao Utiyama.
Improving Embedding Transfer for Low-Resource Machine Translation.
In Proc. of MT Summit, pp. 123--134, 2023.
-
Qianying Liu, Yuhang Yang, Zhuo Gong, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, and Sadao Kurohashi.
Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition.
In Proc. of ICASSP, pp. 1--5, 2023.
-
Abhisek Chakrabarty, Raj Dabre, Chenchen Ding, Hideki Tanaka, Masao Utiyama, and Eiichiro Sumita.
FeatureBART: Linguistic Features Based Sequence-to-Sequence Pre-Training for Low-Resource NMT.
In Proc. of COLING, pp. 5014--5020, 2022.
-
Yongjie Lü, Longbiao Wang, Sheng li, Chenchen Ding, Jianwu Dang, and Kiyoshi Honda.
Compressing Transformer-Based ASR Model by Task-Driven Loss and Attention-Based Multi-Level Feature Distillation.
In Proc. of ICASSP, pp. 7992--7996, 2022.
-
Abhisek Chakrabarty, Raj Dabre, Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
Improving Low-Resource NMT through Relevance Based Linguistic Features Incorporation.
In Proc. of COLING, pp. 4263--4274, 2020.
-
Yuqin Lin, Longbiao Wang, Sheng Li, Jianwu Dang, and Chenchen Ding.
Staged Knowledge Distillation for End-to-End Dysarthric Speech Recognition and Speech Attribute Transcription.
In Proc. of INTERSPEECH, pp. 4791--4795, 2020.
-
Hao Shi, Longbiao Wang, Sheng Li, Chenchen Ding, Meng Ge, Nan Li, Jianwu Dang, and Hiroshi Seki.
Singing Voice Extraction with Attention based Spectrograms Fusion.
In Proc. of INTERSPEECH, pp. 2412--2416, 2020.
-
Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
A Three-Parameter Rank-Frequency Relation in Natural Languages.
In Proc. of ACL, pp. 460--464, 2020.
# The $\gamma$ in this paper is not used consistently. Generally, $\gamma$ is always in the same scale of the rank $r$.
# In Fig. 1 and Sec. 3, the $\gamma$ is in logarithmic scale, where $\gamma = \log (r_{max}) / 2$ for initialization.
# In the axis of $\beta + \gamma$, the $\gamma$ is also in the logarithmic scale.
-
Aye Myat Mon, Chenchen Ding*, Hour Kaing, Khin Mar Soe, Masao Utiyama, and Eiichiro Sumita.
A Myanmar (Burmese)-English Named Entity Transliteration Dictionary.
In Proc. of LREC, pp. 2973--2976, 2020.
# * corresponding author
-
Yuqin Lin, Longbiao Wang, Jianwu Dang, Sheng Li, and Chenchen Ding.
End-to-End Articulatory Modeling for Dysarthria Articulatory Attribute Detection.
In Proc. of ICASSP, pp. 7349--7353, 2020.
-
Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
MY-AKKHARA: A Romanization-based Burmese (Myanmar) Input Method.
In Proc. of EMNLP-IJCNLP, System Demonstrations, pp. 157--162, 2019.
# The data in footnote 5 are old version. Please refer to the ALT page for the latest data.
# Please refer to the MY-AKKHARA page for more details.
We thank Prof. Win Pa Pa for her help in Myanmar translation.
-
Sheng Li, Xugang Lu, Chenchen Ding, Peng Shen, Tatsuya Kawahara, and Hisashi Kawai.
Investigating Radical-based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese.
In Proc. of INTERSPEECH, pp. 2200--2204, 2019.
-
Sheng Li, Chenchen Ding, Xugang Lu, Peng Shen, Tatsuya Kawahara, and Hisashi Kawai.
End-to-End Articulatory Attribute Modeling for Low-resource Multilingual Speech Recognition.
In Proc. of INTERSPEECH, pp. 2145--2149, 2019.
-
Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
Simplified Abugidas.
In Proc. of ACL, Vol. 2, pp. 491--495, 2018.
# A newspaper article about this work:
アブギダ系文字、入力効率化, NICT先端研究/情通機構(88), 日刊工業新聞 23面, May 21, 2019.
-
Chenchen Ding, Win Pa Pa, Masao Utiyama, and Eiichiro Sumita.
Burmese (Myanmar) Name Romanization: A Sub-syllabic Segmentation Scheme for Statistical Solutions.
In Proc. of PACLING, CCIS 781, pp. 191--202, 2018.
#data (or please contact me to get the data)
-
Chenchen Ding, Vichet Chea, Masao Utiyama, Eiichiro Sumita, Sethserey Sam, and Sopheap Seng.
Statistical Khmer Name Romanization.
In Proc. of PACLING, CCIS 781, pp. 179--190, 2018.
#data (or please contact me to get the data)
# Best Paper Award of PACLING 2017
-
Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
Improving fast_align by Reordering.
In Proc. of EMNLP, pp. 1034--1039, 2015.
-
Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
Document-level Re-ranking with Soft Lexical and Semantic Features for Statistical Machine Translation.
In Proc. of AMTA, Vol. 1, pp. 110--123, 2014.
-
Chenchen Ding and Yuki Arase.
Dependency Tree Abstraction for Long-Distance Reordering in Statistical Machine Translation.
In Proc. of EACL, pp. 424--433, 2014.
-
Chenchen Ding and Mikio Yamamoto.
An Unsupervised Parameter Estimation Algorithm for a Generative Dependency N-gram Language Model.
In Proc. of IJCNLP, pp. 516--524, 2013.
# Statement around footnote 5 is wrong. Please refer to footnote 4 of the JNLP2014 paper.
# Exp. (6) is wrong. Please refer to Exp. (12) of the JNLP2014 paper.
-
Workshop
-
Jiannan Mao, Chenchen Ding, Hour Kaing, Hideki Tanaka, Masao Utiyama, and Tadahiro Matsumoto.
Overcoming Early Saturation on Low-Resource Languages in Multilingual Dependency Parsing.
In Proc. of MWE-UD, pp. 63--69, 2024.
-
Chenchen Ding, Ye Kyaw Thu, Masao Utiyama, Andrew Finch, and Eiichiro Sumita.
Empirical Dependency-Based Head Finalization for Statistical Chinese-, English-, and French-to-Myanmar (Burmese) Machine Translation.
In Proc. of IWSLT, pp. 184--191, 2014.
# If some characters disappear in Fig. 5, Fig. 7, and appendix A, please try this one.
-
Chenchen Ding, Takashi Inui, and Mikio Yamamoto.
Long-Distance Hierarchical Structure Transformation Rules Utilizing Function Words.
In Proc. of IWSLT, pp. 159--166, 2011.
Other Publication
-
2024
-
Jiannan Mao, Chenchen Ding, Hour Kaing, Hideki Tanaka, Masao Utiyama, and Tadahiro Matsumoto.
Improving Zero-Shot Dependency Parsing by Unsupervised Learning.
言語処理学会第30回年次大会発表論文集, pp. 1645--1649, 2024.
-
Hour Kaing, Chenchen Ding, Haiyue Song, Jiannan Mao, Hideki Tanaka, and Masao Utiyama.
Robust Neural Machine Translation for Abugidas by Glyph Perturbation.
言語処理学会第30回年次大会発表論文集, pp. 560--565, 2024.
-
2022
-
2021
-
Kak Soky, Masato Mimura, Chenhui Chu, Tatsuya Kawahara, Sheng Li, Chenchen Ding, and Sethserey Sam.
TriECCC: Trilingual Corpus of the Extraordinary Chambers in the Courts of Cambodia for Speech Recognition and Translation Studies.
International Journal of Asian Language Processing, Vol. 31, No. 03/04, 2021.
-
Hour Kaing, Chenchen Ding, Katsuhito Sudoh, Masao Utiyama, Eiichiro Sumita, and Satoshi Nakamura.
Multi-Source Cross-Lingual Constituency Parsing.
In Proc. of ICON, pp. 341--346, 2021.
-
Dawei Liu, Longbiao Wang, Sheng Li, Haoyu Li, Chenchen Ding, Ju Zhang, and Jianwu Dang.
Exploring Effective Speech Representation via ASR for High-Quality End-to-End Multispeaker TTS.
In Proc. of ICONIP, CCIS 1517, pp. 110--118, 2021.
-
Kak Soky, Masato Mimura, Tatsuya Kawahara, Sheng Li, Chenchen Ding, Chenhui Chu, and Sethserey Sam.
Khmer Speech Translation Corpus of The Extraordinary Chambers in The Courts of Cambodia (ECCC).
In Proc. of O-COCOSDA, pp. 122--127, 2021.
-
Hour Kaing, Chenchen Ding, Masao Utiyama, Eiichiro Sumita, Katsuhito Sudoh, and Satoshi Nakamura.
Constituency Parsing by Cross-Lingual Delexicalization.
IEEE Access, Vol. 9, pp. 141571--141578, 2021.
-
Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda, Sadao Kurohashi.
Overview of the 8th Workshop on Asian Translation.
In Proc. of WAT, pp. 1--45, 2021.
-
2020
-
Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Sadao Kurohashi.
Overview of the 7th Workshop on Asian Translation.
In Proc. of WAT, pp. 1--44, 2020.
-
Aye Thida, Nway Nway Han, Sheinn Thawtar Oo, Sheng Li, and Chenchen Ding.
VOIS: The First Speech Therapy App in the World for Myanmar Hearing-Impaired Children.
In Proc. of O-COCOSDA, pp. 151--154, 2020.
-
2019
-
Rui Wang, Haipeng Sun, Kehai Chen, Chenchen Ding, Masao Utiyama, Eiichiro Sumita.
English-Myanmar Supervised and Unsupervised NMT: NICT’s Machine Translation Systems at WAT-2019.
In Proc. of WAT, pp. 90--93, 2019.
-
Benjamin Marie, Hour Kaing, Aye Myat Mon, Chenchen Ding, Atsushi Fujita, Masao Utiyama, Eiichiro Sumita.
Supervised and Unsupervised Machine Translation for Myanmar-English and Khmer-English.
In Proc. of WAT, pp. 68--75, 2019.
-
Toshiaki Nakazawa, Nobushige Doi, Shohei Higashiyama, Chenchen Ding, Raj Dabre, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Sadao Kurohashi.
Overview of the 6th Workshop on Asian Translation.
In Proc. of WAT, pp. 1--35, 2019.
-
丁 塵辰, 内山 将夫, 隅田 英一郎.
ローマ字によるビルマ文字入力方式.
言語処理学会第25回年次大会発表論文集, pp. 1427--1430, 2019.
-
Sann Su Su Yee, Chenchen Ding, Khin Mar Soe, Masao Utiyama, and Eiichiro Sumita.
Modifying NOVA-annotated Myanmar Data to Universal Part-of-Speech Tagset.
In Proc. of ICCA (Myanmar), pp. 230--237, 2019.
-
2018
-
Rui Wang, Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
English-Myanmar NMT and SMT with Pre-ordering: NICT’s Machine Translation Systems at WAT-2018.
In Proc. of PACLIC, pp. 972--974, 2018.
-
Toshiaki Nakazawa, Katsuhito Sudoh, Shohei Higashiyama, Chenchen Ding, Raj Dabre, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, and Sadao Kurohashi.
Overview of the 5th Workshop on Asian Translation.
In Proc. of PACLIC, pp. 904--944, 2018.
-
Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
Structured Common Subsequences for Automatic Machine Translation Evaluation.
言語処理学会第24回年次大会発表論文集, pp. 837--840, 2018.
-
2017
-
Toshiaki Nakazawa, Shohei Higashiyama, Chenchen Ding, Hideya Mino, Isao Goto, Hideto Kazawa, Yusuke Oda, Graham Neubig, and Sadao Kurohashi.
Overview of the 4th Workshop on Asian Translation.
In Proc. of WAT, pp. 1--54, 2017.
-
Chenchen Ding, Vichet Chea, Win Pa Pa, Masao Utiyama, and Eiichiro Sumita.
Statistical Romanization for Abugida Scripts: Data and Experiment on Khmer and Burmese.
言語処理学会第23回年次大会発表論文集, pp. 234--237, 2017.
-
Hnin Thu Zar Aye, Chenchen Ding, Win Pa Pa, Khin Thandar Nwet, Masao Utiyama, and Eiichiro Sumita.
English-to-Myanmar Statistical Machine Translation Using a Language Model on Part-of-Speech in Decoding.
In Proc. of ICCA (Myanmar), pp. 409--415, 2017.
-
2016
-
Hour Kaing, Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
Improving English-to-Khmer Statistical Machine Translation Using Part-of-Speech Information.
In Proc. of KNLP (Cambodia), 2016.
-
Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
Similar Southeast Asian Languages: Corpus-Based Case Study on Thai-Laotian and Malay-Indonesian.
In Proc. of WAT, pp. 149--156, 2016.
-
Toshiaki Nakazawa, Chenchen Ding, Hideya Mino, Isao Goto, Graham Neubig, and Sadao Kurohashi.
Overview of the 3rd Workshop on Asian Translation.
In Proc. of WAT, pp. 1--46, 2016.
-
Hammam Riza, Michael Purwoadi, Gunarso, Teduh Uliniansyah, Aw Ai Ti, Sharifah Mahani Aljunied,
Luong Chi Mai, Vu Tat Thang, Nguyen Phuong Thai, Vichet Chea, Rapid Sun, Sethserey Sam, Sopheap Seng,
Khin Mar Soe, Khin Thandar Nwet, Masao Utiyama, and Chenchen Ding.
Introduction of the Asian Language Treebank.
In Proc. of O-COCOSDA, pp. 1--6, 2016.
-
Chenchen Ding, Ye Kyaw Thu, Masao Utiyama, and Eiichiro Sumita.
Parsing Myanmar (Burmese) by Using Japanese as a Pivot. (camera-ready version)
In Proc. of ICCA (Myanmar), pp. 158--162, 2016.
-
2015
-
Chenchen Ding, Vichet Chea, Masao Utiyama, and Eiichiro Sumita.
Reverse Pre-reordering for SMT from Head-final to Head-initial Languages: A Case Study on Japanese-to-Khmer.
In Proc. of KNLP (Cambodia), 2015.
-
Vichet Chea, Ye Kyaw Thu, Chenchen Ding, Masao Utiyama, Andrew Finch, and Eiichiro Sumita.
Khmer Word Segmentation Using Conditional Random Fields.
In Proc. of KNLP (Cambodia), 2015.
-
Chenchen Ding, Masao Utiyama, and Eiichiro Sumita.
NICT at WAT 2015.
In Proc. of WAT, pp. 42--47, 2015.
-
Chenchen Ding, Ye Kyaw Thu, Eiichiro Sumita, and Yoshinori Sagisaka.
Transcribing Chinese into Myanmar (Burmese) Script.
In Proc. of ICCA (Myanmar), pp. 174--180, 2015.
# camera-ready version with a neater layout
-
2014
-
Chenchen Ding, Masao Utiyama, Eiichiro Sumita, and Mikio Yamamoto.
Word Order Does NOT Differ Significantly Between Chinese and Japanese.
In Proc. of WAT, pp. 77--82, 2014.
-
Chenchen Ding and Mikio Yamamoto.
To Filter Discontinuous Word Alignment for Statistical Machine Translation.
In Proc. of ICALIP (Shanghai), pp. 449--453, 2014.
# There is a typo in the title on the linked page... (camera-ready version, IEEE copyright)
-
谷口 正訓, 丁 塵辰, 山本 幹雄.
英日機械翻訳のための主辞後置事前並べ替えにおける限量詞移動の改善.
情報処理学会第13回情報科学技術フォーラム講演論文集, 第2分冊, pp. 251--252, 2014.
-
丁 塵辰, 酒主 佳祐, 通事 寛奈, 山本 幹雄.
統計的日英翻訳における依存構造に基づく事前並べ替えルール.
言語処理学会第20回年次大会発表論文集, pp. 963--966, 2014.
-
丁 塵辰, 内山 将夫, 吉田 光男, 山本 幹雄.
対訳文書からのモデル学習:日韓・韓日統計的機械翻訳.
言語処理学会第20回年次大会発表論文集, pp. 820--823, 2014.
-
Zhongyuan Zhu, Masanori Taniguchi, Chenchen Ding, and Mikio Yamamoto.
A Preordering Method Using Head-Restructured CFG Parse Tree for SMT.
言語処理学会第20回年次大会発表論文集, pp. 594--597, 2014.
-
2012