此界面旨在幫助研究人員获得普通話詞彙統計信息
An interface to aid researchers in accessing lexical statistics for Mandarin Chinese.
The goal of DoWLS is to provide word-level statistics, particularly of the phonological neighborhood variety, that vary due to syllable segmentation.
You have three choices for searching the database
1. If you would like to generate a wordlist based on word-level statistics then go to Generate a wordlist above. On that tab you can select which word-level descriptors you want, and enter ranges for numeric descriptors.
2. If you have a wordlist and need word-level statistics, go to Get word-level statistics above. On that tab you can enter your words and select the word-level descriptors you want
3. If you would like to use the raw database files then use the links below.
If you'd like help in mapping between IPA, sampa (ascii phonetic transcription), and pinyin, click on Pronunciation chart.
I'm currently having problems getting the website to let you download the full database. Until I fix this issue, feel free to contact me at karlneergaard at gmail dot com, and I will send it to you in both Excel and text files.
You can also download individual database files at this github link, where the files are available per each segmentation schema separately for words and nonwords. In each file, the lexical items are organized by their pronunciation in sampa (ascii phonetic transcription).
This website was partially made possible through funding from A*midex in association with the Laboratoire Parole et Langage.
The database relies on the syllable inventory created in Neergaard & Huang (2019). Please use the table below to help in mapping between the multiple transcriptions available in the database.
IPA | Sampa | Pinyin word | Sampa word | Ortho word | IPA | Sampa | Pinyin word | Sampa word | Ortho word | ||
---|---|---|---|---|---|---|---|---|---|---|---|
Vowels | a | a | ba3 | pa3 | 把 | Plosives | p | p | bu4 | pu4 | 不 |
ə | @ | she2 | S@2 | 蛇 | pʰ | P | pao3 | PaU3 | 跑 | ||
e | e | gei3 | keI3 | 给 | k | k | ge0 | k@0 | 个 | ||
ɛ | E | ye3 | iE3 | 也 | kʰ | K | ke4 | K@4 | 课 | ||
ɨ | ! | zhi1 | Z!1 | 之 | t | t | dou1 | toU1 | 都 | ||
i | i | di4 | ti4 | 第 | tʰ | T | ta1 | Ta1 | 他 | ||
ɪ | I | sui4 | sueI4 | 岁 | Fricatives | s | s | suo3 | suo3 | 所 | |
o | o | ruo4 | ruo4 | 若 | f | f | fang4 | faN4 | 放 | ||
ʊ | U | chou3 | CoU3 | 丑 | x | x | hui4 | xueI4 | 会 | ||
u | u | wo3 | uo3 | 我 | ʂ | S | shi4 | S!4 | 是 | ||
y | y | yuan2 | yEn2 | 元 | ɕ | X | xia4 | Xia4 | 下 | ||
Nasals | m | m | ma1 | ma1 | 妈 | Affricates | tɕ | J | jiu4 | JioU4 | 就 |
n | n | neng2 | n@N2 | 能 | tɕʰ | Q | qing3 | QiN3 | 请 | ||
ŋ | N | xiang3 | XiaN3 | 想 | tsʰ | c | cong2 | coN2 | 从 | ||
Liquids | l | l | lie4 | liE4 | 列 | tʂʰ | C | chu1 | Cu1 | 出 | |
ɹ | r | rang4 | raN4 | 让 | ts | z | zi4 | z!4 | 字 | ||
tʂ | Z | zhe4 | Z@4 | 这 |