I scrapped the Wiktionary Chinese Frequency list pages to build the list in other formats more suitable for my programs, uses.
Eric Streit 9760b021f4 correction typo dans le readme | 5 years ago | |
---|---|---|
Liste-mots-formatés | 5 years ago | |
Programmes | 5 years ago | |
.gitignore | 5 years ago | |
README.md | 5 years ago |
Wiktionary keeps pages on frequency lists for various languages.
I scrapped the Chinese frequency lists to build a list in different formats more suitable for my needs, like importing to Anki, adding sounds, import into a database (MongoDB).
Here is an example for the Json format:
{
"hanzi": "一",
"traditional": "一",
"pinyin": "yī",
"translation": "det.: one",
"frequence": 1,
"origine": "Wiktionary"
}
To import this list into MOngoDB was straightforward.
The CVS version is more suitable to be imported into Anki.
一 一 yī det.: one Wiktionary 1