I scrapped the Wiktionary Chinese Frequency list pages to build the list in other formats more suitable for my programs, uses.

Eric Streit 9760b021f4 correction typo dans le readme 5 years ago
Liste-mots-formatés 334820654e premier commit 5 years ago
Programmes 334820654e premier commit 5 years ago
.gitignore 334820654e premier commit 5 years ago
README.md 9760b021f4 correction typo dans le readme 5 years ago

README.md

Wiktionary Chinese Frequency list

Wiktionary keeps pages on frequency lists for various languages.

I scrapped the Chinese frequency lists to build a list in different formats more suitable for my needs, like importing to Anki, adding sounds, import into a database (MongoDB).

Here is an example for the Json format:

{
      "hanzi": "一",
      "traditional": "一",
      "pinyin": "yī",
      "translation": "det.: one",
      "frequence": 1,
      "origine": "Wiktionary"
    }

To import this list into MOngoDB was straightforward.

The CVS version is more suitable to be imported into Anki.

一   一   yī  det.: one   Wiktionary  1