I scrapped the Wiktionary Chinese Frequency list pages to build the list in other formats more suitable for my programs, uses.

Eric Streit 5ddb008d43 adding a README 6 years ago
Liste-mots-formatés 334820654e premier commit 6 years ago
Programmes 334820654e premier commit 6 years ago
.gitignore 334820654e premier commit 6 years ago
README.md 5ddb008d43 adding a README 6 years ago

README.md

Wiktionary Chinese Fraquency list

Wiktionary keeps pages on frequency lists for various languages.

I scrapped the Chinese frequency lists to build a list in different formats more suitable for my needs, like importing to Anki, adding sounds, import into a database (MongoDB).

Here is an example for the Json format:

{
      "hanzi": "一",
      "traditional": "一",
      "pinyin": "yī",
      "translation": "det.: one",
      "frequence": 1,
      "origine": "Wiktionary"
    }

To import this list into MOngoDB was straightforward.

The CVS version is more suitable to be imported into Anki.

        det.: one   Wiktionary  1