Cedict is a dictionary project for the Chinese language. This is the Chinese-English part of the project. There exist a French version and other languages too.
Here you'll find programs to convert the original format in json and csv format as well as the corresponding formatted files.

Eric Streit 279b558b25 ajout de programmes pour générer les fichiers pour EasyOCR 4 years ago
Liste-mots-formatés 65af6ea0c6 modification du programme pour traiter le pinyin dans les traductions 5 years ago
Programmes 279b558b25 ajout de programmes pour générer les fichiers pour EasyOCR 4 years ago
Programmes1 65af6ea0c6 modification du programme pour traiter le pinyin dans les traductions 5 years ago
.gitignore f84b16e836 premier commit 5 years ago
README.md 185898bc4a README 5 years ago

README.md

Cedict project

Cedict is a free Chinese-English dictionary project. The base is used by many projects, on desktop, web and mobile platforms.

I wrote some programs (nodejs/javascript) to transform the text-based format in formats more suitable to my projects; there exists a php prgram to convert the original format into a xml format.

My programs format the dictionary into a json and .csv format.

I process the parts written in numbered pinyin to accentued pinyin; here is an example, with the original line and the CVS formatted one:

 K金 K金 [K jin1] /see 開金|开金[kai1 jin1]/

 K金 K金  k jīn   see 開金|开金[kāi jīn]

Here is an excerpt of the json formatted file, with the same part as above:

{
    "hanzi": "K金",
    "traditional": "K金",
    "pinyin": "k jīn",
    "translations": [
      "see 開金|开金[kāi jīn]"
    ]
}