README.md 902 B

Cedict project

Cedict is a free Chinese-English dictionary project. The base is used by many projects, on desktop, web and mobile platforms.

I wrote some programs (nodejs/javascript) to transform the text-based format in formats more suitable to my projects; there exists a php prgram to convert the original format into a xml format.

My programs format the dictionary into a json and .csv format.

I process the parts written in numbered pinyin to accentued pinyin; here is an example, with the original line and the CVS formatted one:

 K金 K金 [K jin1] /see 開金|开金[kai1 jin1]/

 K金 K金  k jīn   see 開金|开金[kāi jīn]

Here is an excerpt of the json formatted file, with the same part as above:

{
    "hanzi": "K金",
    "traditional": "K金",
    "pinyin": "k jīn",
    "translations": [
      "see 開金|开金[kāi jīn]"
    ]
}