The http://learnchinese888.com/ website provides vocabulary lists and examples lists.

Eric Streit bede949ece ajout de .gitignore 4 years ago
Listes-mots-formatés 4e8aff29b4 ajout du traitement des conversations, programmes, listes 5 years ago
Medias 99eabb3062 ajout des lsites de vocabulaires avec exemples, programmes d etraitement, readme 5 years ago
Programmes 4e8aff29b4 ajout du traitement des conversations, programmes, listes 5 years ago
.gitignore bede949ece ajout de .gitignore 4 years ago
A-FAIRE.txt 99eabb3062 ajout des lsites de vocabulaires avec exemples, programmes d etraitement, readme 5 years ago
README.md cdfc640dbb modification du README 5 years ago

README.md

Mandarin 888

The Learn Chinese 888 site provides hsk1 to hsk4 lists of words with English translation.

They provide for each word in the list one example.

They also have Conversations which are more complex examples for the different HSK levels.

The lists are provided in xls, pdf and doc formats.

I transformed them to have more fields like the traditional characters version of the hanzi, and to get more output formats like json, xml and csv.

Examples of the vocabulary part:

{
      "numero": "2",
      "hanzi": "机场",
      "traditional": "機場",
      "pinyin": "jīchǎng",
      "translation": "Airport.",
      "exampleHanzi": "国际机场",
      "exampleTraditional": "國際機場",
      "examplePinyin": "Guójì jīchǎng",
      "exampleTranslation": "International Airport.",
      "lesson": "HSK2",
      "origin": "Mandarin888"
    }

The conversation parts got audio with at least 2 voices at different speed. I still don't know how to integrate these parts into the csv, json and xml files.

I converted the original files into the usual json, csv and xml versions, added the traditional writing et some formatting.