I transformed the indexes of the Shtooka Chinese repositories in json format, so I can easily link the audio files to the Chinese words.

Eric Streit cd8fb028ef premier commit sur atom 3 years ago
Programmes cd8fb028ef premier commit sur atom 3 years ago
Repositories cd8fb028ef premier commit sur atom 3 years ago
.gitignore cd8fb028ef premier commit sur atom 3 years ago
README.md cd8fb028ef premier commit sur atom 3 years ago

README.md

The Shtooka repositories

The Shtooka project (or the copy of it on my server Yojik Shtooka backup) was created in 2006. The goal of the project was to record in high audio quality lists of words, sentences in different languages.

A lot of Chinese words were recorded, taggued with linguistic and technical information.

These recordings are downloadble and can be freely used.

I formatted the Chinese index files of the 4 Chinese collections, so I can easily connect the audio files and the words in different words lists, typically, HSK1 to HSK6.

I wrote the programs to read the files and make the connection automatiquely. The missing recordings are displayed in the console, and a script can use Google TTS to generate the missing sounds.