The Ottoman Text Archival Project has its own reversible transcription system. However, for word labels, this is an overkill and requires too much work from experts.

I’m looking for a one-to-one mapping between the different visual elements of a word and its representation in UTF-8. The labels should be simple to remember, yet distinctive enough to represent visual variations of words.

I’m considering creating letter+digit codes. The letter part will reflect the most similar sound, and the digit will reflect the visual variation. In this case, I need a Latin letter for each letter in the Ottoman alphabet, but there are not enough letters in the standard Turkish alphabet to denote all the letters of Ottoman. So, how can I solve this?