Something I’ve been thinking about recently is how difficult translation is. I’ve been slowly making my way through Douglas Hofstadter’s Gödel, Escher, Bach, and he has a whole chapter devoted to translation. However, I think he fails to recognize how meaning can be imbued or obscured by the act of translation. Equivalence is a very difficult property, and this holds true for translations. Proclaiming that two translation is a lost cause. Consider the following criteria for determining “equivalence” of two translations:
- Show that there is a one-to-one matching of semantically important words in the translation
- Show that all people who know both languages agree that both are semantically equivalent
- Show that all people who know one language have the same reaction to the translation in the language that they know.
These three things are a very basic test of equivalence, and yet they would require giant confirmations. I note that I relaxed requirement 1 quite a bit (by specifying semantically important words rather than all words). It could be relaxed further by requiring a matching to a common synonym. Regardless, it is already hard to prove “equivalence” solely on semantics without discussing other features of natural language (rhythm, assonance, alliteration, imagery, and everything else your 10th grade English teacher taught you). As an example of this, consider the Chinese proverb “富不过三代” which is roughly translated as “Wealth doesn’t last three generations.” Google spits out a translation of “Not rich for three generations” which to be fair is the literal translation of the characters. To give Google a break, it’s hard to reason about sayings and idioms and figurative language. However, even this simple Chinese phrase provides immense complexity for one of the most powerful translation tools in the world. The first result when searching “富不过三代 translation” on Google sends you to a site that provides a translation of “Great men’s sons seldom do well.” which is somewhat similar to the original proverb, says nothing about the subsequent (third) generation.
Clearly, translation is no simple feat. And yet on the other hand, we are constantly translating our own thoughts into words and sentences that are understandable to those around us. Think of how you might use the following phrases in different contexts:
- I would like some food please.
- Feed me!
- I am so hungry.
- Could you please pass the spaghetti?
All of these indicate that the speaker wants sustenance. However, they vary drastically in tone, formality, and length. And yet in conversation we never consciously recognize the near instant translation of our innermost thoughts into the words we think our audience will best understand.
I couldn’t write this post without talking about one of my favorite subjects: compilers. Compilers are the programs that take code written in one language (C, C++, Java, OCaml) and translate it into a machine understandable language (i.e. binary). This is usually done through various steps and intermediate representations of the program breaking down structures through small translations. However, if each step is considered a translation, then compilation has a lot of room for lost meaning. Consider for example, comments in a language. Comments provide very helpful documentation and describe what a programmer is trying to do. They’re usually removed immediately by the compiler as they contribute no semantic meaning. Should we consider two programs “equivalent” solely based on semantic equivalence or extensional equivalence? I know most software managers would not agree with you if you tried to argue your code without comments is fine because it’s extensionally equivalent to one with comments. Or consider programs that are semantically equivalent, but are on opposite sides of the great tabs vs. spaces debate. Those programs would be clearly semantically equivalent as well, but would be very different to certain people who feel strongly about their white space.
-Semantics