January 6 video montage7/1/2023 ![]() ![]() ![]() As you can see, the structure in Spanish is really close to the structure in English. This visualization was made using gradient descent to optimize a linear transformation between the source and destination language word vectors. A nice visualization of this feature is the one below from the paper “ Exploiting Similarities among Languages for Machine Translation“: I pretend to try to use the words found by Stephen Bax in the future to check if it is possible to capture some transformation that could lead to find similar structures with other languages. Word vectors can also be used (surprise) for translation, and this is the feature of the word vectors that I think that its most important when used to understand text where we know some of the words translations. These vectors also approximates same-meaning words together, allowing similarity queries like in the example below: > model.most_similar("man") We can see that those vectors can be used in vector operations to extract information about the regularities of the captured linguistic semantics. These word vectors, after trained, carry with them a lot of semantic meaning. ![]() Here is a visualization of this architecture from the TensorFlow site: It is a unsupervised technique that uses supervided learning tasks to learn the linguistic context of the words. Word embeddings are created by using a shallow neural network architecture. My idea when I heard about the work of Stephen Bax was to try to capture the patterns of the text using word2vec. There is also a transcription of the manuscript done thanks to the hard-work of many folks working on it since many moons ago. In 2014, Stephen Bax proposed a provisional, partial decoding of the manuscript, the video of his presentation is very interesting and I really recommend you to watch if you like this codex. This hypothesis states that the codex isn’t ciphered, it states that the codex was just written in an unknown language that disappeared due to a culture extinction. The manuscript itself is always subject of a lot of different hypothesis, including the one that I like the most which is the “culture extinction” hypothesis, supported in 2014 by Stephen Bax. The manuscript is known to be written in two different languages (Language A and Language B) and it is also known to be written by a group of people. Although the manuscript has been studied by some famous cryptographers of the World War I and II, nobody has deciphered it yet. The Voynich Manuscript is a hand-written codex written in an unknown system and carbon-dated to the early 15th century (1404–1438). ![]()
0 Comments
Leave a Reply. |