Contents Home Top Prev Next
Map Refs Pics Gloss Pages

Text Analysis

Introduction

The main mystery of the Voynich MS is clearly its unknown writing. This topic is addressed from three different aspects, on three (sets of) pages:

This page addresses the third part, the statistical analysis of the text. Such analyses have been made in many different ways over the last 100+ years, either in order to decipher the text, or just to better understand its properties. The purpose of this part of the web site is to present these analyses. Many of the material presented here includes hypotheses about the MS text or tentative conclusions. These are not the main focus here. It is only after taking into account all statistics that a complete explanation could be put together.

The present page is just an overview. The topic has been subdivided into five 'areas' each with a dedicated page, as follows:

Text Analysis: Table of Contents

1. Introductory information
2. Character statistics
3. Word structure
4. Word statistics
5. Sentences, paragraphs, sections

I need to start with some disclaimers: it has not been possible for me to read everything that has been written on this topic, and this section cannot therefore be complete. I will always be grateful for information about additional work that has been done.

It is difficult to present the multitude of analyses that have been performed in an orderly fashion. Beside the five general areas indicated above, there are a number of studies that cannot be classified so easily. These five areas of analysis are now summarised briefly.

1. Introductory information

This part introduces the most common concepts used in the analysis section: Currier languages, entropy, Zipf law, etc. The reader is reminded that the analysis of the script of the MS is investigated on a separate page. There is another page dedicated to the MS transcription.

Some additional words about transcription

The transcription alphabet used throughout the site is the Eva alphabet, for which a more detailed description is given here. In some places, I will use small graphic files for the Voynich characters. In the present analysis section, the Voynich characters are rendered by the "Voynich EVA Hand 1" True Type font created by Gabriel Landini. This is demonstrated below, using the first paragraph of text on folio 1r of the manuscript:

The following figure was created using the Eva True Type font. The Eva text representing this section is given below it. It is then repeated, but using the Eva True Type font for the rendition.

fachys ykal ar ataiin Shol Shory cThres y kor Sholdy
sory cThar or y kair chtaiin Shar are cThar cThar dan
syaiir Sheky or ykaiin Shod cThoary cThes daraiin sa
o'oiin oteey oteor roloty cTh*ar daiin otaiin or okan
sair y chear cThaiin cPhar cFhaiin    ydaraiShy
fachys ykal ar ataiin Shol Shory cThres y kor Sholdy
sory cThar or y kair chtaiin Shar are cThar cThar dan
syaiir Sheky or ykaiin Shod cThoary cThes daraiin sa
o'oiin oteey oteor roloty cTh*ar daiin otaiin or okan
sair y chear cThaiin cPhar cFhaiin    ydaraiShy

If the third part does not look like the Voynich script in the first picture, please see here.

The choice of the transcription alphabet will have an impact on numerical analysis done on the Voynich MS text. This is particularly important for the calculation of the word length distribution, since the number of charcters to represent one 'glyph' of the Voynich MS text is different for each alphabet. It does play a role in other statistics as well. In general, the Eva alphabet is not the most suitable for performing statistics.

2. Character statistics

This includes, among others:

3. + 4. Word statistics

When people talk about a 'word' in the Voynich MS, they refer to a string of characters separated from other such words by a space in the writing. Whether these strings of characters actually represent words as we understand it, is not certain.

The analysis of the apparent words in the Voynich MS is discussed in two separate sections. The first treats the word struture, a unique property of the Voynich MS text.

The second section includes:

5. Sentences, paragraphs, sections

This includes topics like:

Contents Home Top Prev Next
Map Refs Pics Gloss Pages
Copyright René Zandbergen, 2017
Comments, questions, suggestions? Your feedback is welcome.
Latest update: 05/03/2017