Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 52695 |
| Missing cells | 561161 |
| Missing cells (%) | 44.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 9.6 MiB |
| Average record size in memory | 192.0 B |
Variable types
| Numeric | 2 |
|---|---|
| Categorical | 22 |
Name has a high cardinality: 28129 distinct values | High cardinality |
Dates associated with name has a high cardinality: 2757 distinct values | High cardinality |
All names has a high cardinality: 33026 distinct values | High cardinality |
Title has a high cardinality: 50029 distinct values | High cardinality |
Variant titles has a high cardinality: 1743 distinct values | High cardinality |
Series title has a high cardinality: 157 distinct values | High cardinality |
Number within series has a high cardinality: 110 distinct values | High cardinality |
Country of publication has a high cardinality: 71 distinct values | High cardinality |
Place of publication has a high cardinality: 3492 distinct values | High cardinality |
Publisher has a high cardinality: 7263 distinct values | High cardinality |
Date of publication has a high cardinality: 458 distinct values | High cardinality |
Edition has a high cardinality: 1559 distinct values | High cardinality |
Physical description has a high cardinality: 10735 distinct values | High cardinality |
Dewey classification has a high cardinality: 67 distinct values | High cardinality |
BL shelfmark has a high cardinality: 52345 distinct values | High cardinality |
Topics has a high cardinality: 1208 distinct values | High cardinality |
Genre has a high cardinality: 64 distinct values | High cardinality |
Languages has a high cardinality: 109 distinct values | High cardinality |
Notes has a high cardinality: 5667 distinct values | High cardinality |
Name has 5143 (9.8%) missing values | Missing |
Dates associated with name has 41870 (79.5%) missing values | Missing |
Type of name has 5143 (9.8%) missing values | Missing |
Role has 51015 (96.8%) missing values | Missing |
All names has 3062 (5.8%) missing values | Missing |
Variant titles has 46828 (88.9%) missing values | Missing |
Series title has 52435 (99.5%) missing values | Missing |
Number within series has 52584 (99.8%) missing values | Missing |
Country of publication has 16235 (30.8%) missing values | Missing |
Place of publication has 772 (1.5%) missing values | Missing |
Publisher has 25208 (47.8%) missing values | Missing |
Edition has 48497 (92.0%) missing values | Missing |
Physical description has 12849 (24.4%) missing values | Missing |
Dewey classification has 52617 (99.9%) missing values | Missing |
Topics has 49559 (94.0%) missing values | Missing |
Genre has 50722 (96.3%) missing values | Missing |
Notes has 46119 (87.5%) missing values | Missing |
Number within series is uniformly distributed | Uniform |
Dewey classification is uniformly distributed | Uniform |
BL shelfmark is uniformly distributed | Uniform |
BL record ID has unique values | Unique |
Reproduction
| Analysis started | 2021-09-17 10:06:34.511002 |
|---|---|
| Analysis finished | 2021-09-17 10:06:44.598969 |
| Duration | 10.09 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 52695 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14936555.58 |
| Minimum | 14602826 |
|---|---|
| Maximum | 16289062 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 14602826 |
|---|---|
| 5-th percentile | 14635415.7 |
| Q1 | 14811722.5 |
| median | 14829381 |
| Q3 | 14872527.5 |
| 95-th percentile | 16286365.3 |
| Maximum | 16289062 |
| Range | 1686236 |
| Interquartile range (IQR) | 60805 |
Descriptive statistics
| Standard deviation | 367921.3739 |
|---|---|
| Coefficient of variation (CV) | 0.02463227696 |
| Kurtosis | 8.42143983 |
| Mean | 14936555.58 |
| Median Absolute Deviation (MAD) | 23411 |
| Skewness | 3.08396255 |
| Sum | 7.870817963 × 1011 |
| Variance | 1.353661374 × 1011 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 14602826 | 1 | < 0.1% |
| 14861274 | 1 | < 0.1% |
| 14861264 | 1 | < 0.1% |
| 14861265 | 1 | < 0.1% |
| 14861266 | 1 | < 0.1% |
| 14861267 | 1 | < 0.1% |
| 14861268 | 1 | < 0.1% |
| 14861269 | 1 | < 0.1% |
| 14861270 | 1 | < 0.1% |
| 14861271 | 1 | < 0.1% |
| Other values (52685) | 52685 |
| Value | Count | Frequency (%) |
| 14602826 | 1 | |
| 14602830 | 1 | |
| 14602831 | 1 | |
| 14602832 | 1 | |
| 14602833 | 1 | |
| 14602834 | 1 | |
| 14602835 | 1 | |
| 14602836 | 1 | |
| 14602837 | 1 | |
| 14602838 | 1 |
| Value | Count | Frequency (%) |
| 16289062 | 1 | |
| 16289061 | 1 | |
| 16289060 | 1 | |
| 16289059 | 1 | |
| 16289058 | 1 | |
| 16289057 | 1 | |
| 16289056 | 1 | |
| 16289055 | 1 | |
| 16289054 | 1 | |
| 16289053 | 1 |
Type of resource
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| Monograph | |
|---|---|
| Monographic component part | 93 |
| Serial | 46 |
Length
| Max length | 26 |
|---|---|
| Median length | 9 |
| Mean length | 9.027384002 |
| Min length | 6 |
Characters and Unicode
| Total characters | 475698 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Monograph |
|---|---|
| 2nd row | Monograph |
| 3rd row | Monograph |
| 4th row | Monograph |
| 5th row | Monograph |
Common Values
| Value | Count | Frequency (%) |
| Monograph | 52556 | |
| Monographic component part | 93 | 0.2% |
| Serial | 46 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| monograph | 52556 | |
| monographic | 93 | 0.2% |
| component | 93 | 0.2% |
| part | 93 | 0.2% |
| serial | 46 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 105484 | |
| n | 52835 | |
| p | 52835 | |
| r | 52788 | |
| a | 52788 | |
| M | 52649 | |
| g | 52649 | |
| h | 52649 | |
| c | 186 | < 0.1% |
| 186 | < 0.1% | |
| Other values (6) | 649 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 422817 | |
| Uppercase Letter | 52695 | 11.1% |
| Space Separator | 186 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 105484 | |
| n | 52835 | |
| p | 52835 | |
| r | 52788 | |
| a | 52788 | |
| g | 52649 | |
| h | 52649 | |
| c | 186 | < 0.1% |
| t | 186 | < 0.1% |
| e | 139 | < 0.1% |
| Other values (3) | 278 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 52649 | |
| S | 46 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 186 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 475512 | |
| Common | 186 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 105484 | |
| n | 52835 | |
| p | 52835 | |
| r | 52788 | |
| a | 52788 | |
| M | 52649 | |
| g | 52649 | |
| h | 52649 | |
| c | 186 | < 0.1% |
| t | 186 | < 0.1% |
| Other values (5) | 463 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 186 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 475698 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 105484 | |
| n | 52835 | |
| p | 52835 | |
| r | 52788 | |
| a | 52788 | |
| M | 52649 | |
| g | 52649 | |
| h | 52649 | |
| c | 186 | < 0.1% |
| 186 | < 0.1% | |
| Other values (6) | 649 | 0.1% |
| Distinct | 28129 |
|---|---|
| Distinct (%) | 59.2% |
| Missing | 5143 |
| Missing (%) | 9.8% |
| Memory size | 411.8 KiB |
| Great Britain, Hydrographic Department | 159 |
|---|---|
| Byron, George Gordon Byron, Baron | 154 |
| Scott, Walter, Sir | 109 |
| Wood, Henry, Mrs | 103 |
| Dickens, Charles | 74 |
| Other values (28124) |
Length
| Max length | 223 |
|---|---|
| Median length | 20 |
| Mean length | 22.66573435 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1077801 |
|---|---|
| Distinct characters | 159 |
| Distinct categories | 10 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 8 ? |
Unique
| Unique | 21176 ? |
|---|---|
| Unique (%) | 44.5% |
Sample
| 1st row | Yearsley, Ann |
|---|---|
| 2nd row | A, T. |
| 3rd row | Albert, Prince Consort, consort of Victoria, Queen of Great Britain |
| 4th row | Anslow, Robert |
| 5th row | Bellamy, James William |
Common Values
| Value | Count | Frequency (%) |
| Great Britain, Hydrographic Department | 159 | 0.3% |
| Byron, George Gordon Byron, Baron | 154 | 0.3% |
| Scott, Walter, Sir | 109 | 0.2% |
| Wood, Henry, Mrs | 103 | 0.2% |
| Dickens, Charles | 74 | 0.1% |
| Oliphant, Mrs (Margaret) | 74 | 0.1% |
| Marryat, Florence | 58 | 0.1% |
| Goldsmith, Oliver | 55 | 0.1% |
| Dryden, John | 50 | 0.1% |
| Ainsworth, William Harrison | 47 | 0.1% |
| Other values (28119) | 46669 | |
| (Missing) | 5143 | 9.8% |
Length
| Value | Count | Frequency (%) |
| john | 3376 | 2.2% |
| william | 3352 | 2.2% |
| of | 2797 | 1.8% |
| george | 1970 | 1.3% |
| henry | 1948 | 1.3% |
| charles | 1833 | 1.2% |
| james | 1824 | 1.2% |
| thomas | 1796 | 1.2% |
| de | 1626 | 1.1% |
| j | 1289 | 0.8% |
| Other values (22638) | 131290 |
Most occurring characters
| Value | Count | Frequency (%) |
| 105549 | 9.8% | |
| e | 90721 | 8.4% |
| r | 73559 | 6.8% |
| a | 72314 | 6.7% |
| n | 60318 | 5.6% |
| o | 59621 | 5.5% |
| , | 57646 | 5.3% |
| i | 55437 | 5.1% |
| l | 50947 | 4.7% |
| s | 40060 | 3.7% |
| Other values (149) | 411629 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 746524 | |
| Uppercase Letter | 143261 | 13.3% |
| Space Separator | 105549 | 9.8% |
| Other Punctuation | 73308 | 6.8% |
| Open Punctuation | 3775 | 0.4% |
| Close Punctuation | 3775 | 0.4% |
| Dash Punctuation | 1160 | 0.1% |
| Decimal Number | 319 | < 0.1% |
| Nonspacing Mark | 89 | < 0.1% |
| Modifier Letter | 41 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 90721 | |
| r | 73559 | |
| a | 72314 | |
| n | 60318 | 8.1% |
| o | 59621 | 8.0% |
| i | 55437 | 7.4% |
| l | 50947 | 6.8% |
| s | 40060 | 5.4% |
| t | 38253 | 5.1% |
| h | 29021 | 3.9% |
| Other values (74) | 176273 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 11180 | 7.8% |
| J | 10877 | 7.6% |
| M | 10869 | 7.6% |
| H | 10219 | 7.1% |
| S | 9823 | 6.9% |
| B | 9735 | 6.8% |
| A | 9319 | 6.5% |
| W | 8982 | 6.3% |
| G | 8100 | 5.7% |
| R | 6940 | 4.8% |
| Other values (36) | 47217 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 89 | |
| 8 | 48 | |
| 2 | 38 | |
| 3 | 29 | 9.1% |
| 4 | 26 | 8.2% |
| 9 | 25 | 7.8% |
| 7 | 21 | 6.6% |
| 5 | 19 | 6.0% |
| 6 | 16 | 5.0% |
| 0 | 8 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 57646 | |
| . | 14844 | 20.2% |
| ' | 681 | 0.9% |
| * | 59 | 0.1% |
| ? | 46 | 0.1% |
| & | 32 | < 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ︠ | 43 | |
| ︡ | 43 | |
| ̡ | 2 | 2.2% |
| ̐ | 1 | 1.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 35 | |
| ʺ | 4 | 9.8% |
| ʿ | 2 | 4.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2939 | |
| [ | 836 | 22.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2939 | |
| ] | 836 | 22.1% |
Space Separator
| Value | Count | Frequency (%) |
| 105549 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1160 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 889785 | |
| Common | 187927 | 17.4% |
| Inherited | 89 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 90721 | 10.2% |
| r | 73559 | 8.3% |
| a | 72314 | 8.1% |
| n | 60318 | 6.8% |
| o | 59621 | 6.7% |
| i | 55437 | 6.2% |
| l | 50947 | 5.7% |
| s | 40060 | 4.5% |
| t | 38253 | 4.3% |
| h | 29021 | 3.3% |
| Other values (120) | 319534 |
Common
| Value | Count | Frequency (%) |
| 105549 | ||
| , | 57646 | |
| . | 14844 | 7.9% |
| ( | 2939 | 1.6% |
| ) | 2939 | 1.6% |
| - | 1160 | 0.6% |
| [ | 836 | 0.4% |
| ] | 836 | 0.4% |
| ' | 681 | 0.4% |
| 1 | 89 | < 0.1% |
| Other values (15) | 408 | 0.2% |
Inherited
| Value | Count | Frequency (%) |
| ︠ | 43 | |
| ︡ | 43 | |
| ̡ | 2 | 2.2% |
| ̐ | 1 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1073574 | |
| Latin 1 Sup | 3653 | 0.3% |
| Latin Ext A | 430 | < 0.1% |
| Half Marks | 86 | < 0.1% |
| Modifier Letters | 41 | < 0.1% |
| Latin Ext Additional | 9 | < 0.1% |
| Latin Ext B | 5 | < 0.1% |
| Diacriticals | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 105549 | 9.8% | |
| e | 90721 | 8.5% |
| r | 73559 | 6.9% |
| a | 72314 | 6.7% |
| n | 60318 | 5.6% |
| o | 59621 | 5.6% |
| , | 57646 | 5.4% |
| i | 55437 | 5.2% |
| l | 50947 | 4.7% |
| s | 40060 | 3.7% |
| Other values (64) | 407402 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| é | 1569 | |
| á | 344 | 9.4% |
| è | 294 | 8.0% |
| ç | 243 | 6.7% |
| É | 221 | 6.0% |
| ó | 173 | 4.7% |
| í | 169 | 4.6% |
| ö | 132 | 3.6% |
| ü | 103 | 2.8% |
| ë | 50 | 1.4% |
| Other values (28) | 355 | 9.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 35 | |
| ʺ | 4 | 9.8% |
| ʿ | 2 | 4.9% |
Latin Ext A
| Value | Count | Frequency (%) |
| ĭ | 104 | |
| ł | 67 | |
| ń | 42 | |
| ī | 32 | 7.4% |
| ć | 21 | 4.9% |
| ē | 19 | 4.4% |
| š | 19 | 4.4% |
| č | 16 | 3.7% |
| ā | 15 | 3.5% |
| ő | 12 | 2.8% |
| Other values (20) | 83 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| Ḟ | 2 | |
| Ṭ | 1 | |
| ṇ | 1 | |
| ḍ | 1 | |
| ṅ | 1 | |
| ẏ | 1 | |
| ṕ | 1 | |
| ṭ | 1 |
Diacriticals
| Value | Count | Frequency (%) |
| ̡ | 2 | |
| ̐ | 1 |
Latin Ext B
| Value | Count | Frequency (%) |
| ǵ | 4 | |
| ǎ | 1 | 20.0% |
Half Marks
| Value | Count | Frequency (%) |
| ︠ | 43 | |
| ︡ | 43 |
| Distinct | 2757 |
|---|---|
| Distinct (%) | 25.5% |
| Missing | 41870 |
| Missing (%) | 79.5% |
| Memory size | 411.8 KiB |
| 1788-1824 | 154 |
|---|---|
| 1771-1832 | 109 |
| 1814-1887 | 106 |
| 1812-1870 | 74 |
| 1828-1897 | 74 |
| Other values (2752) |
Length
| Max length | 37 |
|---|---|
| Median length | 9 |
| Mean length | 9.588637413 |
| Min length | 4 |
Characters and Unicode
| Total characters | 103797 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1230 ? |
|---|---|
| Unique (%) | 11.4% |
Sample
| 1st row | 1753-1806 |
|---|---|
| 2nd row | 1819-1861 |
| 3rd row | 1600-1649 |
| 4th row | 1782-1865 |
| 5th row | 1772-1834 |
Common Values
| Value | Count | Frequency (%) |
| 1788-1824 | 154 | 0.3% |
| 1771-1832 | 109 | 0.2% |
| 1814-1887 | 106 | 0.2% |
| 1812-1870 | 74 | 0.1% |
| 1828-1897 | 74 | 0.1% |
| 1833-1899 | 58 | 0.1% |
| approximately 1730-1774 | 55 | 0.1% |
| 1805-1882 | 52 | 0.1% |
| 1631-1700 | 49 | 0.1% |
| 1865-1936 | 46 | 0.1% |
| Other values (2747) | 10048 | 19.1% |
| (Missing) | 41870 |
Length
| Value | Count | Frequency (%) |
| approximately | 424 | 3.7% |
| 1788-1824 | 155 | 1.3% |
| 1771-1832 | 109 | 0.9% |
| 1814-1887 | 106 | 0.9% |
| active | 95 | 0.8% |
| 1812-1870 | 74 | 0.6% |
| 1828-1897 | 74 | 0.6% |
| 1833-1899 | 58 | 0.5% |
| 1730-1774 | 55 | 0.5% |
| 1805-1882 | 52 | 0.5% |
| Other values (2735) | 10289 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 25752 | |
| 8 | 16814 | |
| - | 10795 | |
| 9 | 8057 | 7.8% |
| 7 | 7259 | 7.0% |
| 2 | 5009 | 4.8% |
| 0 | 4746 | 4.6% |
| 3 | 4605 | 4.4% |
| 4 | 4504 | 4.3% |
| 6 | 4375 | 4.2% |
| Other values (17) | 11881 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 85420 | |
| Dash Punctuation | 10797 | 10.4% |
| Lowercase Letter | 6913 | 6.7% |
| Space Separator | 666 | 0.6% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1057 | |
| p | 962 | |
| i | 576 | |
| t | 576 | |
| e | 576 | |
| r | 526 | |
| o | 526 | |
| x | 481 | |
| m | 481 | |
| l | 481 | |
| Other values (3) | 671 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 25752 | |
| 8 | 16814 | |
| 9 | 8057 | 9.4% |
| 7 | 7259 | 8.5% |
| 2 | 5009 | 5.9% |
| 0 | 4746 | 5.6% |
| 3 | 4605 | 5.4% |
| 4 | 4504 | 5.3% |
| 6 | 4375 | 5.1% |
| 5 | 4299 | 5.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10795 | |
| – | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 666 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 96884 | |
| Latin | 6913 | 6.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 25752 | |
| 8 | 16814 | |
| - | 10795 | |
| 9 | 8057 | 8.3% |
| 7 | 7259 | 7.5% |
| 2 | 5009 | 5.2% |
| 0 | 4746 | 4.9% |
| 3 | 4605 | 4.8% |
| 4 | 4504 | 4.6% |
| 6 | 4375 | 4.5% |
| Other values (4) | 4968 | 5.1% |
Latin
| Value | Count | Frequency (%) |
| a | 1057 | |
| p | 962 | |
| i | 576 | |
| t | 576 | |
| e | 576 | |
| r | 526 | |
| o | 526 | |
| x | 481 | |
| m | 481 | |
| l | 481 | |
| Other values (3) | 671 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 103795 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 25752 | |
| 8 | 16814 | |
| - | 10795 | |
| 9 | 8057 | 7.8% |
| 7 | 7259 | 7.0% |
| 2 | 5009 | 4.8% |
| 0 | 4746 | 4.6% |
| 3 | 4605 | 4.4% |
| 4 | 4504 | 4.3% |
| 6 | 4375 | 4.2% |
| Other values (16) | 11879 |
Punctuation
| Value | Count | Frequency (%) |
| – | 2 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5143 |
| Missing (%) | 9.8% |
| Memory size | 411.8 KiB |
| person | |
|---|---|
| organisation | 1693 |
| meeting/conference | 3 |
Length
| Max length | 18 |
|---|---|
| Median length | 6 |
| Mean length | 6.214375841 |
| Min length | 6 |
Characters and Unicode
| Total characters | 295506 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | person |
|---|---|
| 2nd row | person |
| 3rd row | person |
| 4th row | person |
| 5th row | person |
Common Values
| Value | Count | Frequency (%) |
| person | 45856 | |
| organisation | 1693 | 3.2% |
| meeting/conference | 3 | < 0.1% |
| (Missing) | 5143 | 9.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| person | 45856 | |
| organisation | 1693 | 3.6% |
| meeting/conference | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 49251 | |
| o | 49245 | |
| r | 47552 | |
| s | 47549 | |
| e | 45871 | |
| p | 45856 | |
| i | 3389 | 1.1% |
| a | 3386 | 1.1% |
| g | 1696 | 0.6% |
| t | 1696 | 0.6% |
| Other values (4) | 15 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 295503 | |
| Other Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 49251 | |
| o | 49245 | |
| r | 47552 | |
| s | 47549 | |
| e | 45871 | |
| p | 45856 | |
| i | 3389 | 1.1% |
| a | 3386 | 1.1% |
| g | 1696 | 0.6% |
| t | 1696 | 0.6% |
| Other values (3) | 12 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 295503 | |
| Common | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 49251 | |
| o | 49245 | |
| r | 47552 | |
| s | 47549 | |
| e | 45871 | |
| p | 45856 | |
| i | 3389 | 1.1% |
| a | 3386 | 1.1% |
| g | 1696 | 0.6% |
| t | 1696 | 0.6% |
| Other values (3) | 12 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| / | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 295506 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 49251 | |
| o | 49245 | |
| r | 47552 | |
| s | 47549 | |
| e | 45871 | |
| p | 45856 | |
| i | 3389 | 1.1% |
| a | 3386 | 1.1% |
| g | 1696 | 0.6% |
| t | 1696 | 0.6% |
| Other values (4) | 15 | < 0.1% |
| Distinct | 33 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 51015 |
| Missing (%) | 96.8% |
| Memory size | 411.8 KiB |
| author | |
|---|---|
| writer | |
| novelist | |
| poet | |
| publisher | |
| Other values (28) |
Length
| Max length | 22 |
|---|---|
| Median length | 6 |
| Mean length | 6.592261905 |
| Min length | 4 |
Characters and Unicode
| Total characters | 11075 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | writer |
|---|---|
| 2nd row | poet |
| 3rd row | bookseller |
| 4th row | poet |
| 5th row | poet |
Common Values
| Value | Count | Frequency (%) |
| author | 372 | 0.7% |
| writer | 332 | 0.6% |
| novelist | 292 | 0.6% |
| poet | 281 | 0.5% |
| publisher | 64 | 0.1% |
| editor | 62 | 0.1% |
| historian | 57 | 0.1% |
| engineer | 34 | 0.1% |
| printer | 31 | 0.1% |
| lecturer | 24 | < 0.1% |
| Other values (23) | 131 | 0.2% |
| (Missing) | 51015 |
Length
| Value | Count | Frequency (%) |
| author | 374 | |
| writer | 333 | |
| novelist | 292 | |
| poet | 287 | |
| publisher | 65 | 3.8% |
| editor | 62 | 3.6% |
| historian | 57 | 3.4% |
| engineer | 34 | 2.0% |
| printer | 31 | 1.8% |
| lecturer | 24 | 1.4% |
| Other values (21) | 141 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1579 | |
| r | 1546 | |
| e | 1367 | |
| o | 1194 | |
| i | 982 | |
| h | 534 | 4.8% |
| a | 517 | 4.7% |
| n | 504 | 4.6% |
| s | 493 | 4.5% |
| u | 486 | 4.4% |
| Other values (16) | 1873 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11045 | |
| Space Separator | 20 | 0.2% |
| Other Punctuation | 9 | 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1579 | |
| r | 1546 | |
| e | 1367 | |
| o | 1194 | |
| i | 982 | |
| h | 534 | 4.8% |
| a | 517 | 4.7% |
| n | 504 | 4.6% |
| s | 493 | 4.5% |
| u | 486 | 4.4% |
| Other values (13) | 1843 |
Space Separator
| Value | Count | Frequency (%) |
| 20 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 9 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11046 | |
| Common | 29 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1579 | |
| r | 1546 | |
| e | 1367 | |
| o | 1194 | |
| i | 982 | |
| h | 534 | 4.8% |
| a | 517 | 4.7% |
| n | 504 | 4.6% |
| s | 493 | 4.5% |
| u | 486 | 4.4% |
| Other values (14) | 1844 |
Common
| Value | Count | Frequency (%) |
| 20 | ||
| ; | 9 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11075 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1579 | |
| r | 1546 | |
| e | 1367 | |
| o | 1194 | |
| i | 982 | |
| h | 534 | 4.8% |
| a | 517 | 4.7% |
| n | 504 | 4.6% |
| s | 493 | 4.5% |
| u | 486 | 4.4% |
| Other values (16) | 1873 |
| Distinct | 33026 |
|---|---|
| Distinct (%) | 66.5% |
| Missing | 3062 |
| Missing (%) | 5.8% |
| Memory size | 411.8 KiB |
| Byron, George Gordon Byron, Baron, 1788-1824 [person] | 126 |
|---|---|
| Wood, Henry, Mrs, 1814-1887 [person] | 103 |
| Oliphant, Mrs (Margaret), 1828-1897 [person] | 81 |
| Scott, Walter, Sir, 1771-1832 [person] | 78 |
| Great Britain, Hydrographic Department [organisation] | 69 |
| Other values (33021) |
Length
| Max length | 623 |
|---|---|
| Median length | 34 |
| Mean length | 43.43241392 |
| Min length | 13 |
Characters and Unicode
| Total characters | 2155681 |
|---|---|
| Distinct characters | 200 |
| Distinct categories | 11 ? |
| Distinct scripts | 5 ? |
| Distinct blocks | 11 ? |
Unique
| Unique | 26633 ? |
|---|---|
| Unique (%) | 53.7% |
Sample
| 1st row | More, Hannah, 1745-1833 [person] ; Yearsley, Ann, 1753-1806 [person] |
|---|---|
| 2nd row | Oldham, John, 1653-1683 [person] ; A, T. [person] |
| 3rd row | Plimsoll, Joseph [person] ; Albert, Prince Consort, consort of Victoria, Queen of Great Britain, 1819-1861 [person] |
| 4th row | Anslow, Robert [person] |
| 5th row | Swift, Jonathan, 1667-1745 [person] |
Common Values
| Value | Count | Frequency (%) |
| Byron, George Gordon Byron, Baron, 1788-1824 [person] | 126 | 0.2% |
| Wood, Henry, Mrs, 1814-1887 [person] | 103 | 0.2% |
| Oliphant, Mrs (Margaret), 1828-1897 [person] | 81 | 0.2% |
| Scott, Walter, Sir, 1771-1832 [person] | 78 | 0.1% |
| Great Britain, Hydrographic Department [organisation] | 69 | 0.1% |
| Marryat, Florence, 1833-1899 [person] | 58 | 0.1% |
| Payn, James, 1830-1898 [person] | 54 | 0.1% |
| Dryden, John, 1631-1700 [person] | 42 | 0.1% |
| Fenn, George Manville [person] | 42 | 0.1% |
| Braddon, M. E. (Mary Elizabeth), 1835-1915 [person] | 42 | 0.1% |
| Other values (33016) | 48938 | |
| (Missing) | 3062 | 5.8% |
Length
| Value | Count | Frequency (%) |
| person | 58866 | 20.6% |
| 11319 | 4.0% | |
| william | 4376 | 1.5% |
| john | 4361 | 1.5% |
| of | 3910 | 1.4% |
| george | 2609 | 0.9% |
| henry | 2573 | 0.9% |
| charles | 2348 | 0.8% |
| james | 2337 | 0.8% |
| thomas | 2276 | 0.8% |
| Other values (29674) | 190419 |
Most occurring characters
| Value | Count | Frequency (%) |
| 235761 | 10.9% | |
| e | 176894 | 8.2% |
| r | 157517 | 7.3% |
| o | 141342 | 6.6% |
| n | 140514 | 6.5% |
| s | 112312 | 5.2% |
| a | 98590 | 4.6% |
| , | 90670 | 4.2% |
| i | 77015 | 3.6% |
| p | 71051 | 3.3% |
| Other values (190) | 854015 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1353621 | |
| Space Separator | 235761 | 10.9% |
| Uppercase Letter | 182930 | 8.5% |
| Other Punctuation | 121385 | 5.6% |
| Decimal Number | 115067 | 5.3% |
| Open Punctuation | 65402 | 3.0% |
| Close Punctuation | 65402 | 3.0% |
| Dash Punctuation | 15930 | 0.7% |
| Nonspacing Mark | 123 | < 0.1% |
| Modifier Letter | 52 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 176894 | |
| r | 157517 | |
| o | 141342 | |
| n | 140514 | |
| s | 112312 | |
| a | 98590 | 7.3% |
| i | 77015 | 5.7% |
| p | 71051 | 5.2% |
| l | 66493 | 4.9% |
| t | 54074 | 4.0% |
| Other values (93) | 257819 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 14298 | 7.8% |
| J | 14017 | 7.7% |
| M | 13565 | 7.4% |
| H | 13003 | 7.1% |
| S | 12677 | 6.9% |
| B | 12512 | 6.8% |
| A | 11769 | 6.4% |
| W | 11578 | 6.3% |
| G | 10359 | 5.7% |
| R | 8855 | 4.8% |
| Other values (45) | 60297 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 34796 | |
| 8 | 22308 | |
| 9 | 10555 | 9.2% |
| 7 | 10283 | 8.9% |
| 2 | 6666 | 5.8% |
| 0 | 6217 | 5.4% |
| 3 | 6187 | 5.4% |
| 6 | 6129 | 5.3% |
| 4 | 6001 | 5.2% |
| 5 | 5925 | 5.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 90670 | |
| . | 18421 | 15.2% |
| ; | 11276 | 9.3% |
| ' | 850 | 0.7% |
| * | 69 | 0.1% |
| ? | 57 | < 0.1% |
| & | 34 | < 0.1% |
| : | 4 | < 0.1% |
| / | 4 | < 0.1% |
Other Letter
| Value | Count | Frequency (%) |
| ל | 2 | |
| ו | 1 | |
| י | 1 | |
| א | 1 | |
| ב | 1 | |
| ר | 1 | |
| ט | 1 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ︠ | 59 | |
| ︡ | 59 | |
| ̡ | 2 | 1.6% |
| ̐ | 1 | 0.8% |
| ̇ | 1 | 0.8% |
| ̢ | 1 | 0.8% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 44 | |
| ʺ | 5 | 9.6% |
| ʿ | 3 | 5.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15928 | |
| – | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 61765 | |
| ( | 3637 | 5.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 61765 | |
| ) | 3637 | 5.6% |
Space Separator
| Value | Count | Frequency (%) |
| 235761 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1536473 | |
| Common | 618999 | |
| Inherited | 123 | < 0.1% |
| Cyrillic | 78 | < 0.1% |
| Hebrew | 8 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 176894 | |
| r | 157517 | 10.3% |
| o | 141342 | 9.2% |
| n | 140514 | 9.1% |
| s | 112312 | 7.3% |
| a | 98590 | 6.4% |
| i | 77015 | 5.0% |
| p | 71051 | 4.6% |
| l | 66493 | 4.3% |
| t | 54074 | 3.5% |
| Other values (125) | 440671 |
Common
| Value | Count | Frequency (%) |
| 235761 | ||
| , | 90670 | 14.6% |
| [ | 61765 | 10.0% |
| ] | 61765 | 10.0% |
| 1 | 34796 | 5.6% |
| 8 | 22308 | 3.6% |
| . | 18421 | 3.0% |
| - | 15928 | 2.6% |
| ; | 11276 | 1.8% |
| 9 | 10555 | 1.7% |
| Other values (19) | 55754 | 9.0% |
Cyrillic
| Value | Count | Frequency (%) |
| и | 15 | |
| л | 6 | 7.7% |
| е | 5 | 6.4% |
| в | 5 | 6.4% |
| а | 5 | 6.4% |
| к | 4 | 5.1% |
| Н | 4 | 5.1% |
| ч | 4 | 5.1% |
| м | 3 | 3.8% |
| й | 3 | 3.8% |
| Other values (13) | 24 |
Hebrew
| Value | Count | Frequency (%) |
| ל | 2 | |
| ו | 1 | |
| י | 1 | |
| א | 1 | |
| ב | 1 | |
| ר | 1 | |
| ט | 1 |
Inherited
| Value | Count | Frequency (%) |
| ︠ | 59 | |
| ︡ | 59 | |
| ̡ | 2 | 1.6% |
| ̐ | 1 | 0.8% |
| ̇ | 1 | 0.8% |
| ̢ | 1 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2150164 | |
| Latin 1 Sup | 4706 | 0.2% |
| Latin Ext A | 530 | < 0.1% |
| Half Marks | 118 | < 0.1% |
| Cyrillic | 78 | < 0.1% |
| Modifier Letters | 52 | < 0.1% |
| Latin Ext Additional | 13 | < 0.1% |
| Hebrew | 8 | < 0.1% |
| Diacriticals | 5 | < 0.1% |
| Latin Ext B | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 235761 | 11.0% | |
| e | 176894 | 8.2% |
| r | 157517 | 7.3% |
| o | 141342 | 6.6% |
| n | 140514 | 6.5% |
| s | 112312 | 5.2% |
| a | 98590 | 4.6% |
| , | 90670 | 4.2% |
| i | 77015 | 3.6% |
| p | 71051 | 3.3% |
| Other values (67) | 848498 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| é | 2019 | |
| á | 424 | 9.0% |
| è | 385 | 8.2% |
| ç | 331 | 7.0% |
| É | 299 | 6.4% |
| ó | 221 | 4.7% |
| í | 216 | 4.6% |
| ö | 170 | 3.6% |
| ü | 121 | 2.6% |
| ë | 67 | 1.4% |
| Other values (29) | 453 | 9.6% |
Hebrew
| Value | Count | Frequency (%) |
| ל | 2 | |
| ו | 1 | |
| י | 1 | |
| א | 1 | |
| ב | 1 | |
| ר | 1 | |
| ט | 1 |
Punctuation
| Value | Count | Frequency (%) |
| – | 2 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 44 | |
| ʺ | 5 | 9.6% |
| ʿ | 3 | 5.8% |
Latin Ext A
| Value | Count | Frequency (%) |
| ĭ | 123 | |
| ł | 88 | |
| ń | 52 | |
| ī | 41 | 7.7% |
| š | 27 | 5.1% |
| ć | 25 | 4.7% |
| č | 21 | 4.0% |
| ē | 20 | 3.8% |
| ā | 18 | 3.4% |
| ő | 15 | 2.8% |
| Other values (22) | 100 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| Ḟ | 4 | |
| Ṭ | 1 | 7.7% |
| ṇ | 1 | 7.7% |
| ḍ | 1 | 7.7% |
| ṅ | 1 | 7.7% |
| ḣ | 1 | 7.7% |
| Ṫ | 1 | 7.7% |
| ẏ | 1 | 7.7% |
| ṕ | 1 | 7.7% |
| ṭ | 1 | 7.7% |
Diacriticals
| Value | Count | Frequency (%) |
| ̡ | 2 | |
| ̐ | 1 | |
| ̇ | 1 | |
| ̢ | 1 |
Latin Ext B
| Value | Count | Frequency (%) |
| ǵ | 4 | |
| ǎ | 1 | 20.0% |
Half Marks
| Value | Count | Frequency (%) |
| ︠ | 59 | |
| ︡ | 59 |
Cyrillic
| Value | Count | Frequency (%) |
| и | 15 | |
| л | 6 | 7.7% |
| е | 5 | 6.4% |
| в | 5 | 6.4% |
| а | 5 | 6.4% |
| к | 4 | 5.1% |
| Н | 4 | 5.1% |
| ч | 4 | 5.1% |
| м | 3 | 3.8% |
| й | 3 | 3.8% |
| Other values (13) | 24 |
| Distinct | 50029 |
|---|---|
| Distinct (%) | 94.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| Poems | 240 |
|---|---|
| Cook's Handbook for London. With two maps | 17 |
| Poems on several occasions | 14 |
| Poems, etc | 13 |
| Verses | 13 |
| Other values (50024) |
Length
| Max length | 1407 |
|---|---|
| Median length | 68 |
| Mean length | 84.80402315 |
| Min length | 3 |
Characters and Unicode
| Total characters | 4468748 |
|---|---|
| Distinct characters | 381 |
| Distinct categories | 16 ? |
| Distinct scripts | 7 ? |
| Distinct blocks | 15 ? |
Unique
| Unique | 48139 ? |
|---|---|
| Unique (%) | 91.4% |
Sample
| 1st row | Poems on several occasions [With a prefatory letter by Hannah More.] |
|---|---|
| 2nd row | A Satyr against Vertue. (A poem: supposed to be spoken by a Town-Hector [By John Oldham. The preface signed: T. A.]) |
| 3rd row | The Aeronaut, a poem; founded almost entirely, upon a statement, printed in the newspapers, of a voyage from Dublin, in October, 1812 |
| 4th row | The Prince Albert, a poem [By Joseph Plimsoll.] |
| 5th row | The Defeat of the Spanish Armada, A.D. 1588. A tercentenary ballad, A.D. 1888 |
Common Values
| Value | Count | Frequency (%) |
| Poems | 240 | 0.5% |
| Cook's Handbook for London. With two maps | 17 | < 0.1% |
| Poems on several occasions | 14 | < 0.1% |
| Poems, etc | 13 | < 0.1% |
| Verses | 13 | < 0.1% |
| Miscellaneous Poems | 12 | < 0.1% |
| Sonnets | 11 | < 0.1% |
| Poems on various subjects | 10 | < 0.1% |
| The Bride of Abydos. A Turkish tale | 9 | < 0.1% |
| Childe Harold's Pilgrimage. A romaunt [Cantos I and II. With fourteen other poems.] | 9 | < 0.1% |
| Other values (50019) | 52347 |
Length
| Value | Count | Frequency (%) |
| the | 49372 | 6.8% |
| of | 39302 | 5.4% |
| a | 26175 | 3.6% |
| and | 23740 | 3.3% |
| 18542 | 2.5% | |
| in | 14087 | 1.9% |
| by | 13718 | 1.9% |
| with | 11198 | 1.5% |
| etc | 9660 | 1.3% |
| de | 7584 | 1.0% |
| Other values (66559) | 516320 |
Most occurring characters
| Value | Count | Frequency (%) |
| 677003 | ||
| e | 404608 | 9.1% |
| t | 266325 | 6.0% |
| a | 252898 | 5.7% |
| i | 252203 | 5.6% |
| o | 251210 | 5.6% |
| n | 247714 | 5.5% |
| r | 229145 | 5.1% |
| s | 202728 | 4.5% |
| h | 136476 | 3.1% |
| Other values (371) | 1548438 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3189838 | |
| Space Separator | 677003 | 15.1% |
| Uppercase Letter | 319723 | 7.2% |
| Other Punctuation | 197421 | 4.4% |
| Decimal Number | 42642 | 1.0% |
| Open Punctuation | 16242 | 0.4% |
| Close Punctuation | 16242 | 0.4% |
| Dash Punctuation | 9209 | 0.2% |
| Nonspacing Mark | 288 | < 0.1% |
| Other Letter | 60 | < 0.1% |
| Other values (6) | 80 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 404608 | |
| t | 266325 | 8.3% |
| a | 252898 | 7.9% |
| i | 252203 | 7.9% |
| o | 251210 | 7.9% |
| n | 247714 | 7.8% |
| r | 229145 | 7.2% |
| s | 202728 | 6.4% |
| h | 136476 | 4.3% |
| l | 136461 | 4.3% |
| Other values (181) | 810070 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 31191 | 9.8% |
| T | 25856 | 8.1% |
| S | 25161 | 7.9% |
| C | 21730 | 6.8% |
| B | 19026 | 6.0% |
| W | 18145 | 5.7% |
| M | 17647 | 5.5% |
| P | 16860 | 5.3% |
| H | 15006 | 4.7% |
| L | 14227 | 4.4% |
| Other values (103) | 114874 |
Other Letter
| Value | Count | Frequency (%) |
| º | 29 | |
| ל | 5 | 8.3% |
| ב | 4 | 6.7% |
| מ | 3 | 5.0% |
| ת | 3 | 5.0% |
| ה | 3 | 5.0% |
| ש | 2 | 3.3% |
| א | 2 | 3.3% |
| פ | 1 | 1.7% |
| ע | 1 | 1.7% |
| Other values (7) | 7 | 11.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 118136 | |
| , | 49865 | |
| ' | 16011 | 8.1% |
| : | 6015 | 3.0% |
| ; | 5505 | 2.8% |
| & | 913 | 0.5% |
| ? | 354 | 0.2% |
| ! | 306 | 0.2% |
| * | 275 | 0.1% |
| / | 26 | < 0.1% |
| Other values (6) | 15 | < 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ︠ | 129 | |
| ︡ | 129 | |
| ̡ | 10 | 3.5% |
| ̂ | 4 | 1.4% |
| ̈ | 3 | 1.0% |
| ͡ | 3 | 1.0% |
| ̀ | 2 | 0.7% |
| ̒ | 2 | 0.7% |
| ̃ | 1 | 0.3% |
| ̤ | 1 | 0.3% |
| Other values (4) | 4 | 1.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 12669 | |
| 8 | 8218 | |
| 7 | 3398 | 8.0% |
| 6 | 3059 | 7.2% |
| 2 | 2746 | 6.4% |
| 5 | 2643 | 6.2% |
| 4 | 2592 | 6.1% |
| 9 | 2468 | 5.8% |
| 0 | 2451 | 5.7% |
| 3 | 2398 | 5.6% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 27 | |
| ⁿ | 12 | |
| ʿ | 1 | 2.4% |
| ʺ | 1 | 2.4% |
Private Use
| Value | Count | Frequency (%) |
| | 10 | |
| | 10 | |
| | 1 | 4.5% |
| | 1 | 4.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 13806 | |
| ( | 2436 | 15.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 13806 | |
| ) | 2436 | 15.0% |
Currency Symbol
| Value | Count | Frequency (%) |
| £ | 7 | |
| $ | 1 | 12.5% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 | |
| ✠ | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 677003 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9209 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 4 |
Other Number
| Value | Count | Frequency (%) |
| ¹ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3469955 | |
| Common | 958805 | 21.5% |
| Cyrillic | 36080 | 0.8% |
| Greek | 3567 | 0.1% |
| Inherited | 288 | < 0.1% |
| Hebrew | 31 | < 0.1% |
| Unknown | 22 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 404608 | 11.7% |
| t | 266325 | 7.7% |
| a | 252898 | 7.3% |
| i | 252203 | 7.3% |
| o | 251210 | 7.2% |
| n | 247714 | 7.1% |
| r | 229145 | 6.6% |
| s | 202728 | 5.8% |
| h | 136476 | 3.9% |
| l | 136461 | 3.9% |
| Other values (147) | 1090187 |
Cyrillic
| Value | Count | Frequency (%) |
| о | 3329 | 9.2% |
| а | 2688 | 7.5% |
| и | 2517 | 7.0% |
| е | 2397 | 6.6% |
| с | 2349 | 6.5% |
| н | 1974 | 5.5% |
| р | 1916 | 5.3% |
| т | 1673 | 4.6% |
| в | 1476 | 4.1% |
| к | 1435 | 4.0% |
| Other values (66) | 14326 |
Greek
| Value | Count | Frequency (%) |
| α | 402 | 11.3% |
| ι | 306 | 8.6% |
| ο | 292 | 8.2% |
| τ | 248 | 7.0% |
| ν | 232 | 6.5% |
| ρ | 195 | 5.5% |
| ε | 176 | 4.9% |
| η | 144 | 4.0% |
| ς | 144 | 4.0% |
| κ | 142 | 4.0% |
| Other values (63) | 1286 |
Common
| Value | Count | Frequency (%) |
| 677003 | ||
| . | 118136 | 12.3% |
| , | 49865 | 5.2% |
| ' | 16011 | 1.7% |
| [ | 13806 | 1.4% |
| ] | 13806 | 1.4% |
| 1 | 12669 | 1.3% |
| - | 9209 | 1.0% |
| 8 | 8218 | 0.9% |
| : | 6015 | 0.6% |
| Other values (31) | 34067 | 3.6% |
Hebrew
| Value | Count | Frequency (%) |
| ל | 5 | |
| ב | 4 | |
| מ | 3 | |
| ת | 3 | |
| ה | 3 | |
| ש | 2 | 6.5% |
| א | 2 | 6.5% |
| פ | 1 | 3.2% |
| ע | 1 | 3.2% |
| ו | 1 | 3.2% |
| Other values (6) | 6 |
Inherited
| Value | Count | Frequency (%) |
| ︠ | 129 | |
| ︡ | 129 | |
| ̡ | 10 | 3.5% |
| ̂ | 4 | 1.4% |
| ̈ | 3 | 1.0% |
| ͡ | 3 | 1.0% |
| ̀ | 2 | 0.7% |
| ̒ | 2 | 0.7% |
| ̃ | 1 | 0.3% |
| ̤ | 1 | 0.3% |
| Other values (4) | 4 | 1.4% |
Unknown
| Value | Count | Frequency (%) |
| | 10 | |
| | 10 | |
| | 1 | 4.5% |
| | 1 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4408283 | |
| Cyrillic | 36080 | 0.8% |
| Latin 1 Sup | 19287 | 0.4% |
| None | 3378 | 0.1% |
| Latin Ext A | 1122 | < 0.1% |
| Half Marks | 258 | < 0.1% |
| Greek Ext | 201 | < 0.1% |
| Hebrew | 31 | < 0.1% |
| Diacriticals | 30 | < 0.1% |
| Modifier Letters | 29 | < 0.1% |
| Other values (5) | 49 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 677003 | ||
| e | 404608 | 9.2% |
| t | 266325 | 6.0% |
| a | 252898 | 5.7% |
| i | 252203 | 5.7% |
| o | 251210 | 5.7% |
| n | 247714 | 5.6% |
| r | 229145 | 5.2% |
| s | 202728 | 4.6% |
| h | 136476 | 3.1% |
| Other values (70) | 1487973 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| é | 7561 | |
| ü | 1499 | 7.8% |
| ä | 1496 | 7.8% |
| è | 1439 | 7.5% |
| ö | 1235 | 6.4% |
| á | 1062 | 5.5% |
| à | 969 | 5.0% |
| æ | 590 | 3.1% |
| ó | 565 | 2.9% |
| É | 515 | 2.7% |
| Other values (44) | 2356 | 12.2% |
Latin Ext A
| Value | Count | Frequency (%) |
| ł | 180 | |
| œ | 148 | |
| ę | 87 | 7.8% |
| ī | 78 | 7.0% |
| ő | 77 | 6.9% |
| ĭ | 60 | 5.3% |
| ě | 58 | 5.2% |
| ń | 46 | 4.1% |
| ż | 43 | 3.8% |
| ą | 41 | 3.7% |
| Other values (34) | 304 |
None
| Value | Count | Frequency (%) |
| α | 402 | 11.9% |
| ι | 306 | 9.1% |
| ο | 292 | 8.6% |
| τ | 248 | 7.3% |
| ν | 232 | 6.9% |
| ρ | 195 | 5.8% |
| ε | 176 | 5.2% |
| η | 144 | 4.3% |
| ς | 144 | 4.3% |
| κ | 142 | 4.2% |
| Other values (38) | 1097 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ṣ | 2 | |
| ṇ | 2 | |
| ḟ | 2 | |
| ḥ | 1 | |
| ṫ | 1 | |
| ṭ | 1 | |
| ṅ | 1 | |
| ḿ | 1 | |
| ṃ | 1 | |
| ṡ | 1 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 27 | |
| ʿ | 1 | 3.4% |
| ʺ | 1 | 3.4% |
Greek Ext
| Value | Count | Frequency (%) |
| ἐ | 38 | |
| ἀ | 25 | |
| Ἀ | 24 | |
| ἱ | 14 | 7.0% |
| ὐ | 12 | 6.0% |
| ὑ | 11 | 5.5% |
| Ἱ | 10 | 5.0% |
| ῳ | 7 | 3.5% |
| Ἡ | 7 | 3.5% |
| ἰ | 7 | 3.5% |
| Other values (16) | 46 |
Punctuation
| Value | Count | Frequency (%) |
| † | 8 | |
| ′ | 2 | 20.0% |
Hebrew
| Value | Count | Frequency (%) |
| ל | 5 | |
| ב | 4 | |
| מ | 3 | |
| ת | 3 | |
| ה | 3 | |
| ש | 2 | 6.5% |
| א | 2 | 6.5% |
| פ | 1 | 3.2% |
| ע | 1 | 3.2% |
| ו | 1 | 3.2% |
| Other values (6) | 6 |
Diacriticals
| Value | Count | Frequency (%) |
| ̡ | 10 | |
| ̂ | 4 | 13.3% |
| ̈ | 3 | 10.0% |
| ͡ | 3 | 10.0% |
| ̀ | 2 | 6.7% |
| ̒ | 2 | 6.7% |
| ̃ | 1 | 3.3% |
| ̤ | 1 | 3.3% |
| ̔ | 1 | 3.3% |
| ͅ | 1 | 3.3% |
| Other values (2) | 2 | 6.7% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 3329 | 9.2% |
| а | 2688 | 7.5% |
| и | 2517 | 7.0% |
| е | 2397 | 6.6% |
| с | 2349 | 6.5% |
| н | 1974 | 5.5% |
| р | 1916 | 5.3% |
| т | 1673 | 4.6% |
| в | 1476 | 4.1% |
| к | 1435 | 4.0% |
| Other values (66) | 14326 |
Dingbats
| Value | Count | Frequency (%) |
| ✠ | 1 |
Latin Ext B
| Value | Count | Frequency (%) |
| ǔ | 1 | |
| ǵ | 1 | |
| ǒ | 1 |
Half Marks
| Value | Count | Frequency (%) |
| ︠ | 129 | |
| ︡ | 129 |
PUA
| Value | Count | Frequency (%) |
| | 10 | |
| | 10 | |
| | 1 | 4.5% |
| | 1 | 4.5% |
| Distinct | 1743 |
|---|---|
| Distinct (%) | 29.7% |
| Missing | 46828 |
| Missing (%) | 88.9% |
| Memory size | 411.8 KiB |
| Single Works | |
|---|---|
| Appendix | |
| Works | 142 |
| Appendix. Miscellaneous | 123 |
| Smaller Collections | 94 |
| Other values (1738) |
Length
| Max length | 814 |
|---|---|
| Median length | 18 |
| Mean length | 30.2754389 |
| Min length | 3 |
Characters and Unicode
| Total characters | 177626 |
|---|---|
| Distinct characters | 194 |
| Distinct categories | 11 ? |
| Distinct scripts | 5 ? |
| Distinct blocks | 8 ? |
Unique
| Unique | 1325 ? |
|---|---|
| Unique (%) | 22.6% |
Sample
| 1st row | Appendix |
|---|---|
| 2nd row | Appendix. I. Contemporary Satires, Eulogies, etc |
| 3rd row | Appendix. Elegies |
| 4th row | Poetry. Selections |
| 5th row | Single Works. Britannia Rediviva |
Common Values
| Value | Count | Frequency (%) |
| Single Works | 1206 | 2.3% |
| Appendix | 899 | 1.7% |
| Works | 142 | 0.3% |
| Appendix. Miscellaneous | 123 | 0.2% |
| Smaller Collections | 94 | 0.2% |
| Collections | 88 | 0.2% |
| Works. Selections | 45 | 0.1% |
| Appendix. Topography and Travels | 43 | 0.1% |
| Poetical Works | 42 | 0.1% |
| Plays. Single Plays | 35 | 0.1% |
| Other values (1733) | 3150 | 6.0% |
| (Missing) | 46828 |
Length
| Value | Count | Frequency (%) |
| works | 2519 | 9.6% |
| single | 1914 | 7.3% |
| appendix | 1665 | 6.4% |
| the | 838 | 3.2% |
| of | 772 | 3.0% |
| 674 | 2.6% | |
| and | 619 | 2.4% |
| collections | 373 | 1.4% |
| by | 314 | 1.2% |
| miscellaneous | 313 | 1.2% |
| Other values (4288) | 16139 |
Most occurring characters
| Value | Count | Frequency (%) |
| 20273 | 11.4% | |
| e | 15080 | 8.5% |
| i | 12311 | 6.9% |
| n | 11193 | 6.3% |
| o | 10954 | 6.2% |
| s | 9814 | 5.5% |
| r | 9196 | 5.2% |
| a | 7951 | 4.5% |
| l | 7848 | 4.4% |
| t | 7226 | 4.1% |
| Other values (184) | 65780 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 131172 | |
| Space Separator | 20273 | 11.4% |
| Uppercase Letter | 18082 | 10.2% |
| Other Punctuation | 6052 | 3.4% |
| Decimal Number | 1391 | 0.8% |
| Dash Punctuation | 192 | 0.1% |
| Open Punctuation | 144 | 0.1% |
| Close Punctuation | 144 | 0.1% |
| Nonspacing Mark | 123 | 0.1% |
| Other Letter | 28 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 15080 | |
| i | 12311 | 9.4% |
| n | 11193 | 8.5% |
| o | 10954 | 8.4% |
| s | 9814 | 7.5% |
| r | 9196 | 7.0% |
| a | 7951 | 6.1% |
| l | 7848 | 6.0% |
| t | 7226 | 5.5% |
| d | 5244 | 4.0% |
| Other values (84) | 34355 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3205 | |
| W | 2690 | |
| A | 2258 | |
| C | 1092 | 6.0% |
| I | 987 | 5.5% |
| P | 948 | 5.2% |
| T | 831 | 4.6% |
| M | 749 | 4.1% |
| E | 721 | 4.0% |
| B | 645 | 3.6% |
| Other values (46) | 3956 |
Other Letter
| Value | Count | Frequency (%) |
| י | 6 | |
| ו | 3 | |
| ר | 3 | |
| º | 3 | |
| מ | 2 | 7.1% |
| ד | 2 | 7.1% |
| ת | 2 | 7.1% |
| נ | 1 | 3.6% |
| ש | 1 | 3.6% |
| ם | 1 | 3.6% |
| Other values (4) | 4 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 430 | |
| 8 | 242 | |
| 2 | 108 | 7.8% |
| 7 | 104 | 7.5% |
| 0 | 99 | 7.1% |
| 6 | 97 | 7.0% |
| 5 | 93 | 6.7% |
| 4 | 87 | 6.3% |
| 3 | 73 | 5.2% |
| 9 | 58 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4236 | |
| , | 941 | 15.5% |
| ; | 514 | 8.5% |
| ' | 294 | 4.9% |
| : | 49 | 0.8% |
| & | 10 | 0.2% |
| ? | 7 | 0.1% |
| ! | 1 | < 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ︠ | 59 | |
| ︡ | 59 | |
| ͡ | 4 | 3.3% |
| ́ | 1 | 0.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 97 | |
| ( | 47 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 97 | |
| ) | 47 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 18 | |
| ʿ | 7 | 28.0% |
Space Separator
| Value | Count | Frequency (%) |
| 20273 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 192 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 146219 | |
| Common | 28221 | 15.9% |
| Cyrillic | 3038 | 1.7% |
| Inherited | 123 | 0.1% |
| Hebrew | 25 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 15080 | 10.3% |
| i | 12311 | 8.4% |
| n | 11193 | 7.7% |
| o | 10954 | 7.5% |
| s | 9814 | 6.7% |
| r | 9196 | 6.3% |
| a | 7951 | 5.4% |
| l | 7848 | 5.4% |
| t | 7226 | 4.9% |
| d | 5244 | 3.6% |
| Other values (82) | 49402 |
Cyrillic
| Value | Count | Frequency (%) |
| о | 271 | 8.9% |
| и | 256 | 8.4% |
| а | 201 | 6.6% |
| с | 188 | 6.2% |
| е | 176 | 5.8% |
| н | 165 | 5.4% |
| т | 154 | 5.1% |
| р | 146 | 4.8% |
| в | 129 | 4.2% |
| л | 102 | 3.4% |
| Other values (49) | 1250 |
Common
| Value | Count | Frequency (%) |
| 20273 | ||
| . | 4236 | 15.0% |
| , | 941 | 3.3% |
| ; | 514 | 1.8% |
| 1 | 430 | 1.5% |
| ' | 294 | 1.0% |
| 8 | 242 | 0.9% |
| - | 192 | 0.7% |
| 2 | 108 | 0.4% |
| 7 | 104 | 0.4% |
| Other values (16) | 887 | 3.1% |
Hebrew
| Value | Count | Frequency (%) |
| י | 6 | |
| ו | 3 | |
| ר | 3 | |
| מ | 2 | 8.0% |
| ד | 2 | 8.0% |
| ת | 2 | 8.0% |
| נ | 1 | 4.0% |
| ש | 1 | 4.0% |
| ם | 1 | 4.0% |
| ל | 1 | 4.0% |
| Other values (3) | 3 |
Inherited
| Value | Count | Frequency (%) |
| ︠ | 59 | |
| ︡ | 59 | |
| ͡ | 4 | 3.3% |
| ́ | 1 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 173932 | |
| Cyrillic | 3038 | 1.7% |
| Latin 1 Sup | 402 | 0.2% |
| Half Marks | 118 | 0.1% |
| Latin Ext A | 81 | < 0.1% |
| Hebrew | 25 | < 0.1% |
| Modifier Letters | 25 | < 0.1% |
| Diacriticals | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 20273 | 11.7% | |
| e | 15080 | 8.7% |
| i | 12311 | 7.1% |
| n | 11193 | 6.4% |
| o | 10954 | 6.3% |
| s | 9814 | 5.6% |
| r | 9196 | 5.3% |
| a | 7951 | 4.6% |
| l | 7848 | 4.5% |
| t | 7226 | 4.2% |
| Other values (66) | 62086 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| é | 167 | |
| æ | 39 | 9.7% |
| à | 26 | 6.5% |
| ö | 21 | 5.2% |
| ä | 20 | 5.0% |
| ü | 19 | 4.7% |
| è | 18 | 4.5% |
| É | 15 | 3.7% |
| ç | 15 | 3.7% |
| á | 15 | 3.7% |
| Other values (16) | 47 | 11.7% |
Hebrew
| Value | Count | Frequency (%) |
| י | 6 | |
| ו | 3 | |
| ר | 3 | |
| מ | 2 | 8.0% |
| ד | 2 | 8.0% |
| ת | 2 | 8.0% |
| נ | 1 | 4.0% |
| ש | 1 | 4.0% |
| ם | 1 | 4.0% |
| ל | 1 | 4.0% |
| Other values (3) | 3 |
Latin Ext A
| Value | Count | Frequency (%) |
| ĭ | 33 | |
| ā | 15 | |
| ī | 13 | 16.0% |
| ń | 4 | 4.9% |
| ś | 4 | 4.9% |
| ł | 3 | 3.7% |
| ė | 2 | 2.5% |
| Œ | 1 | 1.2% |
| ź | 1 | 1.2% |
| ū | 1 | 1.2% |
| Other values (4) | 4 | 4.9% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 18 | |
| ʿ | 7 | 28.0% |
Half Marks
| Value | Count | Frequency (%) |
| ︠ | 59 | |
| ︡ | 59 |
Cyrillic
| Value | Count | Frequency (%) |
| о | 271 | 8.9% |
| и | 256 | 8.4% |
| а | 201 | 6.6% |
| с | 188 | 6.2% |
| е | 176 | 5.8% |
| н | 165 | 5.4% |
| т | 154 | 5.1% |
| р | 146 | 4.8% |
| в | 129 | 4.2% |
| л | 102 | 3.4% |
| Other values (49) | 1250 |
Diacriticals
| Value | Count | Frequency (%) |
| ͡ | 4 | |
| ́ | 1 | 20.0% |
| Distinct | 157 |
|---|---|
| Distinct (%) | 60.4% |
| Missing | 52435 |
| Missing (%) | 99.5% |
| Memory size | 411.8 KiB |
| Bell's English Classics | 19 |
|---|---|
| The works of Charles Dickens | 18 |
| Thomas Hardy's works. The Wessex novels | 15 |
| Sailing Directions. America | 9 |
| Routledge's sixpenny novels | 5 |
| Other values (152) |
Length
| Max length | 104 |
|---|---|
| Median length | 30.5 |
| Mean length | 37.33076923 |
| Min length | 9 |
Characters and Unicode
| Total characters | 9706 |
|---|---|
| Distinct characters | 123 |
| Distinct categories | 8 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 128 ? |
|---|---|
| Unique (%) | 49.2% |
Sample
| 1st row | Reprints of Rare Tracts & Imprints, etc |
|---|---|
| 2nd row | The illustrated English poems |
| 3rd row | Duncombe's 'Minor British Drama' |
| 4th row | Duncombe and Co.'s 'Minor British Drama' |
| 5th row | Dicks' Standard Plays |
Common Values
| Value | Count | Frequency (%) |
| Bell's English Classics | 19 | < 0.1% |
| The works of Charles Dickens | 18 | < 0.1% |
| Thomas Hardy's works. The Wessex novels | 15 | < 0.1% |
| Sailing Directions. America | 9 | < 0.1% |
| Routledge's sixpenny novels | 5 | < 0.1% |
| Collection de documents relatifs à l'histoire de Paris pendant la Révolution française | 5 | < 0.1% |
| Way-about Series | 4 | < 0.1% |
| Macmillan's Illustrated standard novels | 4 | < 0.1% |
| The Romance of History | 4 | < 0.1% |
| Recueil de voyages et de documents pour servir à l'histoire de la géographie | 4 | < 0.1% |
| Other values (147) | 173 | 0.3% |
| (Missing) | 52435 |
Length
| Value | Count | Frequency (%) |
| the | 62 | 4.5% |
| of | 50 | 3.6% |
| de | 46 | 3.3% |
| works | 40 | 2.9% |
| english | 33 | 2.4% |
| classics | 27 | 1.9% |
| novels | 27 | 1.9% |
| documents | 24 | 1.7% |
| series | 20 | 1.4% |
| bell's | 20 | 1.4% |
| Other values (432) | 1043 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1132 | 11.7% | |
| e | 904 | 9.3% |
| s | 796 | 8.2% |
| i | 633 | 6.5% |
| r | 542 | 5.6% |
| a | 515 | 5.3% |
| o | 498 | 5.1% |
| n | 482 | 5.0% |
| l | 455 | 4.7% |
| t | 418 | 4.3% |
| Other values (113) | 3331 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7490 | |
| Space Separator | 1132 | 11.7% |
| Uppercase Letter | 755 | 7.8% |
| Other Punctuation | 241 | 2.5% |
| Decimal Number | 65 | 0.7% |
| Dash Punctuation | 17 | 0.2% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 904 | |
| s | 796 | |
| i | 633 | 8.5% |
| r | 542 | 7.2% |
| a | 515 | 6.9% |
| o | 498 | 6.6% |
| n | 482 | 6.4% |
| l | 455 | 6.1% |
| t | 418 | 5.6% |
| c | 275 | 3.7% |
| Other values (62) | 1972 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 94 | |
| C | 70 | 9.3% |
| S | 68 | 9.0% |
| E | 56 | 7.4% |
| D | 56 | 7.4% |
| H | 53 | 7.0% |
| B | 47 | 6.2% |
| P | 45 | 6.0% |
| R | 41 | 5.4% |
| A | 36 | 4.8% |
| Other values (22) | 189 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 20 | |
| 8 | 11 | |
| 2 | 10 | |
| 7 | 6 | 9.2% |
| 3 | 6 | 9.2% |
| 5 | 5 | 7.7% |
| 9 | 3 | 4.6% |
| 4 | 2 | 3.1% |
| 6 | 2 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 106 | |
| . | 99 | |
| , | 30 | 12.4% |
| & | 2 | 0.8% |
| : | 2 | 0.8% |
| ; | 2 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 1132 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8063 | |
| Common | 1461 | 15.1% |
| Cyrillic | 182 | 1.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 904 | 11.2% |
| s | 796 | 9.9% |
| i | 633 | 7.9% |
| r | 542 | 6.7% |
| a | 515 | 6.4% |
| o | 498 | 6.2% |
| n | 482 | 6.0% |
| l | 455 | 5.6% |
| t | 418 | 5.2% |
| c | 275 | 3.4% |
| Other values (59) | 2545 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 17 | 9.3% |
| т | 17 | 9.3% |
| о | 16 | 8.8% |
| е | 15 | 8.2% |
| с | 14 | 7.7% |
| и | 12 | 6.6% |
| к | 11 | 6.0% |
| р | 11 | 6.0% |
| п | 6 | 3.3% |
| м | 6 | 3.3% |
| Other values (25) | 57 |
Common
| Value | Count | Frequency (%) |
| 1132 | ||
| ' | 106 | 7.3% |
| . | 99 | 6.8% |
| , | 30 | 2.1% |
| 1 | 20 | 1.4% |
| - | 17 | 1.2% |
| 8 | 11 | 0.8% |
| 2 | 10 | 0.7% |
| 7 | 6 | 0.4% |
| 3 | 6 | 0.4% |
| Other values (9) | 24 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9383 | |
| Cyrillic | 182 | 1.9% |
| Latin 1 Sup | 134 | 1.4% |
| Latin Ext A | 7 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1132 | 12.1% | |
| e | 904 | 9.6% |
| s | 796 | 8.5% |
| i | 633 | 6.7% |
| r | 542 | 5.8% |
| a | 515 | 5.5% |
| o | 498 | 5.3% |
| n | 482 | 5.1% |
| l | 455 | 4.8% |
| t | 418 | 4.5% |
| Other values (59) | 3008 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| é | 59 | |
| à | 19 | 14.2% |
| ö | 18 | 13.4% |
| ä | 7 | 5.2% |
| ñ | 6 | 4.5% |
| á | 6 | 4.5% |
| ç | 5 | 3.7% |
| ó | 4 | 3.0% |
| æ | 3 | 2.2% |
| å | 3 | 2.2% |
| Other values (3) | 4 | 3.0% |
Latin Ext A
| Value | Count | Frequency (%) |
| ĭ | 2 | |
| ż | 1 | |
| ő | 1 | |
| ů | 1 | |
| č | 1 | |
| ī | 1 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 17 | 9.3% |
| т | 17 | 9.3% |
| о | 16 | 8.8% |
| е | 15 | 8.2% |
| с | 14 | 7.7% |
| и | 12 | 6.6% |
| к | 11 | 6.0% |
| р | 11 | 6.0% |
| п | 6 | 3.3% |
| м | 6 | 3.3% |
| Other values (25) | 57 |
| Distinct | 110 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 52584 |
| Missing (%) | 99.8% |
| Memory size | 411.8 KiB |
| number 4 [Way-about Series] | 2 |
|---|---|
| volume 26, number 168 [Parliamentary Papers. House of Commons. Session 1831-32] | 1 |
| number 1-5 [Série A. Opérations électorales de 1789] | 1 |
| 29, 2 [Bibliothek der neuesten und wichtigsten Reisebeschreibungen] | 1 |
| Band 7 [Historische Bibliothek] | 1 |
| Other values (105) |
Length
| Max length | 115 |
|---|---|
| Median length | 50 |
| Mean length | 51.56756757 |
| Min length | 23 |
Characters and Unicode
| Total characters | 5724 |
|---|---|
| Distinct characters | 100 |
| Distinct categories | 8 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 109 ? |
|---|---|
| Unique (%) | 98.2% |
Sample
| 1st row | volume 4 [Reprints of Rare Tracts & Imprints, etc] |
|---|---|
| 2nd row | number 19 [Duncombe's 'Minor British Drama'] |
| 3rd row | number 7 [Duncombe and Co.'s 'Minor British Drama'] |
| 4th row | number 956 [Dicks' Standard Plays] |
| 5th row | number 1 [Chants for Socialists] |
Common Values
| Value | Count | Frequency (%) |
| number 4 [Way-about Series] | 2 | < 0.1% |
| volume 26, number 168 [Parliamentary Papers. House of Commons. Session 1831-32] | 1 | < 0.1% |
| number 1-5 [Série A. Opérations électorales de 1789] | 1 | < 0.1% |
| 29, 2 [Bibliothek der neuesten und wichtigsten Reisebeschreibungen] | 1 | < 0.1% |
| Band 7 [Historische Bibliothek] | 1 | < 0.1% |
| 13 [Bibljoteka historyczna] | 1 | < 0.1% |
| volume 383 [Collection of Ancient and Modern British Authors] | 1 | < 0.1% |
| number 6 [Записки Императорской Академіи Наукъ. том. 8. прил] | 1 | < 0.1% |
| volume 1861, 1862 [Archæologia Cambrensis. Supplement] | 1 | < 0.1% |
| number 4 [Koninklijke Vlaamse Academie voor Taal- en Letterkunde. Publicaties. reeks 5] | 1 | < 0.1% |
| Other values (100) | 100 | 0.2% |
| (Missing) | 52584 |
Length
| Value | Count | Frequency (%) |
| volume | 45 | 5.2% |
| number | 44 | 5.0% |
| the | 25 | 2.9% |
| works | 21 | 2.4% |
| de | 21 | 2.4% |
| thomas | 19 | 2.2% |
| novels | 18 | 2.1% |
| hardy's | 17 | 1.9% |
| wessex | 17 | 1.9% |
| of | 17 | 1.9% |
| Other values (300) | 629 |
Most occurring characters
| Value | Count | Frequency (%) |
| 762 | 13.3% | |
| e | 517 | 9.0% |
| s | 325 | 5.7% |
| r | 319 | 5.6% |
| o | 288 | 5.0% |
| n | 274 | 4.8% |
| i | 272 | 4.8% |
| a | 264 | 4.6% |
| t | 199 | 3.5% |
| l | 194 | 3.4% |
| Other values (90) | 2310 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3945 | |
| Space Separator | 762 | 13.3% |
| Uppercase Letter | 377 | 6.6% |
| Decimal Number | 260 | 4.5% |
| Other Punctuation | 140 | 2.4% |
| Open Punctuation | 111 | 1.9% |
| Close Punctuation | 111 | 1.9% |
| Dash Punctuation | 18 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 517 | |
| s | 325 | 8.2% |
| r | 319 | 8.1% |
| o | 288 | 7.3% |
| n | 274 | 6.9% |
| i | 272 | 6.9% |
| a | 264 | 6.7% |
| t | 199 | 5.0% |
| l | 194 | 4.9% |
| u | 191 | 4.8% |
| Other values (42) | 1102 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 53 | |
| S | 46 | |
| H | 39 | |
| B | 28 | 7.4% |
| P | 26 | 6.9% |
| W | 23 | 6.1% |
| R | 22 | 5.8% |
| A | 17 | 4.5% |
| C | 14 | 3.7% |
| D | 13 | 3.4% |
| Other values (19) | 96 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 60 | |
| 3 | 42 | |
| 2 | 34 | |
| 4 | 25 | |
| 8 | 22 | 8.5% |
| 5 | 21 | 8.1% |
| 6 | 18 | 6.9% |
| 7 | 16 | 6.2% |
| 9 | 14 | 5.4% |
| 0 | 8 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 55 | |
| ' | 44 | |
| , | 38 | |
| & | 2 | 1.4% |
| : | 1 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 762 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 111 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 111 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4282 | |
| Common | 1402 | 24.5% |
| Cyrillic | 40 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 517 | 12.1% |
| s | 325 | 7.6% |
| r | 319 | 7.4% |
| o | 288 | 6.7% |
| n | 274 | 6.4% |
| i | 272 | 6.4% |
| a | 264 | 6.2% |
| t | 199 | 4.6% |
| l | 194 | 4.5% |
| u | 191 | 4.5% |
| Other values (51) | 1439 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 4 | 10.0% |
| и | 4 | 10.0% |
| к | 4 | 10.0% |
| п | 3 | 7.5% |
| м | 3 | 7.5% |
| р | 3 | 7.5% |
| о | 3 | 7.5% |
| с | 2 | 5.0% |
| е | 2 | 5.0% |
| т | 2 | 5.0% |
| Other values (10) | 10 |
Common
| Value | Count | Frequency (%) |
| 762 | ||
| [ | 111 | 7.9% |
| ] | 111 | 7.9% |
| 1 | 60 | 4.3% |
| . | 55 | 3.9% |
| ' | 44 | 3.1% |
| 3 | 42 | 3.0% |
| , | 38 | 2.7% |
| 2 | 34 | 2.4% |
| 4 | 25 | 1.8% |
| Other values (9) | 120 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5636 | |
| Latin 1 Sup | 45 | 0.8% |
| Cyrillic | 40 | 0.7% |
| Latin Ext A | 3 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 762 | 13.5% | |
| e | 517 | 9.2% |
| s | 325 | 5.8% |
| r | 319 | 5.7% |
| o | 288 | 5.1% |
| n | 274 | 4.9% |
| i | 272 | 4.8% |
| a | 264 | 4.7% |
| t | 199 | 3.5% |
| l | 194 | 3.4% |
| Other values (59) | 2222 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| é | 20 | |
| ö | 9 | |
| ä | 5 | 11.1% |
| à | 5 | 11.1% |
| á | 2 | 4.4% |
| É | 1 | 2.2% |
| å | 1 | 2.2% |
| æ | 1 | 2.2% |
| ü | 1 | 2.2% |
Cyrillic
| Value | Count | Frequency (%) |
| а | 4 | 10.0% |
| и | 4 | 10.0% |
| к | 4 | 10.0% |
| п | 3 | 7.5% |
| м | 3 | 7.5% |
| р | 3 | 7.5% |
| о | 3 | 7.5% |
| с | 2 | 5.0% |
| е | 2 | 5.0% |
| т | 2 | 5.0% |
| Other values (10) | 10 |
Latin Ext A
| Value | Count | Frequency (%) |
| ĭ | 2 | |
| ī | 1 |
| Distinct | 71 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 16235 |
| Missing (%) | 30.8% |
| Memory size | 411.8 KiB |
| England | |
|---|---|
| United States of America | 2298 |
| Scotland | 1649 |
| England ; Scotland | 621 |
| Ireland | 434 |
| Other values (66) | 1174 |
Length
| Max length | 45 |
|---|---|
| Median length | 7 |
| Mean length | 8.624821722 |
| Min length | 5 |
Characters and Unicode
| Total characters | 314461 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 29 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | England |
|---|---|
| 2nd row | England |
| 3rd row | Ireland |
| 4th row | England |
| 5th row | England |
Common Values
| Value | Count | Frequency (%) |
| England | 30284 | |
| United States of America | 2298 | 4.4% |
| Scotland | 1649 | 3.1% |
| England ; Scotland | 621 | 1.2% |
| Ireland | 434 | 0.8% |
| England ; United States of America | 394 | 0.7% |
| Italy | 119 | 0.2% |
| France | 92 | 0.2% |
| Wales | 69 | 0.1% |
| Russia | 58 | 0.1% |
| Other values (61) | 442 | 0.8% |
| (Missing) | 16235 |
Length
| Value | Count | Frequency (%) |
| england | 31380 | |
| united | 2699 | 5.8% |
| states | 2698 | 5.8% |
| of | 2698 | 5.8% |
| america | 2698 | 5.8% |
| scotland | 2275 | 4.9% |
| 1109 | 2.4% | |
| ireland | 539 | 1.2% |
| italy | 119 | 0.3% |
| france | 92 | 0.2% |
| Other values (52) | 548 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 68603 | |
| a | 40278 | |
| d | 36956 | |
| l | 34485 | |
| g | 31429 | |
| E | 31380 | |
| t | 10613 | 3.4% |
| 10395 | 3.3% | |
| e | 9058 | 2.9% |
| i | 5648 | 1.8% |
| Other values (37) | 35616 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 259907 | |
| Uppercase Letter | 43049 | 13.7% |
| Space Separator | 10395 | 3.3% |
| Other Punctuation | 1109 | 0.4% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 68603 | |
| a | 40278 | |
| d | 36956 | |
| l | 34485 | |
| g | 31429 | |
| t | 10613 | 4.1% |
| e | 9058 | 3.5% |
| i | 5648 | 2.2% |
| c | 5135 | 2.0% |
| o | 5124 | 2.0% |
| Other values (14) | 12578 | 4.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 31380 | |
| S | 5010 | 11.6% |
| A | 2733 | 6.3% |
| U | 2704 | 6.3% |
| I | 683 | 1.6% |
| F | 92 | 0.2% |
| W | 73 | 0.2% |
| N | 73 | 0.2% |
| G | 68 | 0.2% |
| R | 60 | 0.1% |
| Other values (10) | 173 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 10395 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1109 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 302956 | |
| Common | 11505 | 3.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 68603 | |
| a | 40278 | |
| d | 36956 | |
| l | 34485 | |
| g | 31429 | |
| E | 31380 | |
| t | 10613 | 3.5% |
| e | 9058 | 3.0% |
| i | 5648 | 1.9% |
| c | 5135 | 1.7% |
| Other values (34) | 29371 |
Common
| Value | Count | Frequency (%) |
| 10395 | ||
| ; | 1109 | 9.6% |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 314461 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 68603 | |
| a | 40278 | |
| d | 36956 | |
| l | 34485 | |
| g | 31429 | |
| E | 31380 | |
| t | 10613 | 3.4% |
| 10395 | 3.3% | |
| e | 9058 | 2.9% |
| i | 5648 | 1.8% |
| Other values (37) | 35616 |
| Distinct | 3492 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 772 |
| Missing (%) | 1.5% |
| Memory size | 411.8 KiB |
| London | |
|---|---|
| Paris | 2132 |
| New York | 1179 |
| Edinburgh | 1026 |
| Edinburgh ; London | 524 |
| Other values (3487) |
Length
| Max length | 288 |
|---|---|
| Median length | 6 |
| Mean length | 7.625272037 |
| Min length | 4 |
Characters and Unicode
| Total characters | 395927 |
|---|---|
| Distinct characters | 193 |
| Distinct categories | 10 ? |
| Distinct scripts | 5 ? |
| Distinct blocks | 9 ? |
Unique
| Unique | 1995 ? |
|---|---|
| Unique (%) | 3.8% |
Sample
| 1st row | London |
|---|---|
| 2nd row | London |
| 3rd row | Dublin |
| 4th row | Plymouth |
| 5th row | London |
Common Values
| Value | Count | Frequency (%) |
| London | 26743 | |
| Paris | 2132 | 4.0% |
| New York | 1179 | 2.2% |
| Edinburgh | 1026 | 1.9% |
| Edinburgh ; London | 524 | 1.0% |
| Leipzig | 506 | 1.0% |
| Philadelphia | 450 | 0.9% |
| Berlin | 427 | 0.8% |
| Dublin | 424 | 0.8% |
| London ; New York | 356 | 0.7% |
| Other values (3482) | 18156 | |
| (Missing) | 772 | 1.5% |
Length
| Value | Count | Frequency (%) |
| london | 29044 | |
| 2977 | 4.8% | |
| paris | 2310 | 3.7% |
| york | 1797 | 2.9% |
| new | 1779 | 2.9% |
| edinburgh | 1698 | 2.7% |
| boston | 659 | 1.1% |
| leipzig | 628 | 1.0% |
| dublin | 488 | 0.8% |
| philadelphia | 477 | 0.8% |
| Other values (2815) | 20394 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 71179 | |
| o | 70489 | |
| d | 35620 | 9.0% |
| L | 30903 | 7.8% |
| e | 17066 | 4.3% |
| r | 16616 | 4.2% |
| a | 15280 | 3.9% |
| i | 15003 | 3.8% |
| s | 11178 | 2.8% |
| 10328 | 2.6% | |
| Other values (183) | 102265 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 321702 | |
| Uppercase Letter | 59157 | 14.9% |
| Space Separator | 10328 | 2.6% |
| Other Punctuation | 4242 | 1.1% |
| Dash Punctuation | 460 | 0.1% |
| Decimal Number | 26 | < 0.1% |
| Nonspacing Mark | 4 | < 0.1% |
| Modifier Letter | 4 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 71179 | |
| o | 70489 | |
| d | 35620 | |
| e | 17066 | 5.3% |
| r | 16616 | 5.2% |
| a | 15280 | 4.7% |
| i | 15003 | 4.7% |
| s | 11178 | 3.5% |
| t | 8631 | 2.7% |
| l | 8176 | 2.5% |
| Other values (107) | 52464 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 30903 | |
| P | 3757 | 6.4% |
| B | 3271 | 5.5% |
| N | 2528 | 4.3% |
| M | 2017 | 3.4% |
| E | 1921 | 3.2% |
| Y | 1883 | 3.2% |
| C | 1879 | 3.2% |
| S | 1500 | 2.5% |
| D | 1101 | 1.9% |
| Other values (48) | 8397 | 14.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 2 | 6 | |
| 6 | 5 | |
| 8 | 5 | |
| 3 | 1 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 2956 | |
| , | 1161 | 27.4% |
| ' | 106 | 2.5% |
| & | 19 | 0.4% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ︠ | 1 | |
| ︡ | 1 | |
| ̒ | 1 | |
| ̤ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 10328 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 460 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 374952 | |
| Common | 15064 | 3.8% |
| Cyrillic | 5558 | 1.4% |
| Greek | 349 | 0.1% |
| Inherited | 4 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 71179 | |
| o | 70489 | |
| d | 35620 | |
| L | 30903 | 8.2% |
| e | 17066 | 4.6% |
| r | 16616 | 4.4% |
| a | 15280 | 4.1% |
| i | 15003 | 4.0% |
| s | 11178 | 3.0% |
| t | 8631 | 2.3% |
| Other values (86) | 82987 |
Cyrillic
| Value | Count | Frequency (%) |
| е | 651 | |
| р | 632 | |
| т | 464 | 8.3% |
| а | 398 | 7.2% |
| ъ | 352 | 6.3% |
| к | 322 | 5.8% |
| г | 313 | 5.6% |
| у | 310 | 5.6% |
| б | 302 | 5.4% |
| С | 277 | 5.0% |
| Other values (37) | 1537 |
Greek
| Value | Count | Frequency (%) |
| ν | 46 | |
| ι | 41 | |
| η | 37 | |
| θ | 33 | |
| Ἀ | 32 | |
| α | 32 | |
| ς | 28 | |
| ο | 15 | 4.3% |
| ρ | 10 | 2.9% |
| σ | 10 | 2.9% |
| Other values (22) | 65 |
Common
| Value | Count | Frequency (%) |
| 10328 | ||
| ; | 2956 | 19.6% |
| , | 1161 | 7.7% |
| - | 460 | 3.1% |
| ' | 106 | 0.7% |
| & | 19 | 0.1% |
| 1 | 9 | 0.1% |
| 2 | 6 | < 0.1% |
| 6 | 5 | < 0.1% |
| 8 | 5 | < 0.1% |
| Other values (4) | 9 | 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ︠ | 1 | |
| ︡ | 1 | |
| ̒ | 1 | |
| ̤ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 388753 | |
| Cyrillic | 5558 | 1.4% |
| Latin 1 Sup | 1194 | 0.3% |
| None | 305 | 0.1% |
| Latin Ext A | 65 | < 0.1% |
| Greek Ext | 44 | < 0.1% |
| Modifier Letters | 4 | < 0.1% |
| Half Marks | 2 | < 0.1% |
| Diacriticals | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 71179 | |
| o | 70489 | |
| d | 35620 | 9.2% |
| L | 30903 | 7.9% |
| e | 17066 | 4.4% |
| r | 16616 | 4.3% |
| a | 15280 | 3.9% |
| i | 15003 | 3.9% |
| s | 11178 | 2.9% |
| 10328 | 2.7% | |
| Other values (55) | 95091 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| ü | 289 | |
| é | 232 | |
| ø | 190 | |
| ö | 133 | |
| á | 72 | 6.0% |
| è | 65 | 5.4% |
| æ | 38 | 3.2% |
| ó | 37 | 3.1% |
| â | 33 | 2.8% |
| É | 18 | 1.5% |
| Other values (18) | 87 | 7.3% |
Latin Ext A
| Value | Count | Frequency (%) |
| ń | 25 | |
| ě | 8 | 12.3% |
| ł | 7 | 10.8% |
| ż | 5 | 7.7% |
| œ | 3 | 4.6% |
| ĭ | 3 | 4.6% |
| ő | 3 | 4.6% |
| ō | 2 | 3.1% |
| š | 2 | 3.1% |
| ą | 1 | 1.5% |
| Other values (6) | 6 | 9.2% |
Greek Ext
| Value | Count | Frequency (%) |
| Ἀ | 32 | |
| ῃ | 3 | 6.8% |
| ῳ | 3 | 6.8% |
| Ἱ | 2 | 4.5% |
| Ἑ | 2 | 4.5% |
| ᾳ | 2 | 4.5% |
None
| Value | Count | Frequency (%) |
| ν | 46 | |
| ι | 41 | |
| η | 37 | |
| θ | 33 | |
| α | 32 | |
| ς | 28 | |
| ο | 15 | 4.9% |
| ρ | 10 | 3.3% |
| σ | 10 | 3.3% |
| υ | 10 | 3.3% |
| Other values (16) | 43 |
Cyrillic
| Value | Count | Frequency (%) |
| е | 651 | |
| р | 632 | |
| т | 464 | 8.3% |
| а | 398 | 7.2% |
| ъ | 352 | 6.3% |
| к | 322 | 5.8% |
| г | 313 | 5.6% |
| у | 310 | 5.6% |
| б | 302 | 5.4% |
| С | 277 | 5.0% |
| Other values (37) | 1537 |
Half Marks
| Value | Count | Frequency (%) |
| ︠ | 1 | |
| ︡ | 1 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 4 |
Diacriticals
| Value | Count | Frequency (%) |
| ̒ | 1 | |
| ̤ | 1 |
| Distinct | 7263 |
|---|---|
| Distinct (%) | 26.4% |
| Missing | 25208 |
| Missing (%) | 47.8% |
| Memory size | 411.8 KiB |
| Macmillan | 546 |
|---|---|
| Sampson Low | 490 |
| Hurst & Blackett | 460 |
| Chatto & Windus | 426 |
| Chapman & Hall | 389 |
| Other values (7258) |
Length
| Max length | 186 |
|---|---|
| Median length | 12 |
| Mean length | 12.93138575 |
| Min length | 4 |
Characters and Unicode
| Total characters | 355445 |
|---|---|
| Distinct characters | 133 |
| Distinct categories | 9 ? |
| Distinct scripts | 4 ? |
| Distinct blocks | 7 ? |
Unique
| Unique | 5144 ? |
|---|---|
| Unique (%) | 18.7% |
Sample
| 1st row | Richard Milliken |
|---|---|
| 2nd row | W. Cann |
| 3rd row | Elliot Stock |
| 4th row | I. C. Bose |
| 5th row | E. T. W. Dennis |
Common Values
| Value | Count | Frequency (%) |
| Macmillan | 546 | 1.0% |
| Sampson Low | 490 | 0.9% |
| Hurst & Blackett | 460 | 0.9% |
| Chatto & Windus | 426 | 0.8% |
| Chapman & Hall | 389 | 0.7% |
| Longmans | 364 | 0.7% |
| Richard Bentley | 355 | 0.7% |
| R. Bentley | 314 | 0.6% |
| John Murray | 306 | 0.6% |
| Cassell | 288 | 0.5% |
| Other values (7253) | 23549 | |
| (Missing) | 25208 |
Length
| Value | Count | Frequency (%) |
| 6055 | 9.4% | |
| j | 2628 | 4.1% |
| w | 1939 | 3.0% |
| h | 1182 | 1.8% |
| a | 1130 | 1.8% |
| t | 1095 | 1.7% |
| r | 1035 | 1.6% |
| g | 1023 | 1.6% |
| john | 840 | 1.3% |
| c | 832 | 1.3% |
| Other values (4725) | 46808 |
Most occurring characters
| Value | Count | Frequency (%) |
| 37080 | 10.4% | |
| e | 24970 | 7.0% |
| n | 23101 | 6.5% |
| a | 21560 | 6.1% |
| l | 18774 | 5.3% |
| o | 18608 | 5.2% |
| r | 17865 | 5.0% |
| i | 16390 | 4.6% |
| . | 14756 | 4.2% |
| t | 14224 | 4.0% |
| Other values (123) | 148117 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 236157 | |
| Uppercase Letter | 58386 | 16.4% |
| Space Separator | 37080 | 10.4% |
| Other Punctuation | 23603 | 6.6% |
| Dash Punctuation | 103 | < 0.1% |
| Nonspacing Mark | 62 | < 0.1% |
| Decimal Number | 48 | < 0.1% |
| Modifier Letter | 5 | < 0.1% |
| Other Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 24970 | |
| n | 23101 | 9.8% |
| a | 21560 | 9.1% |
| l | 18774 | 7.9% |
| o | 18608 | 7.9% |
| r | 17865 | 7.6% |
| i | 16390 | 6.9% |
| t | 14224 | 6.0% |
| s | 13443 | 5.7% |
| h | 9042 | 3.8% |
| Other values (62) | 58180 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 5051 | 8.7% |
| S | 4933 | 8.4% |
| H | 4775 | 8.2% |
| B | 4676 | 8.0% |
| J | 4472 | 7.7% |
| C | 4222 | 7.2% |
| M | 3764 | 6.4% |
| R | 3293 | 5.6% |
| L | 3090 | 5.3% |
| G | 2594 | 4.4% |
| Other values (31) | 17516 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 15 | |
| 3 | 9 | |
| 9 | 7 | |
| 2 | 5 | 10.4% |
| 5 | 4 | 8.3% |
| 7 | 3 | 6.2% |
| 0 | 2 | 4.2% |
| 6 | 2 | 4.2% |
| 4 | 1 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 14756 | |
| & | 4996 | 21.2% |
| , | 2271 | 9.6% |
| ; | 1049 | 4.4% |
| ' | 531 | 2.2% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ︠ | 31 | |
| ︡ | 31 |
Space Separator
| Value | Count | Frequency (%) |
| 37080 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 103 |
Other Letter
| Value | Count | Frequency (%) |
| º | 1 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 294410 | |
| Common | 60839 | 17.1% |
| Cyrillic | 134 | < 0.1% |
| Inherited | 62 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 24970 | 8.5% |
| n | 23101 | 7.8% |
| a | 21560 | 7.3% |
| l | 18774 | 6.4% |
| o | 18608 | 6.3% |
| r | 17865 | 6.1% |
| i | 16390 | 5.6% |
| t | 14224 | 4.8% |
| s | 13443 | 4.6% |
| h | 9042 | 3.1% |
| Other values (65) | 116433 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 18 | |
| о | 13 | 9.7% |
| р | 11 | 8.2% |
| е | 10 | 7.5% |
| и | 9 | 6.7% |
| н | 8 | 6.0% |
| в | 6 | 4.5% |
| т | 6 | 4.5% |
| г | 6 | 4.5% |
| п | 5 | 3.7% |
| Other values (29) | 42 |
Common
| Value | Count | Frequency (%) |
| 37080 | ||
| . | 14756 | 24.3% |
| & | 4996 | 8.2% |
| , | 2271 | 3.7% |
| ; | 1049 | 1.7% |
| ' | 531 | 0.9% |
| - | 103 | 0.2% |
| 1 | 15 | < 0.1% |
| 3 | 9 | < 0.1% |
| 9 | 7 | < 0.1% |
| Other values (7) | 22 | < 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ︠ | 31 | |
| ︡ | 31 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 354995 | |
| Latin 1 Sup | 220 | 0.1% |
| Cyrillic | 134 | < 0.1% |
| Half Marks | 62 | < 0.1% |
| Latin Ext A | 28 | < 0.1% |
| Modifier Letters | 5 | < 0.1% |
| Latin Ext Additional | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 37080 | 10.4% | |
| e | 24970 | 7.0% |
| n | 23101 | 6.5% |
| a | 21560 | 6.1% |
| l | 18774 | 5.3% |
| o | 18608 | 5.2% |
| r | 17865 | 5.0% |
| i | 16390 | 4.6% |
| . | 14756 | 4.2% |
| t | 14224 | 4.0% |
| Other values (58) | 147667 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| ü | 107 | |
| á | 29 | 13.2% |
| é | 25 | 11.4% |
| è | 21 | 9.5% |
| ö | 16 | 7.3% |
| ä | 5 | 2.3% |
| ó | 5 | 2.3% |
| æ | 3 | 1.4% |
| â | 2 | 0.9% |
| ô | 2 | 0.9% |
| Other values (4) | 5 | 2.3% |
Latin Ext A
| Value | Count | Frequency (%) |
| ĭ | 10 | |
| ī | 7 | |
| ō | 3 | 10.7% |
| ł | 3 | 10.7% |
| ő | 2 | 7.1% |
| Ė | 1 | 3.6% |
| ę | 1 | 3.6% |
| Ż | 1 | 3.6% |
Half Marks
| Value | Count | Frequency (%) |
| ︠ | 31 | |
| ︡ | 31 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 18 | |
| о | 13 | 9.7% |
| р | 11 | 8.2% |
| е | 10 | 7.5% |
| и | 9 | 6.7% |
| н | 8 | 6.0% |
| в | 6 | 4.5% |
| т | 6 | 4.5% |
| г | 6 | 4.5% |
| п | 5 | 3.7% |
| Other values (29) | 42 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 5 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| Ḟ | 1 |
| Distinct | 458 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 178 |
| Missing (%) | 0.3% |
| Memory size | 411.8 KiB |
| 1897 | 1478 |
|---|---|
| 1896 | 1414 |
| 1895 | 1277 |
| 1893 | 1205 |
| 1890 | 1182 |
| Other values (453) |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 4.007806996 |
| Min length | 4 |
Characters and Unicode
| Total characters | 210478 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 123 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 1786 |
|---|---|
| 2nd row | 1679 |
| 3rd row | 1816 |
| 4th row | 1868 |
| 5th row | 1888 |
Common Values
| Value | Count | Frequency (%) |
| 1897 | 1478 | 2.8% |
| 1896 | 1414 | 2.7% |
| 1895 | 1277 | 2.4% |
| 1893 | 1205 | 2.3% |
| 1890 | 1182 | 2.2% |
| 1894 | 1154 | 2.2% |
| 1891 | 1129 | 2.1% |
| 1898 | 1116 | 2.1% |
| 1892 | 1103 | 2.1% |
| 1889 | 1022 | 1.9% |
| Other values (448) | 40437 |
Length
| Value | Count | Frequency (%) |
| 1897 | 1479 | 2.8% |
| 1896 | 1415 | 2.7% |
| 1895 | 1277 | 2.4% |
| 1893 | 1206 | 2.3% |
| 1890 | 1182 | 2.3% |
| 1894 | 1154 | 2.2% |
| 1891 | 1129 | 2.1% |
| 1898 | 1116 | 2.1% |
| 1892 | 1103 | 2.1% |
| 1889 | 1022 | 1.9% |
| Other values (412) | 40434 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 60749 | |
| 8 | 60618 | |
| 9 | 21261 | 10.1% |
| 7 | 14294 | 6.8% |
| 6 | 11744 | 5.6% |
| 5 | 10399 | 4.9% |
| 4 | 8566 | 4.1% |
| 2 | 8074 | 3.8% |
| 3 | 7453 | 3.5% |
| 0 | 7206 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 210364 | |
| Dash Punctuation | 114 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 60749 | |
| 8 | 60618 | |
| 9 | 21261 | 10.1% |
| 7 | 14294 | 6.8% |
| 6 | 11744 | 5.6% |
| 5 | 10399 | 4.9% |
| 4 | 8566 | 4.1% |
| 2 | 8074 | 3.8% |
| 3 | 7453 | 3.5% |
| 0 | 7206 | 3.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 114 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 210478 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 60749 | |
| 8 | 60618 | |
| 9 | 21261 | 10.1% |
| 7 | 14294 | 6.8% |
| 6 | 11744 | 5.6% |
| 5 | 10399 | 4.9% |
| 4 | 8566 | 4.1% |
| 2 | 8074 | 3.8% |
| 3 | 7453 | 3.5% |
| 0 | 7206 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 210478 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 60749 | |
| 8 | 60618 | |
| 9 | 21261 | 10.1% |
| 7 | 14294 | 6.8% |
| 6 | 11744 | 5.6% |
| 5 | 10399 | 4.9% |
| 4 | 8566 | 4.1% |
| 2 | 8074 | 3.8% |
| 3 | 7453 | 3.5% |
| 0 | 7206 | 3.4% |
| Distinct | 1559 |
|---|---|
| Distinct (%) | 37.1% |
| Missing | 48497 |
| Missing (%) | 92.0% |
| Memory size | 411.8 KiB |
| Another edition | |
|---|---|
| Second edition | |
| Third edition | 214 |
| New edition | 184 |
| Fourth edition | 91 |
| Other values (1554) |
Length
| Max length | 389 |
|---|---|
| Median length | 15 |
| Mean length | 35.27703668 |
| Min length | 10 |
Characters and Unicode
| Total characters | 148093 |
|---|---|
| Distinct characters | 132 |
| Distinct categories | 8 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 1437 ? |
|---|---|
| Unique (%) | 34.2% |
Sample
| 1st row | Fourth edition MANUSCRIPT note |
|---|---|
| 2nd row | New edition |
| 3rd row | Second edition |
| 4th row | Another edition |
| 5th row | Second edition |
Common Values
| Value | Count | Frequency (%) |
| Another edition | 1027 | 1.9% |
| Second edition | 457 | 0.9% |
| Third edition | 214 | 0.4% |
| New edition | 184 | 0.3% |
| Fourth edition | 91 | 0.2% |
| A new edition | 82 | 0.2% |
| Fifth edition | 69 | 0.1% |
| Sixth edition | 45 | 0.1% |
| Seventh edition | 36 | 0.1% |
| Second edition, enlarged | 24 | < 0.1% |
| Other values (1549) | 1969 | 3.7% |
| (Missing) | 48497 |
Length
| Value | Count | Frequency (%) |
| edition | 4095 | 18.1% |
| another | 1724 | 7.6% |
| the | 787 | 3.5% |
| second | 734 | 3.2% |
| with | 678 | 3.0% |
| and | 620 | 2.7% |
| of | 594 | 2.6% |
| new | 577 | 2.6% |
| by | 576 | 2.5% |
| a | 560 | 2.5% |
| Other values (3258) | 11671 |
Most occurring characters
| Value | Count | Frequency (%) |
| 18418 | ||
| e | 15581 | |
| i | 14568 | 9.8% |
| t | 12729 | 8.6% |
| n | 11751 | 7.9% |
| o | 11548 | 7.8% |
| d | 8801 | 5.9% |
| r | 6981 | 4.7% |
| h | 5482 | 3.7% |
| a | 5179 | 3.5% |
| Other values (122) | 37055 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 114855 | |
| Space Separator | 18418 | 12.4% |
| Uppercase Letter | 9568 | 6.5% |
| Other Punctuation | 4289 | 2.9% |
| Decimal Number | 633 | 0.4% |
| Dash Punctuation | 134 | 0.1% |
| Open Punctuation | 98 | 0.1% |
| Close Punctuation | 98 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 15581 | |
| i | 14568 | |
| t | 12729 | |
| n | 11751 | |
| o | 11548 | |
| d | 8801 | |
| r | 6981 | 6.1% |
| h | 5482 | 4.8% |
| a | 5179 | 4.5% |
| s | 3782 | 3.3% |
| Other values (68) | 18453 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2336 | |
| S | 1226 | |
| T | 829 | 8.7% |
| W | 615 | 6.4% |
| N | 590 | 6.2% |
| F | 456 | 4.8% |
| C | 399 | 4.2% |
| E | 323 | 3.4% |
| R | 288 | 3.0% |
| H | 281 | 2.9% |
| Other values (25) | 2225 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 199 | |
| 8 | 104 | |
| 2 | 58 | 9.2% |
| 6 | 52 | 8.2% |
| 7 | 51 | 8.1% |
| 3 | 42 | 6.6% |
| 5 | 42 | 6.6% |
| 4 | 32 | 5.1% |
| 0 | 27 | 4.3% |
| 9 | 26 | 4.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2797 | |
| . | 1229 | |
| ' | 258 | 6.0% |
| ? | 3 | 0.1% |
| ! | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 18418 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 134 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 98 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 98 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 124284 | |
| Common | 23670 | 16.0% |
| Cyrillic | 139 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 15581 | |
| i | 14568 | |
| t | 12729 | |
| n | 11751 | |
| o | 11548 | |
| d | 8801 | 7.1% |
| r | 6981 | 5.6% |
| h | 5482 | 4.4% |
| a | 5179 | 4.2% |
| s | 3782 | 3.0% |
| Other values (69) | 27882 |
Cyrillic
| Value | Count | Frequency (%) |
| и | 13 | 9.4% |
| о | 12 | 8.6% |
| е | 10 | 7.2% |
| в | 10 | 7.2% |
| с | 9 | 6.5% |
| р | 9 | 6.5% |
| н | 9 | 6.5% |
| а | 8 | 5.8% |
| і | 7 | 5.0% |
| т | 6 | 4.3% |
| Other values (24) | 46 |
Common
| Value | Count | Frequency (%) |
| 18418 | ||
| , | 2797 | 11.8% |
| . | 1229 | 5.2% |
| ' | 258 | 1.1% |
| 1 | 199 | 0.8% |
| - | 134 | 0.6% |
| 8 | 104 | 0.4% |
| ( | 98 | 0.4% |
| ) | 98 | 0.4% |
| 2 | 58 | 0.2% |
| Other values (9) | 277 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 147503 | |
| Latin 1 Sup | 441 | 0.3% |
| Cyrillic | 139 | 0.1% |
| Latin Ext A | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 18418 | ||
| e | 15581 | |
| i | 14568 | 9.9% |
| t | 12729 | 8.6% |
| n | 11751 | 8.0% |
| o | 11548 | 7.8% |
| d | 8801 | 6.0% |
| r | 6981 | 4.7% |
| h | 5482 | 3.7% |
| a | 5179 | 3.5% |
| Other values (61) | 36465 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| é | 282 | |
| è | 71 | 16.1% |
| ó | 14 | 3.2% |
| à | 9 | 2.0% |
| ä | 8 | 1.8% |
| É | 8 | 1.8% |
| æ | 7 | 1.6% |
| á | 7 | 1.6% |
| ü | 6 | 1.4% |
| ö | 6 | 1.4% |
| Other values (11) | 23 | 5.2% |
Latin Ext A
| Value | Count | Frequency (%) |
| ą | 3 | |
| ę | 2 | |
| ł | 2 | |
| ő | 1 | 10.0% |
| Ż | 1 | 10.0% |
| œ | 1 | 10.0% |
Cyrillic
| Value | Count | Frequency (%) |
| и | 13 | 9.4% |
| о | 12 | 8.6% |
| е | 10 | 7.2% |
| в | 10 | 7.2% |
| с | 9 | 6.5% |
| р | 9 | 6.5% |
| н | 9 | 6.5% |
| а | 8 | 5.8% |
| і | 7 | 5.0% |
| т | 6 | 4.3% |
| Other values (24) | 46 |
| Distinct | 10735 |
|---|---|
| Distinct (%) | 26.9% |
| Missing | 12849 |
| Missing (%) | 24.4% |
| Memory size | 411.8 KiB |
| 3 volumes (8°) | 2755 |
|---|---|
| 2 volumes (8°) | 2488 |
| (12°) | 658 |
| 2 tomes (8°) | 344 |
| 2 parts (8°) | 307 |
| Other values (10730) |
Length
| Max length | 200 |
|---|---|
| Median length | 14 |
| Mean length | 15.61062591 |
| Min length | 5 |
Characters and Unicode
| Total characters | 622021 |
|---|---|
| Distinct characters | 85 |
| Distinct categories | 11 ? |
| Distinct scripts | 4 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 7531 ? |
|---|---|
| Unique (%) | 18.9% |
Sample
| 1st row | 15 pages (4°) |
|---|---|
| 2nd row | 17 pages (8°) |
| 3rd row | 16 pages (8°) |
| 4th row | 40 pages (8°) |
| 5th row | 7 pages (4°) |
Common Values
| Value | Count | Frequency (%) |
| 3 volumes (8°) | 2755 | 5.2% |
| 2 volumes (8°) | 2488 | 4.7% |
| (12°) | 658 | 1.2% |
| 2 tomes (8°) | 344 | 0.7% |
| 2 parts (8°) | 307 | 0.6% |
| 2 volumes (12°) | 302 | 0.6% |
| 16 pages (8°) | 274 | 0.5% |
| 32 pages (8°) | 221 | 0.4% |
| 3 volumes (12°) | 200 | 0.4% |
| 24 pages (8°) | 157 | 0.3% |
| Other values (10725) | 32140 | |
| (Missing) | 12849 | 24.4% |
Length
| Value | Count | Frequency (%) |
| 8° | 33791 | |
| pages | 29762 | |
| volumes | 6935 | 5.2% |
| 2 | 4593 | 3.4% |
| 3 | 3549 | 2.6% |
| 4° | 3081 | 2.3% |
| viii | 2378 | 1.8% |
| 12° | 1701 | 1.3% |
| vi | 1234 | 0.9% |
| xii | 898 | 0.7% |
| Other values (1537) | 46229 |
Most occurring characters
| Value | Count | Frequency (%) |
| 94305 | ||
| 8 | 40297 | 6.5% |
| s | 39872 | 6.4% |
| ( | 39180 | 6.3% |
| ) | 39178 | 6.3% |
| e | 39169 | 6.3% |
| ° | 38630 | 6.2% |
| a | 32100 | 5.2% |
| p | 31271 | 5.0% |
| g | 29874 | 4.8% |
| Other values (75) | 198145 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 261771 | |
| Decimal Number | 133286 | |
| Space Separator | 94305 | 15.2% |
| Open Punctuation | 39180 | 6.3% |
| Close Punctuation | 39178 | 6.3% |
| Other Symbol | 38630 | 6.2% |
| Other Punctuation | 14995 | 2.4% |
| Uppercase Letter | 454 | 0.1% |
| Dash Punctuation | 209 | < 0.1% |
| Modifier Letter | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 39872 | |
| e | 39169 | |
| a | 32100 | |
| p | 31271 | |
| g | 29874 | |
| i | 21068 | |
| v | 14924 | 5.7% |
| l | 9644 | 3.7% |
| o | 9463 | 3.6% |
| m | 8766 | 3.3% |
| Other values (51) | 25620 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 40297 | |
| 2 | 19517 | |
| 3 | 14690 | 11.0% |
| 1 | 14640 | 11.0% |
| 4 | 12741 | 9.6% |
| 6 | 7286 | 5.5% |
| 5 | 6884 | 5.2% |
| 0 | 6414 | 4.8% |
| 7 | 5543 | 4.2% |
| 9 | 5274 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 412 | |
| H | 32 | 7.0% |
| A | 9 | 2.0% |
| J | 1 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 14994 | |
| ; | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 5 | |
| × | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 94305 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 39180 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 38630 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 39178 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 209 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 359796 | |
| Latin | 261880 | |
| Cyrillic | 329 | 0.1% |
| Greek | 16 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 39872 | |
| e | 39169 | |
| a | 32100 | |
| p | 31271 | |
| g | 29874 | |
| i | 21068 | |
| v | 14924 | 5.7% |
| l | 9644 | 3.7% |
| o | 9463 | 3.6% |
| m | 8766 | 3.3% |
| Other values (31) | 25729 |
Common
| Value | Count | Frequency (%) |
| 94305 | ||
| 8 | 40297 | |
| ( | 39180 | |
| ) | 39178 | |
| ° | 38630 | |
| 2 | 19517 | 5.4% |
| , | 14994 | 4.2% |
| 3 | 14690 | 4.1% |
| 1 | 14640 | 4.1% |
| 4 | 12741 | 3.5% |
| Other values (10) | 31624 | 8.8% |
Cyrillic
| Value | Count | Frequency (%) |
| т | 83 | |
| ч | 52 | |
| с | 51 | |
| а | 50 | |
| о | 31 | 9.4% |
| м | 30 | 9.1% |
| к | 6 | 1.8% |
| н | 6 | 1.8% |
| ы | 4 | 1.2% |
| в | 4 | 1.2% |
| Other values (6) | 12 | 3.6% |
Greek
| Value | Count | Frequency (%) |
| ζ | 3 | |
| η | 2 | |
| θ | 2 | |
| τ | 2 | |
| ο | 2 | |
| μ | 2 | |
| ι | 2 | |
| β | 1 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 582701 | |
| Latin 1 Sup | 38959 | 6.3% |
| Cyrillic | 329 | 0.1% |
| None | 16 | < 0.1% |
| Latin Ext A | 9 | < 0.1% |
| Modifier Letters | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 94305 | ||
| 8 | 40297 | 6.9% |
| s | 39872 | 6.8% |
| ( | 39180 | 6.7% |
| ) | 39178 | 6.7% |
| e | 39169 | 6.7% |
| a | 32100 | 5.5% |
| p | 31271 | 5.4% |
| g | 29874 | 5.1% |
| i | 21068 | 3.6% |
| Other values (36) | 176387 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| ° | 38630 | |
| ä | 262 | 0.7% |
| ö | 37 | 0.1% |
| é | 19 | < 0.1% |
| ü | 5 | < 0.1% |
| í | 3 | < 0.1% |
| æ | 1 | < 0.1% |
| ø | 1 | < 0.1% |
| × | 1 | < 0.1% |
Cyrillic
| Value | Count | Frequency (%) |
| т | 83 | |
| ч | 52 | |
| с | 51 | |
| а | 50 | |
| о | 31 | 9.4% |
| м | 30 | 9.1% |
| к | 6 | 1.8% |
| н | 6 | 1.8% |
| ы | 4 | 1.2% |
| в | 4 | 1.2% |
| Other values (6) | 12 | 3.6% |
Latin Ext A
| Value | Count | Frequency (%) |
| ę | 2 | |
| ś | 2 | |
| ć | 2 | |
| š | 2 | |
| ő | 1 |
None
| Value | Count | Frequency (%) |
| ζ | 3 | |
| η | 2 | |
| θ | 2 | |
| τ | 2 | |
| ο | 2 | |
| μ | 2 | |
| ι | 2 | |
| β | 1 | 6.2% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 7 |
| Distinct | 67 |
|---|---|
| Distinct (%) | 85.9% |
| Missing | 52617 |
| Missing (%) | 99.9% |
| Memory size | 411.8 KiB |
| 941 | 3 |
|---|---|
| 942 | 3 |
| 823.8 | 3 |
| 915.2 | 2 |
| 915 | 2 |
| Other values (62) |
Length
| Max length | 17 |
|---|---|
| Median length | 5 |
| Mean length | 5.5 |
| Min length | 3 |
Characters and Unicode
| Total characters | 429 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 59 ? |
|---|---|
| Unique (%) | 75.6% |
Sample
| 1st row | 266.96 |
|---|---|
| 2nd row | 915.7 |
| 3rd row | 456 |
| 4th row | 914.7904 |
| 5th row | 623.89296 |
Common Values
| Value | Count | Frequency (%) |
| 941 | 3 | < 0.1% |
| 942 | 3 | < 0.1% |
| 823.8 | 3 | < 0.1% |
| 915.2 | 2 | < 0.1% |
| 915 | 2 | < 0.1% |
| 942.45 | 2 | < 0.1% |
| 942.1 | 2 | < 0.1% |
| 941.1 | 2 | < 0.1% |
| 447.9 | 1 | < 0.1% |
| 944.26 | 1 | < 0.1% |
| Other values (57) | 57 | 0.1% |
| (Missing) | 52617 |
Length
| Value | Count | Frequency (%) |
| 942 | 3 | 3.7% |
| 823.8 | 3 | 3.7% |
| 941 | 3 | 3.7% |
| 915 | 2 | 2.4% |
| 941.1 | 2 | 2.4% |
| 2 | 2.4% | |
| 942.45 | 2 | 2.4% |
| 915.2 | 2 | 2.4% |
| 942.1 | 2 | 2.4% |
| 266.96 | 1 | 1.2% |
| Other values (60) | 60 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 81 | |
| . | 64 | |
| 2 | 49 | |
| 4 | 48 | |
| 1 | 45 | |
| 8 | 31 | 7.2% |
| 5 | 30 | 7.0% |
| 6 | 23 | 5.4% |
| 3 | 20 | 4.7% |
| 7 | 18 | 4.2% |
| Other values (3) | 20 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 359 | |
| Other Punctuation | 66 | 15.4% |
| Space Separator | 4 | 0.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 81 | |
| 2 | 49 | |
| 4 | 48 | |
| 1 | 45 | |
| 8 | 31 | 8.6% |
| 5 | 30 | 8.4% |
| 6 | 23 | 6.4% |
| 3 | 20 | 5.6% |
| 7 | 18 | 5.0% |
| 0 | 14 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 64 | |
| ; | 2 | 3.0% |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 429 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 81 | |
| . | 64 | |
| 2 | 49 | |
| 4 | 48 | |
| 1 | 45 | |
| 8 | 31 | 7.2% |
| 5 | 30 | 7.0% |
| 6 | 23 | 5.4% |
| 3 | 20 | 4.7% |
| 7 | 18 | 4.2% |
| Other values (3) | 20 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 429 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 81 | |
| . | 64 | |
| 2 | 49 | |
| 4 | 48 | |
| 1 | 45 | |
| 8 | 31 | 7.2% |
| 5 | 30 | 7.0% |
| 6 | 23 | 5.4% |
| 3 | 20 | 4.7% |
| 7 | 18 | 4.2% |
| Other values (3) | 20 | 4.7% |
| Distinct | 52345 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 267 |
| Missing (%) | 0.5% |
| Memory size | 411.8 KiB |
| Digital Store 012626.e.8 | 6 |
|---|---|
| Digital Store 012624.l | 5 |
| Digital Store 1303.b.3 | 5 |
| Digital Store 10497.w.12 | 3 |
| Digital Store 10007.f.22 | 3 |
| Other values (52340) |
Length
| Max length | 94 |
|---|---|
| Median length | 24 |
| Mean length | 24.95017929 |
| Min length | 14 |
Characters and Unicode
| Total characters | 1308088 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 52281 ? |
|---|---|
| Unique (%) | 99.7% |
Sample
| 1st row | Digital Store 11644.d.32 |
|---|---|
| 2nd row | Digital Store 11602.ee.10. (2.) |
| 3rd row | Digital Store 992.i.12. (3.) |
| 4th row | Digital Store 11602.ee.17. (1.) |
| 5th row | Digital Store 11602.ee.17. (7.) |
Common Values
| Value | Count | Frequency (%) |
| Digital Store 012626.e.8 | 6 | < 0.1% |
| Digital Store 012624.l | 5 | < 0.1% |
| Digital Store 1303.b.3 | 5 | < 0.1% |
| Digital Store 10497.w.12 | 3 | < 0.1% |
| Digital Store 10007.f.22 | 3 | < 0.1% |
| Digital Store 12274.m | 3 | < 0.1% |
| Digital Store 10497.w.20 | 3 | < 0.1% |
| Digital Store 9314.c.7 | 3 | < 0.1% |
| Digital Store 11609.k.5 | 3 | < 0.1% |
| Digital Store 9225.m.15 | 3 | < 0.1% |
| Other values (52335) | 52391 | |
| (Missing) | 267 | 0.5% |
Length
| Value | Count | Frequency (%) |
| digital | 53232 | |
| store | 53232 | |
| 1 | 834 | 0.5% |
| 2 | 826 | 0.5% |
| 809 | 0.5% | |
| 3 | 625 | 0.4% |
| 4 | 523 | 0.3% |
| 5 | 437 | 0.3% |
| 6 | 355 | 0.2% |
| 7 | 307 | 0.2% |
| Other values (48963) | 54444 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 116639 | 8.9% |
| 113196 | 8.7% | |
| i | 110345 | 8.4% |
| t | 106597 | 8.1% |
| 1 | 89661 | 6.9% |
| e | 67023 | 5.1% |
| g | 59130 | 4.5% |
| a | 55903 | 4.3% |
| 0 | 55217 | 4.2% |
| l | 55158 | 4.2% |
| Other values (50) | 479219 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 603537 | |
| Decimal Number | 356508 | |
| Other Punctuation | 117791 | 9.0% |
| Space Separator | 113196 | 8.7% |
| Uppercase Letter | 106581 | 8.1% |
| Open Punctuation | 5115 | 0.4% |
| Close Punctuation | 5115 | 0.4% |
| Dash Punctuation | 245 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 110345 | |
| t | 106597 | |
| e | 67023 | |
| g | 59130 | |
| a | 55903 | |
| l | 55158 | |
| r | 53343 | |
| o | 53292 | |
| b | 8891 | 1.5% |
| f | 8631 | 1.4% |
| Other values (15) | 25224 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 53232 | |
| S | 53232 | |
| J | 95 | 0.1% |
| R | 4 | < 0.1% |
| B | 3 | < 0.1% |
| K | 3 | < 0.1% |
| F | 2 | < 0.1% |
| C | 2 | < 0.1% |
| T | 1 | < 0.1% |
| M | 1 | < 0.1% |
| Other values (6) | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 89661 | |
| 0 | 55217 | |
| 2 | 46173 | |
| 6 | 35769 | 10.0% |
| 3 | 28001 | 7.9% |
| 4 | 27673 | 7.8% |
| 9 | 22387 | 6.3% |
| 5 | 20772 | 5.8% |
| 7 | 18311 | 5.1% |
| 8 | 12544 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 116639 | |
| ; | 836 | 0.7% |
| / | 201 | 0.2% |
| , | 72 | 0.1% |
| * | 43 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 113196 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5115 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5115 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 245 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 710118 | |
| Common | 597970 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 110345 | |
| t | 106597 | |
| e | 67023 | |
| g | 59130 | |
| a | 55903 | |
| l | 55158 | |
| r | 53343 | |
| o | 53292 | |
| D | 53232 | |
| S | 53232 | |
| Other values (31) | 42863 | 6.0% |
Common
| Value | Count | Frequency (%) |
| . | 116639 | |
| 113196 | ||
| 1 | 89661 | |
| 0 | 55217 | |
| 2 | 46173 | 7.7% |
| 6 | 35769 | 6.0% |
| 3 | 28001 | 4.7% |
| 4 | 27673 | 4.6% |
| 9 | 22387 | 3.7% |
| 5 | 20772 | 3.5% |
| Other values (9) | 42482 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1308088 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 116639 | 8.9% |
| 113196 | 8.7% | |
| i | 110345 | 8.4% |
| t | 106597 | 8.1% |
| 1 | 89661 | 6.9% |
| e | 67023 | 5.1% |
| g | 59130 | 4.5% |
| a | 55903 | 4.3% |
| 0 | 55217 | 4.2% |
| l | 55158 | 4.2% |
| Other values (50) | 479219 |
| Distinct | 1208 |
|---|---|
| Distinct (%) | 38.5% |
| Missing | 49559 |
| Missing (%) | 94.0% |
| Memory size | 411.8 KiB |
| India | 216 |
|---|---|
| Canada | 157 |
| Australia | 99 |
| Revolutions | 85 |
| New Zealand | 73 |
| Other values (1203) |
Length
| Max length | 547 |
|---|---|
| Median length | 23 |
| Mean length | 31.85299745 |
| Min length | 3 |
Characters and Unicode
| Total characters | 99891 |
|---|---|
| Distinct characters | 93 |
| Distinct categories | 10 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 5 ? |
Unique
| Unique | 952 ? |
|---|---|
| Unique (%) | 30.4% |
Sample
| 1st row | Dublin (Ireland) |
|---|---|
| 2nd row | Trinidad and Tobago |
| 3rd row | Shakespeare, William, 1564-1616--Anniversaries, etc |
| 4th row | Saint Helena |
| 5th row | Nelson, Horatio Nelson, Viscount, 1758-1805 |
Common Values
| Value | Count | Frequency (%) |
| India | 216 | 0.4% |
| Canada | 157 | 0.3% |
| Australia | 99 | 0.2% |
| Revolutions | 85 | 0.2% |
| New Zealand | 73 | 0.1% |
| English fiction--19th century | 53 | 0.1% |
| Canada ; British Columbia | 41 | 0.1% |
| American Revolution (1775-1783) | 34 | 0.1% |
| Revolution (France : 1789-1799) | 32 | 0.1% |
| India ; India--Description and travel | 31 | 0.1% |
| Other values (1198) | 2315 | 4.4% |
| (Missing) | 49559 |
Length
| Value | Count | Frequency (%) |
| 1688 | 13.6% | |
| and | 829 | 6.7% |
| india | 516 | 4.1% |
| travel | 480 | 3.9% |
| war | 240 | 1.9% |
| of | 220 | 1.8% |
| canada | 219 | 1.8% |
| south | 191 | 1.5% |
| century | 146 | 1.2% |
| new | 136 | 1.1% |
| Other values (2098) | 7771 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9300 | 9.3% | |
| a | 9119 | 9.1% |
| i | 7224 | 7.2% |
| n | 6829 | 6.8% |
| e | 5412 | 5.4% |
| r | 5248 | 5.3% |
| - | 4979 | 5.0% |
| t | 4574 | 4.6% |
| o | 4324 | 4.3% |
| s | 4105 | 4.1% |
| Other values (83) | 38777 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 66996 | |
| Uppercase Letter | 9709 | 9.7% |
| Space Separator | 9300 | 9.3% |
| Dash Punctuation | 4979 | 5.0% |
| Decimal Number | 4926 | 4.9% |
| Other Punctuation | 2547 | 2.5% |
| Open Punctuation | 685 | 0.7% |
| Close Punctuation | 685 | 0.7% |
| Nonspacing Mark | 46 | < 0.1% |
| Modifier Letter | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9119 | |
| i | 7224 | |
| n | 6829 | |
| e | 5412 | |
| r | 5248 | 7.8% |
| t | 4574 | 6.8% |
| o | 4324 | 6.5% |
| s | 4105 | 6.1% |
| l | 3484 | 5.2% |
| d | 3284 | 4.9% |
| Other values (33) | 13393 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1357 | |
| C | 856 | 8.8% |
| S | 781 | 8.0% |
| D | 664 | 6.8% |
| A | 664 | 6.8% |
| B | 613 | 6.3% |
| H | 511 | 5.3% |
| R | 505 | 5.2% |
| E | 473 | 4.9% |
| G | 415 | 4.3% |
| Other values (17) | 2870 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1564 | |
| 8 | 710 | |
| 7 | 511 | 10.4% |
| 9 | 455 | 9.2% |
| 6 | 436 | 8.9% |
| 5 | 406 | 8.2% |
| 4 | 286 | 5.8% |
| 0 | 258 | 5.2% |
| 2 | 151 | 3.1% |
| 3 | 149 | 3.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1547 | |
| , | 760 | |
| : | 142 | 5.6% |
| . | 75 | 2.9% |
| ' | 18 | 0.7% |
| ? | 5 | 0.2% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ︠ | 23 | |
| ︡ | 23 |
Space Separator
| Value | Count | Frequency (%) |
| 9300 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 685 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 685 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4979 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 76705 | |
| Common | 23140 | 23.2% |
| Inherited | 46 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9119 | 11.9% |
| i | 7224 | 9.4% |
| n | 6829 | 8.9% |
| e | 5412 | 7.1% |
| r | 5248 | 6.8% |
| t | 4574 | 6.0% |
| o | 4324 | 5.6% |
| s | 4105 | 5.4% |
| l | 3484 | 4.5% |
| d | 3284 | 4.3% |
| Other values (60) | 23102 |
Common
| Value | Count | Frequency (%) |
| 9300 | ||
| - | 4979 | |
| 1 | 1564 | 6.8% |
| ; | 1547 | 6.7% |
| , | 760 | 3.3% |
| 8 | 710 | 3.1% |
| ( | 685 | 3.0% |
| ) | 685 | 3.0% |
| 7 | 511 | 2.2% |
| 9 | 455 | 2.0% |
| Other values (11) | 1944 | 8.4% |
Inherited
| Value | Count | Frequency (%) |
| ︠ | 23 | |
| ︡ | 23 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 99745 | |
| Latin 1 Sup | 46 | < 0.1% |
| Half Marks | 46 | < 0.1% |
| Latin Ext A | 36 | < 0.1% |
| Modifier Letters | 18 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9300 | 9.3% | |
| a | 9119 | 9.1% |
| i | 7224 | 7.2% |
| n | 6829 | 6.8% |
| e | 5412 | 5.4% |
| r | 5248 | 5.3% |
| - | 4979 | 5.0% |
| t | 4574 | 4.6% |
| o | 4324 | 4.3% |
| s | 4105 | 4.1% |
| Other values (62) | 38631 |
Latin Ext A
| Value | Count | Frequency (%) |
| ā | 16 | |
| ĭ | 8 | |
| ū | 5 | 13.9% |
| ī | 2 | 5.6% |
| ń | 2 | 5.6% |
| ł | 1 | 2.8% |
| ű | 1 | 2.8% |
| č | 1 | 2.8% |
Latin 1 Sup
| Value | Count | Frequency (%) |
| á | 13 | |
| é | 12 | |
| ó | 11 | |
| ã | 2 | 4.3% |
| ú | 2 | 4.3% |
| ö | 2 | 4.3% |
| ü | 1 | 2.2% |
| ô | 1 | 2.2% |
| Á | 1 | 2.2% |
| ï | 1 | 2.2% |
Half Marks
| Value | Count | Frequency (%) |
| ︠ | 23 | |
| ︡ | 23 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 18 |
| Distinct | 64 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 50722 |
| Missing (%) | 96.3% |
| Memory size | 411.8 KiB |
| Poetry or verse | |
|---|---|
| Drama | |
| Drama ; Poetry or verse | |
| Travel | 77 |
| Periodical | 39 |
| Other values (59) |
Length
| Max length | 53 |
|---|---|
| Median length | 15 |
| Mean length | 12.29042068 |
| Min length | 4 |
Characters and Unicode
| Total characters | 24249 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 30 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | Song |
|---|---|
| 2nd row | Poetry or verse |
| 3rd row | Poetry or verse |
| 4th row | Music |
| 5th row | Poetry or verse |
Common Values
| Value | Count | Frequency (%) |
| Poetry or verse | 1002 | 1.9% |
| Drama | 461 | 0.9% |
| Drama ; Poetry or verse | 151 | 0.3% |
| Travel | 77 | 0.1% |
| Periodical | 39 | 0.1% |
| Diary | 36 | 0.1% |
| Gazetteer | 22 | < 0.1% |
| Directory | 18 | < 0.1% |
| Correspondence | 18 | < 0.1% |
| Fiction | 16 | < 0.1% |
| Other values (54) | 133 | 0.3% |
| (Missing) | 50722 |
Length
| Value | Count | Frequency (%) |
| or | 1166 | |
| verse | 1158 | |
| poetry | 1158 | |
| drama | 622 | |
| 190 | 4.0% | |
| travel | 89 | 1.9% |
| diary | 46 | 1.0% |
| periodical | 41 | 0.9% |
| correspondence | 23 | 0.5% |
| gazetteer | 23 | 0.5% |
| Other values (56) | 205 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 4483 | |
| e | 3827 | |
| 2748 | ||
| o | 2562 | |
| a | 1539 | 6.3% |
| t | 1312 | 5.4% |
| v | 1258 | 5.2% |
| y | 1252 | 5.2% |
| s | 1229 | 5.1% |
| P | 1209 | 5.0% |
| Other values (36) | 2830 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19123 | |
| Space Separator | 2748 | 11.3% |
| Uppercase Letter | 2167 | 8.9% |
| Other Punctuation | 191 | 0.8% |
| Decimal Number | 20 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 4483 | |
| e | 3827 | |
| o | 2562 | |
| a | 1539 | 8.0% |
| t | 1312 | 6.9% |
| v | 1258 | 6.6% |
| y | 1252 | 6.5% |
| s | 1229 | 6.4% |
| m | 636 | 3.3% |
| i | 273 | 1.4% |
| Other values (14) | 752 | 3.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1209 | |
| D | 698 | |
| T | 95 | 4.4% |
| C | 32 | 1.5% |
| S | 31 | 1.4% |
| G | 26 | 1.2% |
| F | 16 | 0.7% |
| L | 14 | 0.6% |
| B | 12 | 0.6% |
| E | 12 | 0.6% |
| Other values (6) | 22 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10 | |
| 1 | 5 | |
| 8 | 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 190 | |
| ' | 1 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 2748 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21290 | |
| Common | 2959 | 12.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 4483 | |
| e | 3827 | |
| o | 2562 | |
| a | 1539 | 7.2% |
| t | 1312 | 6.2% |
| v | 1258 | 5.9% |
| y | 1252 | 5.9% |
| s | 1229 | 5.8% |
| P | 1209 | 5.7% |
| D | 698 | 3.3% |
| Other values (30) | 1921 |
Common
| Value | Count | Frequency (%) |
| 2748 | ||
| ; | 190 | 6.4% |
| 0 | 10 | 0.3% |
| 1 | 5 | 0.2% |
| 8 | 5 | 0.2% |
| ' | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24249 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 4483 | |
| e | 3827 | |
| 2748 | ||
| o | 2562 | |
| a | 1539 | 6.3% |
| t | 1312 | 5.4% |
| v | 1258 | 5.2% |
| y | 1252 | 5.2% |
| s | 1229 | 5.1% |
| P | 1209 | 5.0% |
| Other values (36) | 2830 |
| Distinct | 109 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 58 |
| Missing (%) | 0.1% |
| Memory size | 411.8 KiB |
| English | |
|---|---|
| French | 3855 |
| German | 3166 |
| Spanish | 768 |
| Italian | 660 |
| Other values (104) | 2974 |
Length
| Max length | 51 |
|---|---|
| Median length | 7 |
| Mean length | 6.998537151 |
| Min length | 5 |
Characters and Unicode
| Total characters | 368382 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 27 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | English |
|---|---|
| 2nd row | English |
| 3rd row | English |
| 4th row | English |
| 5th row | English |
Common Values
| Value | Count | Frequency (%) |
| English | 41214 | |
| French | 3855 | 7.3% |
| German | 3166 | 6.0% |
| Spanish | 768 | 1.5% |
| Italian | 660 | 1.3% |
| Russian | 578 | 1.1% |
| Dutch | 551 | 1.0% |
| Hungarian | 259 | 0.5% |
| Swedish | 249 | 0.5% |
| Danish | 230 | 0.4% |
| Other values (99) | 1107 | 2.1% |
Length
| Value | Count | Frequency (%) |
| english | 41408 | |
| french | 4132 | 7.6% |
| german | 3429 | 6.3% |
| spanish | 791 | 1.5% |
| 772 | 1.4% | |
| italian | 748 | 1.4% |
| russian | 633 | 1.2% |
| dutch | 632 | 1.2% |
| latin | 298 | 0.5% |
| hungarian | 287 | 0.5% |
| Other values (29) | 1237 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 52515 | |
| h | 47778 | |
| i | 45023 | |
| s | 44308 | |
| l | 42378 | |
| g | 41801 | |
| E | 41408 | |
| e | 8349 | 2.3% |
| r | 8160 | 2.2% |
| a | 7580 | 2.1% |
| Other values (39) | 29082 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 311661 | |
| Uppercase Letter | 53502 | 14.5% |
| Space Separator | 1730 | 0.5% |
| Other Punctuation | 862 | 0.2% |
| Decimal Number | 360 | 0.1% |
| Open Punctuation | 90 | < 0.1% |
| Close Punctuation | 90 | < 0.1% |
| Dash Punctuation | 87 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 52515 | |
| h | 47778 | |
| i | 45023 | |
| s | 44308 | |
| l | 42378 | |
| g | 41801 | |
| e | 8349 | 2.7% |
| r | 8160 | 2.6% |
| a | 7580 | 2.4% |
| c | 4838 | 1.6% |
| Other values (12) | 8931 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 41408 | |
| F | 4166 | 7.8% |
| G | 3519 | 6.6% |
| S | 1077 | 2.0% |
| D | 891 | 1.7% |
| I | 761 | 1.4% |
| R | 640 | 1.2% |
| L | 301 | 0.6% |
| H | 289 | 0.5% |
| P | 269 | 0.5% |
| Other values (7) | 181 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 90 | |
| 4 | 90 | |
| 5 | 90 | |
| 3 | 90 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 772 | |
| , | 90 | 10.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1730 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 90 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 90 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 87 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 365163 | |
| Common | 3219 | 0.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 52515 | |
| h | 47778 | |
| i | 45023 | |
| s | 44308 | |
| l | 42378 | |
| g | 41801 | |
| E | 41408 | |
| e | 8349 | 2.3% |
| r | 8160 | 2.2% |
| a | 7580 | 2.1% |
| Other values (29) | 25863 |
Common
| Value | Count | Frequency (%) |
| 1730 | ||
| ; | 772 | |
| , | 90 | 2.8% |
| ( | 90 | 2.8% |
| 1 | 90 | 2.8% |
| 4 | 90 | 2.8% |
| 5 | 90 | 2.8% |
| 3 | 90 | 2.8% |
| ) | 90 | 2.8% |
| - | 87 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 368382 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 52515 | |
| h | 47778 | |
| i | 45023 | |
| s | 44308 | |
| l | 42378 | |
| g | 41801 | |
| E | 41408 | |
| e | 8349 | 2.3% |
| r | 8160 | 2.2% |
| a | 7580 | 2.1% |
| Other values (39) | 29082 |
| Distinct | 5667 |
|---|---|
| Distinct (%) | 86.2% |
| Missing | 46119 |
| Missing (%) | 87.5% |
| Memory size | 411.8 KiB |
| No more published | 281 |
|---|---|
| Published in part | 50 |
| Printed for private circulation | 45 |
| The titlepage is engraved | 43 |
| Privately printed | 43 |
| Other values (5662) |
Length
| Max length | 5684 |
|---|---|
| Median length | 90 |
| Mean length | 132.9221411 |
| Min length | 3 |
Characters and Unicode
| Total characters | 874096 |
|---|---|
| Distinct characters | 235 |
| Distinct categories | 13 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 12 ? |
Unique
| Unique | 5506 ? |
|---|---|
| Unique (%) | 83.7% |
Sample
| 1st row | One of an edition of 100 copies |
|---|---|
| 2nd row | Wanting the back wrapper |
| 3rd row | Other edition: The Minstrel; or the Progress of Genius ... The second book. pp. 32. E. & C. Dilly: London, 1774. 4º |
| 4th row | Other edition: The haunch of venison, a poetical epistle to Lord Clare ... With a head of the author, drawn by Henry Bunbury Esq; and etched by Bretherton. Dublin: W. Whitestone, etc, 1776. pp. 15. 8º ; Other edition: The haunch of venison, a poetical epistle to Lord Clare ... With a head of the author, drawn by Henry Bunbury Esq; and etched by Bretherton. London: J. Ridley; G. Kearsly, 1776. pp. 19: plate. 4º ; The price on the half-title is 'One shilling' |
| 5th row | Other edition: Retaliation: a poem ... Including epitaphs on the most distinguished wits of this metropolis. London: G. Kearsly, 1774. pp. 20. 4º ; With an engraved portraits on the titlepage. In this copy there is no engraved text below the portraits, and the error on p. 8 is corrected in manuscript Without pages 17-20 containing 'Explanatory notes and observations' |
Common Values
| Value | Count | Frequency (%) |
| No more published | 281 | 0.5% |
| Published in part | 50 | 0.1% |
| Printed for private circulation | 45 | 0.1% |
| The titlepage is engraved | 43 | 0.1% |
| Privately printed | 43 | 0.1% |
| With an additional titlepage, engraved | 43 | 0.1% |
| Printed on one side of the leaf only | 30 | 0.1% |
| Only 100 copies printed | 27 | 0.1% |
| Without pagination | 21 | < 0.1% |
| A novel | 13 | < 0.1% |
| Other values (5657) | 5980 | 11.3% |
| (Missing) | 46119 |
Length
| Value | Count | Frequency (%) |
| the | 8345 | 5.7% |
| of | 6381 | 4.3% |
| 5845 | 4.0% | |
| edition | 5757 | 3.9% |
| other | 4918 | 3.3% |
| a | 3992 | 2.7% |
| 8º | 3454 | 2.3% |
| london | 3370 | 2.3% |
| and | 3270 | 2.2% |
| in | 2654 | 1.8% |
| Other values (12798) | 99543 |
Most occurring characters
| Value | Count | Frequency (%) |
| 140953 | ||
| e | 69179 | 7.9% |
| o | 52698 | 6.0% |
| t | 51464 | 5.9% |
| i | 50585 | 5.8% |
| n | 48579 | 5.6% |
| r | 38560 | 4.4% |
| a | 38246 | 4.4% |
| . | 31932 | 3.7% |
| d | 28518 | 3.3% |
| Other values (225) | 323382 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 561459 | |
| Space Separator | 140953 | 16.1% |
| Other Punctuation | 61853 | 7.1% |
| Uppercase Letter | 56659 | 6.5% |
| Decimal Number | 40712 | 4.7% |
| Other Letter | 4394 | 0.5% |
| Close Punctuation | 3276 | 0.4% |
| Open Punctuation | 3275 | 0.4% |
| Dash Punctuation | 1441 | 0.2% |
| Nonspacing Mark | 44 | < 0.1% |
| Other values (3) | 30 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 69179 | |
| o | 52698 | |
| t | 51464 | 9.2% |
| i | 50585 | 9.0% |
| n | 48579 | 8.7% |
| r | 38560 | 6.9% |
| a | 38246 | 6.8% |
| d | 28518 | 5.1% |
| s | 28057 | 5.0% |
| h | 26613 | 4.7% |
| Other values (115) | 128960 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 5741 | 10.1% |
| L | 5352 | 9.4% |
| T | 4562 | 8.1% |
| C | 4137 | 7.3% |
| A | 4103 | 7.2% |
| S | 3648 | 6.4% |
| B | 3241 | 5.7% |
| W | 3065 | 5.4% |
| P | 2663 | 4.7% |
| M | 2539 | 4.5% |
| Other values (46) | 17608 |
Other Letter
| Value | Count | Frequency (%) |
| º | 4368 | |
| ת | 5 | 0.1% |
| י | 3 | 0.1% |
| ר | 2 | < 0.1% |
| ג | 2 | < 0.1% |
| מ | 2 | < 0.1% |
| נ | 2 | < 0.1% |
| ל | 2 | < 0.1% |
| ז | 2 | < 0.1% |
| ו | 2 | < 0.1% |
| Other values (4) | 4 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 31932 | |
| , | 11771 | 19.0% |
| : | 8523 | 13.8% |
| ' | 4282 | 6.9% |
| ; | 3353 | 5.4% |
| & | 1759 | 2.8% |
| ? | 135 | 0.2% |
| * | 47 | 0.1% |
| ! | 39 | 0.1% |
| / | 11 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 10353 | |
| 1 | 9584 | |
| 2 | 3842 | 9.4% |
| 4 | 2661 | 6.5% |
| 7 | 2644 | 6.5% |
| 6 | 2493 | 6.1% |
| 9 | 2480 | 6.1% |
| 3 | 2472 | 6.1% |
| 5 | 2164 | 5.3% |
| 0 | 2019 | 5.0% |
Other Number
| Value | Count | Frequency (%) |
| ² | 7 | |
| ⁴ | 7 | |
| ⁸ | 3 | |
| ⁰ | 1 | 5.3% |
| ¹ | 1 | 5.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 2837 | |
| ) | 438 | 13.4% |
| ⁾ | 1 | < 0.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 5 | |
| ʾ | 1 | 14.3% |
| ʺ | 1 | 14.3% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ︠ | 21 | |
| ︡ | 21 | |
| ̈ | 2 | 4.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 2837 | |
| ( | 438 | 13.4% |
Space Separator
| Value | Count | Frequency (%) |
| 140953 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1441 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 621588 | |
| Common | 251540 | |
| Cyrillic | 767 | 0.1% |
| Greek | 131 | < 0.1% |
| Inherited | 44 | < 0.1% |
| Hebrew | 26 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 69179 | 11.1% |
| o | 52698 | 8.5% |
| t | 51464 | 8.3% |
| i | 50585 | 8.1% |
| n | 48579 | 7.8% |
| r | 38560 | 6.2% |
| a | 38246 | 6.2% |
| d | 28518 | 4.6% |
| s | 28057 | 4.5% |
| h | 26613 | 4.3% |
| Other values (89) | 189089 |
Cyrillic
| Value | Count | Frequency (%) |
| о | 74 | 9.6% |
| с | 62 | 8.1% |
| и | 51 | 6.6% |
| а | 50 | 6.5% |
| т | 45 | 5.9% |
| е | 38 | 5.0% |
| н | 38 | 5.0% |
| р | 32 | 4.2% |
| к | 31 | 4.0% |
| і | 27 | 3.5% |
| Other values (42) | 319 |
Common
| Value | Count | Frequency (%) |
| 140953 | ||
| . | 31932 | 12.7% |
| , | 11771 | 4.7% |
| 8 | 10353 | 4.1% |
| 1 | 9584 | 3.8% |
| : | 8523 | 3.4% |
| ' | 4282 | 1.7% |
| 2 | 3842 | 1.5% |
| ; | 3353 | 1.3% |
| [ | 2837 | 1.1% |
| Other values (27) | 24110 | 9.6% |
Greek
| Value | Count | Frequency (%) |
| α | 17 | |
| ο | 16 | 12.2% |
| ν | 11 | 8.4% |
| ς | 10 | 7.6% |
| ι | 8 | 6.1% |
| τ | 8 | 6.1% |
| υ | 7 | 5.3% |
| λ | 5 | 3.8% |
| ρ | 5 | 3.8% |
| σ | 4 | 3.1% |
| Other values (21) | 40 |
Hebrew
| Value | Count | Frequency (%) |
| ת | 5 | |
| י | 3 | |
| ר | 2 | 7.7% |
| ג | 2 | 7.7% |
| מ | 2 | 7.7% |
| נ | 2 | 7.7% |
| ל | 2 | 7.7% |
| ז | 2 | 7.7% |
| ו | 2 | 7.7% |
| ע | 1 | 3.8% |
| Other values (3) | 3 |
Inherited
| Value | Count | Frequency (%) |
| ︠ | 21 | |
| ︡ | 21 | |
| ̈ | 2 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 867691 | |
| Latin 1 Sup | 5339 | 0.6% |
| Cyrillic | 767 | 0.1% |
| None | 137 | < 0.1% |
| Latin Ext A | 77 | < 0.1% |
| Half Marks | 42 | < 0.1% |
| Hebrew | 26 | < 0.1% |
| Modifier Letters | 7 | < 0.1% |
| Greek Ext | 6 | < 0.1% |
| Diacriticals | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 140953 | ||
| e | 69179 | 8.0% |
| o | 52698 | 6.1% |
| t | 51464 | 5.9% |
| i | 50585 | 5.8% |
| n | 48579 | 5.6% |
| r | 38560 | 4.4% |
| a | 38246 | 4.4% |
| . | 31932 | 3.7% |
| d | 28518 | 3.3% |
| Other values (69) | 316977 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| º | 4368 | |
| é | 372 | 7.0% |
| è | 151 | 2.8% |
| æ | 68 | 1.3% |
| ü | 68 | 1.3% |
| ö | 59 | 1.1% |
| ä | 53 | 1.0% |
| à | 50 | 0.9% |
| á | 23 | 0.4% |
| É | 19 | 0.4% |
| Other values (20) | 108 | 2.0% |
Hebrew
| Value | Count | Frequency (%) |
| ת | 5 | |
| י | 3 | |
| ר | 2 | 7.7% |
| ג | 2 | 7.7% |
| מ | 2 | 7.7% |
| נ | 2 | 7.7% |
| ל | 2 | 7.7% |
| ז | 2 | 7.7% |
| ו | 2 | 7.7% |
| ע | 1 | 3.8% |
| Other values (3) | 3 |
Latin Ext A
| Value | Count | Frequency (%) |
| œ | 21 | |
| ī | 11 | |
| ĭ | 8 | 10.4% |
| ę | 7 | 9.1% |
| ō | 4 | 5.2% |
| ś | 4 | 5.2% |
| ā | 3 | 3.9% |
| ą | 3 | 3.9% |
| ł | 3 | 3.9% |
| ć | 3 | 3.9% |
| Other values (8) | 10 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ṫ | 1 |
None
| Value | Count | Frequency (%) |
| α | 17 | 12.4% |
| ο | 16 | 11.7% |
| ν | 11 | 8.0% |
| ς | 10 | 7.3% |
| ι | 8 | 5.8% |
| τ | 8 | 5.8% |
| υ | 7 | 5.1% |
| ⁴ | 7 | 5.1% |
| λ | 5 | 3.6% |
| ρ | 5 | 3.6% |
| Other values (19) | 43 |
Greek Ext
| Value | Count | Frequency (%) |
| ἰ | 1 | |
| Ἱ | 1 | |
| ὐ | 1 | |
| Ὀ | 1 | |
| ἐ | 1 | |
| Ἀ | 1 |
Punctuation
| Value | Count | Frequency (%) |
| … | 1 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 5 | |
| ʾ | 1 | 14.3% |
| ʺ | 1 | 14.3% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 74 | 9.6% |
| с | 62 | 8.1% |
| и | 51 | 6.6% |
| а | 50 | 6.5% |
| т | 45 | 5.9% |
| е | 38 | 5.0% |
| н | 38 | 5.0% |
| р | 32 | 4.2% |
| к | 31 | 4.0% |
| і | 27 | 3.5% |
| Other values (42) | 319 |
Half Marks
| Value | Count | Frequency (%) |
| ︠ | 21 | |
| ︡ | 21 |
Diacriticals
| Value | Count | Frequency (%) |
| ̈ | 2 |
Digitised Record Match
Real number (ℝ≥0)
| Distinct | 52689 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2031536.837 |
| Minimum | 37 |
|---|---|
| Maximum | 19138278 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 37 |
|---|---|
| 5-th percentile | 202740.8 |
| Q1 | 962667.5 |
| median | 1989238 |
| Q3 | 3065901.5 |
| 95-th percentile | 3882951 |
| Maximum | 19138278 |
| Range | 19138241 |
| Interquartile range (IQR) | 2103234 |
Descriptive statistics
| Standard deviation | 1208999.294 |
|---|---|
| Coefficient of variation (CV) | 0.595115615 |
| Kurtosis | 2.878658086 |
| Mean | 2031536.837 |
| Median Absolute Deviation (MAD) | 1056158 |
| Skewness | 0.4452978625 |
| Sum | 1.070518336 × 1011 |
| Variance | 1.461679293 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1091680 | 6 | < 0.1% |
| 1091669 | 2 | < 0.1% |
| 3996603 | 1 | < 0.1% |
| 242145 | 1 | < 0.1% |
| 211441 | 1 | < 0.1% |
| 214859 | 1 | < 0.1% |
| 214961 | 1 | < 0.1% |
| 220903 | 1 | < 0.1% |
| 222151 | 1 | < 0.1% |
| 222955 | 1 | < 0.1% |
| Other values (52679) | 52679 |
| Value | Count | Frequency (%) |
| 37 | 1 | |
| 196 | 1 | |
| 206 | 1 | |
| 216 | 1 | |
| 218 | 1 | |
| 428 | 1 | |
| 472 | 1 | |
| 478 | 1 | |
| 480 | 1 | |
| 481 | 1 |
| Value | Count | Frequency (%) |
| 19138278 | 1 | |
| 15757407 | 1 | |
| 15310067 | 1 | |
| 15309676 | 1 | |
| 15309664 | 1 | |
| 14869816 | 1 | |
| 13952747 | 1 | |
| 12811400 | 1 | |
| 11844530 | 1 | |
| 11844350 | 1 |
First rows
| BL record ID | Type of resource | Name | Dates associated with name | Type of name | Role | All names | Title | Variant titles | Series title | Number within series | Country of publication | Place of publication | Publisher | Date of publication | Edition | Physical description | Dewey classification | BL shelfmark | Topics | Genre | Languages | Notes | Digitised Record Match | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 014602826 | Monograph | Yearsley, Ann | 1753-1806 | person | NaN | More, Hannah, 1745-1833 [person] ; Yearsley, Ann, 1753-1806 [person] | Poems on several occasions [With a prefatory letter by Hannah More.] | NaN | NaN | NaN | England | London | NaN | 1786 | Fourth edition MANUSCRIPT note | NaN | NaN | Digital Store 11644.d.32 | NaN | NaN | English | NaN | 003996603 |
| 1 | 014602830 | Monograph | A, T. | NaN | person | NaN | Oldham, John, 1653-1683 [person] ; A, T. [person] | A Satyr against Vertue. (A poem: supposed to be spoken by a Town-Hector [By John Oldham. The preface signed: T. A.]) | NaN | NaN | NaN | England | London | NaN | 1679 | NaN | 15 pages (4°) | NaN | Digital Store 11602.ee.10. (2.) | NaN | NaN | English | NaN | 000001143 |
| 2 | 014602831 | Monograph | NaN | NaN | NaN | NaN | NaN | The Aeronaut, a poem; founded almost entirely, upon a statement, printed in the newspapers, of a voyage from Dublin, in October, 1812 | NaN | NaN | NaN | Ireland | Dublin | Richard Milliken | 1816 | NaN | 17 pages (8°) | NaN | Digital Store 992.i.12. (3.) | Dublin (Ireland) | NaN | English | NaN | 000022782 |
| 3 | 014602832 | Monograph | Albert, Prince Consort, consort of Victoria, Queen of Great Britain | 1819-1861 | person | NaN | Plimsoll, Joseph [person] ; Albert, Prince Consort, consort of Victoria, Queen of Great Britain, 1819-1861 [person] | The Prince Albert, a poem [By Joseph Plimsoll.] | Appendix | NaN | NaN | NaN | Plymouth | W. Cann | 1868 | NaN | 16 pages (8°) | NaN | Digital Store 11602.ee.17. (1.) | NaN | NaN | English | NaN | 000039775 |
| 4 | 014602833 | Monograph | Anslow, Robert | NaN | person | NaN | Anslow, Robert [person] | The Defeat of the Spanish Armada, A.D. 1588. A tercentenary ballad, A.D. 1888 | NaN | NaN | NaN | England | London | Elliot Stock | 1888 | NaN | 40 pages (8°) | NaN | Digital Store 11602.ee.17. (7.) | NaN | NaN | English | NaN | 000092666 |
| 5 | 014602834 | Monograph | NaN | NaN | NaN | NaN | Swift, Jonathan, 1667-1745 [person] | A Familiar Answer to a Familiar Letter [In verse, addressed to Dean Swift?] | Appendix. I. Contemporary Satires, Eulogies, etc | NaN | NaN | England | London | NaN | 1720 | NaN | 7 pages (4°) | NaN | Digital Store 11602.ee.10. (5.) | NaN | NaN | English | NaN | 000093359 |
| 6 | 014602835 | Monograph | NaN | NaN | NaN | NaN | NaN | The Irish Home Rule Bill. A poetical pamphlet, etc | NaN | NaN | NaN | NaN | Calcutta | I. C. Bose | 1893 | NaN | 4 pages (8°) | NaN | Digital Store 11601.g.28. (3.) | NaN | NaN | English | NaN | 000150273 |
| 7 | 014602836 | Monograph | NaN | NaN | NaN | NaN | NaN | Confessions of a Coquette, while staying at Scarboro', Whitby, & Bridlington. By Azucena [In verse.] | NaN | NaN | NaN | England | Scarborough | E. T. W. Dennis | 1888 | NaN | 42 pages (8°) | NaN | Digital Store 11602.ee.17. (8.) | NaN | NaN | English | NaN | 000156011 |
| 8 | 014602837 | Monograph | Bellamy, James William | NaN | person | NaN | Bellamy, James William [person] | Jonah. The Seatonian Prize Poem for the year 1815 | NaN | NaN | NaN | England | London | Taylor & Hessey | 1815 | NaN | 28 pages (8°) | NaN | Digital Store 992.i.12. (1.) | NaN | NaN | English | NaN | 000261714 |
| 9 | 014602838 | Monograph | Brabant, Henry, Sir | NaN | person | NaN | Brabant, Henry, Sir [person] | The Eve of the Revolution; in Newcastle-upon-Tyne. (The Case of Sir Henry Brabant, knt, Mayor of Newcastle upon Tyne, most humbly offered to your Majesties Royal consideration.) | NaN | Reprints of Rare Tracts & Imprints, etc | volume 4 [Reprints of Rare Tracts & Imprints, etc] | NaN | Newcastle | M. A. Richardson | 1848 | NaN | 24 pages (8°) | NaN | Digital Store 1077.f.89 | NaN | NaN | English | One of an edition of 100 copies | 000445451 |
Last rows
| BL record ID | Type of resource | Name | Dates associated with name | Type of name | Role | All names | Title | Variant titles | Series title | Number within series | Country of publication | Place of publication | Publisher | Date of publication | Edition | Physical description | Dewey classification | BL shelfmark | Topics | Genre | Languages | Notes | Digitised Record Match | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 52685 | 016289053 | Monograph | Eliot, George | 1819-1880 | person | NaN | Eliot, George, 1819-1880 [person] | The mill on the Floss ... Illustrated by J. Barnard Davis | The Mill on the Floss | NaN | NaN | England | London | Blackie | 1908 | Another edition, Illustrated by W. M. Bowles | ix, 438 pages, plates, 20 cm | NaN | Digital Store 012618.fff.6 | NaN | NaN | English | The edition of 1904, republished, with different preliminaries and plates | 004117445 |
| 52686 | 016289054 | Monograph | Eliot, George | 1819-1880 | person | NaN | Eliot, George, 1819-1880 [person] | The Mill on the Floss ... Illustrated by T. H. Robinson | The Mill on the Floss | NaN | NaN | England ; Scotland | Edinburgh ; London | Thomas Nelson | 1928 | Another edition | 589 pages, plates, 20 cm | NaN | Digital Store 012603.c.6 | NaN | NaN | English | The edition of [1919], republished, with the addition of frontispiece | 004117454 |
| 52687 | 016289055 | Monograph | Eliot, George | 1819-1880 | person | NaN | Eliot, George, 1819-1880 [person] | The Mill on the Floss ... Illustrated by T. H. Robinson | The Mill on the Floss | NaN | NaN | England | London | Daily Express Publications | 1933 | Another edition | 511 pages, plates, portraits, 19 cm | NaN | Digital Store 12602.p.7 | NaN | NaN | English | NaN | 004117456 |
| 52688 | 016289056 | Monograph | Eliot, George | 1819-1880 | person | NaN | Eliot, George, 1819-1880 [person] | The Mill on the Floss ... Illustrated by T. H. Robinson | The Mill on the Floss | NaN | NaN | England | London | Dean | 1936 | Another edition | 377 pages, plates, 21 cm | NaN | Digital Store 012604.l.3 | NaN | NaN | English | NaN | 004117457 |
| 52689 | 016289057 | Monograph | Garstang, Walter, M.A., F.Z.S. | NaN | person | NaN | Garstang, Walter, M.A., F.Z.S. [person] ; Shepherd, J. A. (James Affleck), 1867-approximately 1931 [person] | Songs of the Birds ... With illustrations by J.A. Shepherd | NaN | NaN | NaN | England | London | John Lane | 1922 | NaN | 101 pages, illustrations (8°) | 598.259 | Digital Store 011648.g.133 | NaN | NaN | English | Poems, with and introductory essay | 004158005 |
| 52690 | 016289058 | Monograph | Dickens, Charles | 1812-1870 | person | NaN | Dickens, Charles, 1812-1870 [person] | The posthumous papers of the Pickwick Club | Pickwick papers | NaN | NaN | England | Liverpool | World's Best Library | NaN | NaN | xvi, 610 pages, illustrations, 20 cm | 823.8 | NaN | England--Social life and customs--19th century--Fiction ; Men--England--Societies and clubs--Fiction | NaN | English | Spine title: The Pickwick papers | 008594906 |
| 52691 | 016289059 | Serial | NaN | NaN | NaN | NaN | NaN | TRUE STORY CLASSICS | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | English | NaN | 011143842 |
| 52692 | 016289060 | Monograph | Wellesley, Dorothy | 1889-1956 | person | NaN | Wellesley, Dorothy, 1889-1956 [person] | Early Poems. By M. A [i.e. Dorothy Violet Wellesley, Lady Gerald Wellesley.] | NaN | NaN | NaN | England | London | Elkin Mathews | 1913 | NaN | vii, 90 pages (8°) | NaN | Digital Store 011649.eee.17 | NaN | NaN | English | NaN | 000000839 |
| 52693 | 016289061 | Monograph | A, T. H. E. | NaN | person | NaN | A, T. H. E. [person] | Of Life and Love [Poems.] By T. H. E. A, writer of 'The Message.' | NaN | NaN | NaN | England | London | J. M. Watkins | 1924 | NaN | 89 pages (8°) | NaN | Digital Store 011645.e.125 | NaN | NaN | English | NaN | 000001167 |
| 52694 | 016289062 | Monograph | Abbay, Richard | NaN | person | NaN | Abbay, Richard [person] | Life, a Mode of Motion; or, He and I, my two selves [A poem.] | NaN | NaN | NaN | England | London | Jarrold | 1919 | NaN | volumes, 58 pages (8°) | NaN | Digital Store 011649.g.81 | NaN | NaN | English | NaN | 000003140 |