Overview

Dataset statistics

Number of variables24
Number of observations52695
Missing cells561161
Missing cells (%)44.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.6 MiB
Average record size in memory192.0 B

Variable types

Numeric2
Categorical22

Warnings

Name has a high cardinality: 28129 distinct values High cardinality
Dates associated with name has a high cardinality: 2757 distinct values High cardinality
All names has a high cardinality: 33026 distinct values High cardinality
Title has a high cardinality: 50029 distinct values High cardinality
Variant titles has a high cardinality: 1743 distinct values High cardinality
Series title has a high cardinality: 157 distinct values High cardinality
Number within series has a high cardinality: 110 distinct values High cardinality
Country of publication has a high cardinality: 71 distinct values High cardinality
Place of publication has a high cardinality: 3492 distinct values High cardinality
Publisher has a high cardinality: 7263 distinct values High cardinality
Date of publication has a high cardinality: 458 distinct values High cardinality
Edition has a high cardinality: 1559 distinct values High cardinality
Physical description has a high cardinality: 10735 distinct values High cardinality
Dewey classification has a high cardinality: 67 distinct values High cardinality
BL shelfmark has a high cardinality: 52345 distinct values High cardinality
Topics has a high cardinality: 1208 distinct values High cardinality
Genre has a high cardinality: 64 distinct values High cardinality
Languages has a high cardinality: 109 distinct values High cardinality
Notes has a high cardinality: 5667 distinct values High cardinality
Name has 5143 (9.8%) missing values Missing
Dates associated with name has 41870 (79.5%) missing values Missing
Type of name has 5143 (9.8%) missing values Missing
Role has 51015 (96.8%) missing values Missing
All names has 3062 (5.8%) missing values Missing
Variant titles has 46828 (88.9%) missing values Missing
Series title has 52435 (99.5%) missing values Missing
Number within series has 52584 (99.8%) missing values Missing
Country of publication has 16235 (30.8%) missing values Missing
Place of publication has 772 (1.5%) missing values Missing
Publisher has 25208 (47.8%) missing values Missing
Edition has 48497 (92.0%) missing values Missing
Physical description has 12849 (24.4%) missing values Missing
Dewey classification has 52617 (99.9%) missing values Missing
Topics has 49559 (94.0%) missing values Missing
Genre has 50722 (96.3%) missing values Missing
Notes has 46119 (87.5%) missing values Missing
Number within series is uniformly distributed Uniform
Dewey classification is uniformly distributed Uniform
BL shelfmark is uniformly distributed Uniform
BL record ID has unique values Unique

Reproduction

Analysis started2021-09-17 10:06:34.511002
Analysis finished2021-09-17 10:06:44.598969
Duration10.09 seconds
Software versionpandas-profiling v3.0.0
Download configurationconfig.json

Variables

BL record ID
Real number (ℝ≥0)

UNIQUE

Distinct52695
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14936555.58
Minimum14602826
Maximum16289062
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.8 KiB
2021-09-17T11:06:44.790027image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum14602826
5-th percentile14635415.7
Q114811722.5
median14829381
Q314872527.5
95-th percentile16286365.3
Maximum16289062
Range1686236
Interquartile range (IQR)60805

Descriptive statistics

Standard deviation367921.3739
Coefficient of variation (CV)0.02463227696
Kurtosis8.42143983
Mean14936555.58
Median Absolute Deviation (MAD)23411
Skewness3.08396255
Sum7.870817963 × 1011
Variance1.353661374 × 1011
MonotonicityStrictly increasing
2021-09-17T11:06:44.913039image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
146028261
 
< 0.1%
148612741
 
< 0.1%
148612641
 
< 0.1%
148612651
 
< 0.1%
148612661
 
< 0.1%
148612671
 
< 0.1%
148612681
 
< 0.1%
148612691
 
< 0.1%
148612701
 
< 0.1%
148612711
 
< 0.1%
Other values (52685)52685
> 99.9%
ValueCountFrequency (%)
146028261
< 0.1%
146028301
< 0.1%
146028311
< 0.1%
146028321
< 0.1%
146028331
< 0.1%
146028341
< 0.1%
146028351
< 0.1%
146028361
< 0.1%
146028371
< 0.1%
146028381
< 0.1%
ValueCountFrequency (%)
162890621
< 0.1%
162890611
< 0.1%
162890601
< 0.1%
162890591
< 0.1%
162890581
< 0.1%
162890571
< 0.1%
162890561
< 0.1%
162890551
< 0.1%
162890541
< 0.1%
162890531
< 0.1%

Type of resource
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size411.8 KiB
Monograph
52556 
Monographic component part
 
93
Serial
 
46

Length

Max length26
Median length9
Mean length9.027384002
Min length6

Characters and Unicode

Total characters475698
Distinct characters16
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMonograph
2nd rowMonograph
3rd rowMonograph
4th rowMonograph
5th rowMonograph

Common Values

ValueCountFrequency (%)
Monograph52556
99.7%
Monographic component part93
 
0.2%
Serial46
 
0.1%

Length

2021-09-17T11:06:45.127878image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2021-09-17T11:06:45.198176image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
ValueCountFrequency (%)
monograph52556
99.4%
monographic93
 
0.2%
component93
 
0.2%
part93
 
0.2%
serial46
 
0.1%

Most occurring characters

ValueCountFrequency (%)
o105484
22.2%
n52835
11.1%
p52835
11.1%
r52788
11.1%
a52788
11.1%
M52649
11.1%
g52649
11.1%
h52649
11.1%
c186
 
< 0.1%
186
 
< 0.1%
Other values (6)649
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter422817
88.9%
Uppercase Letter52695
 
11.1%
Space Separator186
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o105484
24.9%
n52835
12.5%
p52835
12.5%
r52788
12.5%
a52788
12.5%
g52649
12.5%
h52649
12.5%
c186
 
< 0.1%
t186
 
< 0.1%
e139
 
< 0.1%
Other values (3)278
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
M52649
99.9%
S46
 
0.1%
Space Separator
ValueCountFrequency (%)
186
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin475512
> 99.9%
Common186
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
o105484
22.2%
n52835
11.1%
p52835
11.1%
r52788
11.1%
a52788
11.1%
M52649
11.1%
g52649
11.1%
h52649
11.1%
c186
 
< 0.1%
t186
 
< 0.1%
Other values (5)463
 
0.1%
Common
ValueCountFrequency (%)
186
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII475698
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o105484
22.2%
n52835
11.1%
p52835
11.1%
r52788
11.1%
a52788
11.1%
M52649
11.1%
g52649
11.1%
h52649
11.1%
c186
 
< 0.1%
186
 
< 0.1%
Other values (6)649
 
0.1%

Name
Categorical

HIGH CARDINALITY
MISSING

Distinct28129
Distinct (%)59.2%
Missing5143
Missing (%)9.8%
Memory size411.8 KiB
Great Britain, Hydrographic Department
 
159
Byron, George Gordon Byron, Baron
 
154
Scott, Walter, Sir
 
109
Wood, Henry, Mrs
 
103
Dickens, Charles
 
74
Other values (28124)
46953 

Length

Max length223
Median length20
Mean length22.66573435
Min length3

Characters and Unicode

Total characters1077801
Distinct characters159
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21176 ?
Unique (%)44.5%

Sample

1st rowYearsley, Ann
2nd rowA, T.
3rd rowAlbert, Prince Consort, consort of Victoria, Queen of Great Britain
4th rowAnslow, Robert
5th rowBellamy, James William

Common Values

ValueCountFrequency (%)
Great Britain, Hydrographic Department159
 
0.3%
Byron, George Gordon Byron, Baron154
 
0.3%
Scott, Walter, Sir109
 
0.2%
Wood, Henry, Mrs103
 
0.2%
Dickens, Charles74
 
0.1%
Oliphant, Mrs (Margaret)74
 
0.1%
Marryat, Florence58
 
0.1%
Goldsmith, Oliver55
 
0.1%
Dryden, John50
 
0.1%
Ainsworth, William Harrison47
 
0.1%
Other values (28119)46669
88.6%
(Missing)5143
 
9.8%

Length

2021-09-17T11:06:45.429001image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
john3376
 
2.2%
william3352
 
2.2%
of2797
 
1.8%
george1970
 
1.3%
henry1948
 
1.3%
charles1833
 
1.2%
james1824
 
1.2%
thomas1796
 
1.2%
de1626
 
1.1%
j1289
 
0.8%
Other values (22638)131290
85.8%

Most occurring characters

ValueCountFrequency (%)
105549
 
9.8%
e90721
 
8.4%
r73559
 
6.8%
a72314
 
6.7%
n60318
 
5.6%
o59621
 
5.5%
,57646
 
5.3%
i55437
 
5.1%
l50947
 
4.7%
s40060
 
3.7%
Other values (149)411629
38.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter746524
69.3%
Uppercase Letter143261
 
13.3%
Space Separator105549
 
9.8%
Other Punctuation73308
 
6.8%
Open Punctuation3775
 
0.4%
Close Punctuation3775
 
0.4%
Dash Punctuation1160
 
0.1%
Decimal Number319
 
< 0.1%
Nonspacing Mark89
 
< 0.1%
Modifier Letter41
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e90721
12.2%
r73559
9.9%
a72314
9.7%
n60318
 
8.1%
o59621
 
8.0%
i55437
 
7.4%
l50947
 
6.8%
s40060
 
5.4%
t38253
 
5.1%
h29021
 
3.9%
Other values (74)176273
23.6%
Uppercase Letter
ValueCountFrequency (%)
C11180
 
7.8%
J10877
 
7.6%
M10869
 
7.6%
H10219
 
7.1%
S9823
 
6.9%
B9735
 
6.8%
A9319
 
6.5%
W8982
 
6.3%
G8100
 
5.7%
R6940
 
4.8%
Other values (36)47217
33.0%
Decimal Number
ValueCountFrequency (%)
189
27.9%
848
15.0%
238
11.9%
329
 
9.1%
426
 
8.2%
925
 
7.8%
721
 
6.6%
519
 
6.0%
616
 
5.0%
08
 
2.5%
Other Punctuation
ValueCountFrequency (%)
,57646
78.6%
.14844
 
20.2%
'681
 
0.9%
*59
 
0.1%
?46
 
0.1%
&32
 
< 0.1%
Nonspacing Mark
ValueCountFrequency (%)
43
48.3%
43
48.3%
̡2
 
2.2%
̐1
 
1.1%
Modifier Letter
ValueCountFrequency (%)
ʹ35
85.4%
ʺ4
 
9.8%
ʿ2
 
4.9%
Open Punctuation
ValueCountFrequency (%)
(2939
77.9%
[836
 
22.1%
Close Punctuation
ValueCountFrequency (%)
)2939
77.9%
]836
 
22.1%
Space Separator
ValueCountFrequency (%)
105549
100.0%
Dash Punctuation
ValueCountFrequency (%)
-1160
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin889785
82.6%
Common187927
 
17.4%
Inherited89
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e90721
 
10.2%
r73559
 
8.3%
a72314
 
8.1%
n60318
 
6.8%
o59621
 
6.7%
i55437
 
6.2%
l50947
 
5.7%
s40060
 
4.5%
t38253
 
4.3%
h29021
 
3.3%
Other values (120)319534
35.9%
Common
ValueCountFrequency (%)
105549
56.2%
,57646
30.7%
.14844
 
7.9%
(2939
 
1.6%
)2939
 
1.6%
-1160
 
0.6%
[836
 
0.4%
]836
 
0.4%
'681
 
0.4%
189
 
< 0.1%
Other values (15)408
 
0.2%
Inherited
ValueCountFrequency (%)
43
48.3%
43
48.3%
̡2
 
2.2%
̐1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1073574
99.6%
Latin 1 Sup3653
 
0.3%
Latin Ext A430
 
< 0.1%
Half Marks86
 
< 0.1%
Modifier Letters41
 
< 0.1%
Latin Ext Additional9
 
< 0.1%
Latin Ext B5
 
< 0.1%
Diacriticals3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
105549
 
9.8%
e90721
 
8.5%
r73559
 
6.9%
a72314
 
6.7%
n60318
 
5.6%
o59621
 
5.6%
,57646
 
5.4%
i55437
 
5.2%
l50947
 
4.7%
s40060
 
3.7%
Other values (64)407402
37.9%
Latin 1 Sup
ValueCountFrequency (%)
é1569
43.0%
á344
 
9.4%
è294
 
8.0%
ç243
 
6.7%
É221
 
6.0%
ó173
 
4.7%
í169
 
4.6%
ö132
 
3.6%
ü103
 
2.8%
ë50
 
1.4%
Other values (28)355
 
9.7%
Modifier Letters
ValueCountFrequency (%)
ʹ35
85.4%
ʺ4
 
9.8%
ʿ2
 
4.9%
Latin Ext A
ValueCountFrequency (%)
ĭ104
24.2%
ł67
15.6%
ń42
9.8%
ī32
 
7.4%
ć21
 
4.9%
ē19
 
4.4%
š19
 
4.4%
č16
 
3.7%
ā15
 
3.5%
ő12
 
2.8%
Other values (20)83
19.3%
Latin Ext Additional
ValueCountFrequency (%)
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Diacriticals
ValueCountFrequency (%)
̡2
66.7%
̐1
33.3%
Latin Ext B
ValueCountFrequency (%)
ǵ4
80.0%
ǎ1
 
20.0%
Half Marks
ValueCountFrequency (%)
43
50.0%
43
50.0%

Dates associated with name
Categorical

HIGH CARDINALITY
MISSING

Distinct2757
Distinct (%)25.5%
Missing41870
Missing (%)79.5%
Memory size411.8 KiB
1788-1824
 
154
1771-1832
 
109
1814-1887
 
106
1812-1870
 
74
1828-1897
 
74
Other values (2752)
10308 

Length

Max length37
Median length9
Mean length9.588637413
Min length4

Characters and Unicode

Total characters103797
Distinct characters27
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1230 ?
Unique (%)11.4%

Sample

1st row1753-1806
2nd row1819-1861
3rd row1600-1649
4th row1782-1865
5th row1772-1834

Common Values

ValueCountFrequency (%)
1788-1824154
 
0.3%
1771-1832109
 
0.2%
1814-1887106
 
0.2%
1812-187074
 
0.1%
1828-189774
 
0.1%
1833-189958
 
0.1%
approximately 1730-177455
 
0.1%
1805-188252
 
0.1%
1631-170049
 
0.1%
1865-193646
 
0.1%
Other values (2747)10048
 
19.1%
(Missing)41870
79.5%

Length

2021-09-17T11:06:45.685603image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
approximately424
 
3.7%
1788-1824155
 
1.3%
1771-1832109
 
0.9%
1814-1887106
 
0.9%
active95
 
0.8%
1812-187074
 
0.6%
1828-189774
 
0.6%
1833-189958
 
0.5%
1730-177455
 
0.5%
1805-188252
 
0.5%
Other values (2735)10289
89.5%

Most occurring characters

ValueCountFrequency (%)
125752
24.8%
816814
16.2%
-10795
10.4%
98057
 
7.8%
77259
 
7.0%
25009
 
4.8%
04746
 
4.6%
34605
 
4.4%
44504
 
4.3%
64375
 
4.2%
Other values (17)11881
11.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number85420
82.3%
Dash Punctuation10797
 
10.4%
Lowercase Letter6913
 
6.7%
Space Separator666
 
0.6%
Other Punctuation1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a1057
15.3%
p962
13.9%
i576
8.3%
t576
8.3%
e576
8.3%
r526
7.6%
o526
7.6%
x481
7.0%
m481
7.0%
l481
7.0%
Other values (3)671
9.7%
Decimal Number
ValueCountFrequency (%)
125752
30.1%
816814
19.7%
98057
 
9.4%
77259
 
8.5%
25009
 
5.9%
04746
 
5.6%
34605
 
5.4%
44504
 
5.3%
64375
 
5.1%
54299
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
-10795
> 99.9%
2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
666
100.0%
Other Punctuation
ValueCountFrequency (%)
?1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common96884
93.3%
Latin6913
 
6.7%

Most frequent character per script

Common
ValueCountFrequency (%)
125752
26.6%
816814
17.4%
-10795
11.1%
98057
 
8.3%
77259
 
7.5%
25009
 
5.2%
04746
 
4.9%
34605
 
4.8%
44504
 
4.6%
64375
 
4.5%
Other values (4)4968
 
5.1%
Latin
ValueCountFrequency (%)
a1057
15.3%
p962
13.9%
i576
8.3%
t576
8.3%
e576
8.3%
r526
7.6%
o526
7.6%
x481
7.0%
m481
7.0%
l481
7.0%
Other values (3)671
9.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII103795
> 99.9%
Punctuation2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
125752
24.8%
816814
16.2%
-10795
10.4%
98057
 
7.8%
77259
 
7.0%
25009
 
4.8%
04746
 
4.6%
34605
 
4.4%
44504
 
4.3%
64375
 
4.2%
Other values (16)11879
11.4%
Punctuation
ValueCountFrequency (%)
2
100.0%

Type of name
Categorical

MISSING

Distinct3
Distinct (%)< 0.1%
Missing5143
Missing (%)9.8%
Memory size411.8 KiB
person
45856 
organisation
 
1693
meeting/conference
 
3

Length

Max length18
Median length6
Mean length6.214375841
Min length6

Characters and Unicode

Total characters295506
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowperson
2nd rowperson
3rd rowperson
4th rowperson
5th rowperson

Common Values

ValueCountFrequency (%)
person45856
87.0%
organisation1693
 
3.2%
meeting/conference3
 
< 0.1%
(Missing)5143
 
9.8%

Length

2021-09-17T11:06:45.874013image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2021-09-17T11:06:45.937599image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
ValueCountFrequency (%)
person45856
96.4%
organisation1693
 
3.6%
meeting/conference3
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
n49251
16.7%
o49245
16.7%
r47552
16.1%
s47549
16.1%
e45871
15.5%
p45856
15.5%
i3389
 
1.1%
a3386
 
1.1%
g1696
 
0.6%
t1696
 
0.6%
Other values (4)15
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter295503
> 99.9%
Other Punctuation3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n49251
16.7%
o49245
16.7%
r47552
16.1%
s47549
16.1%
e45871
15.5%
p45856
15.5%
i3389
 
1.1%
a3386
 
1.1%
g1696
 
0.6%
t1696
 
0.6%
Other values (3)12
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
/3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin295503
> 99.9%
Common3
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n49251
16.7%
o49245
16.7%
r47552
16.1%
s47549
16.1%
e45871
15.5%
p45856
15.5%
i3389
 
1.1%
a3386
 
1.1%
g1696
 
0.6%
t1696
 
0.6%
Other values (3)12
 
< 0.1%
Common
ValueCountFrequency (%)
/3
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII295506
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n49251
16.7%
o49245
16.7%
r47552
16.1%
s47549
16.1%
e45871
15.5%
p45856
15.5%
i3389
 
1.1%
a3386
 
1.1%
g1696
 
0.6%
t1696
 
0.6%
Other values (4)15
 
< 0.1%

Role
Categorical

MISSING

Distinct33
Distinct (%)2.0%
Missing51015
Missing (%)96.8%
Memory size411.8 KiB
author
372 
writer
332 
novelist
292 
poet
281 
publisher
64 
Other values (28)
339 

Length

Max length22
Median length6
Mean length6.592261905
Min length4

Characters and Unicode

Total characters11075
Distinct characters26
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.4%

Sample

1st rowwriter
2nd rowpoet
3rd rowbookseller
4th rowpoet
5th rowpoet

Common Values

ValueCountFrequency (%)
author372
 
0.7%
writer332
 
0.6%
novelist292
 
0.6%
poet281
 
0.5%
publisher64
 
0.1%
editor62
 
0.1%
historian57
 
0.1%
engineer34
 
0.1%
printer31
 
0.1%
lecturer24
 
< 0.1%
Other values (23)131
 
0.2%
(Missing)51015
96.8%

Length

2021-09-17T11:06:46.154213image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
author374
22.0%
writer333
19.6%
novelist292
17.2%
poet287
16.9%
publisher65
 
3.8%
editor62
 
3.6%
historian57
 
3.4%
engineer34
 
2.0%
printer31
 
1.8%
lecturer24
 
1.4%
Other values (21)141
 
8.3%

Most occurring characters

ValueCountFrequency (%)
t1579
14.3%
r1546
14.0%
e1367
12.3%
o1194
10.8%
i982
8.9%
h534
 
4.8%
a517
 
4.7%
n504
 
4.6%
s493
 
4.5%
u486
 
4.4%
Other values (16)1873
16.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter11045
99.7%
Space Separator20
 
0.2%
Other Punctuation9
 
0.1%
Uppercase Letter1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t1579
14.3%
r1546
14.0%
e1367
12.4%
o1194
10.8%
i982
8.9%
h534
 
4.8%
a517
 
4.7%
n504
 
4.6%
s493
 
4.5%
u486
 
4.4%
Other values (13)1843
16.7%
Space Separator
ValueCountFrequency (%)
20
100.0%
Other Punctuation
ValueCountFrequency (%)
;9
100.0%
Uppercase Letter
ValueCountFrequency (%)
P1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin11046
99.7%
Common29
 
0.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
t1579
14.3%
r1546
14.0%
e1367
12.4%
o1194
10.8%
i982
8.9%
h534
 
4.8%
a517
 
4.7%
n504
 
4.6%
s493
 
4.5%
u486
 
4.4%
Other values (14)1844
16.7%
Common
ValueCountFrequency (%)
20
69.0%
;9
31.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII11075
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t1579
14.3%
r1546
14.0%
e1367
12.3%
o1194
10.8%
i982
8.9%
h534
 
4.8%
a517
 
4.7%
n504
 
4.6%
s493
 
4.5%
u486
 
4.4%
Other values (16)1873
16.9%

All names
Categorical

HIGH CARDINALITY
MISSING

Distinct33026
Distinct (%)66.5%
Missing3062
Missing (%)5.8%
Memory size411.8 KiB
Byron, George Gordon Byron, Baron, 1788-1824 [person]
 
126
Wood, Henry, Mrs, 1814-1887 [person]
 
103
Oliphant, Mrs (Margaret), 1828-1897 [person]
 
81
Scott, Walter, Sir, 1771-1832 [person]
 
78
Great Britain, Hydrographic Department [organisation]
 
69
Other values (33021)
49176 

Length

Max length623
Median length34
Mean length43.43241392
Min length13

Characters and Unicode

Total characters2155681
Distinct characters200
Distinct categories11 ?
Distinct scripts5 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26633 ?
Unique (%)53.7%

Sample

1st rowMore, Hannah, 1745-1833 [person] ; Yearsley, Ann, 1753-1806 [person]
2nd rowOldham, John, 1653-1683 [person] ; A, T. [person]
3rd rowPlimsoll, Joseph [person] ; Albert, Prince Consort, consort of Victoria, Queen of Great Britain, 1819-1861 [person]
4th rowAnslow, Robert [person]
5th rowSwift, Jonathan, 1667-1745 [person]

Common Values

ValueCountFrequency (%)
Byron, George Gordon Byron, Baron, 1788-1824 [person]126
 
0.2%
Wood, Henry, Mrs, 1814-1887 [person]103
 
0.2%
Oliphant, Mrs (Margaret), 1828-1897 [person]81
 
0.2%
Scott, Walter, Sir, 1771-1832 [person]78
 
0.1%
Great Britain, Hydrographic Department [organisation]69
 
0.1%
Marryat, Florence, 1833-1899 [person]58
 
0.1%
Payn, James, 1830-1898 [person]54
 
0.1%
Dryden, John, 1631-1700 [person]42
 
0.1%
Fenn, George Manville [person]42
 
0.1%
Braddon, M. E. (Mary Elizabeth), 1835-1915 [person]42
 
0.1%
Other values (33016)48938
92.9%
(Missing)3062
 
5.8%

Length

2021-09-17T11:06:46.415623image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
person58866
 
20.6%
11319
 
4.0%
william4376
 
1.5%
john4361
 
1.5%
of3910
 
1.4%
george2609
 
0.9%
henry2573
 
0.9%
charles2348
 
0.8%
james2337
 
0.8%
thomas2276
 
0.8%
Other values (29674)190419
66.7%

Most occurring characters

ValueCountFrequency (%)
235761
 
10.9%
e176894
 
8.2%
r157517
 
7.3%
o141342
 
6.6%
n140514
 
6.5%
s112312
 
5.2%
a98590
 
4.6%
,90670
 
4.2%
i77015
 
3.6%
p71051
 
3.3%
Other values (190)854015
39.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1353621
62.8%
Space Separator235761
 
10.9%
Uppercase Letter182930
 
8.5%
Other Punctuation121385
 
5.6%
Decimal Number115067
 
5.3%
Open Punctuation65402
 
3.0%
Close Punctuation65402
 
3.0%
Dash Punctuation15930
 
0.7%
Nonspacing Mark123
 
< 0.1%
Modifier Letter52
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e176894
13.1%
r157517
11.6%
o141342
10.4%
n140514
10.4%
s112312
8.3%
a98590
 
7.3%
i77015
 
5.7%
p71051
 
5.2%
l66493
 
4.9%
t54074
 
4.0%
Other values (93)257819
19.0%
Uppercase Letter
ValueCountFrequency (%)
C14298
 
7.8%
J14017
 
7.7%
M13565
 
7.4%
H13003
 
7.1%
S12677
 
6.9%
B12512
 
6.8%
A11769
 
6.4%
W11578
 
6.3%
G10359
 
5.7%
R8855
 
4.8%
Other values (45)60297
33.0%
Decimal Number
ValueCountFrequency (%)
134796
30.2%
822308
19.4%
910555
 
9.2%
710283
 
8.9%
26666
 
5.8%
06217
 
5.4%
36187
 
5.4%
66129
 
5.3%
46001
 
5.2%
55925
 
5.1%
Other Punctuation
ValueCountFrequency (%)
,90670
74.7%
.18421
 
15.2%
;11276
 
9.3%
'850
 
0.7%
*69
 
0.1%
?57
 
< 0.1%
&34
 
< 0.1%
:4
 
< 0.1%
/4
 
< 0.1%
Other Letter
ValueCountFrequency (%)
ל2
25.0%
ו1
12.5%
י1
12.5%
א1
12.5%
ב1
12.5%
ר1
12.5%
ט1
12.5%
Nonspacing Mark
ValueCountFrequency (%)
59
48.0%
59
48.0%
̡2
 
1.6%
̐1
 
0.8%
̇1
 
0.8%
̢1
 
0.8%
Modifier Letter
ValueCountFrequency (%)
ʹ44
84.6%
ʺ5
 
9.6%
ʿ3
 
5.8%
Dash Punctuation
ValueCountFrequency (%)
-15928
> 99.9%
2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
[61765
94.4%
(3637
 
5.6%
Close Punctuation
ValueCountFrequency (%)
]61765
94.4%
)3637
 
5.6%
Space Separator
ValueCountFrequency (%)
235761
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1536473
71.3%
Common618999
28.7%
Inherited123
 
< 0.1%
Cyrillic78
 
< 0.1%
Hebrew8
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e176894
11.5%
r157517
 
10.3%
o141342
 
9.2%
n140514
 
9.1%
s112312
 
7.3%
a98590
 
6.4%
i77015
 
5.0%
p71051
 
4.6%
l66493
 
4.3%
t54074
 
3.5%
Other values (125)440671
28.7%
Common
ValueCountFrequency (%)
235761
38.1%
,90670
 
14.6%
[61765
 
10.0%
]61765
 
10.0%
134796
 
5.6%
822308
 
3.6%
.18421
 
3.0%
-15928
 
2.6%
;11276
 
1.8%
910555
 
1.7%
Other values (19)55754
 
9.0%
Cyrillic
ValueCountFrequency (%)
и15
19.2%
л6
 
7.7%
е5
 
6.4%
в5
 
6.4%
а5
 
6.4%
к4
 
5.1%
Н4
 
5.1%
ч4
 
5.1%
м3
 
3.8%
й3
 
3.8%
Other values (13)24
30.8%
Hebrew
ValueCountFrequency (%)
ל2
25.0%
ו1
12.5%
י1
12.5%
א1
12.5%
ב1
12.5%
ר1
12.5%
ט1
12.5%
Inherited
ValueCountFrequency (%)
59
48.0%
59
48.0%
̡2
 
1.6%
̐1
 
0.8%
̇1
 
0.8%
̢1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII2150164
99.7%
Latin 1 Sup4706
 
0.2%
Latin Ext A530
 
< 0.1%
Half Marks118
 
< 0.1%
Cyrillic78
 
< 0.1%
Modifier Letters52
 
< 0.1%
Latin Ext Additional13
 
< 0.1%
Hebrew8
 
< 0.1%
Diacriticals5
 
< 0.1%
Latin Ext B5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
235761
 
11.0%
e176894
 
8.2%
r157517
 
7.3%
o141342
 
6.6%
n140514
 
6.5%
s112312
 
5.2%
a98590
 
4.6%
,90670
 
4.2%
i77015
 
3.6%
p71051
 
3.3%
Other values (67)848498
39.5%
Latin 1 Sup
ValueCountFrequency (%)
é2019
42.9%
á424
 
9.0%
è385
 
8.2%
ç331
 
7.0%
É299
 
6.4%
ó221
 
4.7%
í216
 
4.6%
ö170
 
3.6%
ü121
 
2.6%
ë67
 
1.4%
Other values (29)453
 
9.6%
Hebrew
ValueCountFrequency (%)
ל2
25.0%
ו1
12.5%
י1
12.5%
א1
12.5%
ב1
12.5%
ר1
12.5%
ט1
12.5%
Punctuation
ValueCountFrequency (%)
2
100.0%
Modifier Letters
ValueCountFrequency (%)
ʹ44
84.6%
ʺ5
 
9.6%
ʿ3
 
5.8%
Latin Ext A
ValueCountFrequency (%)
ĭ123
23.2%
ł88
16.6%
ń52
9.8%
ī41
 
7.7%
š27
 
5.1%
ć25
 
4.7%
č21
 
4.0%
ē20
 
3.8%
ā18
 
3.4%
ő15
 
2.8%
Other values (22)100
18.9%
Latin Ext Additional
ValueCountFrequency (%)
4
30.8%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Diacriticals
ValueCountFrequency (%)
̡2
40.0%
̐1
20.0%
̇1
20.0%
̢1
20.0%
Latin Ext B
ValueCountFrequency (%)
ǵ4
80.0%
ǎ1
 
20.0%
Half Marks
ValueCountFrequency (%)
59
50.0%
59
50.0%
Cyrillic
ValueCountFrequency (%)
и15
19.2%
л6
 
7.7%
е5
 
6.4%
в5
 
6.4%
а5
 
6.4%
к4
 
5.1%
Н4
 
5.1%
ч4
 
5.1%
м3
 
3.8%
й3
 
3.8%
Other values (13)24
30.8%

Title
Categorical

HIGH CARDINALITY

Distinct50029
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Memory size411.8 KiB
Poems
 
240
Cook's Handbook for London. With two maps
 
17
Poems on several occasions
 
14
Poems, etc
 
13
Verses
 
13
Other values (50024)
52398 

Length

Max length1407
Median length68
Mean length84.80402315
Min length3

Characters and Unicode

Total characters4468748
Distinct characters381
Distinct categories16 ?
Distinct scripts7 ?
Distinct blocks15 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48139 ?
Unique (%)91.4%

Sample

1st rowPoems on several occasions [With a prefatory letter by Hannah More.]
2nd rowA Satyr against Vertue. (A poem: supposed to be spoken by a Town-Hector [By John Oldham. The preface signed: T. A.])
3rd rowThe Aeronaut, a poem; founded almost entirely, upon a statement, printed in the newspapers, of a voyage from Dublin, in October, 1812
4th rowThe Prince Albert, a poem [By Joseph Plimsoll.]
5th rowThe Defeat of the Spanish Armada, A.D. 1588. A tercentenary ballad, A.D. 1888

Common Values

ValueCountFrequency (%)
Poems240
 
0.5%
Cook's Handbook for London. With two maps17
 
< 0.1%
Poems on several occasions14
 
< 0.1%
Poems, etc13
 
< 0.1%
Verses13
 
< 0.1%
Miscellaneous Poems12
 
< 0.1%
Sonnets11
 
< 0.1%
Poems on various subjects10
 
< 0.1%
The Bride of Abydos. A Turkish tale9
 
< 0.1%
Childe Harold's Pilgrimage. A romaunt [Cantos I and II. With fourteen other poems.]9
 
< 0.1%
Other values (50019)52347
99.3%

Length

2021-09-17T11:06:46.706986image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
the49372
 
6.8%
of39302
 
5.4%
a26175
 
3.6%
and23740
 
3.3%
18542
 
2.5%
in14087
 
1.9%
by13718
 
1.9%
with11198
 
1.5%
etc9660
 
1.3%
de7584
 
1.0%
Other values (66559)516320
70.8%

Most occurring characters

ValueCountFrequency (%)
677003
15.1%
e404608
 
9.1%
t266325
 
6.0%
a252898
 
5.7%
i252203
 
5.6%
o251210
 
5.6%
n247714
 
5.5%
r229145
 
5.1%
s202728
 
4.5%
h136476
 
3.1%
Other values (371)1548438
34.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter3189838
71.4%
Space Separator677003
 
15.1%
Uppercase Letter319723
 
7.2%
Other Punctuation197421
 
4.4%
Decimal Number42642
 
1.0%
Open Punctuation16242
 
0.4%
Close Punctuation16242
 
0.4%
Dash Punctuation9209
 
0.2%
Nonspacing Mark288
 
< 0.1%
Other Letter60
 
< 0.1%
Other values (6)80
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e404608
12.7%
t266325
 
8.3%
a252898
 
7.9%
i252203
 
7.9%
o251210
 
7.9%
n247714
 
7.8%
r229145
 
7.2%
s202728
 
6.4%
h136476
 
4.3%
l136461
 
4.3%
Other values (181)810070
25.4%
Uppercase Letter
ValueCountFrequency (%)
A31191
 
9.8%
T25856
 
8.1%
S25161
 
7.9%
C21730
 
6.8%
B19026
 
6.0%
W18145
 
5.7%
M17647
 
5.5%
P16860
 
5.3%
H15006
 
4.7%
L14227
 
4.4%
Other values (103)114874
35.9%
Other Letter
ValueCountFrequency (%)
º29
48.3%
ל5
 
8.3%
ב4
 
6.7%
מ3
 
5.0%
ת3
 
5.0%
ה3
 
5.0%
ש2
 
3.3%
א2
 
3.3%
פ1
 
1.7%
ע1
 
1.7%
Other values (7)7
 
11.7%
Other Punctuation
ValueCountFrequency (%)
.118136
59.8%
,49865
25.3%
'16011
 
8.1%
:6015
 
3.0%
;5505
 
2.8%
&913
 
0.5%
?354
 
0.2%
!306
 
0.2%
*275
 
0.1%
/26
 
< 0.1%
Other values (6)15
 
< 0.1%
Nonspacing Mark
ValueCountFrequency (%)
129
44.8%
129
44.8%
̡10
 
3.5%
̂4
 
1.4%
̈3
 
1.0%
͡3
 
1.0%
̀2
 
0.7%
̒2
 
0.7%
̃1
 
0.3%
̤1
 
0.3%
Other values (4)4
 
1.4%
Decimal Number
ValueCountFrequency (%)
112669
29.7%
88218
19.3%
73398
 
8.0%
63059
 
7.2%
22746
 
6.4%
52643
 
6.2%
42592
 
6.1%
92468
 
5.8%
02451
 
5.7%
32398
 
5.6%
Modifier Letter
ValueCountFrequency (%)
ʹ27
65.9%
12
29.3%
ʿ1
 
2.4%
ʺ1
 
2.4%
Private Use
ValueCountFrequency (%)
10
45.5%
10
45.5%
1
 
4.5%
1
 
4.5%
Open Punctuation
ValueCountFrequency (%)
[13806
85.0%
(2436
 
15.0%
Close Punctuation
ValueCountFrequency (%)
]13806
85.0%
)2436
 
15.0%
Currency Symbol
ValueCountFrequency (%)
£7
87.5%
$1
 
12.5%
Other Symbol
ValueCountFrequency (%)
°3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
677003
100.0%
Dash Punctuation
ValueCountFrequency (%)
-9209
100.0%
Math Symbol
ValueCountFrequency (%)
=4
100.0%
Other Number
ValueCountFrequency (%)
¹1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin3469955
77.6%
Common958805
 
21.5%
Cyrillic36080
 
0.8%
Greek3567
 
0.1%
Inherited288
 
< 0.1%
Hebrew31
 
< 0.1%
Unknown22
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e404608
 
11.7%
t266325
 
7.7%
a252898
 
7.3%
i252203
 
7.3%
o251210
 
7.2%
n247714
 
7.1%
r229145
 
6.6%
s202728
 
5.8%
h136476
 
3.9%
l136461
 
3.9%
Other values (147)1090187
31.4%
Cyrillic
ValueCountFrequency (%)
о3329
 
9.2%
а2688
 
7.5%
и2517
 
7.0%
е2397
 
6.6%
с2349
 
6.5%
н1974
 
5.5%
р1916
 
5.3%
т1673
 
4.6%
в1476
 
4.1%
к1435
 
4.0%
Other values (66)14326
39.7%
Greek
ValueCountFrequency (%)
α402
 
11.3%
ι306
 
8.6%
ο292
 
8.2%
τ248
 
7.0%
ν232
 
6.5%
ρ195
 
5.5%
ε176
 
4.9%
η144
 
4.0%
ς144
 
4.0%
κ142
 
4.0%
Other values (63)1286
36.1%
Common
ValueCountFrequency (%)
677003
70.6%
.118136
 
12.3%
,49865
 
5.2%
'16011
 
1.7%
[13806
 
1.4%
]13806
 
1.4%
112669
 
1.3%
-9209
 
1.0%
88218
 
0.9%
:6015
 
0.6%
Other values (31)34067
 
3.6%
Hebrew
ValueCountFrequency (%)
ל5
16.1%
ב4
12.9%
מ3
9.7%
ת3
9.7%
ה3
9.7%
ש2
 
6.5%
א2
 
6.5%
פ1
 
3.2%
ע1
 
3.2%
ו1
 
3.2%
Other values (6)6
19.4%
Inherited
ValueCountFrequency (%)
129
44.8%
129
44.8%
̡10
 
3.5%
̂4
 
1.4%
̈3
 
1.0%
͡3
 
1.0%
̀2
 
0.7%
̒2
 
0.7%
̃1
 
0.3%
̤1
 
0.3%
Other values (4)4
 
1.4%
Unknown
ValueCountFrequency (%)
10
45.5%
10
45.5%
1
 
4.5%
1
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII4408283
98.6%
Cyrillic36080
 
0.8%
Latin 1 Sup19287
 
0.4%
None3378
 
0.1%
Latin Ext A1122
 
< 0.1%
Half Marks258
 
< 0.1%
Greek Ext201
 
< 0.1%
Hebrew31
 
< 0.1%
Diacriticals30
 
< 0.1%
Modifier Letters29
 
< 0.1%
Other values (5)49
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
677003
15.4%
e404608
 
9.2%
t266325
 
6.0%
a252898
 
5.7%
i252203
 
5.7%
o251210
 
5.7%
n247714
 
5.6%
r229145
 
5.2%
s202728
 
4.6%
h136476
 
3.1%
Other values (70)1487973
33.8%
Latin 1 Sup
ValueCountFrequency (%)
é7561
39.2%
ü1499
 
7.8%
ä1496
 
7.8%
è1439
 
7.5%
ö1235
 
6.4%
á1062
 
5.5%
à969
 
5.0%
æ590
 
3.1%
ó565
 
2.9%
É515
 
2.7%
Other values (44)2356
 
12.2%
Latin Ext A
ValueCountFrequency (%)
ł180
16.0%
œ148
13.2%
ę87
 
7.8%
ī78
 
7.0%
ő77
 
6.9%
ĭ60
 
5.3%
ě58
 
5.2%
ń46
 
4.1%
ż43
 
3.8%
ą41
 
3.7%
Other values (34)304
27.1%
None
ValueCountFrequency (%)
α402
 
11.9%
ι306
 
9.1%
ο292
 
8.6%
τ248
 
7.3%
ν232
 
6.9%
ρ195
 
5.8%
ε176
 
5.2%
η144
 
4.3%
ς144
 
4.3%
κ142
 
4.2%
Other values (38)1097
32.5%
Latin Ext Additional
ValueCountFrequency (%)
2
15.4%
2
15.4%
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
ḿ1
7.7%
1
7.7%
1
7.7%
Modifier Letters
ValueCountFrequency (%)
ʹ27
93.1%
ʿ1
 
3.4%
ʺ1
 
3.4%
Greek Ext
ValueCountFrequency (%)
38
18.9%
25
12.4%
24
11.9%
14
 
7.0%
12
 
6.0%
11
 
5.5%
10
 
5.0%
7
 
3.5%
7
 
3.5%
7
 
3.5%
Other values (16)46
22.9%
Punctuation
ValueCountFrequency (%)
8
80.0%
2
 
20.0%
Hebrew
ValueCountFrequency (%)
ל5
16.1%
ב4
12.9%
מ3
9.7%
ת3
9.7%
ה3
9.7%
ש2
 
6.5%
א2
 
6.5%
פ1
 
3.2%
ע1
 
3.2%
ו1
 
3.2%
Other values (6)6
19.4%
Diacriticals
ValueCountFrequency (%)
̡10
33.3%
̂4
 
13.3%
̈3
 
10.0%
͡3
 
10.0%
̀2
 
6.7%
̒2
 
6.7%
̃1
 
3.3%
̤1
 
3.3%
̔1
 
3.3%
ͅ1
 
3.3%
Other values (2)2
 
6.7%
Cyrillic
ValueCountFrequency (%)
о3329
 
9.2%
а2688
 
7.5%
и2517
 
7.0%
е2397
 
6.6%
с2349
 
6.5%
н1974
 
5.5%
р1916
 
5.3%
т1673
 
4.6%
в1476
 
4.1%
к1435
 
4.0%
Other values (66)14326
39.7%
Dingbats
ValueCountFrequency (%)
1
100.0%
Latin Ext B
ValueCountFrequency (%)
ǔ1
33.3%
ǵ1
33.3%
ǒ1
33.3%
Half Marks
ValueCountFrequency (%)
129
50.0%
129
50.0%
PUA
ValueCountFrequency (%)
10
45.5%
10
45.5%
1
 
4.5%
1
 
4.5%

Variant titles
Categorical

HIGH CARDINALITY
MISSING

Distinct1743
Distinct (%)29.7%
Missing46828
Missing (%)88.9%
Memory size411.8 KiB
Single Works
1206 
Appendix
899 
Works
 
142
Appendix. Miscellaneous
 
123
Smaller Collections
 
94
Other values (1738)
3403 

Length

Max length814
Median length18
Mean length30.2754389
Min length3

Characters and Unicode

Total characters177626
Distinct characters194
Distinct categories11 ?
Distinct scripts5 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1325 ?
Unique (%)22.6%

Sample

1st rowAppendix
2nd rowAppendix. I. Contemporary Satires, Eulogies, etc
3rd rowAppendix. Elegies
4th rowPoetry. Selections
5th rowSingle Works. Britannia Rediviva

Common Values

ValueCountFrequency (%)
Single Works1206
 
2.3%
Appendix899
 
1.7%
Works142
 
0.3%
Appendix. Miscellaneous123
 
0.2%
Smaller Collections94
 
0.2%
Collections88
 
0.2%
Works. Selections45
 
0.1%
Appendix. Topography and Travels43
 
0.1%
Poetical Works42
 
0.1%
Plays. Single Plays35
 
0.1%
Other values (1733)3150
 
6.0%
(Missing)46828
88.9%

Length

2021-09-17T11:06:47.003326image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
works2519
 
9.6%
single1914
 
7.3%
appendix1665
 
6.4%
the838
 
3.2%
of772
 
3.0%
674
 
2.6%
and619
 
2.4%
collections373
 
1.4%
by314
 
1.2%
miscellaneous313
 
1.2%
Other values (4288)16139
61.7%

Most occurring characters

ValueCountFrequency (%)
20273
 
11.4%
e15080
 
8.5%
i12311
 
6.9%
n11193
 
6.3%
o10954
 
6.2%
s9814
 
5.5%
r9196
 
5.2%
a7951
 
4.5%
l7848
 
4.4%
t7226
 
4.1%
Other values (184)65780
37.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter131172
73.8%
Space Separator20273
 
11.4%
Uppercase Letter18082
 
10.2%
Other Punctuation6052
 
3.4%
Decimal Number1391
 
0.8%
Dash Punctuation192
 
0.1%
Open Punctuation144
 
0.1%
Close Punctuation144
 
0.1%
Nonspacing Mark123
 
0.1%
Other Letter28
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e15080
11.5%
i12311
 
9.4%
n11193
 
8.5%
o10954
 
8.4%
s9814
 
7.5%
r9196
 
7.0%
a7951
 
6.1%
l7848
 
6.0%
t7226
 
5.5%
d5244
 
4.0%
Other values (84)34355
26.2%
Uppercase Letter
ValueCountFrequency (%)
S3205
17.7%
W2690
14.9%
A2258
12.5%
C1092
 
6.0%
I987
 
5.5%
P948
 
5.2%
T831
 
4.6%
M749
 
4.1%
E721
 
4.0%
B645
 
3.6%
Other values (46)3956
21.9%
Other Letter
ValueCountFrequency (%)
י6
21.4%
ו3
10.7%
ר3
10.7%
º3
10.7%
מ2
 
7.1%
ד2
 
7.1%
ת2
 
7.1%
נ1
 
3.6%
ש1
 
3.6%
ם1
 
3.6%
Other values (4)4
14.3%
Decimal Number
ValueCountFrequency (%)
1430
30.9%
8242
17.4%
2108
 
7.8%
7104
 
7.5%
099
 
7.1%
697
 
7.0%
593
 
6.7%
487
 
6.3%
373
 
5.2%
958
 
4.2%
Other Punctuation
ValueCountFrequency (%)
.4236
70.0%
,941
 
15.5%
;514
 
8.5%
'294
 
4.9%
:49
 
0.8%
&10
 
0.2%
?7
 
0.1%
!1
 
< 0.1%
Nonspacing Mark
ValueCountFrequency (%)
59
48.0%
59
48.0%
͡4
 
3.3%
́1
 
0.8%
Open Punctuation
ValueCountFrequency (%)
[97
67.4%
(47
32.6%
Close Punctuation
ValueCountFrequency (%)
]97
67.4%
)47
32.6%
Modifier Letter
ValueCountFrequency (%)
ʹ18
72.0%
ʿ7
 
28.0%
Space Separator
ValueCountFrequency (%)
20273
100.0%
Dash Punctuation
ValueCountFrequency (%)
-192
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin146219
82.3%
Common28221
 
15.9%
Cyrillic3038
 
1.7%
Inherited123
 
0.1%
Hebrew25
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e15080
 
10.3%
i12311
 
8.4%
n11193
 
7.7%
o10954
 
7.5%
s9814
 
6.7%
r9196
 
6.3%
a7951
 
5.4%
l7848
 
5.4%
t7226
 
4.9%
d5244
 
3.6%
Other values (82)49402
33.8%
Cyrillic
ValueCountFrequency (%)
о271
 
8.9%
и256
 
8.4%
а201
 
6.6%
с188
 
6.2%
е176
 
5.8%
н165
 
5.4%
т154
 
5.1%
р146
 
4.8%
в129
 
4.2%
л102
 
3.4%
Other values (49)1250
41.1%
Common
ValueCountFrequency (%)
20273
71.8%
.4236
 
15.0%
,941
 
3.3%
;514
 
1.8%
1430
 
1.5%
'294
 
1.0%
8242
 
0.9%
-192
 
0.7%
2108
 
0.4%
7104
 
0.4%
Other values (16)887
 
3.1%
Hebrew
ValueCountFrequency (%)
י6
24.0%
ו3
12.0%
ר3
12.0%
מ2
 
8.0%
ד2
 
8.0%
ת2
 
8.0%
נ1
 
4.0%
ש1
 
4.0%
ם1
 
4.0%
ל1
 
4.0%
Other values (3)3
12.0%
Inherited
ValueCountFrequency (%)
59
48.0%
59
48.0%
͡4
 
3.3%
́1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII173932
97.9%
Cyrillic3038
 
1.7%
Latin 1 Sup402
 
0.2%
Half Marks118
 
0.1%
Latin Ext A81
 
< 0.1%
Hebrew25
 
< 0.1%
Modifier Letters25
 
< 0.1%
Diacriticals5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20273
 
11.7%
e15080
 
8.7%
i12311
 
7.1%
n11193
 
6.4%
o10954
 
6.3%
s9814
 
5.6%
r9196
 
5.3%
a7951
 
4.6%
l7848
 
4.5%
t7226
 
4.2%
Other values (66)62086
35.7%
Latin 1 Sup
ValueCountFrequency (%)
é167
41.5%
æ39
 
9.7%
à26
 
6.5%
ö21
 
5.2%
ä20
 
5.0%
ü19
 
4.7%
è18
 
4.5%
É15
 
3.7%
ç15
 
3.7%
á15
 
3.7%
Other values (16)47
 
11.7%
Hebrew
ValueCountFrequency (%)
י6
24.0%
ו3
12.0%
ר3
12.0%
מ2
 
8.0%
ד2
 
8.0%
ת2
 
8.0%
נ1
 
4.0%
ש1
 
4.0%
ם1
 
4.0%
ל1
 
4.0%
Other values (3)3
12.0%
Latin Ext A
ValueCountFrequency (%)
ĭ33
40.7%
ā15
18.5%
ī13
 
16.0%
ń4
 
4.9%
ś4
 
4.9%
ł3
 
3.7%
ė2
 
2.5%
Œ1
 
1.2%
ź1
 
1.2%
ū1
 
1.2%
Other values (4)4
 
4.9%
Modifier Letters
ValueCountFrequency (%)
ʹ18
72.0%
ʿ7
 
28.0%
Half Marks
ValueCountFrequency (%)
59
50.0%
59
50.0%
Cyrillic
ValueCountFrequency (%)
о271
 
8.9%
и256
 
8.4%
а201
 
6.6%
с188
 
6.2%
е176
 
5.8%
н165
 
5.4%
т154
 
5.1%
р146
 
4.8%
в129
 
4.2%
л102
 
3.4%
Other values (49)1250
41.1%
Diacriticals
ValueCountFrequency (%)
͡4
80.0%
́1
 
20.0%

Series title
Categorical

HIGH CARDINALITY
MISSING

Distinct157
Distinct (%)60.4%
Missing52435
Missing (%)99.5%
Memory size411.8 KiB
Bell's English Classics
 
19
The works of Charles Dickens
 
18
Thomas Hardy's works. The Wessex novels
 
15
Sailing Directions. America
 
9
Routledge's sixpenny novels
 
5
Other values (152)
194 

Length

Max length104
Median length30.5
Mean length37.33076923
Min length9

Characters and Unicode

Total characters9706
Distinct characters123
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique128 ?
Unique (%)49.2%

Sample

1st rowReprints of Rare Tracts & Imprints, etc
2nd rowThe illustrated English poems
3rd rowDuncombe's 'Minor British Drama'
4th rowDuncombe and Co.'s 'Minor British Drama'
5th rowDicks' Standard Plays

Common Values

ValueCountFrequency (%)
Bell's English Classics19
 
< 0.1%
The works of Charles Dickens18
 
< 0.1%
Thomas Hardy's works. The Wessex novels15
 
< 0.1%
Sailing Directions. America9
 
< 0.1%
Routledge's sixpenny novels5
 
< 0.1%
Collection de documents relatifs à l'histoire de Paris pendant la Révolution française5
 
< 0.1%
Way-about Series4
 
< 0.1%
Macmillan's Illustrated standard novels4
 
< 0.1%
The Romance of History4
 
< 0.1%
Recueil de voyages et de documents pour servir à l'histoire de la géographie4
 
< 0.1%
Other values (147)173
 
0.3%
(Missing)52435
99.5%

Length

2021-09-17T11:06:47.293171image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
the62
 
4.5%
of50
 
3.6%
de46
 
3.3%
works40
 
2.9%
english33
 
2.4%
classics27
 
1.9%
novels27
 
1.9%
documents24
 
1.7%
series20
 
1.4%
bell's20
 
1.4%
Other values (432)1043
74.9%

Most occurring characters

ValueCountFrequency (%)
1132
 
11.7%
e904
 
9.3%
s796
 
8.2%
i633
 
6.5%
r542
 
5.6%
a515
 
5.3%
o498
 
5.1%
n482
 
5.0%
l455
 
4.7%
t418
 
4.3%
Other values (113)3331
34.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter7490
77.2%
Space Separator1132
 
11.7%
Uppercase Letter755
 
7.8%
Other Punctuation241
 
2.5%
Decimal Number65
 
0.7%
Dash Punctuation17
 
0.2%
Open Punctuation3
 
< 0.1%
Close Punctuation3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e904
12.1%
s796
10.6%
i633
 
8.5%
r542
 
7.2%
a515
 
6.9%
o498
 
6.6%
n482
 
6.4%
l455
 
6.1%
t418
 
5.6%
c275
 
3.7%
Other values (62)1972
26.3%
Uppercase Letter
ValueCountFrequency (%)
T94
12.5%
C70
 
9.3%
S68
 
9.0%
E56
 
7.4%
D56
 
7.4%
H53
 
7.0%
B47
 
6.2%
P45
 
6.0%
R41
 
5.4%
A36
 
4.8%
Other values (22)189
25.0%
Decimal Number
ValueCountFrequency (%)
120
30.8%
811
16.9%
210
15.4%
76
 
9.2%
36
 
9.2%
55
 
7.7%
93
 
4.6%
42
 
3.1%
62
 
3.1%
Other Punctuation
ValueCountFrequency (%)
'106
44.0%
.99
41.1%
,30
 
12.4%
&2
 
0.8%
:2
 
0.8%
;2
 
0.8%
Space Separator
ValueCountFrequency (%)
1132
100.0%
Dash Punctuation
ValueCountFrequency (%)
-17
100.0%
Open Punctuation
ValueCountFrequency (%)
(3
100.0%
Close Punctuation
ValueCountFrequency (%)
)3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin8063
83.1%
Common1461
 
15.1%
Cyrillic182
 
1.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e904
 
11.2%
s796
 
9.9%
i633
 
7.9%
r542
 
6.7%
a515
 
6.4%
o498
 
6.2%
n482
 
6.0%
l455
 
5.6%
t418
 
5.2%
c275
 
3.4%
Other values (59)2545
31.6%
Cyrillic
ValueCountFrequency (%)
а17
 
9.3%
т17
 
9.3%
о16
 
8.8%
е15
 
8.2%
с14
 
7.7%
и12
 
6.6%
к11
 
6.0%
р11
 
6.0%
п6
 
3.3%
м6
 
3.3%
Other values (25)57
31.3%
Common
ValueCountFrequency (%)
1132
77.5%
'106
 
7.3%
.99
 
6.8%
,30
 
2.1%
120
 
1.4%
-17
 
1.2%
811
 
0.8%
210
 
0.7%
76
 
0.4%
36
 
0.4%
Other values (9)24
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII9383
96.7%
Cyrillic182
 
1.9%
Latin 1 Sup134
 
1.4%
Latin Ext A7
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1132
 
12.1%
e904
 
9.6%
s796
 
8.5%
i633
 
6.7%
r542
 
5.8%
a515
 
5.5%
o498
 
5.3%
n482
 
5.1%
l455
 
4.8%
t418
 
4.5%
Other values (59)3008
32.1%
Latin 1 Sup
ValueCountFrequency (%)
é59
44.0%
à19
 
14.2%
ö18
 
13.4%
ä7
 
5.2%
ñ6
 
4.5%
á6
 
4.5%
ç5
 
3.7%
ó4
 
3.0%
æ3
 
2.2%
å3
 
2.2%
Other values (3)4
 
3.0%
Latin Ext A
ValueCountFrequency (%)
ĭ2
28.6%
ż1
14.3%
ő1
14.3%
ů1
14.3%
č1
14.3%
ī1
14.3%
Cyrillic
ValueCountFrequency (%)
а17
 
9.3%
т17
 
9.3%
о16
 
8.8%
е15
 
8.2%
с14
 
7.7%
и12
 
6.6%
к11
 
6.0%
р11
 
6.0%
п6
 
3.3%
м6
 
3.3%
Other values (25)57
31.3%

Number within series
Categorical

HIGH CARDINALITY
MISSING
UNIFORM

Distinct110
Distinct (%)99.1%
Missing52584
Missing (%)99.8%
Memory size411.8 KiB
number 4 [Way-about Series]
 
2
volume 26, number 168 [Parliamentary Papers. House of Commons. Session 1831-32]
 
1
number 1-5 [Série A. Opérations électorales de 1789]
 
1
29, 2 [Bibliothek der neuesten und wichtigsten Reisebeschreibungen]
 
1
Band 7 [Historische Bibliothek]
 
1
Other values (105)
105 

Length

Max length115
Median length50
Mean length51.56756757
Min length23

Characters and Unicode

Total characters5724
Distinct characters100
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)98.2%

Sample

1st rowvolume 4 [Reprints of Rare Tracts & Imprints, etc]
2nd rownumber 19 [Duncombe's 'Minor British Drama']
3rd rownumber 7 [Duncombe and Co.'s 'Minor British Drama']
4th rownumber 956 [Dicks' Standard Plays]
5th rownumber 1 [Chants for Socialists]

Common Values

ValueCountFrequency (%)
number 4 [Way-about Series]2
 
< 0.1%
volume 26, number 168 [Parliamentary Papers. House of Commons. Session 1831-32]1
 
< 0.1%
number 1-5 [Série A. Opérations électorales de 1789]1
 
< 0.1%
29, 2 [Bibliothek der neuesten und wichtigsten Reisebeschreibungen]1
 
< 0.1%
Band 7 [Historische Bibliothek]1
 
< 0.1%
13 [Bibljoteka historyczna]1
 
< 0.1%
volume 383 [Collection of Ancient and Modern British Authors]1
 
< 0.1%
number 6 [Записки Императорской Академіи Наукъ. том. 8. прил]1
 
< 0.1%
volume 1861, 1862 [Archæologia Cambrensis. Supplement]1
 
< 0.1%
number 4 [Koninklijke Vlaamse Academie voor Taal- en Letterkunde. Publicaties. reeks 5]1
 
< 0.1%
Other values (100)100
 
0.2%
(Missing)52584
99.8%

Length

2021-09-17T11:06:47.565259image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
volume45
 
5.2%
number44
 
5.0%
the25
 
2.9%
works21
 
2.4%
de21
 
2.4%
thomas19
 
2.2%
novels18
 
2.1%
hardy's17
 
1.9%
wessex17
 
1.9%
of17
 
1.9%
Other values (300)629
72.1%

Most occurring characters

ValueCountFrequency (%)
762
 
13.3%
e517
 
9.0%
s325
 
5.7%
r319
 
5.6%
o288
 
5.0%
n274
 
4.8%
i272
 
4.8%
a264
 
4.6%
t199
 
3.5%
l194
 
3.4%
Other values (90)2310
40.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter3945
68.9%
Space Separator762
 
13.3%
Uppercase Letter377
 
6.6%
Decimal Number260
 
4.5%
Other Punctuation140
 
2.4%
Open Punctuation111
 
1.9%
Close Punctuation111
 
1.9%
Dash Punctuation18
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e517
13.1%
s325
 
8.2%
r319
 
8.1%
o288
 
7.3%
n274
 
6.9%
i272
 
6.9%
a264
 
6.7%
t199
 
5.0%
l194
 
4.9%
u191
 
4.8%
Other values (42)1102
27.9%
Uppercase Letter
ValueCountFrequency (%)
T53
14.1%
S46
12.2%
H39
10.3%
B28
 
7.4%
P26
 
6.9%
W23
 
6.1%
R22
 
5.8%
A17
 
4.5%
C14
 
3.7%
D13
 
3.4%
Other values (19)96
25.5%
Decimal Number
ValueCountFrequency (%)
160
23.1%
342
16.2%
234
13.1%
425
9.6%
822
 
8.5%
521
 
8.1%
618
 
6.9%
716
 
6.2%
914
 
5.4%
08
 
3.1%
Other Punctuation
ValueCountFrequency (%)
.55
39.3%
'44
31.4%
,38
27.1%
&2
 
1.4%
:1
 
0.7%
Space Separator
ValueCountFrequency (%)
762
100.0%
Open Punctuation
ValueCountFrequency (%)
[111
100.0%
Close Punctuation
ValueCountFrequency (%)
]111
100.0%
Dash Punctuation
ValueCountFrequency (%)
-18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4282
74.8%
Common1402
 
24.5%
Cyrillic40
 
0.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e517
 
12.1%
s325
 
7.6%
r319
 
7.4%
o288
 
6.7%
n274
 
6.4%
i272
 
6.4%
a264
 
6.2%
t199
 
4.6%
l194
 
4.5%
u191
 
4.5%
Other values (51)1439
33.6%
Cyrillic
ValueCountFrequency (%)
а4
 
10.0%
и4
 
10.0%
к4
 
10.0%
п3
 
7.5%
м3
 
7.5%
р3
 
7.5%
о3
 
7.5%
с2
 
5.0%
е2
 
5.0%
т2
 
5.0%
Other values (10)10
25.0%
Common
ValueCountFrequency (%)
762
54.4%
[111
 
7.9%
]111
 
7.9%
160
 
4.3%
.55
 
3.9%
'44
 
3.1%
342
 
3.0%
,38
 
2.7%
234
 
2.4%
425
 
1.8%
Other values (9)120
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII5636
98.5%
Latin 1 Sup45
 
0.8%
Cyrillic40
 
0.7%
Latin Ext A3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
762
 
13.5%
e517
 
9.2%
s325
 
5.8%
r319
 
5.7%
o288
 
5.1%
n274
 
4.9%
i272
 
4.8%
a264
 
4.7%
t199
 
3.5%
l194
 
3.4%
Other values (59)2222
39.4%
Latin 1 Sup
ValueCountFrequency (%)
é20
44.4%
ö9
20.0%
ä5
 
11.1%
à5
 
11.1%
á2
 
4.4%
É1
 
2.2%
å1
 
2.2%
æ1
 
2.2%
ü1
 
2.2%
Cyrillic
ValueCountFrequency (%)
а4
 
10.0%
и4
 
10.0%
к4
 
10.0%
п3
 
7.5%
м3
 
7.5%
р3
 
7.5%
о3
 
7.5%
с2
 
5.0%
е2
 
5.0%
т2
 
5.0%
Other values (10)10
25.0%
Latin Ext A
ValueCountFrequency (%)
ĭ2
66.7%
ī1
33.3%

Country of publication
Categorical

HIGH CARDINALITY
MISSING

Distinct71
Distinct (%)0.2%
Missing16235
Missing (%)30.8%
Memory size411.8 KiB
England
30284 
United States of America
 
2298
Scotland
 
1649
England ; Scotland
 
621
Ireland
 
434
Other values (66)
 
1174

Length

Max length45
Median length7
Mean length8.624821722
Min length5

Characters and Unicode

Total characters314461
Distinct characters47
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)0.1%

Sample

1st rowEngland
2nd rowEngland
3rd rowIreland
4th rowEngland
5th rowEngland

Common Values

ValueCountFrequency (%)
England30284
57.5%
United States of America2298
 
4.4%
Scotland1649
 
3.1%
England ; Scotland621
 
1.2%
Ireland434
 
0.8%
England ; United States of America394
 
0.7%
Italy119
 
0.2%
France92
 
0.2%
Wales69
 
0.1%
Russia58
 
0.1%
Other values (61)442
 
0.8%
(Missing)16235
30.8%

Length

2021-09-17T11:06:47.834849image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
england31380
67.0%
united2699
 
5.8%
states2698
 
5.8%
of2698
 
5.8%
america2698
 
5.8%
scotland2275
 
4.9%
1109
 
2.4%
ireland539
 
1.2%
italy119
 
0.3%
france92
 
0.2%
Other values (52)548
 
1.2%

Most occurring characters

ValueCountFrequency (%)
n68603
21.8%
a40278
12.8%
d36956
11.8%
l34485
11.0%
g31429
10.0%
E31380
10.0%
t10613
 
3.4%
10395
 
3.3%
e9058
 
2.9%
i5648
 
1.8%
Other values (37)35616
11.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter259907
82.7%
Uppercase Letter43049
 
13.7%
Space Separator10395
 
3.3%
Other Punctuation1109
 
0.4%
Dash Punctuation1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n68603
26.4%
a40278
15.5%
d36956
14.2%
l34485
13.3%
g31429
12.1%
t10613
 
4.1%
e9058
 
3.5%
i5648
 
2.2%
c5135
 
2.0%
o5124
 
2.0%
Other values (14)12578
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
E31380
72.9%
S5010
 
11.6%
A2733
 
6.3%
U2704
 
6.3%
I683
 
1.6%
F92
 
0.2%
W73
 
0.2%
N73
 
0.2%
G68
 
0.2%
R60
 
0.1%
Other values (10)173
 
0.4%
Space Separator
ValueCountFrequency (%)
10395
100.0%
Other Punctuation
ValueCountFrequency (%)
;1109
100.0%
Dash Punctuation
ValueCountFrequency (%)
-1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin302956
96.3%
Common11505
 
3.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
n68603
22.6%
a40278
13.3%
d36956
12.2%
l34485
11.4%
g31429
10.4%
E31380
10.4%
t10613
 
3.5%
e9058
 
3.0%
i5648
 
1.9%
c5135
 
1.7%
Other values (34)29371
9.7%
Common
ValueCountFrequency (%)
10395
90.4%
;1109
 
9.6%
-1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII314461
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n68603
21.8%
a40278
12.8%
d36956
11.8%
l34485
11.0%
g31429
10.0%
E31380
10.0%
t10613
 
3.4%
10395
 
3.3%
e9058
 
2.9%
i5648
 
1.8%
Other values (37)35616
11.3%

Place of publication
Categorical

HIGH CARDINALITY
MISSING

Distinct3492
Distinct (%)6.7%
Missing772
Missing (%)1.5%
Memory size411.8 KiB
London
26743 
Paris
 
2132
New York
 
1179
Edinburgh
 
1026
Edinburgh ; London
 
524
Other values (3487)
20319 

Length

Max length288
Median length6
Mean length7.625272037
Min length4

Characters and Unicode

Total characters395927
Distinct characters193
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1995 ?
Unique (%)3.8%

Sample

1st rowLondon
2nd rowLondon
3rd rowDublin
4th rowPlymouth
5th rowLondon

Common Values

ValueCountFrequency (%)
London26743
50.8%
Paris2132
 
4.0%
New York1179
 
2.2%
Edinburgh1026
 
1.9%
Edinburgh ; London524
 
1.0%
Leipzig506
 
1.0%
Philadelphia450
 
0.9%
Berlin427
 
0.8%
Dublin424
 
0.8%
London ; New York356
 
0.7%
Other values (3482)18156
34.5%
(Missing)772
 
1.5%

Length

2021-09-17T11:06:48.104841image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
london29044
46.7%
2977
 
4.8%
paris2310
 
3.7%
york1797
 
2.9%
new1779
 
2.9%
edinburgh1698
 
2.7%
boston659
 
1.1%
leipzig628
 
1.0%
dublin488
 
0.8%
philadelphia477
 
0.8%
Other values (2815)20394
32.8%

Most occurring characters

ValueCountFrequency (%)
n71179
18.0%
o70489
17.8%
d35620
 
9.0%
L30903
 
7.8%
e17066
 
4.3%
r16616
 
4.2%
a15280
 
3.9%
i15003
 
3.8%
s11178
 
2.8%
10328
 
2.6%
Other values (183)102265
25.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter321702
81.3%
Uppercase Letter59157
 
14.9%
Space Separator10328
 
2.6%
Other Punctuation4242
 
1.1%
Dash Punctuation460
 
0.1%
Decimal Number26
 
< 0.1%
Nonspacing Mark4
 
< 0.1%
Modifier Letter4
 
< 0.1%
Open Punctuation2
 
< 0.1%
Close Punctuation2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n71179
22.1%
o70489
21.9%
d35620
11.1%
e17066
 
5.3%
r16616
 
5.2%
a15280
 
4.7%
i15003
 
4.7%
s11178
 
3.5%
t8631
 
2.7%
l8176
 
2.5%
Other values (107)52464
16.3%
Uppercase Letter
ValueCountFrequency (%)
L30903
52.2%
P3757
 
6.4%
B3271
 
5.5%
N2528
 
4.3%
M2017
 
3.4%
E1921
 
3.2%
Y1883
 
3.2%
C1879
 
3.2%
S1500
 
2.5%
D1101
 
1.9%
Other values (48)8397
 
14.2%
Decimal Number
ValueCountFrequency (%)
19
34.6%
26
23.1%
65
19.2%
85
19.2%
31
 
3.8%
Other Punctuation
ValueCountFrequency (%)
;2956
69.7%
,1161
 
27.4%
'106
 
2.5%
&19
 
0.4%
Nonspacing Mark
ValueCountFrequency (%)
1
25.0%
1
25.0%
̒1
25.0%
̤1
25.0%
Space Separator
ValueCountFrequency (%)
10328
100.0%
Dash Punctuation
ValueCountFrequency (%)
-460
100.0%
Open Punctuation
ValueCountFrequency (%)
(2
100.0%
Close Punctuation
ValueCountFrequency (%)
)2
100.0%
Modifier Letter
ValueCountFrequency (%)
ʹ4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin374952
94.7%
Common15064
 
3.8%
Cyrillic5558
 
1.4%
Greek349
 
0.1%
Inherited4
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n71179
19.0%
o70489
18.8%
d35620
9.5%
L30903
 
8.2%
e17066
 
4.6%
r16616
 
4.4%
a15280
 
4.1%
i15003
 
4.0%
s11178
 
3.0%
t8631
 
2.3%
Other values (86)82987
22.1%
Cyrillic
ValueCountFrequency (%)
е651
11.7%
р632
11.4%
т464
 
8.3%
а398
 
7.2%
ъ352
 
6.3%
к322
 
5.8%
г313
 
5.6%
у310
 
5.6%
б302
 
5.4%
С277
 
5.0%
Other values (37)1537
27.7%
Greek
ValueCountFrequency (%)
ν46
13.2%
ι41
11.7%
η37
10.6%
θ33
9.5%
32
9.2%
α32
9.2%
ς28
8.0%
ο15
 
4.3%
ρ10
 
2.9%
σ10
 
2.9%
Other values (22)65
18.6%
Common
ValueCountFrequency (%)
10328
68.6%
;2956
 
19.6%
,1161
 
7.7%
-460
 
3.1%
'106
 
0.7%
&19
 
0.1%
19
 
0.1%
26
 
< 0.1%
65
 
< 0.1%
85
 
< 0.1%
Other values (4)9
 
0.1%
Inherited
ValueCountFrequency (%)
1
25.0%
1
25.0%
̒1
25.0%
̤1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII388753
98.2%
Cyrillic5558
 
1.4%
Latin 1 Sup1194
 
0.3%
None305
 
0.1%
Latin Ext A65
 
< 0.1%
Greek Ext44
 
< 0.1%
Modifier Letters4
 
< 0.1%
Half Marks2
 
< 0.1%
Diacriticals2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n71179
18.3%
o70489
18.1%
d35620
 
9.2%
L30903
 
7.9%
e17066
 
4.4%
r16616
 
4.3%
a15280
 
3.9%
i15003
 
3.9%
s11178
 
2.9%
10328
 
2.7%
Other values (55)95091
24.5%
Latin 1 Sup
ValueCountFrequency (%)
ü289
24.2%
é232
19.4%
ø190
15.9%
ö133
11.1%
á72
 
6.0%
è65
 
5.4%
æ38
 
3.2%
ó37
 
3.1%
â33
 
2.8%
É18
 
1.5%
Other values (18)87
 
7.3%
Latin Ext A
ValueCountFrequency (%)
ń25
38.5%
ě8
 
12.3%
ł7
 
10.8%
ż5
 
7.7%
œ3
 
4.6%
ĭ3
 
4.6%
ő3
 
4.6%
ō2
 
3.1%
š2
 
3.1%
ą1
 
1.5%
Other values (6)6
 
9.2%
Greek Ext
ValueCountFrequency (%)
32
72.7%
3
 
6.8%
3
 
6.8%
2
 
4.5%
2
 
4.5%
2
 
4.5%
None
ValueCountFrequency (%)
ν46
15.1%
ι41
13.4%
η37
12.1%
θ33
10.8%
α32
10.5%
ς28
9.2%
ο15
 
4.9%
ρ10
 
3.3%
σ10
 
3.3%
υ10
 
3.3%
Other values (16)43
14.1%
Cyrillic
ValueCountFrequency (%)
е651
11.7%
р632
11.4%
т464
 
8.3%
а398
 
7.2%
ъ352
 
6.3%
к322
 
5.8%
г313
 
5.6%
у310
 
5.6%
б302
 
5.4%
С277
 
5.0%
Other values (37)1537
27.7%
Half Marks
ValueCountFrequency (%)
1
50.0%
1
50.0%
Modifier Letters
ValueCountFrequency (%)
ʹ4
100.0%
Diacriticals
ValueCountFrequency (%)
̒1
50.0%
̤1
50.0%

Publisher
Categorical

HIGH CARDINALITY
MISSING

Distinct7263
Distinct (%)26.4%
Missing25208
Missing (%)47.8%
Memory size411.8 KiB
Macmillan
 
546
Sampson Low
 
490
Hurst & Blackett
 
460
Chatto & Windus
 
426
Chapman & Hall
 
389
Other values (7258)
25176 

Length

Max length186
Median length12
Mean length12.93138575
Min length4

Characters and Unicode

Total characters355445
Distinct characters133
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5144 ?
Unique (%)18.7%

Sample

1st rowRichard Milliken
2nd rowW. Cann
3rd rowElliot Stock
4th rowI. C. Bose
5th rowE. T. W. Dennis

Common Values

ValueCountFrequency (%)
Macmillan546
 
1.0%
Sampson Low490
 
0.9%
Hurst & Blackett460
 
0.9%
Chatto & Windus426
 
0.8%
Chapman & Hall389
 
0.7%
Longmans364
 
0.7%
Richard Bentley355
 
0.7%
R. Bentley314
 
0.6%
John Murray306
 
0.6%
Cassell288
 
0.5%
Other values (7253)23549
44.7%
(Missing)25208
47.8%

Length

2021-09-17T11:06:48.686658image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
6055
 
9.4%
j2628
 
4.1%
w1939
 
3.0%
h1182
 
1.8%
a1130
 
1.8%
t1095
 
1.7%
r1035
 
1.6%
g1023
 
1.6%
john840
 
1.3%
c832
 
1.3%
Other values (4725)46808
72.5%

Most occurring characters

ValueCountFrequency (%)
37080
 
10.4%
e24970
 
7.0%
n23101
 
6.5%
a21560
 
6.1%
l18774
 
5.3%
o18608
 
5.2%
r17865
 
5.0%
i16390
 
4.6%
.14756
 
4.2%
t14224
 
4.0%
Other values (123)148117
41.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter236157
66.4%
Uppercase Letter58386
 
16.4%
Space Separator37080
 
10.4%
Other Punctuation23603
 
6.6%
Dash Punctuation103
 
< 0.1%
Nonspacing Mark62
 
< 0.1%
Decimal Number48
 
< 0.1%
Modifier Letter5
 
< 0.1%
Other Letter1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e24970
10.6%
n23101
 
9.8%
a21560
 
9.1%
l18774
 
7.9%
o18608
 
7.9%
r17865
 
7.6%
i16390
 
6.9%
t14224
 
6.0%
s13443
 
5.7%
h9042
 
3.8%
Other values (62)58180
24.6%
Uppercase Letter
ValueCountFrequency (%)
W5051
 
8.7%
S4933
 
8.4%
H4775
 
8.2%
B4676
 
8.0%
J4472
 
7.7%
C4222
 
7.2%
M3764
 
6.4%
R3293
 
5.6%
L3090
 
5.3%
G2594
 
4.4%
Other values (31)17516
30.0%
Decimal Number
ValueCountFrequency (%)
115
31.2%
39
18.8%
97
14.6%
25
 
10.4%
54
 
8.3%
73
 
6.2%
02
 
4.2%
62
 
4.2%
41
 
2.1%
Other Punctuation
ValueCountFrequency (%)
.14756
62.5%
&4996
 
21.2%
,2271
 
9.6%
;1049
 
4.4%
'531
 
2.2%
Nonspacing Mark
ValueCountFrequency (%)
31
50.0%
31
50.0%
Space Separator
ValueCountFrequency (%)
37080
100.0%
Dash Punctuation
ValueCountFrequency (%)
-103
100.0%
Other Letter
ValueCountFrequency (%)
º1
100.0%
Modifier Letter
ValueCountFrequency (%)
ʹ5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin294410
82.8%
Common60839
 
17.1%
Cyrillic134
 
< 0.1%
Inherited62
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e24970
 
8.5%
n23101
 
7.8%
a21560
 
7.3%
l18774
 
6.4%
o18608
 
6.3%
r17865
 
6.1%
i16390
 
5.6%
t14224
 
4.8%
s13443
 
4.6%
h9042
 
3.1%
Other values (65)116433
39.5%
Cyrillic
ValueCountFrequency (%)
а18
13.4%
о13
 
9.7%
р11
 
8.2%
е10
 
7.5%
и9
 
6.7%
н8
 
6.0%
в6
 
4.5%
т6
 
4.5%
г6
 
4.5%
п5
 
3.7%
Other values (29)42
31.3%
Common
ValueCountFrequency (%)
37080
60.9%
.14756
 
24.3%
&4996
 
8.2%
,2271
 
3.7%
;1049
 
1.7%
'531
 
0.9%
-103
 
0.2%
115
 
< 0.1%
39
 
< 0.1%
97
 
< 0.1%
Other values (7)22
 
< 0.1%
Inherited
ValueCountFrequency (%)
31
50.0%
31
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII354995
99.9%
Latin 1 Sup220
 
0.1%
Cyrillic134
 
< 0.1%
Half Marks62
 
< 0.1%
Latin Ext A28
 
< 0.1%
Modifier Letters5
 
< 0.1%
Latin Ext Additional1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
37080
 
10.4%
e24970
 
7.0%
n23101
 
6.5%
a21560
 
6.1%
l18774
 
5.3%
o18608
 
5.2%
r17865
 
5.0%
i16390
 
4.6%
.14756
 
4.2%
t14224
 
4.0%
Other values (58)147667
41.6%
Latin 1 Sup
ValueCountFrequency (%)
ü107
48.6%
á29
 
13.2%
é25
 
11.4%
è21
 
9.5%
ö16
 
7.3%
ä5
 
2.3%
ó5
 
2.3%
æ3
 
1.4%
â2
 
0.9%
ô2
 
0.9%
Other values (4)5
 
2.3%
Latin Ext A
ValueCountFrequency (%)
ĭ10
35.7%
ī7
25.0%
ō3
 
10.7%
ł3
 
10.7%
ő2
 
7.1%
Ė1
 
3.6%
ę1
 
3.6%
Ż1
 
3.6%
Half Marks
ValueCountFrequency (%)
31
50.0%
31
50.0%
Cyrillic
ValueCountFrequency (%)
а18
13.4%
о13
 
9.7%
р11
 
8.2%
е10
 
7.5%
и9
 
6.7%
н8
 
6.0%
в6
 
4.5%
т6
 
4.5%
г6
 
4.5%
п5
 
3.7%
Other values (29)42
31.3%
Modifier Letters
ValueCountFrequency (%)
ʹ5
100.0%
Latin Ext Additional
ValueCountFrequency (%)
1
100.0%

Date of publication
Categorical

HIGH CARDINALITY

Distinct458
Distinct (%)0.9%
Missing178
Missing (%)0.3%
Memory size411.8 KiB
1897
 
1478
1896
 
1414
1895
 
1277
1893
 
1205
1890
 
1182
Other values (453)
45961 

Length

Max length9
Median length4
Mean length4.007806996
Min length4

Characters and Unicode

Total characters210478
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique123 ?
Unique (%)0.2%

Sample

1st row1786
2nd row1679
3rd row1816
4th row1868
5th row1888

Common Values

ValueCountFrequency (%)
18971478
 
2.8%
18961414
 
2.7%
18951277
 
2.4%
18931205
 
2.3%
18901182
 
2.2%
18941154
 
2.2%
18911129
 
2.1%
18981116
 
2.1%
18921103
 
2.1%
18891022
 
1.9%
Other values (448)40437
76.7%

Length

2021-09-17T11:06:48.937664image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
18971479
 
2.8%
18961415
 
2.7%
18951277
 
2.4%
18931206
 
2.3%
18901182
 
2.3%
18941154
 
2.2%
18911129
 
2.1%
18981116
 
2.1%
18921103
 
2.1%
18891022
 
1.9%
Other values (412)40434
77.0%

Most occurring characters

ValueCountFrequency (%)
160749
28.9%
860618
28.8%
921261
 
10.1%
714294
 
6.8%
611744
 
5.6%
510399
 
4.9%
48566
 
4.1%
28074
 
3.8%
37453
 
3.5%
07206
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number210364
99.9%
Dash Punctuation114
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
160749
28.9%
860618
28.8%
921261
 
10.1%
714294
 
6.8%
611744
 
5.6%
510399
 
4.9%
48566
 
4.1%
28074
 
3.8%
37453
 
3.5%
07206
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
-114
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common210478
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
160749
28.9%
860618
28.8%
921261
 
10.1%
714294
 
6.8%
611744
 
5.6%
510399
 
4.9%
48566
 
4.1%
28074
 
3.8%
37453
 
3.5%
07206
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII210478
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
160749
28.9%
860618
28.8%
921261
 
10.1%
714294
 
6.8%
611744
 
5.6%
510399
 
4.9%
48566
 
4.1%
28074
 
3.8%
37453
 
3.5%
07206
 
3.4%

Edition
Categorical

HIGH CARDINALITY
MISSING

Distinct1559
Distinct (%)37.1%
Missing48497
Missing (%)92.0%
Memory size411.8 KiB
Another edition
1027 
Second edition
457 
Third edition
 
214
New edition
 
184
Fourth edition
 
91
Other values (1554)
2225 

Length

Max length389
Median length15
Mean length35.27703668
Min length10

Characters and Unicode

Total characters148093
Distinct characters132
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1437 ?
Unique (%)34.2%

Sample

1st rowFourth edition MANUSCRIPT note
2nd rowNew edition
3rd rowSecond edition
4th rowAnother edition
5th rowSecond edition

Common Values

ValueCountFrequency (%)
Another edition1027
 
1.9%
Second edition457
 
0.9%
Third edition214
 
0.4%
New edition184
 
0.3%
Fourth edition91
 
0.2%
A new edition82
 
0.2%
Fifth edition69
 
0.1%
Sixth edition45
 
0.1%
Seventh edition36
 
0.1%
Second edition, enlarged24
 
< 0.1%
Other values (1549)1969
 
3.7%
(Missing)48497
92.0%

Length

2021-09-17T11:06:49.194706image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
edition4095
 
18.1%
another1724
 
7.6%
the787
 
3.5%
second734
 
3.2%
with678
 
3.0%
and620
 
2.7%
of594
 
2.6%
new577
 
2.6%
by576
 
2.5%
a560
 
2.5%
Other values (3258)11671
51.6%

Most occurring characters

ValueCountFrequency (%)
18418
12.4%
e15581
10.5%
i14568
 
9.8%
t12729
 
8.6%
n11751
 
7.9%
o11548
 
7.8%
d8801
 
5.9%
r6981
 
4.7%
h5482
 
3.7%
a5179
 
3.5%
Other values (122)37055
25.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter114855
77.6%
Space Separator18418
 
12.4%
Uppercase Letter9568
 
6.5%
Other Punctuation4289
 
2.9%
Decimal Number633
 
0.4%
Dash Punctuation134
 
0.1%
Open Punctuation98
 
0.1%
Close Punctuation98
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e15581
13.6%
i14568
12.7%
t12729
11.1%
n11751
10.2%
o11548
10.1%
d8801
7.7%
r6981
 
6.1%
h5482
 
4.8%
a5179
 
4.5%
s3782
 
3.3%
Other values (68)18453
16.1%
Uppercase Letter
ValueCountFrequency (%)
A2336
24.4%
S1226
12.8%
T829
 
8.7%
W615
 
6.4%
N590
 
6.2%
F456
 
4.8%
C399
 
4.2%
E323
 
3.4%
R288
 
3.0%
H281
 
2.9%
Other values (25)2225
23.3%
Decimal Number
ValueCountFrequency (%)
1199
31.4%
8104
16.4%
258
 
9.2%
652
 
8.2%
751
 
8.1%
342
 
6.6%
542
 
6.6%
432
 
5.1%
027
 
4.3%
926
 
4.1%
Other Punctuation
ValueCountFrequency (%)
,2797
65.2%
.1229
28.7%
'258
 
6.0%
?3
 
0.1%
!2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
18418
100.0%
Dash Punctuation
ValueCountFrequency (%)
-134
100.0%
Open Punctuation
ValueCountFrequency (%)
(98
100.0%
Close Punctuation
ValueCountFrequency (%)
)98
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin124284
83.9%
Common23670
 
16.0%
Cyrillic139
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e15581
12.5%
i14568
11.7%
t12729
10.2%
n11751
9.5%
o11548
9.3%
d8801
 
7.1%
r6981
 
5.6%
h5482
 
4.4%
a5179
 
4.2%
s3782
 
3.0%
Other values (69)27882
22.4%
Cyrillic
ValueCountFrequency (%)
и13
 
9.4%
о12
 
8.6%
е10
 
7.2%
в10
 
7.2%
с9
 
6.5%
р9
 
6.5%
н9
 
6.5%
а8
 
5.8%
і7
 
5.0%
т6
 
4.3%
Other values (24)46
33.1%
Common
ValueCountFrequency (%)
18418
77.8%
,2797
 
11.8%
.1229
 
5.2%
'258
 
1.1%
1199
 
0.8%
-134
 
0.6%
8104
 
0.4%
(98
 
0.4%
)98
 
0.4%
258
 
0.2%
Other values (9)277
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII147503
99.6%
Latin 1 Sup441
 
0.3%
Cyrillic139
 
0.1%
Latin Ext A10
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18418
12.5%
e15581
10.6%
i14568
 
9.9%
t12729
 
8.6%
n11751
 
8.0%
o11548
 
7.8%
d8801
 
6.0%
r6981
 
4.7%
h5482
 
3.7%
a5179
 
3.5%
Other values (61)36465
24.7%
Latin 1 Sup
ValueCountFrequency (%)
é282
63.9%
è71
 
16.1%
ó14
 
3.2%
à9
 
2.0%
ä8
 
1.8%
É8
 
1.8%
æ7
 
1.6%
á7
 
1.6%
ü6
 
1.4%
ö6
 
1.4%
Other values (11)23
 
5.2%
Latin Ext A
ValueCountFrequency (%)
ą3
30.0%
ę2
20.0%
ł2
20.0%
ő1
 
10.0%
Ż1
 
10.0%
œ1
 
10.0%
Cyrillic
ValueCountFrequency (%)
и13
 
9.4%
о12
 
8.6%
е10
 
7.2%
в10
 
7.2%
с9
 
6.5%
р9
 
6.5%
н9
 
6.5%
а8
 
5.8%
і7
 
5.0%
т6
 
4.3%
Other values (24)46
33.1%

Physical description
Categorical

HIGH CARDINALITY
MISSING

Distinct10735
Distinct (%)26.9%
Missing12849
Missing (%)24.4%
Memory size411.8 KiB
3 volumes (8°)
 
2755
2 volumes (8°)
 
2488
(12°)
 
658
2 tomes (8°)
 
344
2 parts (8°)
 
307
Other values (10730)
33294 

Length

Max length200
Median length14
Mean length15.61062591
Min length5

Characters and Unicode

Total characters622021
Distinct characters85
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7531 ?
Unique (%)18.9%

Sample

1st row15 pages (4°)
2nd row17 pages (8°)
3rd row16 pages (8°)
4th row40 pages (8°)
5th row7 pages (4°)

Common Values

ValueCountFrequency (%)
3 volumes (8°)2755
 
5.2%
2 volumes (8°)2488
 
4.7%
(12°)658
 
1.2%
2 tomes (8°)344
 
0.7%
2 parts (8°)307
 
0.6%
2 volumes (12°)302
 
0.6%
16 pages (8°)274
 
0.5%
32 pages (8°)221
 
0.4%
3 volumes (12°)200
 
0.4%
24 pages (8°)157
 
0.3%
Other values (10725)32140
61.0%
(Missing)12849
 
24.4%

Length

2021-09-17T11:06:49.486399image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
33791
25.2%
pages29762
22.2%
volumes6935
 
5.2%
24593
 
3.4%
33549
 
2.6%
3081
 
2.3%
viii2378
 
1.8%
12°1701
 
1.3%
vi1234
 
0.9%
xii898
 
0.7%
Other values (1537)46229
34.5%

Most occurring characters

ValueCountFrequency (%)
94305
15.2%
840297
 
6.5%
s39872
 
6.4%
(39180
 
6.3%
)39178
 
6.3%
e39169
 
6.3%
°38630
 
6.2%
a32100
 
5.2%
p31271
 
5.0%
g29874
 
4.8%
Other values (75)198145
31.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter261771
42.1%
Decimal Number133286
21.4%
Space Separator94305
 
15.2%
Open Punctuation39180
 
6.3%
Close Punctuation39178
 
6.3%
Other Symbol38630
 
6.2%
Other Punctuation14995
 
2.4%
Uppercase Letter454
 
0.1%
Dash Punctuation209
 
< 0.1%
Modifier Letter7
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s39872
15.2%
e39169
15.0%
a32100
12.3%
p31271
11.9%
g29874
11.4%
i21068
8.0%
v14924
 
5.7%
l9644
 
3.7%
o9463
 
3.6%
m8766
 
3.3%
Other values (51)25620
9.8%
Decimal Number
ValueCountFrequency (%)
840297
30.2%
219517
14.6%
314690
 
11.0%
114640
 
11.0%
412741
 
9.6%
67286
 
5.5%
56884
 
5.2%
06414
 
4.8%
75543
 
4.2%
95274
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
B412
90.7%
H32
 
7.0%
A9
 
2.0%
J1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
,14994
> 99.9%
;1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+5
83.3%
×1
 
16.7%
Space Separator
ValueCountFrequency (%)
94305
100.0%
Open Punctuation
ValueCountFrequency (%)
(39180
100.0%
Other Symbol
ValueCountFrequency (%)
°38630
100.0%
Close Punctuation
ValueCountFrequency (%)
)39178
100.0%
Dash Punctuation
ValueCountFrequency (%)
-209
100.0%
Modifier Letter
ValueCountFrequency (%)
ʹ7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common359796
57.8%
Latin261880
42.1%
Cyrillic329
 
0.1%
Greek16
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
s39872
15.2%
e39169
15.0%
a32100
12.3%
p31271
11.9%
g29874
11.4%
i21068
8.0%
v14924
 
5.7%
l9644
 
3.7%
o9463
 
3.6%
m8766
 
3.3%
Other values (31)25729
9.8%
Common
ValueCountFrequency (%)
94305
26.2%
840297
11.2%
(39180
10.9%
)39178
10.9%
°38630
10.7%
219517
 
5.4%
,14994
 
4.2%
314690
 
4.1%
114640
 
4.1%
412741
 
3.5%
Other values (10)31624
 
8.8%
Cyrillic
ValueCountFrequency (%)
т83
25.2%
ч52
15.8%
с51
15.5%
а50
15.2%
о31
 
9.4%
м30
 
9.1%
к6
 
1.8%
н6
 
1.8%
ы4
 
1.2%
в4
 
1.2%
Other values (6)12
 
3.6%
Greek
ValueCountFrequency (%)
ζ3
18.8%
η2
12.5%
θ2
12.5%
τ2
12.5%
ο2
12.5%
μ2
12.5%
ι2
12.5%
β1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII582701
93.7%
Latin 1 Sup38959
 
6.3%
Cyrillic329
 
0.1%
None16
 
< 0.1%
Latin Ext A9
 
< 0.1%
Modifier Letters7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
94305
16.2%
840297
 
6.9%
s39872
 
6.8%
(39180
 
6.7%
)39178
 
6.7%
e39169
 
6.7%
a32100
 
5.5%
p31271
 
5.4%
g29874
 
5.1%
i21068
 
3.6%
Other values (36)176387
30.3%
Latin 1 Sup
ValueCountFrequency (%)
°38630
99.2%
ä262
 
0.7%
ö37
 
0.1%
é19
 
< 0.1%
ü5
 
< 0.1%
í3
 
< 0.1%
æ1
 
< 0.1%
ø1
 
< 0.1%
×1
 
< 0.1%
Cyrillic
ValueCountFrequency (%)
т83
25.2%
ч52
15.8%
с51
15.5%
а50
15.2%
о31
 
9.4%
м30
 
9.1%
к6
 
1.8%
н6
 
1.8%
ы4
 
1.2%
в4
 
1.2%
Other values (6)12
 
3.6%
Latin Ext A
ValueCountFrequency (%)
ę2
22.2%
ś2
22.2%
ć2
22.2%
š2
22.2%
ő1
11.1%
None
ValueCountFrequency (%)
ζ3
18.8%
η2
12.5%
θ2
12.5%
τ2
12.5%
ο2
12.5%
μ2
12.5%
ι2
12.5%
β1
 
6.2%
Modifier Letters
ValueCountFrequency (%)
ʹ7
100.0%

Dewey classification
Categorical

HIGH CARDINALITY
MISSING
UNIFORM

Distinct67
Distinct (%)85.9%
Missing52617
Missing (%)99.9%
Memory size411.8 KiB
941
 
3
942
 
3
823.8
 
3
915.2
 
2
915
 
2
Other values (62)
65 

Length

Max length17
Median length5
Mean length5.5
Min length3

Characters and Unicode

Total characters429
Distinct characters13
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)75.6%

Sample

1st row266.96
2nd row915.7
3rd row456
4th row914.7904
5th row623.89296

Common Values

ValueCountFrequency (%)
9413
 
< 0.1%
9423
 
< 0.1%
823.83
 
< 0.1%
915.22
 
< 0.1%
9152
 
< 0.1%
942.452
 
< 0.1%
942.12
 
< 0.1%
941.12
 
< 0.1%
447.91
 
< 0.1%
944.261
 
< 0.1%
Other values (57)57
 
0.1%
(Missing)52617
99.9%

Length

2021-09-17T11:06:49.752970image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
9423
 
3.7%
823.83
 
3.7%
9413
 
3.7%
9152
 
2.4%
941.12
 
2.4%
2
 
2.4%
942.452
 
2.4%
915.22
 
2.4%
942.12
 
2.4%
266.961
 
1.2%
Other values (60)60
73.2%

Most occurring characters

ValueCountFrequency (%)
981
18.9%
.64
14.9%
249
11.4%
448
11.2%
145
10.5%
831
 
7.2%
530
 
7.0%
623
 
5.4%
320
 
4.7%
718
 
4.2%
Other values (3)20
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number359
83.7%
Other Punctuation66
 
15.4%
Space Separator4
 
0.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
981
22.6%
249
13.6%
448
13.4%
145
12.5%
831
 
8.6%
530
 
8.4%
623
 
6.4%
320
 
5.6%
718
 
5.0%
014
 
3.9%
Other Punctuation
ValueCountFrequency (%)
.64
97.0%
;2
 
3.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common429
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
981
18.9%
.64
14.9%
249
11.4%
448
11.2%
145
10.5%
831
 
7.2%
530
 
7.0%
623
 
5.4%
320
 
4.7%
718
 
4.2%
Other values (3)20
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII429
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
981
18.9%
.64
14.9%
249
11.4%
448
11.2%
145
10.5%
831
 
7.2%
530
 
7.0%
623
 
5.4%
320
 
4.7%
718
 
4.2%
Other values (3)20
 
4.7%

BL shelfmark
Categorical

HIGH CARDINALITY
UNIFORM

Distinct52345
Distinct (%)99.8%
Missing267
Missing (%)0.5%
Memory size411.8 KiB
Digital Store 012626.e.8
 
6
Digital Store 012624.l
 
5
Digital Store 1303.b.3
 
5
Digital Store 10497.w.12
 
3
Digital Store 10007.f.22
 
3
Other values (52340)
52406 

Length

Max length94
Median length24
Mean length24.95017929
Min length14

Characters and Unicode

Total characters1308088
Distinct characters60
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52281 ?
Unique (%)99.7%

Sample

1st rowDigital Store 11644.d.32
2nd rowDigital Store 11602.ee.10. (2.)
3rd rowDigital Store 992.i.12. (3.)
4th rowDigital Store 11602.ee.17. (1.)
5th rowDigital Store 11602.ee.17. (7.)

Common Values

ValueCountFrequency (%)
Digital Store 012626.e.86
 
< 0.1%
Digital Store 012624.l5
 
< 0.1%
Digital Store 1303.b.35
 
< 0.1%
Digital Store 10497.w.123
 
< 0.1%
Digital Store 10007.f.223
 
< 0.1%
Digital Store 12274.m3
 
< 0.1%
Digital Store 10497.w.203
 
< 0.1%
Digital Store 9314.c.73
 
< 0.1%
Digital Store 11609.k.53
 
< 0.1%
Digital Store 9225.m.153
 
< 0.1%
Other values (52335)52391
99.4%
(Missing)267
 
0.5%

Length

2021-09-17T11:06:50.034896image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
digital53232
32.1%
store53232
32.1%
1834
 
0.5%
2826
 
0.5%
809
 
0.5%
3625
 
0.4%
4523
 
0.3%
5437
 
0.3%
6355
 
0.2%
7307
 
0.2%
Other values (48963)54444
32.9%

Most occurring characters

ValueCountFrequency (%)
.116639
 
8.9%
113196
 
8.7%
i110345
 
8.4%
t106597
 
8.1%
189661
 
6.9%
e67023
 
5.1%
g59130
 
4.5%
a55903
 
4.3%
055217
 
4.2%
l55158
 
4.2%
Other values (50)479219
36.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter603537
46.1%
Decimal Number356508
27.3%
Other Punctuation117791
 
9.0%
Space Separator113196
 
8.7%
Uppercase Letter106581
 
8.1%
Open Punctuation5115
 
0.4%
Close Punctuation5115
 
0.4%
Dash Punctuation245
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i110345
18.3%
t106597
17.7%
e67023
11.1%
g59130
9.8%
a55903
9.3%
l55158
9.1%
r53343
8.8%
o53292
8.8%
b8891
 
1.5%
f8631
 
1.4%
Other values (15)25224
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
D53232
49.9%
S53232
49.9%
J95
 
0.1%
R4
 
< 0.1%
B3
 
< 0.1%
K3
 
< 0.1%
F2
 
< 0.1%
C2
 
< 0.1%
T1
 
< 0.1%
M1
 
< 0.1%
Other values (6)6
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
189661
25.1%
055217
15.5%
246173
13.0%
635769
 
10.0%
328001
 
7.9%
427673
 
7.8%
922387
 
6.3%
520772
 
5.8%
718311
 
5.1%
812544
 
3.5%
Other Punctuation
ValueCountFrequency (%)
.116639
99.0%
;836
 
0.7%
/201
 
0.2%
,72
 
0.1%
*43
 
< 0.1%
Space Separator
ValueCountFrequency (%)
113196
100.0%
Open Punctuation
ValueCountFrequency (%)
(5115
100.0%
Close Punctuation
ValueCountFrequency (%)
)5115
100.0%
Dash Punctuation
ValueCountFrequency (%)
-245
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin710118
54.3%
Common597970
45.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
i110345
15.5%
t106597
15.0%
e67023
9.4%
g59130
8.3%
a55903
7.9%
l55158
7.8%
r53343
7.5%
o53292
7.5%
D53232
7.5%
S53232
7.5%
Other values (31)42863
 
6.0%
Common
ValueCountFrequency (%)
.116639
19.5%
113196
18.9%
189661
15.0%
055217
9.2%
246173
 
7.7%
635769
 
6.0%
328001
 
4.7%
427673
 
4.6%
922387
 
3.7%
520772
 
3.5%
Other values (9)42482
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1308088
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.116639
 
8.9%
113196
 
8.7%
i110345
 
8.4%
t106597
 
8.1%
189661
 
6.9%
e67023
 
5.1%
g59130
 
4.5%
a55903
 
4.3%
055217
 
4.2%
l55158
 
4.2%
Other values (50)479219
36.6%

Topics
Categorical

HIGH CARDINALITY
MISSING

Distinct1208
Distinct (%)38.5%
Missing49559
Missing (%)94.0%
Memory size411.8 KiB
India
 
216
Canada
 
157
Australia
 
99
Revolutions
 
85
New Zealand
 
73
Other values (1203)
2506 

Length

Max length547
Median length23
Mean length31.85299745
Min length3

Characters and Unicode

Total characters99891
Distinct characters93
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique952 ?
Unique (%)30.4%

Sample

1st rowDublin (Ireland)
2nd rowTrinidad and Tobago
3rd rowShakespeare, William, 1564-1616--Anniversaries, etc
4th rowSaint Helena
5th rowNelson, Horatio Nelson, Viscount, 1758-1805

Common Values

ValueCountFrequency (%)
India216
 
0.4%
Canada157
 
0.3%
Australia99
 
0.2%
Revolutions85
 
0.2%
New Zealand73
 
0.1%
English fiction--19th century53
 
0.1%
Canada ; British Columbia41
 
0.1%
American Revolution (1775-1783)34
 
0.1%
Revolution (France : 1789-1799)32
 
0.1%
India ; India--Description and travel31
 
0.1%
Other values (1198)2315
 
4.4%
(Missing)49559
94.0%

Length

2021-09-17T11:06:50.311098image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1688
 
13.6%
and829
 
6.7%
india516
 
4.1%
travel480
 
3.9%
war240
 
1.9%
of220
 
1.8%
canada219
 
1.8%
south191
 
1.5%
century146
 
1.2%
new136
 
1.1%
Other values (2098)7771
62.5%

Most occurring characters

ValueCountFrequency (%)
9300
 
9.3%
a9119
 
9.1%
i7224
 
7.2%
n6829
 
6.8%
e5412
 
5.4%
r5248
 
5.3%
-4979
 
5.0%
t4574
 
4.6%
o4324
 
4.3%
s4105
 
4.1%
Other values (83)38777
38.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter66996
67.1%
Uppercase Letter9709
 
9.7%
Space Separator9300
 
9.3%
Dash Punctuation4979
 
5.0%
Decimal Number4926
 
4.9%
Other Punctuation2547
 
2.5%
Open Punctuation685
 
0.7%
Close Punctuation685
 
0.7%
Nonspacing Mark46
 
< 0.1%
Modifier Letter18
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a9119
13.6%
i7224
10.8%
n6829
10.2%
e5412
8.1%
r5248
 
7.8%
t4574
 
6.8%
o4324
 
6.5%
s4105
 
6.1%
l3484
 
5.2%
d3284
 
4.9%
Other values (33)13393
20.0%
Uppercase Letter
ValueCountFrequency (%)
I1357
14.0%
C856
 
8.8%
S781
 
8.0%
D664
 
6.8%
A664
 
6.8%
B613
 
6.3%
H511
 
5.3%
R505
 
5.2%
E473
 
4.9%
G415
 
4.3%
Other values (17)2870
29.6%
Decimal Number
ValueCountFrequency (%)
11564
31.7%
8710
14.4%
7511
 
10.4%
9455
 
9.2%
6436
 
8.9%
5406
 
8.2%
4286
 
5.8%
0258
 
5.2%
2151
 
3.1%
3149
 
3.0%
Other Punctuation
ValueCountFrequency (%)
;1547
60.7%
,760
29.8%
:142
 
5.6%
.75
 
2.9%
'18
 
0.7%
?5
 
0.2%
Nonspacing Mark
ValueCountFrequency (%)
23
50.0%
23
50.0%
Space Separator
ValueCountFrequency (%)
9300
100.0%
Open Punctuation
ValueCountFrequency (%)
(685
100.0%
Close Punctuation
ValueCountFrequency (%)
)685
100.0%
Dash Punctuation
ValueCountFrequency (%)
-4979
100.0%
Modifier Letter
ValueCountFrequency (%)
ʹ18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin76705
76.8%
Common23140
 
23.2%
Inherited46
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a9119
 
11.9%
i7224
 
9.4%
n6829
 
8.9%
e5412
 
7.1%
r5248
 
6.8%
t4574
 
6.0%
o4324
 
5.6%
s4105
 
5.4%
l3484
 
4.5%
d3284
 
4.3%
Other values (60)23102
30.1%
Common
ValueCountFrequency (%)
9300
40.2%
-4979
21.5%
11564
 
6.8%
;1547
 
6.7%
,760
 
3.3%
8710
 
3.1%
(685
 
3.0%
)685
 
3.0%
7511
 
2.2%
9455
 
2.0%
Other values (11)1944
 
8.4%
Inherited
ValueCountFrequency (%)
23
50.0%
23
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII99745
99.9%
Latin 1 Sup46
 
< 0.1%
Half Marks46
 
< 0.1%
Latin Ext A36
 
< 0.1%
Modifier Letters18
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9300
 
9.3%
a9119
 
9.1%
i7224
 
7.2%
n6829
 
6.8%
e5412
 
5.4%
r5248
 
5.3%
-4979
 
5.0%
t4574
 
4.6%
o4324
 
4.3%
s4105
 
4.1%
Other values (62)38631
38.7%
Latin Ext A
ValueCountFrequency (%)
ā16
44.4%
ĭ8
22.2%
ū5
 
13.9%
ī2
 
5.6%
ń2
 
5.6%
ł1
 
2.8%
ű1
 
2.8%
č1
 
2.8%
Latin 1 Sup
ValueCountFrequency (%)
á13
28.3%
é12
26.1%
ó11
23.9%
ã2
 
4.3%
ú2
 
4.3%
ö2
 
4.3%
ü1
 
2.2%
ô1
 
2.2%
Á1
 
2.2%
ï1
 
2.2%
Half Marks
ValueCountFrequency (%)
23
50.0%
23
50.0%
Modifier Letters
ValueCountFrequency (%)
ʹ18
100.0%

Genre
Categorical

HIGH CARDINALITY
MISSING

Distinct64
Distinct (%)3.2%
Missing50722
Missing (%)96.3%
Memory size411.8 KiB
Poetry or verse
1002 
Drama
461 
Drama ; Poetry or verse
151 
Travel
 
77
Periodical
 
39
Other values (59)
243 

Length

Max length53
Median length15
Mean length12.29042068
Min length4

Characters and Unicode

Total characters24249
Distinct characters46
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)1.5%

Sample

1st rowSong
2nd rowPoetry or verse
3rd rowPoetry or verse
4th rowMusic
5th rowPoetry or verse

Common Values

ValueCountFrequency (%)
Poetry or verse1002
 
1.9%
Drama461
 
0.9%
Drama ; Poetry or verse151
 
0.3%
Travel77
 
0.1%
Periodical39
 
0.1%
Diary36
 
0.1%
Gazetteer22
 
< 0.1%
Directory18
 
< 0.1%
Correspondence18
 
< 0.1%
Fiction16
 
< 0.1%
Other values (54)133
 
0.3%
(Missing)50722
96.3%

Length

2021-09-17T11:06:50.566706image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
or1166
24.7%
verse1158
24.5%
poetry1158
24.5%
drama622
13.2%
190
 
4.0%
travel89
 
1.9%
diary46
 
1.0%
periodical41
 
0.9%
correspondence23
 
0.5%
gazetteer23
 
0.5%
Other values (56)205
 
4.3%

Most occurring characters

ValueCountFrequency (%)
r4483
18.5%
e3827
15.8%
2748
11.3%
o2562
10.6%
a1539
 
6.3%
t1312
 
5.4%
v1258
 
5.2%
y1252
 
5.2%
s1229
 
5.1%
P1209
 
5.0%
Other values (36)2830
11.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter19123
78.9%
Space Separator2748
 
11.3%
Uppercase Letter2167
 
8.9%
Other Punctuation191
 
0.8%
Decimal Number20
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r4483
23.4%
e3827
20.0%
o2562
13.4%
a1539
 
8.0%
t1312
 
6.9%
v1258
 
6.6%
y1252
 
6.5%
s1229
 
6.4%
m636
 
3.3%
i273
 
1.4%
Other values (14)752
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
P1209
55.8%
D698
32.2%
T95
 
4.4%
C32
 
1.5%
S31
 
1.4%
G26
 
1.2%
F16
 
0.7%
L14
 
0.6%
B12
 
0.6%
E12
 
0.6%
Other values (6)22
 
1.0%
Decimal Number
ValueCountFrequency (%)
010
50.0%
15
25.0%
85
25.0%
Other Punctuation
ValueCountFrequency (%)
;190
99.5%
'1
 
0.5%
Space Separator
ValueCountFrequency (%)
2748
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin21290
87.8%
Common2959
 
12.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
r4483
21.1%
e3827
18.0%
o2562
12.0%
a1539
 
7.2%
t1312
 
6.2%
v1258
 
5.9%
y1252
 
5.9%
s1229
 
5.8%
P1209
 
5.7%
D698
 
3.3%
Other values (30)1921
9.0%
Common
ValueCountFrequency (%)
2748
92.9%
;190
 
6.4%
010
 
0.3%
15
 
0.2%
85
 
0.2%
'1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII24249
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r4483
18.5%
e3827
15.8%
2748
11.3%
o2562
10.6%
a1539
 
6.3%
t1312
 
5.4%
v1258
 
5.2%
y1252
 
5.2%
s1229
 
5.1%
P1209
 
5.0%
Other values (36)2830
11.7%

Languages
Categorical

HIGH CARDINALITY

Distinct109
Distinct (%)0.2%
Missing58
Missing (%)0.1%
Memory size411.8 KiB
English
41214 
French
 
3855
German
 
3166
Spanish
 
768
Italian
 
660
Other values (104)
 
2974

Length

Max length51
Median length7
Mean length6.998537151
Min length5

Characters and Unicode

Total characters368382
Distinct characters49
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)0.1%

Sample

1st rowEnglish
2nd rowEnglish
3rd rowEnglish
4th rowEnglish
5th rowEnglish

Common Values

ValueCountFrequency (%)
English41214
78.2%
French3855
 
7.3%
German3166
 
6.0%
Spanish768
 
1.5%
Italian660
 
1.3%
Russian578
 
1.1%
Dutch551
 
1.0%
Hungarian259
 
0.5%
Swedish249
 
0.5%
Danish230
 
0.4%
Other values (99)1107
 
2.1%

Length

2021-09-17T11:06:50.822305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
english41408
76.2%
french4132
 
7.6%
german3429
 
6.3%
spanish791
 
1.5%
772
 
1.4%
italian748
 
1.4%
russian633
 
1.2%
dutch632
 
1.2%
latin298
 
0.5%
hungarian287
 
0.5%
Other values (29)1237
 
2.3%

Most occurring characters

ValueCountFrequency (%)
n52515
14.3%
h47778
13.0%
i45023
12.2%
s44308
12.0%
l42378
11.5%
g41801
11.3%
E41408
11.2%
e8349
 
2.3%
r8160
 
2.2%
a7580
 
2.1%
Other values (39)29082
7.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter311661
84.6%
Uppercase Letter53502
 
14.5%
Space Separator1730
 
0.5%
Other Punctuation862
 
0.2%
Decimal Number360
 
0.1%
Open Punctuation90
 
< 0.1%
Close Punctuation90
 
< 0.1%
Dash Punctuation87
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n52515
16.9%
h47778
15.3%
i45023
14.4%
s44308
14.2%
l42378
13.6%
g41801
13.4%
e8349
 
2.7%
r8160
 
2.6%
a7580
 
2.4%
c4838
 
1.6%
Other values (12)8931
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
E41408
77.4%
F4166
 
7.8%
G3519
 
6.6%
S1077
 
2.0%
D891
 
1.7%
I761
 
1.4%
R640
 
1.2%
L301
 
0.6%
H289
 
0.5%
P269
 
0.5%
Other values (7)181
 
0.3%
Decimal Number
ValueCountFrequency (%)
190
25.0%
490
25.0%
590
25.0%
390
25.0%
Other Punctuation
ValueCountFrequency (%)
;772
89.6%
,90
 
10.4%
Space Separator
ValueCountFrequency (%)
1730
100.0%
Open Punctuation
ValueCountFrequency (%)
(90
100.0%
Close Punctuation
ValueCountFrequency (%)
)90
100.0%
Dash Punctuation
ValueCountFrequency (%)
-87
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin365163
99.1%
Common3219
 
0.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
n52515
14.4%
h47778
13.1%
i45023
12.3%
s44308
12.1%
l42378
11.6%
g41801
11.4%
E41408
11.3%
e8349
 
2.3%
r8160
 
2.2%
a7580
 
2.1%
Other values (29)25863
7.1%
Common
ValueCountFrequency (%)
1730
53.7%
;772
24.0%
,90
 
2.8%
(90
 
2.8%
190
 
2.8%
490
 
2.8%
590
 
2.8%
390
 
2.8%
)90
 
2.8%
-87
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII368382
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n52515
14.3%
h47778
13.0%
i45023
12.2%
s44308
12.0%
l42378
11.5%
g41801
11.3%
E41408
11.2%
e8349
 
2.3%
r8160
 
2.2%
a7580
 
2.1%
Other values (39)29082
7.9%

Notes
Categorical

HIGH CARDINALITY
MISSING

Distinct5667
Distinct (%)86.2%
Missing46119
Missing (%)87.5%
Memory size411.8 KiB
No more published
 
281
Published in part
 
50
Printed for private circulation
 
45
The titlepage is engraved
 
43
Privately printed
 
43
Other values (5662)
6114 

Length

Max length5684
Median length90
Mean length132.9221411
Min length3

Characters and Unicode

Total characters874096
Distinct characters235
Distinct categories13 ?
Distinct scripts6 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5506 ?
Unique (%)83.7%

Sample

1st rowOne of an edition of 100 copies
2nd rowWanting the back wrapper
3rd rowOther edition: The Minstrel; or the Progress of Genius ... The second book. pp. 32. E. & C. Dilly: London, 1774. 4º
4th rowOther edition: The haunch of venison, a poetical epistle to Lord Clare ... With a head of the author, drawn by Henry Bunbury Esq; and etched by Bretherton. Dublin: W. Whitestone, etc, 1776. pp. 15. 8º ; Other edition: The haunch of venison, a poetical epistle to Lord Clare ... With a head of the author, drawn by Henry Bunbury Esq; and etched by Bretherton. London: J. Ridley; G. Kearsly, 1776. pp. 19: plate. 4º ; The price on the half-title is 'One shilling'
5th rowOther edition: Retaliation: a poem ... Including epitaphs on the most distinguished wits of this metropolis. London: G. Kearsly, 1774. pp. 20. 4º ; With an engraved portraits on the titlepage. In this copy there is no engraved text below the portraits, and the error on p. 8 is corrected in manuscript Without pages 17-20 containing 'Explanatory notes and observations'

Common Values

ValueCountFrequency (%)
No more published281
 
0.5%
Published in part50
 
0.1%
Printed for private circulation45
 
0.1%
The titlepage is engraved43
 
0.1%
Privately printed43
 
0.1%
With an additional titlepage, engraved43
 
0.1%
Printed on one side of the leaf only30
 
0.1%
Only 100 copies printed27
 
0.1%
Without pagination21
 
< 0.1%
A novel13
 
< 0.1%
Other values (5657)5980
 
11.3%
(Missing)46119
87.5%

Length

2021-09-17T11:06:51.097228image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
the8345
 
5.7%
of6381
 
4.3%
5845
 
4.0%
edition5757
 
3.9%
other4918
 
3.3%
a3992
 
2.7%
3454
 
2.3%
london3370
 
2.3%
and3270
 
2.2%
in2654
 
1.8%
Other values (12798)99543
67.5%

Most occurring characters

ValueCountFrequency (%)
140953
16.1%
e69179
 
7.9%
o52698
 
6.0%
t51464
 
5.9%
i50585
 
5.8%
n48579
 
5.6%
r38560
 
4.4%
a38246
 
4.4%
.31932
 
3.7%
d28518
 
3.3%
Other values (225)323382
37.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter561459
64.2%
Space Separator140953
 
16.1%
Other Punctuation61853
 
7.1%
Uppercase Letter56659
 
6.5%
Decimal Number40712
 
4.7%
Other Letter4394
 
0.5%
Close Punctuation3276
 
0.4%
Open Punctuation3275
 
0.4%
Dash Punctuation1441
 
0.2%
Nonspacing Mark44
 
< 0.1%
Other values (3)30
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e69179
12.3%
o52698
9.4%
t51464
 
9.2%
i50585
 
9.0%
n48579
 
8.7%
r38560
 
6.9%
a38246
 
6.8%
d28518
 
5.1%
s28057
 
5.0%
h26613
 
4.7%
Other values (115)128960
23.0%
Uppercase Letter
ValueCountFrequency (%)
O5741
 
10.1%
L5352
 
9.4%
T4562
 
8.1%
C4137
 
7.3%
A4103
 
7.2%
S3648
 
6.4%
B3241
 
5.7%
W3065
 
5.4%
P2663
 
4.7%
M2539
 
4.5%
Other values (46)17608
31.1%
Other Letter
ValueCountFrequency (%)
º4368
99.4%
ת5
 
0.1%
י3
 
0.1%
ר2
 
< 0.1%
ג2
 
< 0.1%
מ2
 
< 0.1%
נ2
 
< 0.1%
ל2
 
< 0.1%
ז2
 
< 0.1%
ו2
 
< 0.1%
Other values (4)4
 
0.1%
Other Punctuation
ValueCountFrequency (%)
.31932
51.6%
,11771
 
19.0%
:8523
 
13.8%
'4282
 
6.9%
;3353
 
5.4%
&1759
 
2.8%
?135
 
0.2%
*47
 
0.1%
!39
 
0.1%
/11
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
810353
25.4%
19584
23.5%
23842
 
9.4%
42661
 
6.5%
72644
 
6.5%
62493
 
6.1%
92480
 
6.1%
32472
 
6.1%
52164
 
5.3%
02019
 
5.0%
Other Number
ValueCountFrequency (%)
²7
36.8%
7
36.8%
3
15.8%
1
 
5.3%
¹1
 
5.3%
Close Punctuation
ValueCountFrequency (%)
]2837
86.6%
)438
 
13.4%
1
 
< 0.1%
Modifier Letter
ValueCountFrequency (%)
ʹ5
71.4%
ʾ1
 
14.3%
ʺ1
 
14.3%
Nonspacing Mark
ValueCountFrequency (%)
21
47.7%
21
47.7%
̈2
 
4.5%
Open Punctuation
ValueCountFrequency (%)
[2837
86.6%
(438
 
13.4%
Space Separator
ValueCountFrequency (%)
140953
100.0%
Dash Punctuation
ValueCountFrequency (%)
-1441
100.0%
Math Symbol
ValueCountFrequency (%)
=4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin621588
71.1%
Common251540
28.8%
Cyrillic767
 
0.1%
Greek131
 
< 0.1%
Inherited44
 
< 0.1%
Hebrew26
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e69179
 
11.1%
o52698
 
8.5%
t51464
 
8.3%
i50585
 
8.1%
n48579
 
7.8%
r38560
 
6.2%
a38246
 
6.2%
d28518
 
4.6%
s28057
 
4.5%
h26613
 
4.3%
Other values (89)189089
30.4%
Cyrillic
ValueCountFrequency (%)
о74
 
9.6%
с62
 
8.1%
и51
 
6.6%
а50
 
6.5%
т45
 
5.9%
е38
 
5.0%
н38
 
5.0%
р32
 
4.2%
к31
 
4.0%
і27
 
3.5%
Other values (42)319
41.6%
Common
ValueCountFrequency (%)
140953
56.0%
.31932
 
12.7%
,11771
 
4.7%
810353
 
4.1%
19584
 
3.8%
:8523
 
3.4%
'4282
 
1.7%
23842
 
1.5%
;3353
 
1.3%
[2837
 
1.1%
Other values (27)24110
 
9.6%
Greek
ValueCountFrequency (%)
α17
13.0%
ο16
 
12.2%
ν11
 
8.4%
ς10
 
7.6%
ι8
 
6.1%
τ8
 
6.1%
υ7
 
5.3%
λ5
 
3.8%
ρ5
 
3.8%
σ4
 
3.1%
Other values (21)40
30.5%
Hebrew
ValueCountFrequency (%)
ת5
19.2%
י3
11.5%
ר2
 
7.7%
ג2
 
7.7%
מ2
 
7.7%
נ2
 
7.7%
ל2
 
7.7%
ז2
 
7.7%
ו2
 
7.7%
ע1
 
3.8%
Other values (3)3
11.5%
Inherited
ValueCountFrequency (%)
21
47.7%
21
47.7%
̈2
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII867691
99.3%
Latin 1 Sup5339
 
0.6%
Cyrillic767
 
0.1%
None137
 
< 0.1%
Latin Ext A77
 
< 0.1%
Half Marks42
 
< 0.1%
Hebrew26
 
< 0.1%
Modifier Letters7
 
< 0.1%
Greek Ext6
 
< 0.1%
Diacriticals2
 
< 0.1%
Other values (2)2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
140953
16.2%
e69179
 
8.0%
o52698
 
6.1%
t51464
 
5.9%
i50585
 
5.8%
n48579
 
5.6%
r38560
 
4.4%
a38246
 
4.4%
.31932
 
3.7%
d28518
 
3.3%
Other values (69)316977
36.5%
Latin 1 Sup
ValueCountFrequency (%)
º4368
81.8%
é372
 
7.0%
è151
 
2.8%
æ68
 
1.3%
ü68
 
1.3%
ö59
 
1.1%
ä53
 
1.0%
à50
 
0.9%
á23
 
0.4%
É19
 
0.4%
Other values (20)108
 
2.0%
Hebrew
ValueCountFrequency (%)
ת5
19.2%
י3
11.5%
ר2
 
7.7%
ג2
 
7.7%
מ2
 
7.7%
נ2
 
7.7%
ל2
 
7.7%
ז2
 
7.7%
ו2
 
7.7%
ע1
 
3.8%
Other values (3)3
11.5%
Latin Ext A
ValueCountFrequency (%)
œ21
27.3%
ī11
14.3%
ĭ8
 
10.4%
ę7
 
9.1%
ō4
 
5.2%
ś4
 
5.2%
ā3
 
3.9%
ą3
 
3.9%
ł3
 
3.9%
ć3
 
3.9%
Other values (8)10
13.0%
Latin Ext Additional
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
α17
 
12.4%
ο16
 
11.7%
ν11
 
8.0%
ς10
 
7.3%
ι8
 
5.8%
τ8
 
5.8%
υ7
 
5.1%
7
 
5.1%
λ5
 
3.6%
ρ5
 
3.6%
Other values (19)43
31.4%
Greek Ext
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Punctuation
ValueCountFrequency (%)
1
100.0%
Modifier Letters
ValueCountFrequency (%)
ʹ5
71.4%
ʾ1
 
14.3%
ʺ1
 
14.3%
Cyrillic
ValueCountFrequency (%)
о74
 
9.6%
с62
 
8.1%
и51
 
6.6%
а50
 
6.5%
т45
 
5.9%
е38
 
5.0%
н38
 
5.0%
р32
 
4.2%
к31
 
4.0%
і27
 
3.5%
Other values (42)319
41.6%
Half Marks
ValueCountFrequency (%)
21
50.0%
21
50.0%
Diacriticals
ValueCountFrequency (%)
̈2
100.0%

Digitised Record Match
Real number (ℝ≥0)

Distinct52689
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2031536.837
Minimum37
Maximum19138278
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.8 KiB
2021-09-17T11:06:51.236049image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum37
5-th percentile202740.8
Q1962667.5
median1989238
Q33065901.5
95-th percentile3882951
Maximum19138278
Range19138241
Interquartile range (IQR)2103234

Descriptive statistics

Standard deviation1208999.294
Coefficient of variation (CV)0.595115615
Kurtosis2.878658086
Mean2031536.837
Median Absolute Deviation (MAD)1056158
Skewness0.4452978625
Sum1.070518336 × 1011
Variance1.461679293 × 1012
MonotonicityNot monotonic
2021-09-17T11:06:51.367373image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10916806
 
< 0.1%
10916692
 
< 0.1%
39966031
 
< 0.1%
2421451
 
< 0.1%
2114411
 
< 0.1%
2148591
 
< 0.1%
2149611
 
< 0.1%
2209031
 
< 0.1%
2221511
 
< 0.1%
2229551
 
< 0.1%
Other values (52679)52679
> 99.9%
ValueCountFrequency (%)
371
< 0.1%
1961
< 0.1%
2061
< 0.1%
2161
< 0.1%
2181
< 0.1%
4281
< 0.1%
4721
< 0.1%
4781
< 0.1%
4801
< 0.1%
4811
< 0.1%
ValueCountFrequency (%)
191382781
< 0.1%
157574071
< 0.1%
153100671
< 0.1%
153096761
< 0.1%
153096641
< 0.1%
148698161
< 0.1%
139527471
< 0.1%
128114001
< 0.1%
118445301
< 0.1%
118443501
< 0.1%

Missing values

2021-09-17T11:06:42.410165image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
A simple visualization of nullity by column.
2021-09-17T11:06:42.981332image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2021-09-17T11:06:43.960414image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
2021-09-17T11:06:44.318913image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

BL record IDType of resourceNameDates associated with nameType of nameRoleAll namesTitleVariant titlesSeries titleNumber within seriesCountry of publicationPlace of publicationPublisherDate of publicationEditionPhysical descriptionDewey classificationBL shelfmarkTopicsGenreLanguagesNotesDigitised Record Match
0014602826MonographYearsley, Ann1753-1806personNaNMore, Hannah, 1745-1833 [person] ; Yearsley, Ann, 1753-1806 [person]Poems on several occasions [With a prefatory letter by Hannah More.]NaNNaNNaNEnglandLondonNaN1786Fourth edition MANUSCRIPT noteNaNNaNDigital Store 11644.d.32NaNNaNEnglishNaN003996603
1014602830MonographA, T.NaNpersonNaNOldham, John, 1653-1683 [person] ; A, T. [person]A Satyr against Vertue. (A poem: supposed to be spoken by a Town-Hector [By John Oldham. The preface signed: T. A.])NaNNaNNaNEnglandLondonNaN1679NaN15 pages (4°)NaNDigital Store 11602.ee.10. (2.)NaNNaNEnglishNaN000001143
2014602831MonographNaNNaNNaNNaNNaNThe Aeronaut, a poem; founded almost entirely, upon a statement, printed in the newspapers, of a voyage from Dublin, in October, 1812NaNNaNNaNIrelandDublinRichard Milliken1816NaN17 pages (8°)NaNDigital Store 992.i.12. (3.)Dublin (Ireland)NaNEnglishNaN000022782
3014602832MonographAlbert, Prince Consort, consort of Victoria, Queen of Great Britain1819-1861personNaNPlimsoll, Joseph [person] ; Albert, Prince Consort, consort of Victoria, Queen of Great Britain, 1819-1861 [person]The Prince Albert, a poem [By Joseph Plimsoll.]AppendixNaNNaNNaNPlymouthW. Cann1868NaN16 pages (8°)NaNDigital Store 11602.ee.17. (1.)NaNNaNEnglishNaN000039775
4014602833MonographAnslow, RobertNaNpersonNaNAnslow, Robert [person]The Defeat of the Spanish Armada, A.D. 1588. A tercentenary ballad, A.D. 1888NaNNaNNaNEnglandLondonElliot Stock1888NaN40 pages (8°)NaNDigital Store 11602.ee.17. (7.)NaNNaNEnglishNaN000092666
5014602834MonographNaNNaNNaNNaNSwift, Jonathan, 1667-1745 [person]A Familiar Answer to a Familiar Letter [In verse, addressed to Dean Swift?]Appendix. I. Contemporary Satires, Eulogies, etcNaNNaNEnglandLondonNaN1720NaN7 pages (4°)NaNDigital Store 11602.ee.10. (5.)NaNNaNEnglishNaN000093359
6014602835MonographNaNNaNNaNNaNNaNThe Irish Home Rule Bill. A poetical pamphlet, etcNaNNaNNaNNaNCalcuttaI. C. Bose1893NaN4 pages (8°)NaNDigital Store 11601.g.28. (3.)NaNNaNEnglishNaN000150273
7014602836MonographNaNNaNNaNNaNNaNConfessions of a Coquette, while staying at Scarboro', Whitby, & Bridlington. By Azucena [In verse.]NaNNaNNaNEnglandScarboroughE. T. W. Dennis1888NaN42 pages (8°)NaNDigital Store 11602.ee.17. (8.)NaNNaNEnglishNaN000156011
8014602837MonographBellamy, James WilliamNaNpersonNaNBellamy, James William [person]Jonah. The Seatonian Prize Poem for the year 1815NaNNaNNaNEnglandLondonTaylor & Hessey1815NaN28 pages (8°)NaNDigital Store 992.i.12. (1.)NaNNaNEnglishNaN000261714
9014602838MonographBrabant, Henry, SirNaNpersonNaNBrabant, Henry, Sir [person]The Eve of the Revolution; in Newcastle-upon-Tyne. (The Case of Sir Henry Brabant, knt, Mayor of Newcastle upon Tyne, most humbly offered to your Majesties Royal consideration.)NaNReprints of Rare Tracts & Imprints, etcvolume 4 [Reprints of Rare Tracts & Imprints, etc]NaNNewcastleM. A. Richardson1848NaN24 pages (8°)NaNDigital Store 1077.f.89NaNNaNEnglishOne of an edition of 100 copies000445451

Last rows

BL record IDType of resourceNameDates associated with nameType of nameRoleAll namesTitleVariant titlesSeries titleNumber within seriesCountry of publicationPlace of publicationPublisherDate of publicationEditionPhysical descriptionDewey classificationBL shelfmarkTopicsGenreLanguagesNotesDigitised Record Match
52685016289053MonographEliot, George1819-1880personNaNEliot, George, 1819-1880 [person]The mill on the Floss ... Illustrated by J. Barnard DavisThe Mill on the FlossNaNNaNEnglandLondonBlackie1908Another edition, Illustrated by W. M. Bowlesix, 438 pages, plates, 20 cmNaNDigital Store 012618.fff.6NaNNaNEnglishThe edition of 1904, republished, with different preliminaries and plates004117445
52686016289054MonographEliot, George1819-1880personNaNEliot, George, 1819-1880 [person]The Mill on the Floss ... Illustrated by T. H. RobinsonThe Mill on the FlossNaNNaNEngland ; ScotlandEdinburgh ; LondonThomas Nelson1928Another edition589 pages, plates, 20 cmNaNDigital Store 012603.c.6NaNNaNEnglishThe edition of [1919], republished, with the addition of frontispiece004117454
52687016289055MonographEliot, George1819-1880personNaNEliot, George, 1819-1880 [person]The Mill on the Floss ... Illustrated by T. H. RobinsonThe Mill on the FlossNaNNaNEnglandLondonDaily Express Publications1933Another edition511 pages, plates, portraits, 19 cmNaNDigital Store 12602.p.7NaNNaNEnglishNaN004117456
52688016289056MonographEliot, George1819-1880personNaNEliot, George, 1819-1880 [person]The Mill on the Floss ... Illustrated by T. H. RobinsonThe Mill on the FlossNaNNaNEnglandLondonDean1936Another edition377 pages, plates, 21 cmNaNDigital Store 012604.l.3NaNNaNEnglishNaN004117457
52689016289057MonographGarstang, Walter, M.A., F.Z.S.NaNpersonNaNGarstang, Walter, M.A., F.Z.S. [person] ; Shepherd, J. A. (James Affleck), 1867-approximately 1931 [person]Songs of the Birds ... With illustrations by J.A. ShepherdNaNNaNNaNEnglandLondonJohn Lane1922NaN101 pages, illustrations (8°)598.259Digital Store 011648.g.133NaNNaNEnglishPoems, with and introductory essay004158005
52690016289058MonographDickens, Charles1812-1870personNaNDickens, Charles, 1812-1870 [person]The posthumous papers of the Pickwick ClubPickwick papersNaNNaNEnglandLiverpoolWorld's Best LibraryNaNNaNxvi, 610 pages, illustrations, 20 cm823.8NaNEngland--Social life and customs--19th century--Fiction ; Men--England--Societies and clubs--FictionNaNEnglishSpine title: The Pickwick papers008594906
52691016289059SerialNaNNaNNaNNaNNaNTRUE STORY CLASSICSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNEnglishNaN011143842
52692016289060MonographWellesley, Dorothy1889-1956personNaNWellesley, Dorothy, 1889-1956 [person]Early Poems. By M. A [i.e. Dorothy Violet Wellesley, Lady Gerald Wellesley.]NaNNaNNaNEnglandLondonElkin Mathews1913NaNvii, 90 pages (8°)NaNDigital Store 011649.eee.17NaNNaNEnglishNaN000000839
52693016289061MonographA, T. H. E.NaNpersonNaNA, T. H. E. [person]Of Life and Love [Poems.] By T. H. E. A, writer of 'The Message.'NaNNaNNaNEnglandLondonJ. M. Watkins1924NaN89 pages (8°)NaNDigital Store 011645.e.125NaNNaNEnglishNaN000001167
52694016289062MonographAbbay, RichardNaNpersonNaNAbbay, Richard [person]Life, a Mode of Motion; or, He and I, my two selves [A poem.]NaNNaNNaNEnglandLondonJarrold1919NaNvolumes, 58 pages (8°)NaNDigital Store 011649.g.81NaNNaNEnglishNaN000003140