| Literature DB >> 28398288 |
Finn Hedefalk1,2, Patrick Svensson2, Lars Harrie1.
Abstract
This paper presents datasets that enable historical longitudinal studies of micro-level geographic factors in a rural setting. These types of datasets are new, as historical demography studies have generally failed to properly include the micro-level geographic factors. Our datasets describe the geography over five Swedish rural parishes, and by linking them to a longitudinal demographic database, we obtain a geocoded population (at the property unit level) for this area for the period 1813-1914. The population is a subset of the Scanian Economic Demographic Database (SEDD). The geographic information includes the following feature types: property units, wetlands, buildings, roads and railroads. The property units and wetlands are stored in object-lifeline time representations (information about creation, changes and ends of objects are recorded in time), whereas the other feature types are stored as snapshots in time. Thus, the datasets present one of the first opportunities to study historical spatio-temporal patterns at the micro-level.Entities:
Year: 2017 PMID: 28398288 PMCID: PMC5387924 DOI: 10.1038/sdata.2017.46
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1The five parishes (Hög, Kävlinge, Sireköpinge, Halmstad and Kågeröd) covered in the datasets.
Number of map documents digitized from historical maps.
| Land Survey Maps (LSMs) | 1757–1863 | 39 | 1:4,000–1:8,000 | Property units, buildings |
| Military Topographical Survey Maps (MTSMs) | 1812–1820 | 11 | 1:20,000 | Roads, buildings, streams, wetlands, lakes |
| Topographic Maps (TMs) | 1860–1865 | 2 | 1:100,000 | Roads, buildings |
| Economic Maps (EMs) | 1910–1915 | 7 | 1:20,000 | Property units, buildings, roads, railroads, |
| Cadastral Dossiers (CDs) | 1757–1914 | Approximately 150 | 1:1,000–1:8,000 | Property units |
Figure 2Four maps that show Videröra mansion at the border of Halmstad parish.
(a) LSM from 1831. (b) MTSM from 1812–1820. (c) TM from 1860. (d) EM from 1910–1915. (Source: ref. 17).
Figure 3Property units and address units in 1871 in Halmstad parish.
Property units that share an address (identical colour in the map) constitute one address unit. For example, the two property units with the address Hasslebacken 07 constitute one address unit. Only the address units are labelled.
Figure 4Changes of address units and property units for the period 1813–1914.
(a–d) Percentage of address units constituted by a given number of property units for the years 1813, 1850, 1880 and 1914; the values on the bars represent number of address units with the specific amount of property units. (e) Total number of property units for the years 1813, 1850, 1880 and 1914.
Figure 5Property units and wetlands (blue) in object-lifeline representations.
(a) The parishes Sireköpinge, Halmstad and Kågeröd (b) Enlarged area of some property units (including Bångstorp 01).
Overview of the datasets.
| ‘No. of objects’ represents the number of geographic objects such as building points or road segments. | ||||
|---|---|---|---|---|
| Property Units | property_units_SEDD.shp | Polygon | Object-lifelines | 1,165 |
| Wetlands | Wetlands.shp | Polygon | Object-lifelines | MTSM: 364Modern: 195 |
| Buildings | Buildings_polygons.shp | Polygon | Snapshots | LSM: 385MTSM: 438 |
| Buildings | Buildings_points.shp | Point | Snapshots | TM: 638EM: 4,216 |
| Roads | Roads.shp | Line | Snapshots | MTSM: 839TM: 433EM: 2,060 |
| Railroads | Railroads.shp | Line | Snapshots | EM: 150 |
| Streams | Streams.shp | Line | Snapshots | MTSM: 99 |
| Lakes | Lakes.shp | Polygon | Snapshots | MTSM: 40 |
Property Units (property_units_SEDD.shp).
| FID | Integer | Shapefile automatic identifier |
| Shape | Geometry | Geometry of the object (polygon). |
| PolygonId | String | Unique identifier of the polygon. Created by the authors. |
| PuId | String | Property unit identifier created by the authors. This identifier establishes links on the property unit level between the geographic data and the individuals in the SEDD (cf. section ‘Links to the Scanian Economic Demographic Database (SEDD)’. |
| AuId | String | Address unit identifier created by the authors. This identifier establishes links on the address unit level between the geographic data and the individuals in the SEDD (cf. section ‘Links to the Scanian Economic Demographic Database (SEDD)’. |
| Parish | String | The parish that the property unit belongs to. |
| addrName | String | Address name (e.g., ‘Hög’). |
| addrCode | String | Address number (e.g., 04). |
| addrLetter | String | Address part (e.g., ‘B’ or ‘1’). |
| sDate | String | Start date of the object (usually only the year). |
| eDate | String | End date of the object (usually only the year). |
| sDateMin | String | The earliest start date (usually only the year), which represents an uncertainty interval of the object’s lifeline. That is, the object may exist as early as indicated in this attribute. |
| eDateMax | String | The latest end date (usually only the year), which represents an uncertainty interval of the object’s lifeline. That is, the object may exist as late as indicated in this attribute. |
| obsDate | String | Date of observation (=date of the map from which the object was digitized) (usually only the year). |
| mapCode | String | Code of the historical map from which the object was digitized. From the Lantmäteriet historical map archive[ |
| mapName | String | Name of the historical map from which the object was digitized. From the Lantmäteriet historical map archive[ |
| mapSeries | String | Name of the map series of the historical map from which the object was digitized. |
| taxValue | Float | Taxation value of the property unit (Swedish: |
| propForm | String | Type of property formation that created or altered the property unit (In Swedish; e.g., |
| propFormEn | String | English translation of the propForm attribute (e.g., subdivision). |
| ownerFNObs | String | First name of the owner of the property unit when observed in the map. |
| ownerLNObs | String | Last name of the owner of the property unit when observed in the map. |
| Subtype | String | Type of property unit (e.g., croft). |
| notePU | String | Comments made about the property unit when it was digitized (sometimes a transcription of a text part in the map document or cadastral dossier). This information is primarily relevant if the property units were to be improved or changed. |
| noteTimEst | String | Comments about the lifeline estimation of the property unit/object. This information is primarily relevant if the lifeline estimation were to be improved. |
| Shape_Leng | Float | Circumference of the property unit polygon (meters). |
| Shape_Area | Float | Area of the property unit polygon (square meters). |
| parishCode | String | National parish codes from Statistics Sweden (SCB). |
Wetlands in object-lifelines (Wetlands.shp).
| FID | Integer | Shapefile automatic identifier. |
| Shape | Geometry | Geometry of the object (polygon). |
| wetlandId | String | Unique identifier of the wetland. The same identifier is assigned to those overlapping historical and modern wetland polygons that represent the same wetland (but with different geometric shapes because of, e.g., drainage). Created by the authors. |
| sDateMin | Integer | The earliest start date (usually only the year), which represents an uncertainty interval of the object’s lifeline. That is, the object may exist as early as indicated in this attribute. |
| sDate | Integer | Specific start date (usually only the year). Almost always used when a joint drainage unit explicitly determines the start date of a modern wetland. E.g., a wetland was partially drained in 1889, and an overlapping, smaller, wetland can be observed in the modern data, this wetland is assigned the start date of 1890.That is, we assume that at this point in time, the modern wetland got its present geometric shape.The value ‘9999’ indicates that the wetland start within the interval that is defined by the sDateMin and sDateMax. |
| sDateAvg | Integer | An approximate start date for the wetlands observed in the modern data (usually only the year). Calculated as sDateMin+(eDateMin−sDateMin)/2. E.g., if a wetland observed in the modern data can earliest start in 1821, and earliest end at 2007 (the year it was digitized), its sDateAvg=1914. |
| sDateMax | Integer | The latest possible start date. Most often the year when the wetland was observed in the map. |
| eDateMin | Integer | The earliest possible end date (usually only the year), which represents an uncertainty interval of the object’s lifeline. That is, the object exists to at least the date indicated in this attribute. This date correspond often to the observation in the map. E.g., if a wetland was digitized from the MTSM map in 1820, eDateMin is set to 1820. |
| eDate | Integer | A specific date which most often indicate the year when a joint drainage was carried out on the wetland (usually only the year). The value ‘9999’ means that no joint drainage has been observed and that the wetland therefore stops to exist within the interval eDateMin and eDateMax. |
| eDateAvg | Integer | Used mostly for the historical wetlands observed in the 1820 MTSM map (usually only the year). Calculated as: sDateMax+(eDateMin−sDateMax)/2. If eDate is a specific date, eDateAvg is assigned the value of eDate. |
| eDateMax | Integer | The latest possible end date (usually only the year). The value ‘9999’ is used for the wetlands observed in the modern data and means ‘and onwards’ (i.e., it still exist and we do not know when it will stop existing). |
| obsDate | Integer | Date of observation (=date of the map from which the wetland was digitized) (usually only the year). |
| Source | String | The source of the wetland object. |
| wLandType | String | Type of wetland (only for wetlands observed in the modern data). Marshy or Very marshy. |
| Note | String | Note about the wetland object. This information is mainly relevant if the lifeline estimation were to be improved. |
| drainName | String | Name of the joint drainage related to the wetland. |
Buildings (Buildings_polygons.shp, Buildings_points.shp).
| FID | Integer | Shapefile automatic identifier. |
| Shape | Geometry | Geometry of the object (polygon/point). |
| Parish | String | The parish that the building is located in. |
| noteBuild | String | Note about the building. |
| addrName | String | Address name (e.g., ‘Hög’, or ‘Kågeröd church’). |
| addrCode | String | Address number (e.g., 04). |
| addrLetter | String | Address part (e.g., ‘B’ or ‘1’). |
| obsDate | String | Date of observation (=date of the map from which the object was digitized) (usually only the year). |
| mapCode | String | Code of the historical map from which the building was digitized. From the Lantmäteriet historical map archive[ |
| mapName | String | Name of the historical map from which the building was digitized. From the Lantmäteriet historical map archive[ |
| mapSeries | String | Name of the map series of the historical map from which the building was digitized. |
| subType | String | Type of building (e.g., church). |
| nameOnMap | String | Name or number on the historical map. |
| buildingId | String | Unique identifier of the building. Created by the authors. |
| parishCode | String | National parish codes from Statistics Sweden (SCB). |
Roads and railroads (Roads.shp, Railroads.shp).
| FID | Integer | Shapefile automatic identifier. |
| Shape | Geometry | Geometry of the object (line). |
| obsDate | String | Date of observation (=date of the map from which the object was digitized) (usually only the year). |
| mapCode | String | Code of the historical map from which the object was digitized. From the Lantmäteriet historical map archive[ |
| mapName | String | Name of the historical map from which the object was digitized. From the Lantmäteriet historical map archive[ |
| mapSeries | String | Name of the map series of the historical map from which the object was digitized. |
| subType | String | Type of road (e.g., passage way, regional way) or railroad (undefined). |
| roadId/railId | String | Unique identifier of the road/railroad segment. Created by the authors. |
| noteRoad/noteRail | String | Note about the road/railroad. |
Streams and lakes (Streams.shp, Lakes.shp).
| FID | Integer | Shapefile automatic identifier. |
| Shape | Geometry | Geometry of the object (line or polygon). |
| lakeId/streamId | String | Unique identifier of the lake or stream. Created by the authors. |
| obsDate | String | Date of observation (=date of the map from which the object was digitized) (usually only the year). |
| Name | String | Name of the object. |
| mapCode | String | Code of the historical map from which the object was digitized. From the Lantmäteriet historical map archive[ |
| mapName | String | Name of the historical map from which the object was digitized. From the Lantmäteriet historical map archive[ |
| mapSeries | String | Name of the map series from which the object was digitized. |
| subType | String | Type of lake/stream. |
| noteStream/noteLake | String | Note about the stream/lake. |
| Shape_Leng | Float | Length of the stream segment/circumference of the lake polygon (in meters). |
| Shape_Area | Float | Area of the lake polygon (in square meters) (only for Lake.shp). |
Figure 6Mean BOS accuracy results for the historical property units (HB) compared to the modern property units (RB).
The error bars represent the s.d.
Positional accuracy—property unit centroids.
| 16.7 | 14.4 | 10.3 | 11.4 | 1.0 | 59.4 | 40.1 |
Match rates for each geocoding level and parish.
| The match rate represents the percent of geocoded person-years. | |||||
|---|---|---|---|---|---|
| Hög | 1813–1848 | 1,398 | 9,108 | 87.8 | 97.5 |
| 1849–1914 | 3,408 | 33,035 | 83.8 | 97.9 | |
| Kävlinge | 1813–1848 | 1,898 | 11,272 | 71.5 | 96.3 |
| 1849–1914 | 9,507 | 66,486 | 63.2 | 91.7 | |
| Sireköpinge | 1813–1848 | 2,388 | 18,337 | 0 | 86.9 |
| 1849–1914 | 10,353 | 80,500 | 62.5 | 96.4 | |
| Halmstad | 1813–1848 | 2,427 | 18,833 | 58.8 | 75.21 |
| 1849–1914 | 6,806 | 56,397 | 54.8 | 94.9 | |
| Kågeröd | 1813–1848 | 5,165 | 55,997 | 27.1 | 31.2 |
| 1849–1914 | 10,850 | 10,3175 | 76.9 | 94.2 | |
| All parishes | 1813–1848 | 12,826 | 113,548 | 37.3 | 45.4 |
| 1849–1914 | 39,350 | 339,595 | 66.6 | 92.2 |
*The land reforms had not been conducted in Sireköpinge for the period 1813–1848. The individuals can be geocoded on the address level with the given match rate because most individuals lived within the village and did not live in their property units.
†The total number of individuals in this dataset for the period 1813–1914 is approximately 45,000, which is less than the 53,000 individuals mentioned in ref. 7. The reason is that the latter number includes all individuals with at least one event registered, whereas the former and publicly open dataset excluded individuals with only one event registered (individuals with only a birth, a death or a migration registered).
‡These match rates are low because some of the land reforms had not been implemented in this period. Thus, people still lived within the villages and cultivated nearby scattered plots.