Literature DB >> 30002291

The Occurrence of 275 Rare Diseases and 47 Rare Disease Groups in Italy. Results from the National Registry of Rare Diseases.

Domenica Taruscio1, Luciano Vittozzi2, Adele Rocchetti3, Paola Torreri4, Luca Ferrari5.   

Abstract

Knowledge of rare diseases (RD) is often scattered among many data collections and registries of patient cohorts. Therefore, assessing the burden of RD in the general population, developing appropriate policies and planning services for the care of RD patients is difficult. This study aimed at providing a systematic picture of RD occurrence in a population as big as 60 million. Data of diagnoses were certified and collected by a network of 247 specialized centres covering the whole Italian territory. Data received (about 200,000 records) were validated according to formal criteria and, where necessary, corrected by the data sources. Data of age at onset and sex distribution are given for about 400 diseases. Incidence and/or birth prevalence are given for 275 diseases and 47 disease groups, which, altogether, comprise a substantial part of the known rare diseases. Data quality, internal consistency, and external validity of the database have also been assessed and ways to limit the impact of some discrepancies were devised. The information provided by RNMR, cutting across such a wide range of RD, represents a unique coherent basis allowing the prioritization of relevant public health measures and research activities.

Entities:  

Keywords:  Italy; data quality; epidemiology; national registry; rare diseases

Mesh:

Year:  2018        PMID: 30002291      PMCID: PMC6068991          DOI: 10.3390/ijerph15071470

Source DB:  PubMed          Journal:  Int J Environ Res Public Health        ISSN: 1660-4601            Impact factor:   3.390


1. Introduction

Rare diseases (RD) are complex diseases with a very low prevalence (in the EU, a prevalence of less than 5/10,000 in the general population is considered). These conditions are currently estimated to be as many as 6000–8000 and the International Classification of Diseases codes a very limited number of them. Consequently, hospital health records are often not the appropriate sources to get information on RD. Therefore, knowledge on RD is often based on data collections and registries resulting from academic and commercial interests. ORPHANET censused a total of more than 700 registries and databases on RD [1] involving European scientists. They address a variety of aims and differ in their organization, quality and database structure, usually monitoring one disease or a group of related diseases [2]. Moreover, many studies are based on hospital data in regions with a higher prevalence [3]. This situation results in the difficulty of assessing the burden of RD in the general population and developing appropriate priority policies for the care of RD patients and the planning of health and social care services. To overcome these difficulties and support the development of knowledge on RD, the USA NIH-NCATS launched the Global RD Patient Registry and Data Repository—GRDR [4], and the EU Council issued a Recommendation urging Member States to support specific disease information networks, registries, and databases [5]. The EU also supported several initiatives addressing the better exploitation of RD patient data [6,7,8,9,10,11]. All these efforts envisage a system addressed to the registration of many, not to say all, RD and to the improvement of data interoperability. The need for efficient and extensive registries for RDs is also made urgent by international cooperations, such as IRDiRC [12] and the European Reference Networks [13], as well as by the establishment of National Registries [14]. The specificities of RD and the special needs of RD patients were recognized in Italy in 2001 when the National Network of Accredited Centres for the assistance and care of RD patients was established by the Ministerial Decree (MD) 279/2001 [15]. This Decree mandated the notification of new cases to the National Registry of Rare Diseases (RNMR), through Regional Registries (RR). The RNMR central database was placed in the National Institute of Health and has been managed by the National Centre for Rare Diseases since its establishment. This registry can be of interest to the current efforts of promoting data comparability and data retrieval across registries for many reasons. RNMR is organized as a devolved system of 17 regional registries, two registries of Autonomous Provinces, and one interregional registry (Piemonte and Valle d’Aosta), which cover populations spanning from about 10.0 to 0.3 million residents. They are controlled by regional or provincial authorities, which make independent decisions and have their own peculiar needs and resources regarding the delivery of care. Therefore, RNMR may represent a pilot on the feasibility and use of the cooperation among national registries, which will seemingly be developed by mainly following the needs of different countries. Moreover, the set of data collected in the central repository very closely resembles the mandatory Common Data Set proposed by EPIRARE [16], which aimed at building synergies among public health authorities, researchers, patients, and the industry for fulfilling their information needs. To this aim, the EPIRARE Common Data Set was devised to support the production of indicators useful for decision-making in public health and research on RD [6,16,17], as well as to fulfill some patients’ information needs. A recent publication by a regional registry which contributes to RNMR, provided some indicator data relevant to RD and their care [18]. After achieving the full coverage of the national territory in 2011 monitoring, individually, 284 RD and, collectively, many more RD distinguished in 47 groups, it is now possible to use the data contained in RNMR. In this paper, we aim to provide a detailed and systematic picture of the epidemiology of RD in Italy, with particular reference to patients requiring RD dedicated assistance and care. The results have been obtained from 247 accredited hospitals, which cover the whole national territory and serve a population of 60.4 million residents. This population is characterized by a sex ratio of 0.94 (males:females), while the youth (0–14 year) and 65+ people make up 13.9% and 21.4%, respectively. In 2017, non-European foreigners were mostly from Western and Northern Africa (1.70% residents), Eastern and Central-Southern Asia (1.66%), and Latin America (0.58), with all other origins accounting for 0.22%. Since this is the first time that an analysis of RNMR data has been published, special attention has been devoted to assessing and describing the validity of the data collected.

2. Materials and Methods

2.1. Set of Data Communicated to RNMR and Collection Procedures

The data communicated to RNMR by the RR and used for further elaborations were: Personal identifiers of the patient: name, surname, place and date of birth, sex, and national ID code (represented by the Fiscal Code and by an Encrypted Univocal Patient Code derived from it) Patient place of residence Disease diagnosed (defined according to the exemption codes listed in MD 279/2001 [15]) Center making the diagnosis, with geographical location (coded according to the National Health Service Information System [19]) Date of disease onset Date of diagnosis Data, which referred to the time point when the RD diagnosis is certified by an Accredited Centre, was communicated immediately to its RR by the Accredited Centre itself. This procedure was formally agreed with the regional authorities in 2007 [20] and ensures that diagnoses are notified based on the most reliable diagnostic protocols and best expert advice. Following this agreement, two slightly different record structures were developed by the centers to better serve their regional health service organization. In some regions, the record reported the centre which ascertained the RD diagnosis for the first time and the Accredited Centre notifying the case, while, in other regions, the record indicated the Accredited Centre entitled to formally certify the diagnosis and notify the case. Correspondingly, the date of diagnosis was reported as the date of the first ascertainment of the diagnosis or as the date on which the diagnosis was formally certified and notified. Piemonte, Valle d’Aosta, and Lombardia communicated both the dates of diagnosis ascertainment and certification, while Sardinia switched from the former to the latter record structure in 2012–2013. The date of diagnosis is a main reference for data selection and elaboration. Therefore, data were kept distinct: records characterized by the diagnosis ascertainment date are indicated as “AD notifications” and the regions using this record structure are referred to as “AD regions”. The records characterized by the diagnosis certification date are indicated as “CD notifications” and the related regions as “CD regions”. The coexistence of two notification practices had an important bearing on the epidemiological data. Indeed, as it will be shown in more detail in the results section, frequency data of a disease in a regional population were usually a mix of data notified by AD and CD regions, due to the migration of patients from the residence region to another region, which could confirm the RD diagnosis and notified the case. Finally, Friuli-Venezia Giulia (FVG) communicated no personal identifiers in compliance with local personal data protection regulations. Procedures of data collection and communication have been described elsewhere [21]. The collected data set and the data flow were established and made mandatory by the supporting MD 279/2001 [15]. Therefore, no patient consent nor ethical approval was required. Data were processed according to the EU and Italian data protection regulations in all participating structures. Data processing procedures were communicated to the National Authority for Personal Data Protection and their approval was obtained.

2.2. Aims, Procedures and Limitations of Data Curation

Data curation aimed at spotting erratic or inconsistent data, at discarding records that referred to cases diagnosed after the end of the reference period (31 December 2014) and at detecting and managing duplicate records. Data communicated by RR were checked upon their reception by the National Centre of Rare Diseases. Regional duplicates (i.e., records sent by the same region, referred to the same national ID code) and records showing data not passing the validation criteria (detailed in Supplementary File 1), were fed back to the experts responsible for pertaining the RR, for them to perform the necessary checks, even with the primary data sources, and to confirm or correct the data. After receiving the data sets resulting from these controls, the residual mistakes and discrepancies were recorded; then, the name, surname, place of birth, and Fiscal Code of the patient were removed by all records and the place of residence of the patient was substituted by her/his region of residence; finally, the regional data sets were merged into a single file. In this file, national duplicates (i.e., records with the same Encrypted Univocal Patient Code) were detected, classified, and managed to obtain the subsets to be used according to the scope of the analyses, as reported in Supplementary File 1 (Table S1. Control of duplicate records and curation of demographic data from Friuli-Venezia Giulia (FVG) could not be carried out. Data curation was accomplished by means of an Excel model.

2.3. Symptoms Onset Age and Distribution between Sexes

These features have been calculated from the data of Subset 1 for diseases with at least four records with valid data. For these analyses, data of individual diseases identified with their specific denomination were used, including those coded with group codes.

2.4. Birth Prevalence and Incidence of Rare Diseases

The data used for the calculation of the incidence rates refer to cases with date of diagnosis ascertainment or certification included in the period from 1 January 2012 to 31 December 2014. The data selected for the assessment of birth prevalence (BP) were limited to cases born between 1 January 2012 and 31 December 2013 and with diagnoses ascertained or certified during the first year of life of the case. In these calculations, the dates of diagnosis ascertainment were only considered in the records from Piemonte, Val d’Aosta, and Lombardia. Records were analysed by codes, so that diseases coded with group codes were not analysed individually. The calculations of regional incidence and BP rates were carried out as follows. It was assumed that differences in the frequency of each pathology, as resulting from either AD or CD notifications, were mainly due to the different local sensitivities of two notification practices rather than to changes of RD frequencies over time. Therefore, for each pathology and each region, the fraction of AD or CD notifications in their respective reference periods were expressed in percent with respect to the total AD plus CD notifications. The notification practice, which represented at least 90% of total notifications of a region over its reference period, was used to calculate the disease incidence and BP in that region. For those diseases and regions where neither notification system reached 90% total regional notifications, the calculation was not carried out. The assumption was applied considering that the datasets from Piemonte, Valle d’Aosta, and Lombardia, which recorded both dates, showed differences between dates of ascertainment and dates of certification of 4.9 years (95% Confidence Interval: 4.4–5.4 years), 8.5 years (95% CI: 6.1–10.9), and 5.3 years (95% CI: 4.9–5.7), respectively (data not shown). Mean national incidence and BP rates for the three-year period of 2012–2014 were estimated with the same assumption as above, that the frequency of new cases in the period defined by the diagnosis certification date did not differ significantly from the frequency of new cases in the period defined by the diagnosis ascertainment date. Therefore, AD and CD notifications were merged and their sum was used for national frequency estimates. Regional populations on 31 December 2012, 2013, and 2014 [22] and regional live births in the two-year period of 2012–2013 [23] were used to calculate the denominators for regional and national data.

2.5. Internal Consistency of the Incidence and Birth Prevalence Data and Comparison with Literature Data

The internal consistency of the RNMR database has been checked by comparing the medians of the regional incidence and BP rates obtained from AD and CD notifications. The comparison with literature data was limited to the diseases which were monitored by RNMR and could be traced with certainty to an ORPHA code. In addition, comparisons of BP data were limited to diseases which showed a median onset age lower than six months. Results from the RNMR database were firstly compared with literature data reported by ORPHANET [3,24]. In cases where differences of more than one order of magnitude were observed, a further, dedicated, literature search of incidence and BP estimates in the general population was carried out.

2.6. Statistical Methods

Data were reduced by means of contingency tables using RNMR individual disease and disease group codes for frequency calculations. Specific disease names were used to determine the sex ratio and the age at disease onset. Therefore, these features were determined separately for diseases coded with the same group code. The dispersion of regional notifications was described by means of medians and quartiles to minimize the effects of specific situations of selected regions, such as: unusual low sensitivity or unusually high frequencies due to the notification of cases diagnosed in previous years, which were especially likely in regions which activated a registry or adopted a CD record structure shortly before the observation period. The dispersion of the age at symptoms onset was described by means of median and extreme values due to the skewness of the dispersion of the values. Data of frequency for each disease were reported as the yearly average calculated from the total notifications in the three-year observation period. The use of yearly notifications was not considered due to the enhancement of stochastic effects for most diseases.

3. Results and Discussion

3.1. General Description of the National Database

The National Database was made up of records associated with 195,492 notifications. Its Subset 1, which refers to disease occurrences, was made up of 190,622 records, after excluding 4870 Type 2 duplicate records from the National Database. Subset 1 records regarded 275 individually-coded diseases and 47 disease groups. Records with dates of diagnosis ascertainment and certification in the period 2012–2014 were 26,870 and 36,292, respectively. Both the National Database and its Subset 1 contain 6607 Type 3 duplicate records related to cases with multiple diagnoses (3.5% total records of Subset 1). These duplicates may represent cases with multiple diseases, second opinions, and revised or refined diagnoses. Since it was not possible to distinguish among these different possibilities, no selection was applied at this time. Most of these records referred to group-coded pathologies and the differing diagnoses pertained to the same code. In 1855 records, referring to 925 cases, the diagnoses pointed at diseases belonging to different chapters of the ICD9-CM.

3.2. Data Quality: Missing, Inaccurate or Inconsistent Values

The disease onset date was missing or inaccurate in 26% of records of the National Database. The limited completeness of the onset date prevented the use of this variable to study the incidence of the monitored RD in the whole database, regardless of the notification practice. Due to the importance of this variable, a dedicated analysis was carried out to characterize the incomplete records. The study considered both the notifying region and the disease. Figure S1 shows the distribution of the notified diseases in completeness classes of the onset date per notifying region. Data presented also includes the conventional dates for unknown dates and asymptomatic cases. It can be observed that the completeness of this date can be very different, even in regions notifying comparable and large fractions of the monitored diseases. Completeness (including conventional values) for each disease or disease group code was, on average, 81.9% (SD: 17.5%; n = 317 codes) for AD regions, while it was 54.2 (SD: 23.6; n = 303 codes) for CD regions, with a statistically significant difference among the two groups (p < 0.001). Therefore, it appears that the completeness of this variable depends, to a considerable extent, on the rules for data access and communication associated with CD or AD notification practices. Since it is not expected that these rules introduce a biased selection of the data for a disease, available data were used for the determination of the disease onset age. The completeness of the variable indicating the centre making the first ascertainment of the diagnosis was only checked in AD records: this indication was missing in 5% of records. The Fiscal Code and other identifiers were missing in all FVG records, which make up about 1.1% of total records, and in an additional 0.2% of records from other regions. Remarks on the other variables affected less than 1% of records. Table S2 (Supplementary File 1) shows detailed remarks regarding the variables of the National Database.

3.3. Data Quality: Selection of the Regions and Reference Period for Frequency Rates Assessment

Table S3 (Supplementary File 1) shows the overall number of RD occurrences per region of residence of the case. The table provides summary evidence that epidemiological analyses of the available data could not rely on a single AD or CD record structure. Consequently, the analyses were carried out for individual regional populations, using the data from the record structure which dominated for each disease. The progress of notifications was studied distinguishing AD from CD notifications, to detect possible differences related to the data collection practice, and the results are summarised in Figure 1. Yearly AD notifications increased from 2001 to 2008 and then remained approximately constant afterwards. The time course of CD notifications showed an increasing trend from 2006 to 2013, when the median of notifications attained the maximum value. The high variance of CD regional notifications is explained by the adoption of the CD record structure by the CD regions at various times till 2011: before 2006, this record structure was only used by Veneto. Moreover, in some regions, the establishment of registries was followed by a peak in the yearly notifications. The variability of the absolute frequencies of regional notifications over time in the periods 2008–2014 and 2012–2014 is presented in Table S4 (Supplementary File 1). Umbria, accounting for about 1.5% of the national population and adopting the CD record structure in 2011, showed by far the highest variability of yearly regional notifications in the period 2012–2014. This period showed the lower variability of both AD and CD notifications with respect to the period 2008–2014. Therefore, it was selected as the reference period for the assessment of RD frequency rates.
Figure 1

Time course of notifications, by year of diagnosis ascertainment or certification. Dataset: Subset 1; Origin of notifications: Regions using the record structure with the Date of diagnosis Ascertainment (AD) and regions using the record structure with the Date of diagnosis Certification (CD); Selection of records: Records with valid diagnosis ascertainment or certification date; Total records: 115,575 (AD records) and 121,854 (CD records, including those from Lombardia, Piemonte, Sardegna, and Val d’Aosta). Data are expressed as percentage of the total regional notifications (by region of residence) to better compare the progress of AD or CD notifications from regional populations with different dimensions. Absolute numbers of yearly notifications per region of residence are reported in Table S4.

3.4. Symptoms Onset Age and Sex Distribution

Data of age at disease symptoms onset are reported in Table 1 for selected diseases. Table S5 reports data for all 414 diseases in which it was determined. In total, 108 diseases set on in all cases notified before 18 years of age and 15 showed all occurrences setting on after 18. The comparison with literature data could be carried out for almost three hundred diseases using data available in the ORPHANET portal, which mostly referred to the main life stages where the first symptoms set on (Table S5). There was a general agreement between RNMR and literature data, but for about 12% of compared diseases.
Table 1

Age at symptoms onset of selected rare diseases monitored by RNMR. Dataset: Subset 1; Origin of notifications: all Regions; Record selection: records with diagnosis indicating a specific disease (i.e., excluding disease groups) and at least four records with valid onset date per disease; Total records: 84,859.

RNMR CodeORPHA CodeDiseaseRecords (N)Median Onset Age (Years)Life Stages Reported by ORPHANET [24]
RNG0401791Frontofacionasal dysplasia7−0.28Neonatal
RN04903447Weaver syndrome7−0.23Neonatal, Antenatal
RNG03087Apert syndrome13−0.18Antenatal, Neonatal
RNG0402108Hallermann-Streiff syndrome9−0.17Neonatal, Infancy
RP00401915Fetal alcohol syndrome56−0.14Antenatal, Neonatal
RNG070313Lamellar ichthyosis60−0.11Neonatal
RN04001540Jackson-Weiss syndrome12−0.10Neonatal
RNG040207Crouzon disease43−0.09Infancy, Neonatal
RN11501340Cardiofaciocutaneous syndrome54−0.06Antenatal, Neonatal
RN06401114Aplasia cutis congenita14−0.06Antenatal, Neonatal
RFG010141Canavan disease6−0.06Neonatal, infancy (severe form); childhood (mild form)
RNG0401452Cleidocranial dysostosis7−0.05Neonatal
RCG060352Galactosemia58−0.05Infancy, Neonatal, Childhood
RN013035,737Morning glory syndrome10−0.05Childhood
RNG050429Hypochondroplasia7−0.05Childhood
RP0010290Congenital rubella syndrome31−0.05Antenatal, Neonatal
RN0390380Greig cephalopolysyndactyly syndrome15−0.04Neonatal
RDG020326Factor V deficiency24−0.04All ages
RN0100708Peters anomaly12−0.04Infancy, Neonatal
RCG040238,583Hyperphenylalaninemia673−0.04Neonatal, Infancy
RN1040710Pfeiffer syndrome10−0.03Antenatal, Neonatal
RN1530500LEOPARD syndrome37−0.03Childhood
RCG040407Non-ketotic hyperglycinemia6−0.03Infancy, Neonatal
RCG060348Fructose-1,6-bisphosphatase deficiency4−0.03All ages
RN0510464Incontinentia pigmenti74−0.03Neonatal
RN16401466COFS syndrome6−0.03Neonatal, Antenatal
RN1760912Zellweger syndrome13−0.02Neonatal
RNG070461Recessive X-linked ichthyosis60−0.02Neonatal
RP0060415,286Bilirubin encephalopathy7−0.02Neonatal
RCG05023Argininosuccinatelyase deficiency12−0.02Neonatal, All ages
RN0910374Goldenhar syndrome187−0.02Neonatal, Antenatal
RN0600312Epidermolytic ichthyosis32−0.02Neonatal
RDG020327Factor VII deficiency98−0.02All ages
RN0990570Moebius syndrome52−0.02Neonatal
RN03202368Gastroschisis57−0.02Neonatal, Antenatal
RDG020328Factor X deficiency9−0.02All ages
RN0700280Wolf-Hirschhorn syndrome64−0.02Neonatal, Antenatal
RN1080813Silver-Russell syndrome85−0.02Neonatal, Antenatal
RN01801203Duodenal atresia75−0.02Antenatal, Neonatal, Infancy, Childhood
RN05401556Cutis marmorata telangiectatica congenita21−0.02Neonatal
RCG040511Maple syrup urine disease26−0.02Infancy, Neonatal, Childhood
RNG070634Netherton syndrome17−0.01Infancy, Neonatal
RN1100808Seckel syndrome11−0.01Neonatal, Antenatal
RN1310739Prader-Willi syndrome388−0.01Neonatal, Antenatal
RCG04035Propionic acidemia4−0.01Infancy, Neonatal
RN00801764Familial dysautonomia6−0.01Neonatal, Antenatal
RDG020331Factor XIII deficiency5−0.01All ages
RN0850138CHARGE syndrome116−0.01Neonatal
RCG040293,355Methylmalonic acidemia11−0.01All ages
RN1170744Proteus syndrome19−0.01Infancy
RN01701201Atresia of small intestine77−0.01Neonatal
RN1140107Branchio-oto-renal syndrome35−0.01All ages
RN1250887VACTERL/VATER association80−0.01Neonatal
RN1410199Cornelia de Lange syndrome74−0.01Neonatal, Antenatal
RN0930392Holt-Oram syndrome23−0.01Neonatal
RDG02098,878Hemophilia A1209−0.01Infancy, Neonatal
RDG02098,879Hemophilia B2020.00Infancy, Neonatal
RN1740899Walker-Warburg syndrome60.00Infancy, Neonatal
RN1010648Noonan syndrome4590.00Neonatal
RN04302911Poland syndrome2990.00Infancy, Neonatal
RDG020745Protein C deficiency2330.00Childhood
RN0200388Hirschsprung disease2440.00Infancy, Neonatal
RN00602162Holoprosencephaly460.00Neonatal, Antenatal
RN1240857Townes-Brocks syndrome80.00All ages
RN11301297Branchio-oculo-facial syndrome70.00Neonatal
RNG0601522Craniometaphyseal dysplasia110.00Childhood
RN1210819Smith-Magenis syndrome450.00Infancy, Neonatal
RN08902053Freeman-Sheldon syndrome90.00Neonatal
RCG050187Citrullinemia100.00Neonatal, Adult
RN03601465Coffin-Siris syndrome130.00Neonatal
RN04102311Jarcho-Levin syndrome110.00Neonatal, Antenatal
RN0280950Acrodysostosis120.00Neonatal, Antenatal
RN166035,125Epidermal nevus syndrome150.00Childhood, Adolescent, Adult
RNG040861Treacher-Collins syndrome160.00Neonatal
RN145094,068Congenital spondyloepiphyseal dysplasia140.00Neonatal
RNG060289Ellis-Van Creveld syndrome80.00Neonatal, Antenatal
RN1590884Pallister-Killian syndrome160.00Neonatal, Antenatal
RN0670281Cri du chat syndrome460.00Neonatal
RN0790915Aarskog-Scott syndrome280.00Childhood
RN09402322Kabuki make-up syndrome1120.00Infancy, Neonatal
RN0040475Joubert syndrome960.00Neonatal, Antenatal
RN1270904Williams syndrome3160.00Neonatal, Antenatal
RCG160567Di George syndrome3590.00Neonatal
RN1620783Rubinstein-Taybi syndrome630.00All ages
RN1380110Bardet-Biedl syndrome510.00Infancy, Neonatal, Antenatal
RN0660870Down syndrome22830.00Antenatal, Neonatal
RN0680881Turner syndrome9470.00Infancy, Neonatal, Antenatal, Childhood
RN07703205Sturge-Weber syndrome1340.00Infancy, Neonatal, Childhood, Adolescent
RN1330908Fragile X syndrome2880.00Neonatal, Infancy, Childhood
RN151090,308Klippel-Trénaunay syndrome1700.00Infancy, Childhood, Adolescent
RN011077Aniridia690.00Infancy, Neonatal
RFG05083,330Werdnig-Hoffman disease130.00Infancy, Neonatal
RCG04026Methylmalonic acidemia with homocystinuria160.00All ages
RN08603157De Morsier syndrome460.00Infancy, Neonatal
RN1400191Cockayne syndrome60.01All ages
RN1430220Denys-Drash syndrome50.02Infancy, Neonatal
RN02402138True hermaphroditism170.02Infancy, Neonatal
RN11902614Nail-patella syndrome330.02Neonatal, Infancy, Childhood
RN0340974Adams-Oliver syndrome110.02Neonatal
RDG020903Von Willebrand disease8000.02All ages
RC0180205Crigler-Najjar syndrome270.04Neonatal
RN1730893WAGR syndrome80.07Neonatal
RN095098,861Primary ciliary dyskinesia, Kartagener type3450.08Neonatal, infancy

Note: This table reports pathologies ordered by increasing median age at onset. The number of records is indicated with the only aim of allowing a better assessment of the statistical data presented and cannot be used as an indication of the disease or exemption code frequency.

The sex distribution of diseases unevenly frequent between sexes is presented in Table 2. Eleven diseases occurred with a female:male ratio higher than 9:1. Twelve diseases prevailed in males with a female:male ratio lower than 1:9. Data of sex distribution could be compared with data of 65 diseases reported in the ORPHANET data sheets and further literature as having a predominance in either sex. RNMR results showed a good agreement with literature data, except for three diseases (5% comparisons). Table S6 shows the sex distribution of 441 RD reported to RNMR and the comparison with literature data, where available.
Table 2

Sex distribution of rare diseases monitored by RNMR with the highest relative frequencies in males or females. Dataset: Subset 1; Origin of notifications: all regions; Record selection: records with diagnosis indicating a specific disease (i.e., excluding disease groups) and at least four records per disease; Total records: 119,762.

ORPHA CodeDiseaseRecords (N)Females (%)Males (%)F:M Ratio and Other Literature Information Reported in ORPHANET Data Sheets [24]
1791Frontofacionasal dysplasia71000-
209,981Iron refractory iron deficiency anemia41000-
881Turner syndrome1754991Almost exclusively in F
778Rett syndrome603973Predominant in F
538Lymphangioleiomyomatosis180973Almost exclusively in F
464Incontinentia pigmenti9597320:1
37,202Interstitial cystitis11929559:1
-Storage pool deficiency11919-
816Sjögren-Larsson syndrome93909-
-Thrombocytopenic thrombotic purpura-hemolytic-uremic syndrome109010-
559Marinesco-Sjögren syndrome309010-
169,615Idiopathic central precocious puberty3686891110:1
98,973Posterior polymorphous corneal dystrophy88813-
79,473Porphyria variegata88813Predominant in F
1297Branchio-oculo-facial syndrome88813-
3143Schmidt syndrome728614-
1452Cleidocranial dysostosis78614-
429Hypochondroplasia78614-
809Mixed connective tissue disease1092861410:1
3287Takayasu arteritis3468514Predominant in F
2483Melkersson-Rosenthal syndrome258416-
317Erythrokeratodermia variabilis68317-
98,958Droplet cornea178218-
436Hypophosphatasia118218-
-Secretion deficiency thrombocytopathy118218-
125Bloom syndrome58020-
79,273Hereditary coproporphyria58020Predominant in F
98,974Fuchs endothelial dystrophy3080203–4:1
581Mucopolysaccharidosis type 358020-
2059Fryns syndrome58020-
65,684Hirayama disease520801:20
3447Weaver syndrome102080-
98,895Becker muscular dystrophy2211981Predominant in M
379Chronic granulomatous disease871882-
221,117Gerstmann syndrome61783-
97,360Robinow syndrome61783-
478Kallmann syndrome50916841:5
101,330Porphyria cutanea tarda531585Predominant in M
-Opitz syndrome201585-
1466COFS syndrome71486-
-Simpson-Golabi-Behmel syndrome71486-
2796Pachydermoperiostosis1513871:7
481Kennedy disease511288-
98,879Hemophilia B2711189Predominant in M
223Nephrogenic diabetes insipidus661189-
915Aarskog-Scott syndrome38892Predominant in M
98,878Hemophilia A1577793Predominant in M
98,896Duchenne muscular dystrophy258595Predominant in M
461Recessive X-linked ichthyosis65397Almost exclusively in M
-Klinefelter syndrome20710100-
330Factor XII deficiency40100-
425Familial hypoalphalipoproteinemia40100-
510Lesch-Nyhan disease120100Severe form predominant in M
534Lowe syndrome50100Almost exclusively in M
649Norrie disease50100Almost exclusively in M
580Mucopolysaccharidosis type 2270100Almost exclusively in M
245Nager syndrome40100-

Note: The number of records is indicated with the only aim of allowing a better assessment of the statistical data presented and cannot be used as an indication of the disease or exemption code frequency. Where the percentages in males and females sum to less than 100, the difference represents records with missing sex data.

3.5. Incidence in the General Population and Prevalence in Live Births

Table 3 and Table 4 report, respectively, the rates of diseases (and disease groups) most incident in the general population and most prevalent at birth. National and regional incidence and BP rates of all diseases and disease groups notified to RNMR in Italy are reported in Tables S7 and S8. The collection of data of such a wide range of diseases has been made possible thanks to the sustainability of the registration and, since 2011, full coverage of the population residing in the whole Italian territory. Full population coverage relies on the national network for the diagnosis and care of RD, made up of 247 accredited centres belonging to the public health service [21]. Completeness of case ascertainment relies on mandatory case notification and the possibility for the patients to access RD-specific assistance when their condition is certified by these centres. However, completeness of case ascertainment may depend on the actual need for RD-specific assistance and care. Moreover, the occurrence of diseases and malformations in still births and at termination of pregnancy for foetal anomalies is not recorded. This implies that RNMR data are actually representative of patients requiring care in specialized RD centres.
Table 3

National Incidence rates of most incident rare diseases and rare disease groups monitored by RNMR in the Italian population. Yearly average in the three-year period 2012–2014.

ORPHA CodeDisease NameNational Incidence (/Million)
156,071Keratoconus25.53
803Amyotrophic lateral sclerosis19.15
870Down syndrome7.76
169,615Idiopathic central precocious puberty7.33
-Lichen sclerosus5.92
-Bullous pemphigoid4.88
930Idiopathic achalasia4.63
117Behçet disease4.57
-Arnold-Chiari malformation4.19
2932Chronic inflammatory demyelinating polyneuropathy3.77
-Pemphigus vulgaris3.75
-Klinefelter syndrome3.75
397Giant cell arteritis3.73
399Huntington disease3.38
37,202Interstitial cystitis3.19
881Turner syndrome2.53
98,249Ehlers-Danlos syndrome2.43
774Hereditary hemorrhagic telangiectasia2.36
2331Kawasaki disease2.08
732Polymyositis1.95
733Familial adenomatous polyposis1.90
183Churg-Strauss syndrome1.88
221Dermatomyositis1.87
809Mixed connective tissue disease1.80
558Marfan syndrome1.79
900Wegener granulomatosis1.77
-Mixed cryoglobulinemia1.75
-Idiopathic torsion dystonia1.46
-Rheumatic endocarditis1.36
761Henoch-Schönlein purpura1.35
648Noonan syndrome1.30
805Tuberous sclerosis1.18
908Fragile X syndrome1.15
2911Poland syndrome1.14
727Microscopic polyangiitis1.14
171Primary sclerosing cholangitis1.03
904Williams syndrome0.98
2073Narcolepsy0.97
478Kallmann syndrome0.94
96,346Anorectal malformation0.93
388Hirschsprung disease0.88
790Retinoblastoma0.83
91,378Hereditary angioedema0.81
104Leber hereditary optic neuropathy0.79
739Prader-Willi syndrome0.78
70,590Infantile apnea0.78
116Beckwith-Wiedemann syndrome0.76
550MELAS0.73
49,041Retroperitoneal fibrosis0.72
3451West syndrome0.70
Disease Group Name
-Hereditary coagulation defects25.16
-Undifferentiated connective tissue syndromes17.75
-Hereditary retinic dystrophies17.60
-Hereditary anemias15.04
-Neurofibromatoses10.08
-Muscular dystrophies8.12
-Congenital alterations of iron metabolism8.04
-Disturbances of aminoacid transport and metabolism7.92
-Chromosomal duplication/deficiency syndromes7.82
-Hereditary neuropathies6.61
Table 4

National BP Rates of most birth-prevalent rare diseases and rare disease groups monitored by RNMR in the Italian population. Rates in live births of the two-year period 2012–2013.

ORPHA CodeDisease NameBirth Prevalence (/100,000)
870Down syndrome35.00
-Esophageal atresia and/or Isolated tracheo-esophageal fistula6.67
96,346Anorectal malformation6.37
3451West syndrome5.69
388Hirschsprung disease4.22
739Prader-Willi syndrome3.63
-Biliary atresia3.14
805Tuberous sclerosis2.84
116Beckwith-Wiedemann syndrome2.65
648Noonan syndrome2.45
881Turner syndrome1.96
1203Duodenal atresia1.86
374Goldenhar syndrome1.67
1201Atresia of small intestine1.67
290Congenital rubella syndrome1.47
904Williams syndrome1.47
-Epidermolysis bullosa1.37
2911Poland syndrome1.27
2368Gastroschisis1.18
138CHARGE syndrome1.18
-Lissencephaly1.08
98,861Primary ciliary dyskinesia, Kartagener type1.08
464Incontinentia pigmenti0.78
887VACTERL/VATER association0.78
-Congenital colobomatous optic disc0.69
3205Sturge-Weber syndrome0.69
-Alagille syndrome0.69
2162Holoprosencephaly0.59
1896EEC syndrome0.59
205Crigler-Najjar syndrome0.59
77Aniridia0.49
70,590Infantile apnea0.49
1915Fetal alcohol syndrome0.39
199Cornelia de Lange syndrome0.39
908Fragile X syndrome0.39
90,308Klippel-Trénaunay syndrome0.39
912Zellweger syndrome0.29
570Moebius syndrome0.29
199,642Microcephaly0.29
281Cri du chat syndrome0.29
783Rubinstein-Taybi syndrome0.29
380Greig cephalopolysyndactyly syndrome0.20
280Wolf-Hirschhorn syndrome0.20
813Silver-Russell syndrome0.20
475Joubert syndrome0.20
884Pallister-Killian syndrome0.20
3157De Morsier syndrome0.20
974Adams-Oliver syndrome0.20
726Alpers syndrome0.20
435Ito hypomelanosis0.20
Disease Group Name
-Disturbances of aminoacid transport and metabolism30.10
-Congenital craniofacial anomalies21.57
-Chromosomal duplication/deficiency syndromes8.73
-Disturbances of carbohydrate transport and metabolism, excluded diabetes mellitus 5.39
-Neurofibromatoses3.92
-Congenital chondrodystrophies3.53
-Congenital ichthyoses2.94
-Other congenital anomalies with intellectual disability 2.84
-Chromosomal aneuploidy syndromes2.55
-Urea cycle disturbances1.18
Median regional incidence and BP rates calculated from CD and from AD collection practices are compared in Tables S9 and S10. Incidence data from the two collection practices showed a Pearson correlation coefficient of 0.78, while the Pearson’s correlation coefficient of BP rates was 0.44. The low correlation of BP rates may be related to the limited population monitored and record selection criteria. For both incidence and BP, the rates calculated from the CD collection practice were, on average, higher than those from the AD collection practice. The reason for such a difference may be related to the different time courses shown by CD and AD notifications (Figure 1), which may involve the certification of cases diagnosed before the adoption of the CD practice. Incidence data for 34 diseases and BP data for 56 diseases could be compared with data retrieved in the ORPHANET compilations (Tables S11 and S12). Diseases with differences of more than one order of magnitude were subject to a dedicated survey of the literature, which showed that information and data justifying our results were published, except for three and four diseases, respectively, for incidence and BP rates.

4. Conclusions

This study determined basic features to assess the reliability and internal consistency of RNMR data. Data of the three-year period 2012–2014 provided information on 275 diseases and 47 disease groups. Moreover, the onset age and sex distribution have been determined for more than 400 individual diseases. This data is mostly representative of patients requiring care in specialized RD centres and is especially suitable to assess the burden of assistance and care by RD-dedicated resources of the public health service. The external validity of the data collected in the RNMR database was studied by comparing the RD national estimates derived from it with the estimates of frequency in the general population available in the literature. Differences bigger than an order of magnitude were found for 7–8% diseases for which comparisons were possible regarding national incidence and BP. A similar level of agreement with literature data was found regarding the age at disease onset and sex distribution. Therefore, RNMR data is rather consistent with information available in the literature. The remarkable consistency of RNMR data with data of some diseases, which can be traced reliably from hospital records, was also shown in a recent publication [25]. In conclusion, it appears that RNMR results provide basic epidemiological information, which was still lacking for many RD, and overcome the current typical fragmentation of data resulting from the observation of one or a few related diseases in different, selected populations with varied methodologies. In fact, RNMR results compose a unique systematic picture of the occurrence of a wide range of RD in a population as big as about 60 million residents. Although further efforts are still required to achieve the full exploitation of such a complex and extended collection system, this picture represents a novel and sound information basis to improve the assessment of the rare disease burden, and to inform public health policy planning and research prioritization on rare diseases in Italy and beyond.
  24 in total

1.  The current situation and needs of rare disease registries in Europe.

Authors:  D Taruscio; S Gainotti; E Mollo; L Vittozzi; F Bianchi; M Ensini; M Posada
Journal:  Public Health Genomics       Date:  2014-02-03       Impact factor: 2.000

2.  The Italian National Rare Diseases Registry.

Authors:  Domenica Taruscio; Yllka Kodra; Gianluca Ferrari; Luciano Vittozzi
Journal:  Blood Transfus       Date:  2014-04       Impact factor: 3.443

3.  Birth prevalence rates of skeletal dysplasias.

Authors:  C Stoll; B Dott; M P Roth; Y Alembik
Journal:  Clin Genet       Date:  1989-02       Impact factor: 4.438

4.  Chronic proximal spinal muscular atrophy of childhood and adolescence: sex influence.

Authors:  I Hausmanowa-Petrusewicz; J Zaremba; J Borkowska; W Szirkowiec
Journal:  J Med Genet       Date:  1984-12       Impact factor: 6.318

5.  Incidence and mortality of relapsing polychondritis in the UK: a population-based cohort study.

Authors:  Nisha Hazra; Alex Dregan; Judith Charlton; Martin C Gulliford; David P D'Cruz
Journal:  Rheumatology (Oxford)       Date:  2015-07-17       Impact factor: 7.580

6.  Pallister-Killian syndrome: a study of 22 British patients.

Authors:  Moira Blyth; Viv Maloney; Sarah Beal; Morag Collinson; Shuwen Huang; John Crolla; I Karen Temple; Diana Baralle
Journal:  J Med Genet       Date:  2015-04-17       Impact factor: 6.318

7.  Genotype-phenotype correlation study in 529 patients with multiple hereditary exostoses: identification of "protective" and "risk" factors.

Authors:  Elena Pedrini; Ivy Jennes; Morena Tremosini; Annamaria Milanesi; Marina Mordenti; Alessandro Parra; Federica Sgariglia; Monia Zuntini; Laura Campanacci; Nicola Fabbri; Elettra Pignotti; Wim Wuyts; Luca Sangiorgi
Journal:  J Bone Joint Surg Am       Date:  2011-12-21       Impact factor: 5.284

8.  Five years of molecular diagnosis of Fragile X syndrome (1997-2001): a collaborative study reporting 95% of the activity in France.

Authors:  Valérie Biancalana; Chérif Beldjord; Agnès Taillandier; Sylvie Szpiro-Tapia; Véronica Cusin; Fabienne Gerson; Christophe Philippe; Jean-Louis Mandel
Journal:  Am J Med Genet A       Date:  2004-09-01       Impact factor: 2.802

9.  National registries of rare diseases in Europe: an overview of the current situation and experiences.

Authors:  Domenica Taruscio; Luciano Vittozzi; Remy Choquet; Ketil Heimdal; Georgi Iskrov; Yllka Kodra; Paul Landais; Manuel Posada; Rumen Stefanov; Christiane Steinmueller; Elfriede Swinnen; Herman Van Oyen
Journal:  Public Health Genomics       Date:  2014-09-09       Impact factor: 2.000

10.  Prevalence, incidence, and age at diagnosis in Marfan Syndrome.

Authors:  Kristian A Groth; Hanne Hove; Kasper Kyhl; Lars Folkestad; Mette Gaustadnes; Niels Vejlstrup; Kirstine Stochholm; John R Østergaard; Niels H Andersen; Claus H Gravholt
Journal:  Orphanet J Rare Dis       Date:  2015-12-02       Impact factor: 4.123

View more
  4 in total

1.  Primary Sclerosing Cholangitis: Burden of Disease and Mortality Using Data from the National Rare Diseases Registry in Italy.

Authors:  Marco Carbone; Yllka Kodra; Adele Rocchetti; Valerio Manno; Giada Minelli; Alessio Gerussi; Vincenzo Ronca; Federica Malinverno; Laura Cristoferi; Annarosa Floreani; Pietro Invernizzi; Susanna Conti; Domenica Taruscio
Journal:  Int J Environ Res Public Health       Date:  2020-04-29       Impact factor: 3.390

Review 2.  Multifactorial Rare Diseases: Can Uncertainty Analysis Bring Added Value to the Search for Risk Factors and Etiopathogenesis?

Authors:  Domenica Taruscio; Alberto Mantovani
Journal:  Medicina (Kaunas)       Date:  2021-01-28       Impact factor: 2.430

3.  Prevalence of Huntington Disease in Italy: a systematic review and meta-analysis.

Authors:  Matteo Riccò; Luigi Vezzosi; Federica Balzarini; Giovanni Gualerzi; Silvia Ranzieri
Journal:  Acta Biomed       Date:  2020-04-10

4.  10 years of CEMARA database in the AnDDI-Rares network: a unique resource facilitating research and epidemiology in developmental disorders in France.

Authors:  Claude Messiaen; Caroline Racine; Arnaud Sandrin; Alain Verloes; Laurence Faivre; Ahlem Khatim; Louis Soussand; Sylvie Odent; Didier Lacombe; Sylvie Manouvrier; Patrick Edery; Sabine Sigaudy; David Geneviève; Christel Thauvin-Robinet; Laurent Pasquier; Florence Petit; Massimiliano Rossi; Marjolaine Willems; Tania Attié-Bitach; Pierre-Henry Roux-Levy; Laurent Demougeot; Lilia Ben Slama; Paul Landais; Anne-Sophie Jannot; Christine Binquet
Journal:  Orphanet J Rare Dis       Date:  2021-08-04       Impact factor: 4.123

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.