Literature DB >> 34751474

Skin measurement devices to assess skin quality: A systematic review on reliability and validity.

Mirte Langeveld1, Lara S van de Lande2, Eimear O' Sullivan2, Berend van der Lei3, Joris A van Dongen4.   

Abstract

BACKGROUND: Many treatments aim to slow down or reverse the visible signs of skin aging and thereby improve skin quality. Measurement devices are frequently employed to measure the effects of these treatments to improve skin quality, for example, skin elasticity, color, and texture. However, it remains unknown which of these devices is most reliable and valid.
MATERIALS AND METHODS: MEDLINE, Embase, Cochrane Central, Web of Science, and Google Scholar databases were searched. Instruments were scored on reporting construct validity by means of convergent validity, interobserver, intraobserver, and interinstrument reliability.
RESULTS: For the evaluation of skin color, 11 studies were included describing 16 measurement devices, analyzing 3172 subjects. The most reliable device for skin color assessment is the Minolta Chromameter CR-300 due to good interobserver, intraobserver, and interinstrument reliability. For skin elasticity, seven studies assessed nine types of devices analyzing 290 subjects in total. No intra and interobserver reliability was reported. Skin texture was assessed in two studies evaluating 72 subjects using three different types of measurement devices. The PRIMOS device reported excellent intra and interobserver reliability. None of the included reviewed devices could be determined to be valid based on construct validity.
CONCLUSION: The most reliable devices to evaluate skin color and texture in ordinary skin were, respectively, the Minolta Chromameter and PRIMOS. No reliable device is available to measure skin elasticity in ordinary skin and none of the included devices could be determined to be designated as valid.
© 2021 The Authors. Skin Research and Technology published by John Wiley & Sons Ltd.

Entities:  

Keywords:  facial surgery; measurement tools; plastic surgery; skin quality; skin rejuvenation; systematic review

Mesh:

Year:  2021        PMID: 34751474      PMCID: PMC9299221          DOI: 10.1111/srt.13113

Source DB:  PubMed          Journal:  Skin Res Technol        ISSN: 0909-752X            Impact factor:   2.240


INTRODUCTION

The skin is the largest and most visible organ of the human body with important protective and regulatory functions. The epidermal barrier forms the first line of defense against exogenic factors and pathogenic microorganisms. The skin also plays an important role in thermoregulation, metabolic processes, and sensory perception. , , Adjacent to its function, skin plays a key role in aesthetics; unfortunately, skin quality decreases over time due to aging, especially in the face. Facial aging is characterized by many changes in a broad spectrum of facial skin features, for example, pigmentation, wrinkles, and rosacea. , Aging of skin can be categorized into two types of aging: intrinsic and extrinsic aging. Intrinsic aging derives from genetic and hormonal influences, whereas extrinsic aging is caused by environmental factors, such as cigarette smoke, ultraviolet radiation, or trauma. , In the epidermis of the skin, aging of the face is characterized by loss of dermal mast cells and fibroblasts as well as by shortening of telomeres. In the dermis, lower levels of collagen, dysfunctional collagen, and a reduction of elastin fibers are observed. These cellular changes result in increased pigmentation, loss of elasticity, and formation of wrinkles over time. , Nowadays, people have become progressively concerned about their aged facial skin features. Many autologous treatments, for example, lipofilling, platelet‐rich plasma, or nanofat, aim to either slow down or reverse these visible signs of skin aging and thereby improving skin quality. Generally, skin quality and skin quality improvement is assessed merely by visual inspection by the patient and practitioner, which is accompanied by disadvantages of interperson variability and recall‐bias, making the results rather unreliable. Some clinicians determine the effectiveness of such interventions by assessing skin quality with the use of a measurement tool as, for example, tristimulus colorimetry to measure skin color, the Cutometer or Ballistometer for skin elasticity, and polarization imaging techniques to assess skin texture. , , However, it remains unknown whether these devices are accurate and dependable. Therefore, the aim of this study is to systematically search for the best‐validated medical devices to assess skin quality (i.e., skin color, texture, and elasticity) in the most reliable way.

METHODS

Protocol, information sources, and search

This systematic review was performed according to the PRISMA statement. The databases MEDLINE, Embase, Cochrane Central, Web of Science, and Google Scholar were searched on April 16, 2019. An update search was performed on December 15, 2020. The detailed search strategy is provided in the Supplementary Content (S1).

Eligibility criteria and study selection

Title and abstract were independently screened by two authors (M.L. and L.v.d.L.) using eligibility criteria. Full article studies were included if studies investigated the reliability and validity of medical devices assessing changes in human “ordinary” aged skin, that is, skin color, texture, or elasticity (Table 1). Studies were included if reported at least one of the following items regarding skin quality measurement devices: intraobserver reliability, interobserver reliability, interinstrument reliability, or construct validity. Studies evaluating content and criterion validity were not found. Studies assessing the quality of “diseased” skin, for example, melanoma, scars, or burn wounds, were excluded as well as animal studies. Reference lists of included studies were hand‐searched for relevant studies. Disagreements were discussed during a consensus meeting with the last author (J.v.D.).
TABLE 1

Inclusion and exclusion criteria

Inclusion criteriaExclusion criteria
Human skinDiseases and trauma affecting skin quality, for example, burn wounds, scars, and disease‐caused
Medical devices assessing human skin texture, color, or elasticity
Reporting of intraobserver and/or interobserver reliability and/or interinstrument observer reliability and/or validity
Prospective and retrospective studiesCase reports, conference abstracts, letter to the editor, and reviews
Inclusion and exclusion criteria

Assessment of quality of included studies and risk of bias

The included studies were graded on quality of evidence using the Oxford Center for Evidence‐Based Medicine (OCEBM) criteria. Disclosure agreements and funding status were reviewed for each study.

Data extraction

Measurement devices were scored on reporting construct validity by means of convergent validity and inter or intraobserver as well as interinstrument reliability. For construct validity, the Pearson's correlation coefficients of correlations between measurement devices were extracted and the median was depicted in a correlogram. Correlations > 0.5 or < −0.5 were considered strong. For reliability, intraclass correlation coefficients (ICCs) were reported. ICCs > 0.8 were considered good, moderate between 0.6 and 0.8, and poor < 0.6.

RESULTS

Included studies

The initial search identified 3724 publications (Figure 1). The update search yielded 621 additional publications. Hand‐searching reference lists of included publications identified two additional records. After abstract screening, 4296 were excluded. Fifty studies were read in full text and assessed on eligibility criteria. Twenty‐seven studies did not describe an outcome of interest and were excluded. Four publications were reviews and therefore excluded. One publication was excluded because of evaluating diseased skin. One study was excluded as it was a letter to the editor. Following full‐text assessment, 18 publications were included in this systematic review. , , , , , , , , , , , , , , , , ,
FIGURE 1

Flow diagram of study selection

Flow diagram of study selection

Study characteristics

Skin color

Eleven studies assessed skin color describing a total of 16 different measurement devices analyzing 3172 subjects (Table 2). , , , , , , , , , , , The largest study by Uter et al. accounted for 2287 of included subjects.9 All studies evaluated measurement devices in a predominantly Caucasian population, except for one study by Wright et al. Wright et al. researched the DRS probe and Mexameter MX 18 in a predominantly (68.5%) African American population (n = 503).
TABLE 2

Study characteristics of studies on skin color measurement

Author, yearPopulation (n)DevicePrincipleClinical parameterMeasurement regionInterventionMeasurement timingsRepetitive measurements (n)
Wright et al., 2016

503

African American

DRS Probe

Mexameter MX 18

Diffuse reflectance spectroscopy

Narrow‐band reflectance spectrophotometry

Melanin, erythema

Melanin, erythema

Inner part of upper arm

Inner part of upper arm

Baseline

Baseline

3

3

Matias et al., 201530

Antera 3D

Mexameter MX 18

Colorimeter CL‐400

Reflectance mapping with L*a*b* color system

Narrow‐band reflectance spectrophotometry

Tristimulus colorimetry with L*a*b* color system

Melanin, erythema, skin color

Melanin, erythema

Skin color

The back

The back

The back

UVB light exposure at various intensities

UVB light exposure at various intensities

UVB light exposure at various intensities

Baseline, 2, 7, 12, and 14 days

Baseline, 2, 7, 12, and 14 days

Baseline, 2, 7, 12, and 14 days

5

5

5

Baquie and Kasraee, 201412

Dermacatch

Mexameter MX 16

Visible‐spectrum reflectance colorimeter

Narrow‐band reflectance

spectrophotometry

Melanin, erythema

Melanin, erythema

Volar side of the forearm and the back

Volar side of the forearm and the back

UVB light exposure, methyl nicotine cream or dermocorticoid cream

UVB light exposure, methyl nicotine cream or dermocorticoid cream

Baseline, 2, 7, and 14 days

Baseline, 2, 7, and 14 days

10

10

Hua et al., 201420

“Soft Plus” with melanin probe

Mexameter MX 18

Double wavelength reflectance

photometry

Narrow‐band

spectrophotometry

Melanin

Melanin

Face

Face

Baseline

Baseline

3‐5

3–5

Gankande et al., 201430DermaLab ComboNarrow‐band reflectance spectrophotometryMelanin, erythemaHead, neck, chest, back, arm, legBaseline3
Uter et al., 20132287

Minolta Chromameter CR‐300

Reflektometer RM 100

Tristimulus colorimetry with Yxy color system

Remission photometry

Skin color

Skin reflectance

Inner part of upper arm

Inner part of upper arm

Baseline

Baseline

3

3

Van der Wal et al., 201350

Mexameter MX 18

Colorimeter CL‐400

DSM II ColorMeter

Narrow‐band reflectance

spectrophotometry

Tristimulus colorimetry with L*a*b* color system

Narrow‐band reflectance

spectrophotometry

and tristimulus colorimetry with L*a*b* color system

Melanin, erythema

Skin color

Skin color,

erythema melanin

Trunk, upper and lower extremities

Trunk, upper and lower extremities

Trunk, upper and lower extremities

Baseline

Baseline

Baseline

2

2

2

Bailey et al., 201288ChromometerPrinciple not mentionedPigmentationForehead, midcheek, jawline, neck, and abdomenBaseline
Barel et al., 200112

Visi‐Chroma VC‐100

Minolta Chromameter CR‐200

Tristimulus colorimetry with L*a*b* color system

Tristimulus colorimetry with L*a*b* color system

Skin color

Skin color

DHA 5% cream, methyl nicotine cream or sodium lauryl sulfate exposure

DHA 5% cream, methyl nicotine cream or sodium lauryl sulfate exposure

Baseline, 2, 4, and 24 h

Baseline, 2, 4, and 24 h

10

10

Kerckhove et al., 200160Minolta Chromameter CR‐300Tristimulus colorimetry with L*a*b* color systemSkin colorVentral side of the forearmBaseline, 7 days
Shriver et al., 200080

Photovolt ColorWalk colorimeter

DermaSpectrometer

Tristimulus colorimetry

Narrow‐band reflectance

spectrophotometry

Skin color with L*a*b*

color system

Melanin, erythema

Inner part of the upper arm, forehead

Inner part of the upper arm, forehead

Baseline

Baseline

3

3

L*a*b* = Commision International d'Eclairage (CIE) color system. Colors are represented by three variables: L*, the lightness‐darkness axis; a* the red‐green axis; and b*, the blue‐yellow axis.

Yxy = Commision International d'Eclairage (CIE) color system. Y value represents lightness‐darkness axis.

DHA = dihydroxyacetone, product used for tanning of the skin.

Study characteristics of studies on skin color measurement 503 African American DRS Probe Mexameter MX 18 Diffuse reflectance spectroscopy Narrow‐band reflectance spectrophotometry Melanin, erythema Melanin, erythema Inner part of upper arm Inner part of upper arm Baseline Baseline 3 3 Antera 3D Mexameter MX 18 Colorimeter CL‐400 Reflectance mapping with L*a*b* color system Narrow‐band reflectance spectrophotometry Tristimulus colorimetry with L*a*b* color system Melanin, erythema, skin color Melanin, erythema Skin color The back The back The back UVB light exposure at various intensities UVB light exposure at various intensities UVB light exposure at various intensities Baseline, 2, 7, 12, and 14 days Baseline, 2, 7, 12, and 14 days Baseline, 2, 7, 12, and 14 days 5 5 5 Dermacatch Mexameter MX 16 Visible‐spectrum reflectance colorimeter Narrow‐band reflectance spectrophotometry Melanin, erythema Melanin, erythema Volar side of the forearm and the back Volar side of the forearm and the back UVB light exposure, methyl nicotine cream or dermocorticoid cream UVB light exposure, methyl nicotine cream or dermocorticoid cream Baseline, 2, 7, and 14 days Baseline, 2, 7, and 14 days 10 10 “Soft Plus” with melanin probe Mexameter MX 18 Double wavelength reflectance photometry Narrow‐band spectrophotometry Melanin Melanin Face Face Baseline Baseline 3‐5 3–5 Minolta Chromameter CR‐300 Reflektometer RM 100 Tristimulus colorimetry with Yxy color system Remission photometry Skin color Skin reflectance Inner part of upper arm Inner part of upper arm Baseline Baseline 3 3 Mexameter MX 18 Colorimeter CL‐400 DSM II ColorMeter Narrow‐band reflectance spectrophotometry Tristimulus colorimetry with L*a*b* color system Narrow‐band reflectance spectrophotometry and tristimulus colorimetry with L*a*b* color system Melanin, erythema Skin color Skin color, erythema melanin Trunk, upper and lower extremities Trunk, upper and lower extremities Trunk, upper and lower extremities Baseline Baseline Baseline 2 2 2 Visi‐Chroma VC‐100 Minolta Chromameter CR‐200 Tristimulus colorimetry with L*a*b* color system Tristimulus colorimetry with L*a*b* color system Skin color Skin color DHA 5% cream, methyl nicotine cream or sodium lauryl sulfate exposure DHA 5% cream, methyl nicotine cream or sodium lauryl sulfate exposure Baseline, 2, 4, and 24 h Baseline, 2, 4, and 24 h 10 10 Photovolt ColorWalk colorimeter DermaSpectrometer Tristimulus colorimetry Narrow‐band reflectance spectrophotometry Skin color with L*a*b* color system Melanin, erythema Inner part of the upper arm, forehead Inner part of the upper arm, forehead Baseline Baseline 3 3 L*a*b* = Commision International d'Eclairage (CIE) color system. Colors are represented by three variables: L*, the lightness‐darkness axis; a* the red‐green axis; and b*, the blue‐yellow axis. Yxy = Commision International d'Eclairage (CIE) color system. Y value represents lightness‐darkness axis. DHA = dihydroxyacetone, product used for tanning of the skin. The two most frequently employed techniques were narrow‐band reflectance spectrophotometry and tristimulus colorimetry. In narrow‐band reflectance spectrophotometry, differences in red and near infrared light absorption and reflection of hemoglobin and melanin are used to measure vascularization (erythema) and pigmentation (melanin) of the skin. , Included devices using reflectance spectrophotometry to assess skin color were the Mexameter MX 16 and 18, DermaLab Combo, DSM II ColorMeter, and DermaSpectrometer. , , , , , , In Tristimulus colorimetry, white LED is scattered in all directions and the reflected light is measured by the probe. The reflected light is analyzed and expressed in the L*a*b* color system and Individual Typology Angle index values (ITA). L* expresses brightness on the black‐white axis, a* expresses erythema values on the red‐green axis, and b* gives the color position on the blue‐yellow axis. Instruments using Tristimulus colorimetry to measure skin color are the Minolta Chromameter CR‐200 and CR‐300, Colorimeter CL‐400, PhotoVolt ColorWalk Colorimeter, and Visi‐Chroma VC‐100. , , , ,

Skin elasticity

For skin elasticity, seven studies assessed nine types of measurement devices analyzing 290 subjects in total (Table 3). , , , , , The Cutometer SEM 575 and Cutometer MPA 580 were the most frequently used devices and were assessed in four studies. , , , The Cutometer MPA 580 is currently still available for purchase, while the Cutometer SEM 575 has been discontinued. The Cutometer uses a suction and optical measuring system to measure various parameters, such as skin distensibility (R0), gross elasticity (R2), and skin firmness (R7). Xu et al. assessed the 3D‐DIC, which measures the displacement of skin and minor as well as major strain of skin deformation using unidirectional force. Other measurement devices include the BTC‐2000, which measures elastic deformation of skin under subatmospheric pressure, and the Ballistometer BLS780, which uses an impact and indentation measuring system. , Peperkamp et al. evaluated the Dermalab Combo to measure skin elasticity through suction. Hua et al. assessed the Soft Plus with an elasticity probe, which also measures skin elasticity by measuring stress under suction application. Lastly, in a single study, elastography with the Toshiba iAplio 900 was used to measure skin elasticity by measuring the velocity of ultrasonic waves through skin tissue.
TABLE 3

Study characteristics of studies on skin elasticity measurement

Author, yearPopulation (n)DevicePrincipleClinical parameterMeasurement regionMeasurement timingsRepetitive measurements (n)
Peperkamp et al., 201949DermaLab ComboVertical suction

ViscoElasticity (VE), Young's elasticity modulus (E), and skin retrac‐

tion time (R)

ViscoElasticity (VE), Young's elasticity modulus (E), and skin retrac‐

tion time (R)

Viscoelasticity (VE), Young's elasticity modulus (E), skin retraction time (R)

Six locations on armBaseline, 45 min2
Xu et al., 201912

3D‐DIC

Cutometer MPA 580

Deformation of skin under unidirectional force

Suction and optical measuring system

Displacement of skin, minor strain, major strain

Net elasticity (R5),

skin firmness (R7), total recovery (R8)

Volar forearm

Volar forearm

Baseline

Baseline

3

3

Paluch et al., 202057Toshiba iAplio 900 UltrasonographShear wave elastographyTissue strain measured by velocity of ultrasonic wave propagationFaceBaseline3
Hua et al., 201420

“Soft Plus” with elasticity probe

Cutometer MPA 580

Stress/deformation of skin by suction application

Suction and optical measuring system

Elasticity

Skin distensibility (R0)

Face

Face

Baseline

Baseline

3

3

Woo et al., 201420

Cutometer MPA 580

Ballistometer BLS780

Suction and optical measuring system

Impact and indentation measuring system

Skin distensibility (R0), return to original skin (R1), gross elasticity (R2), last maximal amplitude (R3), last minimal amplitude (R4), net elasticity (R5),

viscoelasticity (R6), skin firmness (R7), total recovery (R8)

Firmness and elasticity

Forehead, cheek, and volar forearm

Forehead, cheek, and volar forearm

Baseline

Baseline

3

3

Bailey et al., 201288BTC‐2000Deformation of skin under subatmospheric pressureElastic deformation and stiffnessForehead, midcheek, jowl, neck, and abdomenBaseline
Ahn et al., 200744

Cutometer SEM 575

Moiré topography image

Suction and optical measuring system

Visual evaluation of digital contour lines (scale 1–5)

Skin distensibility (R0), gross elasticity (R2), net elasticity (R5), viscoelasticity (R6), skin firmness (R7), total recovery (R8)

Contour lines

Cheek

Cheek

Baseline

Baseline

Study characteristics of studies on skin elasticity measurement ViscoElasticity (VE), Young's elasticity modulus (E), and skin retrac‐ tion time (R) ViscoElasticity (VE), Young's elasticity modulus (E), and skin retrac‐ tion time (R) Viscoelasticity (VE), Young's elasticity modulus (E), skin retraction time (R) 3D‐DIC Cutometer MPA 580 Deformation of skin under unidirectional force Suction and optical measuring system Displacement of skin, minor strain, major strain Net elasticity (R5), skin firmness (R7), total recovery (R8) Volar forearm Volar forearm Baseline Baseline 3 3 “Soft Plus” with elasticity probe Cutometer MPA 580 Stress/deformation of skin by suction application Suction and optical measuring system Elasticity Skin distensibility (R0) Face Face Baseline Baseline 3 3 Cutometer MPA 580 Ballistometer BLS780 Suction and optical measuring system Impact and indentation measuring system Skin distensibility (R0), return to original skin (R1), gross elasticity (R2), last maximal amplitude (R3), last minimal amplitude (R4), net elasticity (R5), viscoelasticity (R6), skin firmness (R7), total recovery (R8) Firmness and elasticity Forehead, cheek, and volar forearm Forehead, cheek, and volar forearm Baseline Baseline 3 3 Cutometer SEM 575 Moiré topography image Suction and optical measuring system Visual evaluation of digital contour lines (scale 1–5) Skin distensibility (R0), gross elasticity (R2), net elasticity (R5), viscoelasticity (R6), skin firmness (R7), total recovery (R8) Contour lines Cheek Cheek Baseline Baseline

Skin texture

Skin texture was assessed in two studies evaluating 72 subjects using three different types of measurement devices: the Visioscan VC 98, the PRIMOS, and the PRIMOSlite (Table 4). , , The PRIMOS and PRIMOSlite devices use rapid in vivo evaluation of the skin (PRIMOS) to measure surface roughness. This technique is based on the deflection of projected parallel stripe patterns on the skin due to differences in skin surface profile. The Visioscan VC 98 is a UVA‐light camera that measures roughness with the Surface Evaluation for Living Skin method (SELS). The PRIMOSlite is a portable version of the PRIMOS.
TABLE 4

Study characteristics of studies on skin texture measurement

Author, yearPopulation (n)DevicePrincipleClinical parameterMeasurement regionInterventionMeasurement timingsRepetitive measurements (n)
Kottner et al., 201212

Visioscan VC 98

PRIMOSlite

Phaseshift rapid evaluation

Phaseshift rapid evaluation

Surface roughness

Surface roughness

Volar forearm

Volar forearm

Baseline

Baseline

3

3

Bloemen et al., 201160PRIMOSPhaseshift rapid evaluationSurface roughnessTrunk, arm, leg, or headBaseline2
Study characteristics of studies on skin texture measurement Visioscan VC 98 PRIMOSlite Phaseshift rapid evaluation Phaseshift rapid evaluation Surface roughness Surface roughness Volar forearm Volar forearm Baseline Baseline 3 3

Reliability

Interobserver reliability was highest for the Minolta Chromameter CR‐300 (Table 5). Van den Kerckhove et al. reported intraclass coefficients between 0.92 and 0.99 in 60 patients with measurements provided by two independent observers. Both Van den Kerckhove et al. and Uter et al. reported good intraobserver reliability for the Minolta Chromameter as well (ICC 0.98–0.99 and 0.926–0.954, respectively). , Intraobserver reliability for the Reflektometer RM 100 was good in a large cohort of 2287 subjects (ICC 0.938–0.946). In a single study of 50 participants, Van der Wal et al. assessed the interobserver reliability of the Mexameter MX 18, Colorimeter CL‐400, and DSM II ColorMeter. The Mexameter MX 18 and DSM II ColorMeter achieved good interobserver reliability (ICC 0.92–0.94 and 0.89–0.96, respectively). The Colorimeter CL‐400 achieved moderate to good interobserver reliability (ICC 0.79–0.97). Gankande et al. reported interobserver reliability of the DermaLab Combo assessing both melanin and erythema. ICCs for erythema were poor to moderate (ICC 0.54–0.73) and good for melanin (ICC 0.91–0.95). Intraobserver reliability was not tested for the Mexameter MX 18, DSM II ColorMeter, ColoriMeter CL‐400, and DermaLab Combo.
TABLE 5

Reliability of assessed devices

Reliability
Author, yearDeviceIntraobserver (ICC) rangeInterobserver (ICC) rangeInterinstrument (ICC) range
Color
Kerckhove et al., 2001Minolta Chromameter CR‐3000.98–0.990.92–0.990.99–0.999
Uter et al., 2013

Minolta Chromameter CR‐300

Reflektometer RM 100

0.926–0.954 a

0.938–0.946

Van der Wal et al., 2013

Mexameter MX 18

Colorimeter CL‐400

DSM II ColorMeter

0.92–0.94

0.79–0.97

0.89–0.96

Gankande et al., 2014DermaLab Combo0.54–0.95
Elasticity
Paluch et al., 2020Toshiba iAplio 900 Ultrasonograph0.842–0.987
Peperkamp et al., 2019DermaLab Combo0.23–0.76
Texture
Kottner et al., 2012

Visioscan VC 98

PRIMOSlite

0.95–1.00

0.35–1.00

Bloemen et al., 2011PRIMOS0.96–0.990.85–0.88

The reliability of the Minolta Chromameter CR‐300 was tested in smaller cohorts of 190, 10, and 8 patients.

Reliability of assessed devices Minolta Chromameter CR‐300 Reflektometer RM 100 0.926–0.954 0.938–0.946 Mexameter MX 18 Colorimeter CL‐400 DSM II ColorMeter 0.92–0.94 0.79–0.97 0.89–0.96 Visioscan VC 98 PRIMOSlite 0.95–1.00 0.35–1.00 The reliability of the Minolta Chromameter CR‐300 was tested in smaller cohorts of 190, 10, and 8 patients. Intraobserver reliability was tested for the Toshiba iAplio ultrasonography (Table 5). Good intraclass coefficients were reported between 0.842 and 0.987 for three repeated measurements (Table 5). Interobserver reliability was not tested. The DermaLab Combo was the only device that reported interobserver reliability, intraclass coefficients were poor to moderate and varied between 0.23 and 0.76 with measurements repeated by two different observers. The PRIMOS reported interobserver reliability of 0.85–0.88, with measurements by three observers in 60 patients (Table 5). Intraobserver reliability of the PRIMOS was 0.96–0.99. The Visioscan VC 98 achieved interobserver reliability of 0.95–1.00 in 12 subjects, with measurements repeated by three different observers. In the same study, interobserver reliability coefficients of the PRIMOSlite ranged between 0.35 and 1.00 (Table 5). Intraobserver reliability was not reported for the Visioscan VC 98 and PRIMOSlite.

Validity

Darkness and erythema measurements of devices were correlated with the Fitzpatrick skin‐type scale for four devices (Figure 2). Correlation for darkness values and Fitzpatrick skin‐type scale score was significant for the Chromometer (R = 0.78), Colorimeter CL‐400 (R = −0.68), DSM II Colormeter (R = 0.7), and Mexameter MX 18 (R = 0.72). , Three devices were assessed for correlation between erythema values and Fitzpatrick skin‐type scale score. No significant correlations were found for the Colorimeter CL‐400 (R = 0.12), DSM II ColorMeter (0.44), and Mexameter MX 18 (R = 0.44).
FIGURE 2

Median correlation between devices for melanin/darkness measurements

Median correlation between devices for melanin/darkness measurements Construct validity was tested most frequently for the Mexameter MX 18 (Figures 2 and 3). Measurements of skin darkness by the Mexameter MX 18 were significantly correlated with darkness values of the Antera 3D (R = 0.73), DRS Probe (R = 0.88), and “Soft plus” with melanin probe (R = 0.96, Figure 2). Measurements for skin erythema were significantly correlated with measurements of the Antera 3D (R = 0.77) (Figure 3). The previous model of the Mexameter, the Mexameter MX 16, reported significant correlations for both melanin and erythema values with the Chromameter (R = = –0.77, R = 0.76), the Dermacatch (R = 1, R = 1), and the DermaSpectrometer (R = 0.53, R = 0.81).
FIGURE 3

Median correlation between devices for erythema measurements

Median correlation between devices for erythema measurements The Minolta Chromameter CR‐200 showed significant correlation with darkness values of the Visi‐Chroma VC‐100 (R = 0.93). The Minolta Chromameter CR‐300 had significant correlation with the Reflektometer RM 100 (R = 0.69). However, both the Minolta Chromameter CR‐200 and CR‐300 have currently been discontinued, while the newer model Chromameter CR‐400 has not yet been evaluated in clinical research. Ahn et al. reported significant correlation between Cutometer values and the values on a digital grading scale, the so‐called Moiré topography (R = 0.67). Moiré topography is a digital program which generates contour lines on a digital photograph of a patient. An evaluator clinically rates these contour lines from 1 to 5 for decreasing skin elasticity. Moreover, the Cutometer reported significant correlations with measurements of the Soft plus with elasticity probe and the 3‐DIC (R = −0.64 and 0.57, respectively, Figure 4). No significant correlation was found between the Cutometer MPA 580 and the Ballistometer BLS780 for gross elasticity (R2), net elasticity (R5), and skin firmness (R7) parameters (R < 0.5).
FIGURE 4

Median correlation between devices for elasticity measurements

Median correlation between devices for elasticity measurements The measurements of the Visioscan VC 98 and PRIMOSlite were not significantly correlated, so no construct validity could be determined.

Conflict of interest/risk of bias

All 18 studies were level of evidence III studies. , , , , , , , , , , , , , , , , , Eight out of 18 articles did not report funding and conflict of interest. , , , , , , , , However, one of these studies was performed by a contract research organization, which developed the investigated device. Three articles reported no conflict of interest, but did not elaborate on funding. , , Seven articles disclosed funding. , , , , , , In six of these studies, no conflict of interest was apparent. In a single study, three authors were involved with the company that developed the investigated measurement device.

DISCUSSION

This review aimed to determine the most reliable and validated available medical devices for assessing skin color, texture, and elasticity. The most reliable medical device for skin color evaluation is the Minolta Chromameter CR‐300 due to good interobserver, intraobserver, and interinstrument reliability in a Caucasian population. The most reliable medical device for skin texture evaluation is the PRIMOS device with excellent intra and interobserver reliability. For the evaluation of skin elasticity, no device could be designated as superior because none of the included devices reported at least both intra and interobserver reliability. Unfortunately, none of the reviewed measurement devices for skin color, elasticity, or texture could be designated to be superior in terms of validity based on construct validity. Yet, many other critical aspects of validity that are needed to determine which measurement device is best valid, were missing in all included studies. These aspects are content and criteria validity, responsiveness, and interpretability of the included devices. Hence, none of the devices to measure skin color, texture, or elasticity could be selected as valid. Validity can be divided into construct validity, content validity, and criterion validity. Construct validity by means of convergent validity is the degree to which different devices that should theoretically measure the same construct are actually correlated. The convergent validity of the included measurement devices of skin color and elasticity demonstrated that the majority of these devices measure similar constructs, for example, the Mexameter MX 18 measurements showing significant correlation with the Antera 3D, DRS Probe, and “Soft plus” with melanin probe. When multiple devices interlink, a web of correlations or correlogram can be constructed. Theoretically, this increases the probability that what is being measured is valid. A limitation of the use of convergent validity is that a correlation between devices does not automatically mean that either of the devices actually measures the intended parameter. For example, in case of skin color, van der Wal et al. correlate measurements of the Mexameter MX 18, Colorimeter CL‐400, and DSM II ColorMeter to the Fitzpatrick skin phototyping scale. The Fitzpatrick scale is widely used to categorize skin color but was originally developed to select the correct dose of photochemotherapy in the treatment of psoriasis. The parameters assessed in the Fitzpatrick scale are, therefore, primarily focused on predicting the reaction of the skin to ultraviolet light. This scale might, therefore, not reflect all facets of the parameter “color” like undertones, redness, or evenness in color. Therefore, it is not surprising that Van der Wal et al. did not find a significant correlation between erythema values of the aforementioned included measurement devices and Fitzpatrick skin‐type scale score. This highlights the shortcomings of the use of only construct validity. Content validity specifies the degree to which assessment instruments are representative of and relevant to the targeted construct these devices are designed to measure. For content validity, a measurement device should measure all aspects of a construct, for example, skin color. By default, a measurement device cannot factor all different nuances of skin color. Criterion validity refers to the degree to which a measure relates to an outcome. Generally, this concerns comparing the instrument under assessment to a different instrument that has been considered as valid, that is, the “gold standard.” To date, there is no gold standard for the evaluation of skin color, elasticity, and texture, making criterion validity assessment impossible. Besides different validity criteria, a measurement device is only useful in clinical practice when it detects clinically meaningful changes in the measured parameter (responsiveness). Responsiveness can be evaluated by correlating changes in the values of measurement devices following intervention to clinical subjective scores. These scores could be questionnaires or scales documenting the perceived benefit of the intervention from the patients’ and clinicians’ viewpoint. The evaluation of responsiveness is critical to determine whether a device can detect clinically meaningful changes in skin quality. Moreover, a clinical understanding of what the quantitative values or changes in value of the device mean (interpretability) should also be investigated. None of these parameters were reported in any of the included studies. Our analysis focuses on devices that evaluate the outcome of interventions that influence skin quality. Therefore, in the context of evaluation of outcomes of clinical interventions like lipofilling, we propose the following criteria for a measurement device to be considered valid. First, a device should be reliable, meaning both inter and intraobserver reliability should be tested and ICC should be at least 0.8. Second, a measurement device should be able to detect a clinically relevant change of a measurable parameter. Clinical relevance can be detected by correlating changes in measurement values to clinical results assessed by practitioners or subjects following cosmetic intervention like lipofilling. These patient‐ or practitioner‐reported outcome measures could, for instance, be blinded clinical photographic analyses, or satisfaction as measured with the FACE‐Q questionnaires. Reproducibility, responsiveness, and interpretability are critical aspects for a device to be considered valid and reliable.

CONCLUSION

The most reliable devices to evaluate skin color and texture in ordinary aged skin were, respectively, the Minolta Chromameter and PRIMOS. No reliable device is available to measure skin elasticity in ordinary aged skin and none of the included devices could be determined to be designated as valid. Independent responsiveness and interpretability research of available devices is needed to determine which device measures skin quality in the most reliable and reproducible way.

CONFLICT OF INTEREST

The authors have no conflict of interest to disclose. Supporting Information Click here for additional data file.
  34 in total

1.  Reproducibility of repeated measurements on healthy skin with Minolta Chromameter CR-300.

Authors:  E Van den Kerckhove; F Staes; M Flour; K Stappaerts; W Boeckx
Journal:  Skin Res Technol       Date:  2001-02       Impact factor: 2.365

2.  Comparison of two series of non-invasive instruments used for the skin physiological properties measurements: the 'Soft Plus' from Callegari S.p.A vs. the series of detectors from Courage & Khazaka.

Authors:  W Hua; H Xie; T Chen; L Li
Journal:  Skin Res Technol       Date:  2013-06-15       Impact factor: 2.365

3.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement.

Authors:  David Moher; Alessandro Liberati; Jennifer Tetzlaff; Douglas G Altman
Journal:  Ann Intern Med       Date:  2009-07-20       Impact factor: 25.391

4.  Objective color measurements: clinimetric performance of three devices on normal skin and scar tissue.

Authors:  Martijn van der Wal; Monica Bloemen; Pauline Verhaegen; Wim Tuinebreijer; Henrica de Vet; Paul van Zuijlen; Esther Middelkoop
Journal:  J Burn Care Res       Date:  2013 May-Jun       Impact factor: 1.845

5.  Assessing skin pigmentation in epidemiological studies: the reliability of measurements under different conditions.

Authors:  W Uter; M Benz; A Mayr; O Gefeller; A Pfahlberg
Journal:  Skin Res Technol       Date:  2012-12-29       Impact factor: 2.365

6.  Comparison of skin elasticity test results from the Ballistometer(®) and Cutometer(®).

Authors:  M S Woo; K J Moon; H Y Jung; S R Park; T K Moon; N S Kim; B C Lee
Journal:  Skin Res Technol       Date:  2014-02-10       Impact factor: 2.365

7.  An objective device for measuring surface roughness of skin and scars.

Authors:  Monica C T Bloemen; Maaike S van Gerven; Martijn B A van der Wal; Pauline D H M Verhaegen; Esther Middelkoop
Journal:  J Am Acad Dermatol       Date:  2011-01-08       Impact factor: 11.527

8.  Intra-rater reproducibility of shear wave elastography in the evaluation of facial skin.

Authors:  Łukasz Paluch; Piotr Pietruski; Bartłomiej Noszczyk; Bartłomiej Kwiek; Marcin Ambroziak
Journal:  Postepy Dermatol Alergol       Date:  2019-01-10       Impact factor: 1.837

Review 9.  Structural and biophysical characteristics of human skin in maintaining proper epidermal barrier function.

Authors:  Magdalena Boer; Ewa Duchnik; Romuald Maleszka; Mariola Marchlewicz
Journal:  Postepy Dermatol Alergol       Date:  2016-02-29       Impact factor: 1.837

10.  The inter-rater and test-retest reliability of skin thickness and skin elasticity measurements by the DermaLab Combo in healthy participants.

Authors:  Kirsten Peperkamp; Arico C Verhulst; Hanneke J P Tielemans; Harm Winters; Demi van Dalen; Dietmar J O Ulrich
Journal:  Skin Res Technol       Date:  2019-05-20       Impact factor: 2.365

View more
  1 in total

1.  Skin measurement devices to assess skin quality: A systematic review on reliability and validity.

Authors:  Mirte Langeveld; Lara S van de Lande; Eimear O' Sullivan; Berend van der Lei; Joris A van Dongen
Journal:  Skin Res Technol       Date:  2021-11-09       Impact factor: 2.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.