Michael Worboys1. 1. Centre for the History of Science, Technology and Medicine, University of Manchester, Manchester, UK. michael.worboys@manchester.ac.uk
Abstract
OBJECTIVES: To show why and how the Hamilton Rating Scale for Depression became the 'Gold Standard' for assessing therapies from the mid-1960s and how it was used to frame depression as a short-term and curable illness rather than a chronic one. METHODS: My approach is that of the social construction of knowledge, identifying the interests, institutional contexts and practices that produce knowledge claims and then mapping the social processes of their circulation, validation and acceptance. RESULTS: The circulation and validation of Hamilton Rating Scale for Depression was relatively slow and it became a 'Gold Standard' 'from below', from an emerging consensus amongst psychiatrists undertaking clinical trials for depression, which from the 1960s were principally with psychopharmaceuticals for short-term illness. Hamilton Rating Scale for Depression, drug trials and the construction of depression as non-chronic were mutually constituted. DISCUSSION: Hamilton Rating Scale for Depression framed depression and its sufferers in new ways, leading psychiatrists to understand illness as a treatable episode, rather than a life course condition. As such, Hamilton Rating Scale for Depression served the interests of psychiatrists and psychiatry in its new era of drug therapy outside the mental hospital. However, Hamilton Rating Scale for Depression was a strange kind of 'standard', being quite non-standard in the widely varying ways it was used and the meanings given to its findings.
OBJECTIVES: To show why and how the Hamilton Rating Scale for Depression became the 'Gold Standard' for assessing therapies from the mid-1960s and how it was used to frame depression as a short-term and curable illness rather than a chronic one. METHODS: My approach is that of the social construction of knowledge, identifying the interests, institutional contexts and practices that produce knowledge claims and then mapping the social processes of their circulation, validation and acceptance. RESULTS: The circulation and validation of Hamilton Rating Scale for Depression was relatively slow and it became a 'Gold Standard' 'from below', from an emerging consensus amongst psychiatrists undertaking clinical trials for depression, which from the 1960s were principally with psychopharmaceuticals for short-term illness. Hamilton Rating Scale for Depression, drug trials and the construction of depression as non-chronic were mutually constituted. DISCUSSION: Hamilton Rating Scale for Depression framed depression and its sufferers in new ways, leading psychiatrists to understand illness as a treatable episode, rather than a life course condition. As such, Hamilton Rating Scale for Depression served the interests of psychiatrists and psychiatry in its new era of drug therapy outside the mental hospital. However, Hamilton Rating Scale for Depression was a strange kind of 'standard', being quite non-standard in the widely varying ways it was used and the meanings given to its findings.
There has been much discussion in recent years about whether depression is a chronic
illness against the modern view that it is typically time-limited.[1] Gask dated the growing dominance of this
view to the 1980s and ‘the launch and promotion of a new group of antidepressants, the
selective serotonin reuptake inhibitors (SSRIs)’.[2] The traditional view of depressive illness,
from melancholia in the nineteenth century to Kraepelin’s characterisation of manic
depression that dominated twentieth-century psychiatry, was that the illness was recurrent,
chronic or both. Sufferers could spend years in mental hospitals, where, from the 1950s,
they might receive regular electro-convulsive therapy (ECT). The changes in the last quarter
of the twentieth century are well known and recognised as revolutionary at all levels:
definitions of ‘depression’ and the impact of DSM-III; the treatment of choice shifting from
ECT to drugs; the closure of long-stay hospitals and the development of community care where
sufferers from depression are mainly treated by general practitioners. The impact of these
changes on medical views of depression was evident in an Editorial in Psychological
Medicine in 2012, which had to remind readers of new evidence that amongst
patients diagnosed with depression, only half had a single episode and half had a recurrent
and chronic life-long illness.[3] The authors argued that more effort should now be given to identifying
recurrence, with a view to altering ‘the trajectory of depression that is so chronic, severe
and disabling’ for ‘the betterment of so very many’.
Methods
My principal research question is when and how did the view that depression was typically
time-limited and non-chronic originate? Was it in the 1980s and early 1990s with the arrival
of SSRIs? These drugs were undoubtedly important, but so too were the changes in service
provision and a host of other patient, professional and other factors. In this article, I
investigate the longer-term origins in ways that depression was framed by psychiatrists
through the impact of the Hamilton Rating Scale for Depression (HRSD), which from the 1970s
became, and to a large extent remains, the dominant tool in assessing the severity of
depression. A key feature of HRSD was that it was used to measure the outcome of treatment,
especially drugs, and was applied as a ‘before and after’ schema, leading to the view that
depression was event, thereby downplaying seriality. My argument also offers a case study of
the impact of standard scales in medicine, and the interaction of drug standards and
standard drugs.My methods are those of the social construction of knowledge, explaining how ways of
knowing and practising are formulated in specific social contexts, then circulated and
validated in contingent settings by a variety of actors. Constructivist historical methods
were applied to articles and books that discussed the application of HRSD to various patient
groups in hospital and community setting from the 1960s to the late 1970s. Sources were
identified from the standard online databases—Pubmed (keyword) and Science Direct (full
text)—and quantitative indicators were derived from Web of Science. Detailed qualitative
analysis of selected articles was also made, using close reading to identify the assumptions
and modes of analysis of the authors.
Results
The 21-item HRSD for assessing the severity of depression was developed by the English
psychiatrist Max Hamilton and presented to the psychiatric community in 1960 in the, then
somewhat obscure, Journal of Neurology, Neurosurgery and
Psychiatry.[4] Interviewed in 1982, Hamilton observed that, after completing a number of
clinical trials on new drugs,I was also interviewing people about my depression scale and trying to see if I could
get some work going on depression. I went around with my scale and it created a
tremendous wave of apathy. They all thought I was a bit mad. Eventually I got it
published in the Journal of Neurology, Neurosurgery and Psychiatry. It
was the only one that would take it.[5]He took some pleasure in adding that, ‘And now everyone tells me the scale is wonderful, I
always remember when it had a different reception. This makes sure I don't get a swollen
head.’ Whether the last point was accurate is open to debate, as Hamilton was quite a
domineering figure, but there is no doubt that his rating scale was, and still is, widely
used. It has earned the title of the ‘Gold Standard’ for the assessment of depression,
though its reign may now be limited.[6] Given its status and influence, it is surprising that it has not been
subject of historical enquiry and even authors who are critical of modern psychiatry and its
‘manufacturing of depression’ have not subjected it to scrutiny.[7]There are two explanations of its dominance, both of which have some merit but are not the
whole story. The first, which is common amongst psychiatrists, is that HRSD became the ‘Gold
Standard’ simply by being the earliest scale to enjoy widespread use. However, it was born
into a world of already competing scales, so the key question to answer is, why and how did
it see off its rivals? Interestingly, Hamilton’s Anxiety Scale, which was actually published
before HRSD and hence was more of a ‘first,’ did not endure. The second explanation is that
HRSD was ideally suited to measure the effects of drug treatments, especially tricyclics
such as imipramine, which were ‘somewhat anxiolytic and somewhat sedative in
effect.’[8] HRSD
scored for sleep and for weight gain, which were known to be affected by tricyclics. In
other words and to quote one reviewer of The Antidepressant Era, ‘The early
drugs defined the very scale that was used to measure their performance.’[9] One recent critic of the
scale wrote that Hamilton ‘fashioned his test to meet the needs of his drug company
patrons.’[7] Healy
says that there is no evidence that Hamilton used his own scale in clinical practice, but
then it was a research rather than clinical tool, designed to quantify changes in a
patient’s condition over time.[10] It is unclear whether Hamilton had direct ‘drug company patrons,’
though he was the founding President of the British Association of Psychopharmacology and an
early member of the International College of Neuro-Psychopharamcology (CINP), which since
his death in 1988 has awarded an annual prize in his name. On the other hand, Hamilton is
widely described as an iconoclast and seems to have been a socialist; he was certainly a
strong defender of the National Health Service in the 1980s when it was under threat from
Thatcher era cuts in public spending. What is clear is that in the late 1950s and early
1960s Hamilton had many motives and that his abrasive character meant that pleasing anyone
was not high on the list.In this article, I argue that the dominance of HRSD was only slowly achieved and that in
its first two decades it had many rivals and that no one was more surprised than Hamilton
himself that it proved to be so successful. Also, its dominance was largely in clinical
research, translating trial findings, quite often, into simple before-and-after scores.
There was an inherent bias to consider depression as time-limited and all the more so as a
result of drug treatment. Hamilton created the scale to enable psychiatrists to chart
changes in already diagnosed patients through particular treatment regimes, converting
qualitative judgments into quantitative data on a fine-grained 100-point scale. The scale
also allowed psychiatrists to determine what the most significant changes were in an array
of symptoms; though as I will show, most early studies used the aggregated scores rather
than disaggregated data. Indeed, studies in the 1980s demonstrated that the schema was
modified promiscuously, with psychiatrists adding and subtracting items to assess.[11] In 1990, Zitman et al.
surveyed five major journals over a year for research papers using the HAM-D and asked
authors of for a copy of the scale they used. Fewer than half the investigators referenced
the correct version of the HAM-D, and only 4 out of 51 responders used versions that were
the same as a published version.HRSD was not designed as a diagnostic schema, though many used it as such and one reason
for its success was that its approach anticipated the emphasis of symptoms and disease
entities enshrined in DSM-III in 1980.[12] Although invented well before even DSM-II
(1968), Hamilton’s scale was for a specific condition and proposed standardisation around
overt symptoms, the features that distinguished the third from the second version of the
DSM. Shaped by the assumptions of dominant psychodynamic approaches, DSM-I and -II had
‘conceived of symptoms as reflections of broad underlying dynamic conditions…. that only
became meaningful through exploring the personal history of each individual’.[12] Influenced strongly by
Karl Menninger’s assumption that all mental disorders were reducible to ‘the failure of the
suffering individual to adapt to his or her environment’, psychiatrists tended to focus on
finding underlying mental causes and to interpret these as constitutional and likely to be
chronic.[13]
DSM-III’s move towards specific diseases and to focus on symptoms rather than underlying
causes weakened these imperatives.Max Hamilton was born in Offenbach, near Frankfurt, in 1912, and his parents moved to
London in 1915.[14] He
qualified in medicine at University College Hospital London in 1934 and worked in a number
of posts before settling upon psychiatry in 1946, when he joined the Maudsley Hospital in
London. He worked at various London hospitals and began an association with Cyril Burt that
led him to develop expertise in, and an almost missionary commitment to, psychometrics,
which was fashionable in the psychological sciences in the 1950s. In 1953, he moved to the
University of Leeds as lecturer in psychiatry. He found little time for research and in 1957
resigned to take up a temporary, 2-year research position in the University. This was funded
by research grants from the Mental Health Research Trust and by a trial that his head of
department, Ronald Hargreaves, was running on chlorpromazine. In this work, Hamilton
developed a number of scales, the first in 1957 in a study with Hargreaves on the value of
Benactyzine in the treatment of anxiety, for which drugs and placebos were supplied by
Glaxo.[15] The
anxiety scale, later termed HAMA, anticipated many of the features of HRSD.We therefore classified all the symptoms likely to be found in our patients under the
following headings: (1) anxious mood; (2) tension; (3) specific fears and phobias; (4)
sleep disturbance; (5) intellectual disturbance; (6) depressive features; (7) somatic
disturbances (muscular and sensory); (8) cardiovascular disturbance; (9) respiratory
disturbances; (10) gastro-intestinal disturbances; (11) genitourinary disturbances; (12)
autonomic disturbances and (13) manifestations of anxiety in the behaviour at the
interview. A gloss was prepared listing the features to be taken into account in making
an assessment under any of these headings. At the interview we rated each of these
thirteen items on a five-point scale as follows: 0, none; 1, mild; 2, moderate; 3,
severe; 4, grossly disabling. This rating scale yields a variety of different types of
information for each patient, including a “profile” of his symptomatology and, by
summing the ratings for all headings, a gross symptom score.[15]One conclusion of this study was that ‘impressionistic global judgments of a patient's
condition alone are of little value in assessing the effect upon him of a particular
regime’. Hamilton had previously spoken on the use of scales in this work on anxiety at the
British Psychological Society in 1956.[16] In what became a feature of his
publications of scales, he devoted much of the paper to sophisticated statistical testing of
reliability and reproducibility. As noted already, HAMA was further elaborated in the 1960s
but did not have the success of HRSD, but that is a topic for another paper.The first iteration of the HRSD scale was actually published in 1959, in an article
co-authored with Jack White, a consultant psychiatrist at the Stanley Royd Hospital,
Wakefield.[17] The
famous 1960 paper was already in press and mentioned, though without a citation. The scale
in the 1959 paper offered a different and more finely grained classification of patient
symptoms, moving away from the three accepted dichotomies: Reactive – Endogenous; Agitated –
Retarded; Neurotic – Psychotic. Hamilton and White subjected patient’s scores on their
schema to factor analysis and identified four groups of patients and types of depression:
Endogenous, Doubtful Endogenous, Doubtful Reactive and Reactive. In other words, they were
using the scores for the classification of different types of depression. In conclusion,
they argued that, with the range of therapeutic options increasing as new drugs were added
to ECT and psychotherapy, it was important for psychiatrists to be better able to
differentiate forms of depression and their response to treatments. The study was of 64 male
patients at Stanley Royd and included an Appendix of case histories of 20 patients, which
showed that they had received a variety of treatments. Of the 20, 16 had received ECT, so
the origins of HRSD lie in charting the dominant therapeutic regimes of the era and were not
only developed for pharmaceutical treatment.What became known as HRSD was proposed by Hamilton in his now famous and much cited 1960
paper? His stated aim was to improve upon existing scales, which he criticised for being
inappropriate, unreliable or using ill-defined symptoms.[4] His new scale was to be used in interviews
conducted by psychiatrists and was intended for patients already diagnosed with depression.
It relied mostly on the observations of bodily (somatic) and behavioural features by
psychiatrists, which were also weighted more heavily than the few symptoms that relied on
patient’s reports of their feelings (Figure
1).
Figure
1.
Hamilton's now famous paper on rating scales for depression was
published in a little known journal.4
Hamilton's now famous paper on rating scales for depression was
published in a little known journal.4The empirical basis of the paper was drawn from 49 of the 64 patients discussed in the 1959
paper. There were 17 variables in the new scale, each rated on either a four- or two-point
range, which produced a potential maximum of 50 points for extremely severe illness. The
recommendation was that two psychiatrists interview the patient separately and their scores
be added together to give a rating out of 100 (Figure 2). The correlation between the scores of the
two scorers (presumably Hamilton and White) was found to be high and to improve with
experience.
Figure
2.
The first published iteration of what became HAM-D or
HRSD.4
The first published iteration of what became HAM-D or
HRSD.4In discussing individual patients, Hamilton did not use their overall rating score; instead
he gave their pattern of factor measures in terms of the four diagnostic groups identified
in the 1959 paper with White: Factor 1: Endogenous, Factor 2: Doubtful Endogenous, Factor 3:
Doubtful Reactive and Factor 4: Reactive.[17]
Figure 3 presents the description of
one of the patients whose profile was predominantly Factor 1 and this ends with the
classification of his illness as ‘endogenous’ and seemingly chronic and likely to relapse.
Figure 3.
An
example of the case histories and commentaries included in Hamilton's 1960
paper.4
An
example of the case histories and commentaries included in Hamilton's 1960
paper.4Hamilton made clear the importance of factor scores and their value over the classical
clinical categories. In summary, he wrote:A rating scale is described for use in assessing the symptoms of patients diagnosed as
suffering from depressive states. The first four latent vectors of the intercorrelation
matrix obtained from 49 male patients are of interest, as shown by (a) the factor
saturations, (b) the case histories of patients scoring highly in the factors and (c)
the correlation between factor scores and outcome after treatment. The general problem
of the relationship between clinical syndromes and factors extracted from the
intercorrelations of symptoms is discussed.[4]There is no evidence in the paper that ‘before and after’ treatment scores were taken, the
only link to treatment seems to be that the initial factor scores were indicative of the
outcome of (mostly ECT) treatment, hence, this first presentation of HRSD can be read as
offering a more refined diagnosis or prognosis. In another paper with Jack White, also
published in 1960, Hamilton assessed ratings as an indicator of the outcome of depression
treated with ECT.[18]The first published trial to use HRSD was a study of the use of the new drug amitriptyline
by CG Burt and colleagues at the Royal Park Hospital in Melbourne, Australia.[19] For each patient an
aggregate score out of 50 was first used to group patients; there was no factor analysis.After initial evaluation on Hamilton's (1960) scale for quantifying depressive
illnesses, patients were allocated to one of four groups delineated on the basis of two
leading prognostic criteria, age and severity of illness. “Mild young” depressives were
aged between 30 and 49 and, out of a possible maximum score of 100, had total scale
scores below 40; “young severe” depressives were between 30 and 49 and had total scale
scores above 40. Similar criteria of severity were used in the “old mild” and “old
severe”, who were aged between 50 and 70.The same overall rating score was used to assess the outcome after one and then four weeks
treatment with amitriptyline compared to imipramine; the latter being the market leader for
severe depression. The Table and Chart below show the range in individual rating scores and
aggregates for the ‘old severe’ group. In fact, this was one of the few studies in the
period that presented the symptom scores separately, typically the single aggregate score
out of 50 or 100 was used (Figure
4).
Figure
4.
Burt CG, et al. Amitriptyline in depressive states: a controlled
trial. Br J Psychiatry 1962; 108: 711–730.
Burt CG, et al. Amitriptyline in depressive states: a controlled
trial. Br J Psychiatry 1962; 108: 711–730.In their discussion, Burt et al. made two key points about the HRSD that were, and are
still, widely stated to account for its widespread use: (1) it was ‘simple to use and
rapidly completed’ and (2) it could map changes that drugs brought in specific symptoms.
Burt and his colleagues wrote of ‘target’ symptoms, which was perhaps an implicit comparison
to the blunderbuss of ECT and its impact on the whole psyche. HRSD could certainly also map
the temporal and experiential dimensions of treatments that were difficult to collect from
patients after ECT. Fritz Freyhan, Clinical Director, Director of Research, Delaware State
Hospital, Farnhurst, Delaware, explained this point in 1960, showing how drug treatments
could be combined with psychotherapy.The pharmacological treatment of depressions offers this immense psychological
advantage: the patient maintains his experiential continuity. The amnestic syndrome
associated with ECT, to which many attributed therapeutic significance, proves to be
quite superfluous as is seen in successful pharmacotherapy. The preservation of
experiential continuity has vast implications for psychotherapy. Until now,
psychotherapy either followed ECT or had to be limited to patients who seemed capable of
affective contact and of self-control over suicidal impulses. With ECT, the patient
remains physically and emotionally passive. His recovery comes, as it were, from
without. Pharmacotherapy makes him a participating partner. This offers psychotherapy
entirely new opportunities to involve the patient in the therapeutic process until
recovery is seen as coming from within.[20]The second study to use the scale, albeit casually and with crude aggregate scores, was by
AA Robin and J Harris at Runwell Hospital, Essex, in a comparison of imipramine and
ECT.[21] In this
study, as in many others at this time, ECT was found to give better outcomes.In 1963, JT Rose published a study of patient responses to ECT using HRSD.[22] In measuring the impact of
therapy, he validated HRSD by the fact ‘that a drop in the score corresponded in the great
majority of cases with improvement as recorded by overall clinical assessments and with
falling scores in the occupational therapy ratings.’ This is interesting as Hamilton
developed his scale because of his dissatisfaction with overall clinical assessments and
other scales. Cross reference to, and validation against, overall clinical assessment was
common in discussions of HRSD throughout the 1960s and 1970s, not least because the scale
was about changing qualitative judgments of clinical outcomes into quantitative values,
either in a single score or a matrix of scores.Interestingly, HRSD was not used in 1964-1965 in a major clinical trial on treatments for
depressive illness organised by the Clinical Psychiatry Committee of the Medical Research
Council (MRC), even though Hamilton played a leading role in the scheme.[22] The trial used both an
overall clinical rating of severity and its own scale of 15 symptoms: depressed mood,
psychomotor retardation, suicidal ideas, ideas of bodily change, ideas of reference,
self-reproach, anxiety, insomnia (early, middle, late) anorexia and fatigue. This scale bore
a close relation to HRSD in both the symptoms monitored and the range of scoring, giving
tacit endorsement to Hamilton’s approach if not his particular scale. In fact, the Committee
invented its own so-called ‘MRC Scale’, which was used quite widely for a number of years,
but fell away as HRSD took centre stage.That the uptake of HRSD was relatively slow is borne out by the number of publications in
which it was cited in its first 20 years, see Figure 5, which is presented with all the usual caveats
about citations and what they mean. Two sets of data are given: the number of articles each
year citing Hamilton’s 1960 paper and the number of papers cited with ‘depression’ in the
title. There is steady growth in the number of papers citing HRSD, but this is slower than
the overall growth of citations on depression, bearing in mind that both were influenced by
the increase in the number of medical journals and the drive to publish more and often.
Also, there were many publications, particularly at the end of the 1970s, in which HRSD was
used without citing the 1960 paper. Perhaps it was too well known to need citing? Perhaps
the absence of citation indicated that it was being used only casually? And, of course,
citation did not mean that authors followed Hamilton’s protocols, in fact psychiatrists used
HRSD selectively and flexibly. Writing in 2001, Jane Williams observed that over time,
Figure
5.
Number of articles each year citing Hamilton M, A rating
scale for depression, J Neurol Neurosurg Psychiatry 1960; 23:
56–62; and number of articles cited with “depression” in the title. Source: Web of
Science.
Several versions of the scale had come into use, with differences in their total number
of items, their anchor descriptions, their item interpretations and their scoring
conventions … . By 1990 there were so many versions of the HAM-D that researchers and
clinicians had lost track of what was available, and what were the characteristics of
each one. No single version of the HAM-D or single set of conventions has been
universally accepted.[11]Number of articles each year citing Hamilton M, A rating
scale for depression, J Neurol Neurosurg Psychiatry 1960; 23:
56–62; and number of articles cited with “depression” in the title. Source: Web of
Science.Williams noted that by this time, in different publications the number of items scored as
HRSD had risen from 17 to 59.[11]For much of the 1960s, HRSD was discussed as just another rating scale. For example in
1965, Gerald Klerman and Jonathan Cole’s review of imipramine and related antidepressants
mentions HRSD three times in different contexts and always in relation to other
scales.[23]Phenomenological differentiations of depressedpatients have been developed, using
symptom patterns and clusters derived by multivariate statistical techniques. Grinker
et al., Friedman et al., Hamilton and Wittenborn et al. have published promising
findings.For example, in studying hospitalized patients, especially severely depressed or
schizophrenicpatients, well validated scales, particularly by Lorr, Wittenborn,
Hamilton and others are widely used. Instruments for nursing observations and for
patients’ self-ratings also have been developed.Drug-placebo differences were revealed by global estimates of degree of depression and
by ratings of specific symptoms like anxiety, insomnia, weight gain and guilt.
Hamilton’s rating scale, Lorr’s Inpatient Multidimensional Psychiatric Scale and the
Wittenborn Psychiatric Scale were sensitive to differences in most studies in which they
were employed.[23]This illustrates Martin Roth’s statement in his brief biography of Max Hamilton that ‘It
took more than a decade before the HRSD scale was recognised as a major contribution to
knowledge and clinical practice.’[24]Healy suggests that one reason HRSD was widely used is that it gave particular weight to
anxiety symptoms, and thus was good at charting the positive effects of drugs, like
imipramine, that were anxiolytic. Alan Broadhurst, a pharmacologist, who was in the group at
Geigy that discovered imipramine told David Healy that, ‘Max Hamilton was excited about
imipramine and it certainly did fit in beautifully with his rating scale. Years later he
still referred to it as a happy coincidence’.[8] However, therapeutic regimes change for so
many reasons that it is difficult to tease out the relative importance of HRSD relative to
other factors and, although I do not have the data, it is likely that the uptake of
imipramine was more rapid than that of HRSD.[25]An alternative approach to assessing the rise of HRSD is to look at when and how it was
criticised, and why these objections did not impede its progress to becoming the ‘Gold
Standard.’ In the 1960s, HRSD had a competitor, the Inventory for Measuring Depression (then
ID and now Beck depression inventory (BDI)), proposed by Aaron T Beck at the University of
Pennsylvania.[26]
BDI has proved similarly enduring and also had the advantage of being a ‘first’ and the one
against which other scales were calibrated and validated. Beck was a pioneer of cognitive
therapy and his scale was quite different to HRSD in being based on a patient’s self-rating.
In its original form the BDI consisted of 21 questions, each with four possible answers that
the patient had to rate 0-3. This gave a theoretical maximum score of 63. A score above 30
indicated severe illness, 19–29 moderate, 10–18 mild and below 10 minimal. A common way of
contrasting BID with HRSD was to say that it was ‘subjective’: it relied upon patients’
thoughts and feelings, while HRSD was ‘objective’, because it was mainly based on clinician
observations of bodily and behavioural symptoms.In 1965, Maryse Metcalfe and Ellen Goldmann compared HRSD favourably with BDI, though they
acknowledged that it depended on the skill of the rater and their clinical bias, which, they
cautioned, ‘made it somewhat difficult to compare meaningfully results obtained in different
investigations.’[27]
In their view, the advantages of BDI were that it was simple, quick and easy to administer,
and ‘independent of doctors’ and nurses’ bias, seemingly relying on the ‘constant’ of the
patient. In 1967, John Schwab and colleagues, at the University of Florida College of
Medicine, published a comparison of HRSD and BDI amongst ordinary and, one must assume,
mostly non-depressed medical inpatients.[28,29] They found a good correlation
(rz = 0.75) in scores, but argued that the two scales were
complementary because they measured ‘different components of the depressive complex.’Hamilton assessed and offered a further elaboration of his own scale in 1967.[30] The second paper was
largely methodological, though it did consider a larger patient group and females as well as
males. He also added four extra symptoms to score. However, the article was not easy reading
for his peers. It was highly mathematical, as the Abstract illustrates.‘This is an account of further work on a rating scale for depressive states, including
a detailed discussion of the general problems of comparing successive samples from a
‘population’, the meaning of the factor scores, and the other results obtained. The
intercorrelation matrix of the times of the scale has been factor-analysed by the method
of principal components, which were then given a Varimax rotation. Weights are given for
calculating factor scores, both for rotated as well as unrotated factors.[30]The data to the end of 1990 (Figure
6) shows that, if citations in any way indicate the resources used by psychiatrists
in their work, that they stuck with the 1960 paper, for the later elaboration was cited
less, even allowing for lags.
Figure
6.
Number of article each year citing Hamilton M, A rating scale for
depression. J Neurol Neurosurg Psychiatry 1960; 23: 56–62 and
Hamilton M, Development of a rating scale for primary depressive illness. Br J
Soc Clin Psychol 1967; 6: 278–296. Source: Web of
Science.
Number of article each year citing Hamilton M, A rating scale for
depression. J Neurol Neurosurg Psychiatry 1960; 23: 56–62 and
Hamilton M, Development of a rating scale for primary depressive illness. Br J
Soc Clin Psychol 1967; 6: 278–296. Source: Web of
Science.In his 1967 paper, Hamilton noted, in a very revealing statement, that this study had been
difficult because of the time taken to accumulate a sufficient number of patients with
depression. What he actually meant was the difficulty in finding
appropriate patients, that is, those with treatable illness, as he
contrasted this difficulty with the ease of earlier studies with patients in mental
hospitals where there were large numbers of chronic cases.[30] It seems that within a decade, what
counted as depression, along with who and how they suffered, had changed.I now want to jump another ten years and consider the ways that HRSD was being used in
therapeutic trials at the end of the 1970s.[31] By this time almost all trials were with
psychopharmaceuticals, though ECT was still being used for patients diagnosed with ‘severe’
depression. In fact, prior treatment with ECT often excluded patients from participation in
drug trials. However, HRSD was still being used in assessments of ECT, as well as
psychotherapy.[32]
And in 1977, it was even used by Aaron Beck to compare ‘pharmacotherapy’ and ‘cognitive
therapy,’ see Figure 7.[33]
Figure 7.
An
example of the reporting outcomes of the use of HRSD with another scale and for
different treatments.33
An
example of the reporting outcomes of the use of HRSD with another scale and for
different treatments.33To sample the uses of HRSD, I surveyed all of the clinical trials for depression published
in the medical journals listed in Web of Science for 1979. It was impossible to produce
reliable quantitative data of the series, because of the different drugs, protocols and
citation practices, so I have chosen to discuss articles that are representative. In most
trials HRSD was used with another scale and sometimes with multiple scales, as in the report
of a controlled trial of trimipramine and monoamine oxidase inhibitors at St Thomas’s
Hospital, London, published in 1979. The authors stated:The patients completed the Beck scale for depression and the Middlesex Hospital
Questionnaire (MHQ), and were rated blindly by an independent assessor on the Hamilton
rating scale for depression, the MRC depression scales, and an overall six-point rating
of the severity of depression. A standard rating of side effects was completed by the
psychiatrist who regulated drug dosage to prevent knowledge of any such effects biasing
the clinical ratings of the other assessor.[34,35]The graphs below show how the results of the different scales were mapped for the six weeks
of the trial (Figure 8).
Figure 8.
An example
of HRSD scores reported against many other scales.34
An example
of HRSD scores reported against many other scales.34The same pattern was evident in a study of Limbitrol in California.The patients were evaluated at baseline using the Hamilton rating scale for primary
depressive illness (HDS) and the Covi anxiety scale. In addition, the patients completed
the short form of the BDI and the Hopkins symptom checklist (SCL-58). Efficacy was
assessed at follow-up visits after 1, 2, and 4 weeks of treatment by the physician,
using the HDS and a global evaluation, and by the patient using the BDI, the SCL-58, and
a global evaluation. In most instances, the BDI and the SCL-5g were completed by the
patient prior to his seeing the psychiatrist.[36]In a trial of Lithium, HRSD was set against a 5-point nurse rating scale (Figure 9).[37]
Figure 9.
An
example of reporting HRSD in comparison with a nurse rating
scale.37
An
example of reporting HRSD in comparison with a nurse rating
scale.37There are very few publications where the score was disaggregated and the different
components mapped to identify specific changes, one exception was a study comparing
amineptine and amitriptyline at Hôpital de St. Germain-en-Laye.[38] The changes in the total scores were first
presented (Figure 10) and when the
component scores were set out it was difficult to see the wood for the trees (Figure 11), and then only 14 out the 26
items scored had statistical significance.
Figure 10.
A typical use of HRSD charting the effects of two
drugs over time.38
Figure
11.
Reporting disaggregated HRSD scores, as illustrated above,
became less common.38
A typical use of HRSD charting the effects of two
drugs over time.38Reporting disaggregated HRSD scores, as illustrated above,
became less common.38
Discussion
In this paper, I have made two main claims, first that HRSD was applied by clinicians to
construct depression as a time-limited illness, and second, that this influential framing of
the condition was used alongside other scales and only rose to dominance gradually. The
assumption of the time-limited illness supports the claim of Healy and others that an
HRSD-structured characterisation of depression was suited to drug therapy and the interests
of pharmaceutical companies in the 1960s and 1970s. The view of psychiatrists in the first
half of the twentieth century was that depressive mental illness was chronic, either because
of patient susceptibilities rooted in somatic factors, such as hereditary or physical
disease, or in psychic variables influenced by upbringing, interpersonal relationships or
personality. There was however some turnover in mental hospital patients and moves to treat
many sufferers as out-patients. The patient population peaked in Britain in 1954 at 140,000,
when there were 121,000 beds, suggesting that turnover was not great and that most patients
had chronic conditions. The rundown in the number of beds and the move to community care saw
depression move out of the hospital and into the community, as an out-patient or general
practitioner managed condition. In this setting, and due to new framings and new treatments,
it was approached as a ‘mild’ and short-lived condition, at least compared to the illness
that had previously required hospitalisation.[39] HRSD was used to frame this new
‘depression’ and its sufferers, normalising it to the ways of seeing and treating illness as
a treatable episode or episodes, rather than a life course condition. As such, HRSD served
the interests of psychiatrists and psychiatry in the new era of treating specific illnesses
outside of mental hospitals.HRSD rose to dominance ‘from below.’ When it was sanctioned ‘from above’ in the 1980s, by
the World Health Organisation, Food and Drugs Administration, and other medicine licensing
agencies, this was acknowledging its widespread use, not creating it ‘top down.’
Paradoxically, the eventual dominance of HRSD was in large part due to its successful
validation against the holistic clinician assessments, the very thing Hamilton designed it
against. However, HRSD was a clinician scoring instrument and proved simple to use because
clinicians made it so, choosing overall scores rather than disaggregated or factor scores.
In many ways, the ‘S’ in HRSD stood for ‘Score’ not ‘Scale’, but either way it was a
quantitative datum on a relatively large and finely grained scale of 100, at least when
compared to the previous clinician scales. Overall, HRSD was a strange kind of ‘standard,’
being quite non-standard in the flexible and widely varying ways it was used, the number and
type of items in the scale and the meanings given to its findings.
Authors: J P Feighner; B Brauzer; A J Gelenberg; E Gomez; A Kiev; M L Kurland; B L Weiss Journal: Psychopharmacology (Berl) Date: 1979-03-22 Impact factor: 4.530
Authors: Julia W Felton; Maji Hailemariam; Fallon Richie; Madhavi K Reddy; Sophia Edukere; Caron Zlotnick; Jennifer E Johnson Journal: Psychother Res Date: 2019-03-11
Authors: Isabel M Vincent; Rónán Daly; Bertrand Courtioux; Amy M Cattanach; Sylvain Biéler; Joseph M Ndung'u; Sylvie Bisser; Michael P Barrett Journal: PLoS Negl Trop Dis Date: 2016-12-12