Background
Individuals who are both mentally ill and homeless are a major health policy focus internationally [
1,
2]. Several multi-site randomized controlled trials have been mounted, with overlapping objectives that include diversion from health and justice services and reductions in the associated costs among samples defined by chronic homelessness and serious mental illness [
3‐
5]. Self reported service involvement is commonly used as the basis for research ranging from national surveys [
6] to the aforementioned experimental designs, and the resulting data are used to determine costs of care [
7] as well as intervention outcomes.
Although some research has investigated the accuracy of self-reports among people who are either mentally ill [
8] or homeless [
9], few studies have examined the validity of self-reported service use in samples that are
both homeless and seriously mentally ill. Moreover, previous research investigating validity of self-report in this population has used key informant responses as the comparator [
10] rather than administrative sources. Research on shelter using veterans in the United States found only “moderate” concordance between self-reported and administrative data, leading the authors to recommend greater reliance on administrative records in research [
11]. An earlier study among “skid row alcoholics” concluded that the unreliability of self-reports undermined the validity of research investigating employment, arrests, and treatment outcomes in this population [
12]. More recently Clifasefi and colleagues [
13] investigated the validity of self-reported public health utilization among chronically homeless individuals with severe alcohol problems and found fair-moderate agreement with administrative data for shorter (30 day) periods but inadequate agreement for longer (3 years) timeframes. Several recent studies with homeless and mentally ill adults examine self-reported events spanning up to 5 years [
3,
4,
14].
The validity of self-reported information concerning service encounters may be compromised due to the severity of mental illness [
15], or to common correlates of homelessness such as cognitive impairment [
16] or substance use [
17]. Moreover, questions concerning frequently used services may span several sectors including health, justice, and social assistance, with potential differences in social desirability or risks associated with disclosure [
18]. For example, homeless and uninsured individuals are sometimes perceived as contributing to excessive emergency department visits [
19], creating a negative bias. Jail and criminal charges may have an adverse impact on access to healthcare [
20] as well as housing and support services. Furthermore, the validity of recall for events longer than 6 months previous may be particularly low [
16]. Rosen and colleagues [
15] compared self-reported receipt of social security income with administrative data for 7220 homeless people with mental disorders and found that 41 % of those who reported receiving benefits were unconfirmed by administrative sources. Moreover, the authors found that self-reported income assistance was more likely to be unverified among clients with psychotic disorders and longstanding substance use. These findings raise important questions regarding the use of self-reported service encounters for the purposes of calculating intervention outcomes as well as for identifying service priorities and gaps in care for members of a highly vulnerable population.
The use of administrative data as a gold standard measure of service use is often unwarranted, as these databases may be subject to inaccuracy and incompleteness [
21,
22]. However, in settings with robust and centralized health and social welfare systems, administrative data systems reflect relatively complete records of services, and can be used to assess the validity of self-reports [
22,
23].
Aims of the study
The present study examined agreement between self-reported and administrative data for healthcare, corrections, and income assistance in a sample of homeless and severely mentally ill individuals. We hypothesized that agreement regarding hospital admissions would be lower for longer periods of recall than for shorter periods and that jail and emergency department visits would be under-reported due to low social desirability.
Results
The characteristics of participants who consented to accessing administrative data and whose records could be matched (
n = 433) were compared to the full sample (
n = 497) on a number of socio-demographic characteristics (see Table
1). The overall pattern of findings indicates that the eligible sample is broadly representative of the larger cohort. Members of the eligible sample were roughly 41 years of age and were first homeless about 11 years earlier. Nearly three quarters (74 %) were male, and participants self-identified as White (54 %), Indigenous (16 %), or Other ethnicity (30 %), comprised of Asian, African, Caribbean, Latin American, Middle Eastern, Mixed, or Other categories. The majority of participants met criteria for bipolar disorder, schizophrenia or both (“Severe cluster”; 72 %) as well as Substance Dependence (58 %). Table
1 also shows the percentage of the sample that self-reported: having been in hospital for more than 6 months (10 %) and more than two times (50 %) in the past 5 years; having been hospitalized (43 %), in jail (14 %), or charged with an offence (24 %) in the last 6 months; and having received either disability or income assistance (94 %) in the past month.
Table 1
Socio-demographic characteristics of ‘At home’ participants by consent status at enrolment visit
Age at randomization (in years) | 40.8 (11.0) | 40.8 (11.0) | 41.4 (11.0) | 0.682 |
Age of first homelessness (in years) | 30.3 (13.3) | 30.1 (13.4) | 31.9 (12.6) | 0.301 |
Female gender | 134 (27) | 112 (26) | 22 (34) | 0.165 |
Ethnicity |
Aboriginals | 77 (16) | 70 (16) | 7 (11) | 0.054 |
White | 280 (56) | 235 (54) | 45 (70) | |
Other | 140 (28) | 128 (30) | 12 (19) | |
Incomplete High School | 280 (57) | 247 (57) | 33 (52) | 0.376 |
Single/Never Married | 343 (70) | 293 (68) | 50 (79) | 0.071 |
Need level (high) | 297 (60) | 255 (59) | 42 (66) | 0.305 |
Housing first interventions | 297 (60) | 257 (59) | 40 (63) | 0.632 |
Lifetime duration of homelessness (in months) | 60.2 (70.3) | 58.3 (64.8) | 72.9 (99.8) | 0.124 |
Longest episode of homelessness (in months) | 30.9 (40.1) | 30.4 (39.5) | 34.1 (44.4) | 0.498 |
Less severe cluster of mental disorders | 264 (53) | 235 (54) | 29 (45) | 0.180 |
Severe cluster of mental disorders | 363 (73) | 311 (72) | 52 (81) | 0.113 |
Suicidality (high) | 87 (17) | 79 (18) | 8 (12) | 0.259 |
Substance dependence | 288 (58) | 252 (58) | 36 (56) | 0.768 |
Daily substance use | 143 (29) | 131 (30) | 12 (19) | 0.064 |
Mental health severity/CSI score (per unit) | 37.2 (12.5) | 37.4 (12.5) | 35.8 (12.9) | 0.371 |
Chronic Medical Conditions (3 or more) | 344 (69) | 305 (70) | 39 (61) | 0.124 |
Blood-borne Infectious Disease (HIV, Hep. B or C) | 157 (32) | 139 (32) | 18 (29) | 0.603 |
Hospitalized for mental illness in last 5 yearsd
|
Over 6 months | 57 (11) | 44 (10) | 13 (20) |
0.017
|
Two times or more | 253 (51) | 215 (50) | 38 (59) | 0.146 |
Hospitalization in last 6 months | 212 (43) | 187 (43) | 25 (39) | 0.533 |
Any formal charge in last 6 months | 111 (23) | 101 (24) | 10 (16) | 0.172 |
Jail in last 6 months | 69 (14) | 62 (14) | 7 (11) | 0.465 |
Source of income (disability or income assistance) in past month | 469 (94) | 408 (94) | 61 (95) | 0.725 |
Table
2 presents the simple percent agreement and agreement statistics (Cohen’s kappa coefficient and PABAK) between the self-reports and administrative records across specific domains of public service utilization. Based on Cohen’s kappa coefficient, five of the six variables examined showed a moderate agreement (jail: kappa = 0.55, 95 % confidence interval (CI) = 0.46, 0.64; multiple psychiatric hospitalizations: kappa = 0.50, 95 % CI = 0.41, 0.59; disability or income assistance: kappa = 0.47, 95 % CI = 0.38, 0.56; ER visit: kappa = 0.44, 95 % CI = 0.34, 0.53; any hospitalization: kappa = 0.44, 95 % CI = 0.34, 0.53) and the remaining variable (6 months psychiatric hospitalization) indicated a poor agreement (kappa = 0.21, 95 % CI = 0.12, 0.30). As expected, some of these variables demonstrated a high prevalence index (6 months psychiatric hospitalization: 0.84, disability or income assistance: 0.81 and jail: 0.66) associated with the fact that these variables represented either rare (6 months psychiatric hospitalization) or very common conditions (disability or income assistance) in the sample. Bias index was minimal across all variables except for multiple psychiatric hospitalizations. When adjusted for imbalance caused by prevalence differences and bias, kappa values (PABAK) increased substantially for variables addressing 6 month psychiatric hospital separations (PABAK: 0.77; 95 % CI = 0.71, 0.83), disability or income assistance (PABAK: 0.82; 95 % CI = 0.76, 0.87) and jail (PABAK: 0.74; 95 % CI = 0.68, 0.81). Based on PABAK, these variablesshowed substantial (jail and 6 months psychiatric hospitalization) or almost perfect (disability or income assistance) agreement. For other variables, PABAK values showed the same moderate level of agreement as measured by Cohen’s kappa coefficient.
Table 2
Agreement between self-report and administrative records of public service utilization among ‘At Home’ study participants (n = 433a)
Hospitalized over six months for mental illness | 5 years | 88 % | 2 % | 86 % | 12 % | 8 % | 4 % | 0.21 (0.12, 0.30) | 0.84, 0.05 | 0.77 (0.71, 0.83) |
0.007
|
Hospitalized at least two times for mental illness | 5 years | 75 % | 30 % | 45 % | 25 % | 19 % | 6 % | 0.50 (0.41, 0.59) | 0.14, 0.13 | 0.50 (0.42, 0.58) |
<0.001
|
Any Hospitalization | 6 months | 73 % | 28 % | 45 % | 27 % | 16 % | 11 % | 0.44 (0.34, 0.53) | 0.18, 0.04 | 0.46 (0.37, 0.54) | 0.117 |
Any emergency room visitc
| 6 months | 73 % | 46 % | 27 % | 27 % | 9 % | 18 % | 0.44 (0.34, 0.53) | 0.19, 0.08 | 0.45 (0.36, 0.54) |
0.002
|
Jaild
| 6 months | 87 % | 11 % | 76 % | 13 % | 4 % | 9 % | 0.55 (0.46, 0.64) | 0.66, 0.06 | 0.74 (0.68, 0.81) |
0.001
|
Income source: disability or income assistance | Past month | 91 % | 86 % | 5 % | 9 % | 8 % | 1 % | 0.47 (0.38, 0.56) | 0.81, 0.07 | 0.82 (0.76, 0.87) |
<0.001
|
Table
2 also presents findings from the McNemar test, which was conducted to identify any systematic difference in terms of disagreement (under-reporting vs. over-reporting). Five of the six variables examined resulted in significant disagreement between sources. Participants significantly over-reported having been hospitalized for 6 months or more for a mental illness (8 % vs. 4 %,
p = 0.007) and having been hospitalized at least two times for a mental illness (19 % vs. 6 %,
p < 0.001) – both in the previous 5 years. Participants significantly under-reported having been to an emergency department (18 % vs. 9 %
p = 0.002) or to jail (4 % vs. 9 %,
p = 0.001) within the past 6 months. Finally, participants significantly over-reported having received disability or income assistance in the past month (8 % vs. 1 %,
p < 0.001). There was no significant disagreement between self-report and administrative data with respect to having been hospitalized for any reason in the past 6 months.
Discussion
Our findings reveal
moderate to
almost perfect [
33] agreement between self-reported service use and corresponding information from administrative sources. To the best of our knowledge, this is the first study to directly compare self-reported and administrative data for multiple domains of public service in a single large sample meeting criteria for chronic homelessness and serious mental illness. These results support the validity of self-report data as the basis for research examining changes in service use and related costs with this sub-population. In so doing they support the role of people who experience both mental illness and homelessness as participants in the production of knowledge.
Our results hold particular relevance for experimental trials such as those recently implemented in North America [
3] and Europe [
4], which rely on self-reports to investigate the impact of interventions on service use among people who are both homeless and mentally ill. More specifically, the aforementioned trials incorporated items concerning self-reported psychiatric hospitalization in the past 5 years as inclusion criteria - items that our results specifically corroborate.
Previous research among homeless people with chronic alcohol problems employed a similar design to the current study and reported “inadequate” recall over a 3-year period [
13]. These findings and those of earlier research among alcoholics [
12] may be attributable to the cognitive effects of chronic alcohol exposure [
36]. By contrast, our results suggest that individuals who have experienced longstanding homelessness (i.e., 10 years) alongside psychosis or bipolar disorder are nevertheless reliable reporters concerning events in both recent (past month) and distant (5 years) history.
Despite the overall level of agreement between data sources, we identified significant systematic differences in under-reporting versus over-reporting on five out of six variables. Questions related to psychiatric hospitalization during the past 5 years (whether a single long admission or multiple admissions) were associated with significant over-reporting. In contrast, questions related to emergency department visits or jail admissions during the past 6 months were associated with significant under-reporting. Finally, having received income assistance in the past month was significantly over-reported, a result that is consistent with previous research on this sub-population [
15]. We cannot specify the reasons for disagreement between sources, or why some variables were over-reported while others were under-reported. Nevertheless, the over-reporting of psychiatric hospitalizations in the preceding 5 years may have been due to the difficulty of recalling events over a lengthy period, and may have elicited positive responses corresponding to highly salient events that occurred longer than 5 years in the past. The under-reporting of jail in the preceding 6 months may reflect social desirability as well as stigma associated with incarceration. Homeless individuals may use emergency departments for primary healthcare, and may therefor under-report visits that do not involve emergency complaints. These findings are particularly relevant to research that focuses on specific services (e.g., jail) where systematic over or under-reporting could bias results. They are also relevant to clinical settings where biased responding (e.g., over-reporting of income assistance) may lead to oversights in care planning for patients.
Strengths of our study include: the use of verified administrative data; multiple categories of service; administrative records spanning a long-duration (i.e., up to 5 years); pre-testing of interview questions [
25]; and use of calendar aids to strengthen recall. Limitations of our study include: unaccounted for errors associated with administrative records (e.g., coding errors); potential use of aliases when receiving care; and unrecorded events due to not having identification.
Conclusions
Our study found that individuals who experience chronic homelessness and serious mental illness are accurate historians regarding their encounters with public services. We observed high levels of agreement between administrative records and self-report for healthcare, jail, and welfare support spanning periods of time from 1 month to 5 years. Significant over-reporting (e.g., psychiatric hospitalization) and under-reporting (e.g., jail) was specific to individual service areas and warrants caution in studies that focus on these particular domains.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (
http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (
http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
JS led the research project, jointly conceived the study, and drafted the main manuscript. AM jointly conceived the study, led the analysis of data, and drafted sections of the manuscript. LC collected data for the study and drafted sections of the manuscript. SR collected data for the study and drafted sections of the manuscript. AR collected data for the study and drafted sections of the manuscript. MP collected data for the study and drafted sections of the manuscript. All authors read and approved the final version of the manuscript.