Background
WHO guidelines are documents, whatever their title, that contain one or more recommendation(s) regarding health interventions or policies [
1]. Recommendations are statements describing what policy-makers, healthcare providers, patients and other stakeholders should or could do to achieve the best health outcomes possible. WHO guidelines aim to enable end-users to make informed decisions regarding clinical or public health uncertainties. WHO guidelines address a broad range of topics, for example, environmental exposures, health systems, nutrition, patient safety, mental health, maternal and child health, infectious disease management, and public health emergency response [
2].
In 2007, the WHO Director-General established the Guidelines Review Committee (GRC) [
3] to ensure that WHO guidelines adhere to the highest international standards and are developed through a transparent and evidence-based decision-making process [
1]. The GRC is composed of WHO staff and external members who serve 3-year terms. The committee functions as a peer-review body at two stages during the guideline development process, namely proposal and final guideline. Guideline proposals are assessed for scope, methods, appropriateness of group composition and feasibility. Final guidelines are assessed for adherence to the standards outlined in the
WHO Handbook for Guideline Development, 2
nd edition [
1], and for quality in the execution of the required development steps.
At monthly, closed meetings, the GRC makes its decisions via consensus, based only on information in the submitted documents. The assessment is one of approval, conditional approval or non-approval. Conditionally approved documents usually require revisions that can be addressed by the authors with oversight by the GRC Secretariat and the Chair, but without the need for another full GRC review. Documents that are not approved require significant revisions and the revised documents must be discussed at a subsequent GRC meeting. For both conditionally approved and not approved documents, authors must submit a revised document and provide point-by-point responses to the GRC’s comments.
The objectives of this study were to examine the rates of and reasons for conditional approval and non-approval of documents submitted for the first time to the GRC, and calculate the time intervals and numbers of submissions to achieve approval for documents conditionally approved or not approved at first submission.
Methods
Inclusion criteria
All proposals and final guidelines that were submitted for the first time for review at a GRC meeting between 1 January 2014 and 31 December 2017 were eligible for this study. All included documents were related to standard guidelines, and none was developed using abbreviated processes and methods in response to a public health emergency.
Data for this study were extracted from the GRC’s records of written comments from reviewers and from discussions during GRC meetings.
Analyses
We examined trends in initial conditional approval and non-approval of proposals and final guidelines across the years using Cochran–Armitage trend tests for proportions, using the Statistical Analysis System (SAS Institute Inc., Cary, NC, USA, version 9.4). Differences were considered significant at P < 0.05.
Reasons for conditional approval and non-approval were categorised according to their primary focus, namely introduction and guideline scope; key questions underpinning the recommendations (in the Population, Intervention, Comparator, Outcome (PICO) format); guideline contributor group composition, roles and responsibilities; declarations of interest, management of conflict of interest and funding sources; methods for literature reviews; assessment of the certainty of the evidence; and formulation of recommendations.
The time interval and number of submissions to achieve approval were calculated for documents that had been approved as of January 15, 2018. The time interval to achieve approval for documents conditionally or not approved at first submission was calculated using the period between the GRC meeting date when a document was first reviewed and the date when the GRC approved the document. The decisions of the GRC are consistently provided 2 days after monthly meetings, and between-meeting approval of conditionally approved documents occurs within 2 to 3 days of submission. The number of submissions to achieve approval was calculated by counting the initial submission plus the number of subsequent submissions until approval was granted.
Discussion
The GRC and its Secretariat provide WHO staff with technical advice and training for guideline development. Nonetheless, a significant proportion of proposals and final guidelines submitted to the GRC between 2014 and 2017 did not meet WHO’s standards and were either conditionally approved or not approved at first submission, leading to delayed approval and therefore delayed publication. Nearly all proposals had issues with the composition of the guideline contributor groups and the key (‘PICO’) questions. All final guidelines had concerns regarding the recommendations, which included an unclear rationale for the strength of recommendation, and more than three-quarters were submitted with issues regarding the reporting of methods for evidence retrieval. The majority of proposals and final guidelines also had issues with declarations of interest, management of conflicts of interest or funding sources. Half of the documents that were initially conditionally approved or not approved were subsequently approved after a second submission, and within 1 or 2 months. However, some documents required additional revisions over lengthy periods before approval was granted.
Planning proposals are generally short, concise documents that must comply with a reporting checklist provided by the GRC Secretariat. The Secretariat also provides a detailed, structured template for proposals since 2014. The GRC expects the proposal to convey that the guideline developers have clear, achievable objectives and know how to approach the guideline development steps, even if all the details are not provided. The high rate of concerns with the scope and key (‘PICO’) questions may be explained by the fact that some proposals are submitted at an early stage when the WHO steering group and the guideline development group have not finalised these decisions. Getting the scope and key questions right is critical as they represent the health issues that guidelines aim to address and form the basis of the evidence searches which underpin the recommendations. Consequently, key questions must be finalised early in the guideline development process and the GRC Secretariat is working with WHO staff to help ensure that key questions are clear, answerable and acceptable to a wide range of stakeholders.
Furthermore, other problems, such as suboptimal composition of the guideline contributor groups and concerns regarding funding sources, declarations of interest or management of conflicts of interest, also need to be addressed at the planning stage since they are irremediable at the final guideline stage. To better address this issue, the GRC instituted a policy in 2017 that requires GRC Secretariat review of proposed guideline development group members, their declarations of interest and the management plans for any conflicts of interest before the meeting at which recommendations are formulated.
With regard to final guidelines, the GRC noted particular problems with the rationale statement for recommendations. Rationale statements should be clear, concise and cogent statements that articulate the basis for the recommendation, encompassing the balance of benefits and harms as well as considerations of equity, human rights, acceptability, resource use, and feasibility, among others, as relevant. These statements are essential for a high-quality and transparent guideline. Potential solutions include more engagement of the guideline methodologist, whose main role is to assist the guideline development group in formulating recommendations based on evidence. The methodologist may have additional roles, including helping to develop key questions, reviewing the systematic review team’s assessments of the certainty of evidence, helping to draft the methods section of the final guideline, and reviewing the draft final guideline [
1]. Additional solutions include staff training on using an evidence-to-decision framework such as GRADE-DECIDE [
4], which delineates a comprehensive list of the key elements (in addition to benefits and harms of the interventions) that should underpin recommendations. This will help guide searches for relevant evidence at the beginning of the final guideline development process as well as the guideline development group’s discussions and the subsequent rationale statement for each recommendation. Problems noted with documentation and methods of the evidence reviews are being addressed with additional staff training, more attention to the terms of reference for commissioned systematic reviews and increased contact between WHO staff and contractors. In addition, WHO staff are linked to WHO information scientists who have extensive experience with systematic reviews of public health interventions, including the grey literature.
WHO’s quality assurance process is rigorous and self-evaluation efforts such as this study feed back into quality improvement efforts. The high rate of conditional approval and non-approval of both planning proposals and final guidelines is concerning, but presents clear opportunities for quality improvement such as targeted training of guideline developers, dissemination of best practice examples, question-and-answer sessions, and more individual and group consultations with the GRC Secretariat, among other possible strategies.
An important strength of this study is that we had access to all documents submitted to the GRC during the time period examined, as well as to all comments provided by the GRC and its Secretariat during the review process. Nonetheless, several limitations should be noted. First, there are many factors that influence the number and nature of the comments on documents and the decisions made by the GRC. Like any peer-review process, comments vary across reviewers and over time within each reviewer, and GRC members generally serve 3-year terms. Second, the experience and training of WHO technical units, staff and the GRC Secretariat increased over time and an updated, more detailed
WHO Handbook for Guideline Development, 2
nd edition, was published in December 2014 [
1] and widely disseminated to WHO staff developing guidelines. Additional considerations were added to the 2014 handbook such as attention to human rights, equity and social determinants of health, including gender. Third, the categories of reasons for conditional approval and non-approval vary in scope and level of detail; thus, it is not appropriate to compare the number of problematic documents across categories. The categorisation of reasons for non-approval is also a rather subjective process. Fourth, because of the small numbers of documents, the study has low statistical power to show trends in rates of approval over time. Finally, this study was conceived, implemented and written by three individuals with an affiliation to WHO, representing a potential source of bias particularly in the interpretation of the results. We tried to minimise the risk of bias by having the external author (TVP) perform all data extraction and analyses (with subsequent checking by the other authors).
Implications for other organisations that develop guidelines
The processes, methods and standards that are implemented by the GRC are exemplary and relevant to every organisation that develops guidelines. WHO has implemented and executed a transparent, efficient and sustainable quality assurance process for all of its guidelines consisting of structured peer review, actionable constructive feedback, and technical and process support and training. Guideline development groups might consider whether the model of WHO’s GRC might be adapted to their settings and needs. The reasons for non-approval of guideline proposals and final guidelines that we identified are likely relevant to documents developed by other organisations since WHO’s procedures and methods are consistent with those of the international guideline community, and WHO uses existing high-quality systematic reviews and commissions new reviews from external teams. Thus, the weaknesses identified at first review of WHO documents are also likely relevant to many other organisations.