Starting in 2011, the National Toxicology Program (NTP) Office of Health Assessment and Translation (OHAT) began developing and incorporating systematic review methods into literature evaluations to assess scientific evidence of human health effects of exposures to environmental chemicals, physical substances, or mixtures (Birnbaum et al. 2013; Rooney et al. 2014). That effort was part of a cultural change within the environmental-health field in which approaches to evaluation of scientific evidence resulted predominantly in expert-based narrative reviews. However, mounting empirical evidence that narrative reviews generally lacked the ability to evaluate evidence in a rigorous, systematic, transparent, and reproducible manner indicated that more rigorous approaches to evidence-based decision-making were needed (Reenie and Chalmers 2009; NRC 2011; Woodruff and Sutton 2011). As a result, several agencies and institutions have undertaken the development and implementation of systematic review methods to address environmental questions (EFSA 2010; Woodruff and Sutton 2011, 2014; Murray and Thayer 2014).
NTP OHAT systematic review methods are described in several documents. First, the OHAT handbook on systematic review (published in 2015 and updated in 2019) represents the “standard operating procedures” for how systematic review and evidence integration are to be conducted for OHAT literature-based assessments (NTP 2015, 2019a,b).1 Those operational guidelines are based largely on empirically tested approaches and expert input from various fields, such as the clinical sciences, including Cochrane (Higgins et al. 2019), Grading of Recommendations Assessment, Development, and Evaluation (Guyatt et al. 2008), the Navigation Guide (Woodruff and Sutton 2011, 2014), the Collaborative Approach to Meta-Analysis and Review of Animal Data from Experimental
___________________
1 Although the protocol for the fluoride monograph refers to both versions of the OHAT handbook, the committee assumes that it was based on the updated version given that it incorporates a revised figure (NTP 2017, Figure 3, p. 20) that was provided in the updated version.
Studies (CAMARADES),2 and the Systematic Review Centre for Laboratory Animal Experimentation (SYRCLE).3
For NTP OHAT literature-based assessments, such as the monograph evaluated in the present report, a protocol is developed and shared publicly before beginning the assessment. A protocol is the detailed plan that is to be followed in a specific systematic review and describes the rationale, the objectives of the review, and the conduct of each step of the review (NTP 2015; Higgins et al. 2019). The steps include problem formulation, which results in the development of the Population, Exposure, Comparator, and Outcomes (PECO) statement; development of inclusion and exclusion criteria for study selection; a search of the literature; extraction of data from included studies; critical appraisal of studies for risk of bias; synthesis of results from included studies; and hazard identification by integration of the evidence from human and animal studies and consideration of supporting evidence from mechanistic studies. Ideally, the protocol should follow guidance provided by the OHAT handbook but include details specific to the given systematic review, such as how to rate risk of bias in assessing outcomes of neurodevelopmental and cognitive effects as in the case here.
The methods section in the monograph should also describe how a specific systematic review was conducted. Ideally, the methods described in the monograph should align with the details outlined in the protocol or should transparently and explicitly document, describe, and justify any deviations from the protocol. In the present chapter, the committee provides its assessment of the methods and overall presentation.
The committee noted several strengths of the monograph. As noted, the protocol is a critical component of a systematic review and ideally minimizes reviewer bias, allows feedback at early stages of the systematic review, and transparently highlights any changes made as the systematic review process unfolds (IOM 2011). The protocol for the monograph contains descriptions of each step of the systematic review and clearly outlines several protocol revisions, including the date and justification of each change. Furthermore, changes in the protocol are clearly indicated so that both the original text and the modified text are readily
___________________
2 CAMARADES provides a supporting framework for groups involved in the systematic review and meta-analysis of data from experimental animal studies. See http://www.dcn.ed.ac.uk/camarades/.
3 SYRCLE focuses on the execution of systematic reviews of animal studies aimed at more evidence-based translational medicine. See https://www.radboudumc.nl/en/research/departments/health-evidence/systematic-review-center-for-laboratory-animal-experimentation.
apparent. NTP appears to be adhering to best practices for systematic reviews with respect to the availability and documentation of such a protocol before initiation of a review.
The committee also was impressed by the availability of systematic review data in the interactive, freely available program Health Assessment Workspace Collaborative (HAWC).4 Given the plethora of data extracted from studies and the risk-of-bias ratings and justifications, interactive programs increase the ease with which an independent reviewer can explore the data in more detail without being limited to the graphics and tables provided in the report itself. The committee appreciated that most of the tables and figures in the monograph were available with additional study details and interactive graphics in HAWC.
The committee, however, had some overarching concerns regarding the protocol, data presentation, and communication that are described in the following sections, and it provides some suggestions for improvements.
The protocol for the systematic review described in the monograph was published on NTP’s Web site in June 2017 and made available for public comment. It was reviewed several times by technical advisers selected for their expertise on this topic. In general, it describes the overall systematic-review process and clearly outlines modifications that were made during the review. The committee, however, identified several issues associated with the protocol. First, the role of the OHAT handbook in developing the protocol is unclear. Second, important details are missing from the protocol. Third, inconsistencies between the protocol and the monograph raise concerns. Those issues are discussed further below; other issues associated with execution of the protocol are discussed in Chapters 3 and 4. Given the issues raised here and in later chapters, the committee finds that there are some deficiencies in the protocol and its execution.
As discussed, the OHAT handbook outlines “standard operating procedures for systematic review and evidence integration for conducting OHAT literature-based assessments” (NTP 2019a, p. v) and is intended as a “living document” that is continually updated to reflect refinement and modifications of the OHAT approach. However, the protocol scarcely refers to the OHAT handbook and does not discuss its role in developing the protocol. Specifically, the only references to the OHAT handbook in the protocol are noted below.
___________________
Thus, the role of the OHAT handbook in developing the protocol is unclear. That ambiguity leads to concerns about the lack of detail in the protocol and about apparent conflicts between the methodologic approach in the protocol and the OHAT handbook itself. Several examples of that ambiguity and the associated concerns are provided below.
To increase transparency, NTP should clearly describe the role of the OHAT handbook in developing the systematic-review protocol primarily to set expectations for how closely the process described in the handbook will be followed in the protocol and eventually the systematic review. That would help to address concerns about information that appears to be missing from the protocol or about conflicts between the protocol and the handbook.
The presentation by NTP OHAT staff to the committee on November 6, 2019, indicated that the protocol for the monograph is intended to serve as a standalone document—that is, the protocol should contain all details relevant to the conduct of the systematic review. That position might be due partly to the fact that the OHAT handbook will change, as it states: “the procedures are a living document with the expectation that approaches will be updated as methodological practices are refined and strategies identified that improve the ease and efficiency of conducting a systematic review” (NTP 2019a, p. v). The OHAT handbook has already undergone one recent revision (NTP 2019b) and will likely undergo several more. Thus, it might be best if each systematic-review protocol could stand alone as an independent document that contains all the information necessary for understanding of the planning and conduct of the review. The committee, however, acknowledges that it would also be satisfactory to cite the appropriate OHAT handbook versions in the protocol for sections in which the details regarding the process align with the handbook to limit the need to repeat information from the handbook in the protocol.
Any details pertaining to the conduct of a review should be planned beforehand and described in the protocol. The committee found that many important details were missing from the protocol, although some of the information was contained in the OHAT handbook or the monograph. The committee recommends that the details be included in the protocol for transparency. Examples of important details missing in the protocol are as follows:
___________________
5 See https://handbook-5-1.cochrane.org/chapter_3/3_4_2_1_re_executing_the_search.htm.
6 See https://handbook-5-1.cochrane.org/chapter_6/6_4_12_updating_searches.htm.
Some details outlined in the protocol appeared inconsistent with methods ultimately implemented in the monograph. Given the critical role of the protocol in the design and implementation of a systematic review, those potential discrepancies were concerning. Changes in the protocol are common, as illustrated by several transparently documented modifications of the protocol, but inconsistencies between the protocol and the monograph raise concerns regarding the rationale for specific changes and why they were not documented in the protocol.
The monograph does discuss how the NTP (2016) report considered the two age groups for experimental animals separately and indicates that this systematic review has mirrored such an approach (NTP 2019c, p. 4). However, the protocol should indicate when the decision was made, provide justification for it, and discuss specifics of the approach, for example, defining the age ranges that constitute “adult” and “child.” The protocol also should specify how evidence from studies that evaluate mixed populations (those containing children and adults) were incorporated. In general, any planned subgroup or sensitivity analyses should be described and justified in the protocol.
As stated in Chapter 1, the committee did not conduct a data audit, but it did find some minor issues with data presented in the monograph. Some of the issues appear to be minor errors; for example, the number of references is incorrectly reported in Figure 4 “Study Selection Diagram” (NTP 2019c, p. 18). During the title and abstract screening, 9,667 references were screened and excluded; 13,023 references were not screened, on the basis of the SWIFT algorithm; and 807 references were included for full text review. Those numbers sum to 23,497—30 more studies than reported as screened in the figure (23,467).
Other presentation issues are related to enhancing the utility of the data presentation. For example, most tables and figures in the monograph are organized alphabetically by study author last name. That approach does not convey the information in a meaningful format. It might be better to organize the studies in a more informative way; for example, risk-of-bias tables could be organized by risk-of-bias ratings (that is, studies that have the most green “ratings” first and studies that have fewer such ratings thereafter) or by stratifying the studies according to critical risk-of-bias domains similar to the example provided in the OHAT handbook (NTP 2015, p. 39). Because critical domains7 might cause a
___________________
7 Key domains for humans include confounding, exposure characterization, and outcome assessment, and key domains for animals include randomization, exposure characterization, outcome assessment, and litter effects for developmental studies.
study to be excluded from the analysis, highlighting them in each table would help readers to interpret the overall risk-of-bias ratings. As another example, Table 6, “Studies on Neurodevelopmental and Cognitive Function in Humans,” also arranged studies alphabetically by study author last name (NTP 2019c, p. 22ff). An alternative suggestion would be to categorize studies by age cutoffs because the outcomes are evaluated separately for children and adults. Furthermore, NTP should include the detection limit of exposure measures for each study in Table 6. Organization of data is discussed further in Chapters 3 and 4.
The committee noted that the monograph might benefit from improved communication in several respects. First, many people are interested in the question of whether water fluoridation to prevent dental decay poses a threat to cognition and neurodevelopment. Although the monograph includes some discussion of dose–response relationships, NTP did not conduct a formal dose–response assessment that could inform a discussion on water fluoridation. NTP needs to state clearly that the monograph is not designed to be informative with respect to decisions about the concentrations of fluoride that are used for water fluoridation. That point should be reiterated at the end of the monograph with some indication that its evaluation of the literature is focused on hazard identification of fluoride and that it does not draw any conclusions regarding drinking-water fluoridation or other fluoride sources, such as toothpaste or other dental treatments. Although NTP does not explicitly claim that it has done something other than hazard identification, the context into which the monograph falls calls for much more carefully developed and articulated communication on this issue.
Second, the monograph lacks details on the process of evaluating confidence in the body of evidence. As discussed earlier, the protocol lacks sufficient definitions for level-of-evidence descriptors as they pertain to the specific study question addressed in the monograph. However, the ratings applied for each confidence factor (Table 7, NTP 2019c, p. 51) also lack any justification or discussion. For example, NTP’s discussion of the overall risk-of-bias rating (NTP 2019c, p. 28) states that the confidence rating was not downgraded because 20 studies had little or no risk-of-bias concerns. However, NTP also states that the remaining human studies had probably high or definitely high risk of bias for at least two key considerations (exposure characterization, outcome assessment, or confounding factors). Thus, the rationale for not downgrading risk of bias is not entirely clear. NTP should consider supplementing Table 7 with clear justification for each confidence factor rationale for and why upgrade and downgrade factors were not applied for any of the human evidence.
Finally, there is little discussion of the process for obtaining missing or additional information from study authors. The monograph states that authors of included studies “were queried by email to obtain missing information and responses received were used to update risk-of-bias ratings” (NTP 2019c, p. 12). The author responses and changes in risk-of-bias ratings were documented in the
HAWC program. Obtaining additional information from study authors is critical for systematic review because it can minimize the effect of reporting bias vs other aspects of bias related to the design and implementation of studies. However, the monograph lacks clear documentation of how many authors were contacted, which authors were contacted, how many responded, and how risk-of-bias ratings were updated generally. Although that is tracked in HAWC with the actual risk-of-bias ratings, it could be helpful to include overall summary statistics of this critical process for greater transparency in the monograph. At a minimum, risk-of-bias ratings or extracted data that are updated on the basis of information obtained from the study authors should be clearly indicated for the specific risk-of-bias ratings or data in all relevant tables and figures.
Birnbaum L.S., K.A. Thayer, J.R. Bucher, and M.S. Wolfe. 2013. Implementing systematic review at the National Toxicology Program: Status and next steps. Environmental Health Perspectives 124(4):a108-a9.
EFSA (European Food Safety Authority). 2010. Application of systematic review methodology to food and feed safety assessments to support decision making. EFSA Journal 8(6):1-1690.
Guyatt G.H., A.D. Oxman, G.E. Vist, R. Kunz, Y. Falck-Ytter, P. Alonso-Coello, H.J. Schünemann, and GRADE Working Group. 2008. GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ 336(7650): 924-926.
Higgins J.P.T., J. Thomas, T. Li, M.J. Page, V. Welch, M. Cumpston, J. Chandler, and L. Mellor. 2019. Cochrane Handbook for Systematic Reviews of Interventions. Chichester, UK: John Wiley & Sons.
IOM (Institute of Medicine). 2011. Finding what works in health care: standards for systematic reviews. Washington, DC: National Academies Press.
Murray H.E., and K.A. Thayer. 2014. Implementing systematic review in toxicological profiles: ATSDR and NIEHS/NTP collaboration. Journal of Environmental Health 76(8):34-35.
NASEM (National Academies of Sciences, Engineering, and Medicine). 2018. Progress Toward Transforming the Integrated Risk Information System (IRIS) Program: A 2018 Evaluation. Washington, DC: National Academies Press.
NRC (National Research Council). 2011. Review of the Environmental Protection Agency’s Draft IRIS Assessment of Formaldehyde. Washington, DC: National Academies Press.
NRC. 2014. Review of EPA’s Integrated Risk Information System (IRIS) Process. Washington, DC: National Academies Press.
NTP (National Toxicology Program). 2015. New OHAT Handbook for Conducting Systematic Reviews. Available: https://ntp.niehs.nih.gov/update/2015/1/ohat-handbook/index.html.
NTP. 2016. NTP Research Report: Systematic Review of the Effects of Fluoride on Learning and Memory in Animal Studies. Office of Health Assessment and Translation, Division of the NTP, National Institute of Environmental Health Sciences, National Institutes of Health, US Department of Health and Human Services.
NTP. 2017. Protocol for Systematic Review of Effects of Fluoride Exposure on Neurodevelopment.
NTP. 2019a. Handbook for Conducting a Literature-Based Health Assessment Using OHAT Approach for Systematic Review and Evidence Integration. Research Triangle Park, NC: National Institute of Environmental Health Sciences.
NTP. 2019b. Updates and Clarification to the OHAT Approach for Systematic Review and Evidence Integration Research Triangle Park, NC: National Institutes of Health. Available: https://ntp.niehs.nih.gov/ntp/ohat/pubs/handbookclarificationmarch2019_508.pdf.
NTP. 2019c. Draft NTP Monograph on the Systematic Review of Fluoride Exposure and Neurodevelopmental and Cognitive Health Effects. Office of Health Assessment and Translation, Division of the NTP, National Institute of Environmental Health Sciences, National Institutes of Health, US Department of Health and Human Services.
Rennie D., and I. Chalmers. 2009. Assessing authority. JAMA 301(17):1819-1821.
Rooney A.A., A.L. Boyles, M.S. Wolfe, J.R. Bucher, and K.A. Thayer. 2014. Systematic review and evidence integration for literature-based environmental health science assessments. Environmental Health Perspectives 122(7):711-718.
Woodruff T.J., and P. Sutton. 2011. Navigation Guide Work Group. An evidence-based medicine methodology to bridge the gap between clinical and environmental health sciences. Health Aff (Millwood) 30(5):931-937.
Woodruff T.J. and P. Sutton. 2014. The Navigation Guide systematic review methodology: A rigorous and transparent method for translating environmental health science into better health outcomes. Environmental Health Perspectives 122(10):1007-1014.