Robert B. Wallace
Knowledge of the genetic causes of health conditions and age-related physiologic changes is growing rapidly. Much of the lore of genetics and health, in addition to basic genetic science and molecular biology, comes from the study of informative families and patient groups and, to some extent, from specifically designed population studies. Many populations have been surveyed in recent years to address general health issues, and many more are being surveyed for other important reasons, such as for testing social, economic, or political hypotheses. Among the sponsors of extensive population studies of many types, emphasizing older persons and their health and social needs, is the U.S. National Institute on Aging (NIA), part of the National Institutes of Health. With the emerging technology for conducting genetic studies, it is time to ask whether ways can be found to exploit these major population surveys to better understand the genetics of conditions important to public health. The methodologic challenges for marrying large population surveys to genetic hypotheses are complex and not easily solved, in part because each survey was thoroughly rationalized, scrutinized, and funded to address a set of important nongenetic scientific questions relating to general health, social behavior, and economics. However, given the substantial costs of these population surveys and the restricted availability of research funds, it is essential to at least explore possible intersections of genetic inquiry with existing and planned field studies. The purpose of this paper is to (1) catalog many of the important geographic surveys being supported and/or archived by
NIA, (2) describe selected, potential applications of these surveys for genetic study, (3) address the various modes of specimen collection applicable in population surveys, and (4) suggest a research agenda to realize these potential methodologic enhancements.
Many population surveys are conducted in the United States and elsewhere; there is no clear way to identify all of them. Many of the NIA-sponsored surveys are extensive in scope and themes, and several are conducted outside of the United States. Information on a selection of these surveys, the basic characteristics of which are described in Table 12-1 , was taken from the NIA document ''Databases on Aging," a summary of surveys relevant to the demography, economics, and epidemiology of aging, published in February, 1996.1 The surveys noted in Table 12-1 are not an exhaustive list of those available, nor does the table cite many of the survey data sets available in archive form for analysis. In some instances, the tabular information is simplified because of the complex, multiple sampling frames and the varied target populations and different survey intervals. On occasion, survey design and operational information were incomplete.
In summary, the survey study designs reveal the following: (1) the surveys vary dramatically in health-related content; many were intended largely to study behavioral, social, and economic issues; (2) most of the surveys are recent but inactive, and it is unclear whether participants could be located or recontacted to obtain additional information; (3) many of the surveys contain information on at least some family members, but sometimes this is limited to spouse pairs and the extent of documenting either nuclear or extended pedigrees is often limited or uncertain; (4) collection of bodily specimens—either blood or other tissues or fluids—is rare. In the few instances where specimens were collected, this was limited mostly to U.S. national samples and subsamples conducted by the U.S. National Center for Health Statistics; (5) follow-up rates for the longitudinal panels were generally quite good, including mortality follow-up when part of the protocol; and (6) the original investigators would almost always need to be contacted to explore further participant contact and any possibility of specimen collection, including the determination of ethical and administrative procedures. In general, this suggests that retrospective use of these surveys, particularly the inactive ones, would require additional resources and energy to suit them for genetic study, but nonetheless, a reasonable potential remains for exploitation of, at least, ongoing or planned surveys.
TABLE 12-1 Summary of Selected NIA-sponsored Population Surveys
|
Survey |
Currently Activea |
Country |
Age Range |
Family Data |
Specimen Collection |
|
Asset and Health Dynamics (N = 8,000) |
Yes |
USA |
70+ |
Yes |
No |
|
Australian Longitudinal Study of Ageing (N = 2,087) |
Yes |
Australia |
70+ |
Yes |
No |
|
Cross-sectional and Longitudinal Aging Study (N = 2,891) |
Yes |
Israel |
75+ |
Yes |
? |
|
Established Populations for Epidemiological Study of the Elderly (N = 14,000; four populations) |
Yes |
U.S.A. |
65+ |
Yes |
Yes |
|
German Socio-Economic Panel (N = 8,100 households) |
Yes |
Germany |
All |
Yes |
No |
|
Health and Retirement Survey (N = 12,600) |
Yes |
U.S.A. |
51-61 |
Yes |
No |
|
Longitudinal Study on Aging (N = 5,000) |
No |
U.S.A. |
70+ |
No |
No |
|
Supplement on Aging II (Planning phase) |
Yes |
U.S.A. |
70+ |
No |
No |
|
Luxembourg Income Study (Multiple databases) |
Yes |
Multiple |
All |
Yes |
No |
|
Second Malaysian Family Life Survey (N = 6.000) |
No |
Malaysia |
All |
Yes |
No |
|
National Long-Term Care Survey (N = 6,393) |
Yes |
U.S.A. |
65+ |
Yes |
No |
|
National Longitudinal Survey: Re-Survey of Older Males (N = 2,092) |
No |
U.S.A. |
69-84 |
Yes |
No |
|
National Nursing Home Survey Follow-up (N = 11,000) |
No |
U.S.A. |
Older adults |
No |
No |
|
National Survey of Families and Households Reinterview (N= 13,017) |
No |
U.S.A. |
19+ |
Yes |
No |
|
New Beneficiary Survey and New Beneficiary Follow-up (N = 16,000) |
No |
U.S.A. |
60+ |
Yes |
No |
|
Panel Study of Income Dynamics (N = 28,000) |
Yes |
U.S.A. |
All |
Yes |
No |
|
Wisconsin Longitudinal Survey (N = 10.317) |
Yes |
U.S.A. |
18-60 |
Yes |
No |
|
a Active as of December, 1995. |
|||||
It is beyond the scope of this report to review advances in basic and clinical genetics and the relation of genetic structure and function to disease occurrence and outcome. For those involved in social surveys who are not schooled in genetics, a very brief discussion emphasizing the complexity of the situation may be of value. On 23 matched (except for the one pair of sex chromosomes) pairs of chromosomes in the nuclei of each human cell, the human genome contains about 100,000 genes, discrete functional and structural sites that interact with the internal cellular and external environment to direct basic cell growth, activity, and death and to transfer this information to the next generation. Each matched gene may vary somewhat from its mate and from the respective genes at the same site in other individuals. These structural variants, called alleles, may function somewhat differently from each other. The specific genetic makeup of an individual is called the genotype. The process of change in the structure of a gene, often accompanied by changes in function, is called mutation; mutation may occur spontaneously or be accelerated by external environmental forces. Mutations may be harmful or helpful to an organism or be biologically neutral. Not all human genes have yet been identified as to structure and function, but work is progressing rapidly. Determination of the structure and function of genes and the relation of altered gene structure to disease occurrence is made more complex by several recent observations: (1) some genes are not necessarily in one physical location on a chromosome; (2) to the extent that important chronic illnesses are gene-related, there are probably multiple genes involved; (3) the mechanisms of genetic regulation and how environmental factors alter that function are incompletely understood; and, (4) some genetic material (DNA) is located outside the nuclear chromosomes in the cytoplasm of the cell and is probably of maternal origin only. Thus, the search for gene-disease associations is clearly complex and difficult, although extremely important. However, as noted below, there are other potential genetically related applications of population studies.
To find potential applications of existing population studies for addressing genetic hypotheses, it is instructive to indicate some general categories of applications, temporarily leaving aside study methods and logistical issues. These categories are possible through the rapidly expanding ability to identify and characterize many genes within individuals and large population samples. However, as in all other fields of measurement, quality control in the laboratory determination of various alleles is essential, as substantial error can occur in laboratory procedures.
Given the emerging capacity to determine alleles in population samples, the following is a selected list of general genetic research applications in population surveys, recognizing that specific studies have many scientific and methodologic contingencies:
A general survey application is to determine the distribution of various genes and alleles in defined populations. While it is an empirical question whether well-constructed and executed population samples will reveal estimates of genotype (allele) frequencies markedly different from more customary sources, such as volunteer populations, clinical populations, blood donors, and newborn screening samples, this use is probably one of the best applications for preexisting and planned general population samples. In addition to their specific sample representativeness, the NIA populations may be attractive because of their broad national coverage, multinational representation, and in some instances access to special populations such as the institutionalized elderly or the oldest old. This access would be particularly valuable as genes are discovered that are associated with late-onset diseases, given the age distribution of many NIA-sponsored survey participants. Several potential specific applications are presented, with examples from the recent scientific literature:
of diseases. For example, certain alleles predict whether young diabetic patients will acquire a certain severe form of retinal disease (Cruickshanks et al., 1992).
Many other applications exist, based on the population determination of genetic markers, but exploiting these opportunities requires dialogue and interdisciplinary cooperation to identify and answer important scientific questions.
Currently, most gene-disease associations are explored in families. The basic logic is to ascertain whether certain genes (alleles) occur in the same members of genetically related families as does a medical condition of interest, called segregation analysis. Other general methods for studying gene-disease associations involve selected parts of family units such as siblings and cousins, or identical and fraternal twins. However, selecting these families (pedigrees) for study from clinical or volunteer populations may obscure some potential associations because of chance clustering of nongenetically related common diseases in these families, possibly leading to spurious negative findings. Sampling families from existing, defined general population surveys, particularly those with information on health and disease history, might be an effective way of unbiased pedigree sampling. The main obstacle may be that occurrence rates for most medical conditions are relatively low, even for population samples numbering in the thousands, and thus not all survey samples may be fully useful for identifying representative families with the multiple occurrence of various conditions. Panel (cohort) studies, as opposed to prevalence (cross-sectional) studies may be somewhat more valuable in this regard because over time additional cases of the study disease will occur and be monitored.
It is also possible, although somewhat inefficient, to ascertain certain family structures, such as twins, multiple siblings, or multiple cousins, from population surveys for further study.
Health-related population surveys have been determining disease and risk-factor occurrence in populations for many years. This fact is restated to emphasize that accurate data on the occurrence of nonfatal diseases and physiologic measures are surprisingly difficult to acquire in many national and regional populations, particularly when inferences from mortality statistics are unavailable. Potentially, social and economic surveys can collect basic health data in certain situations and these data will often contribute to knowledge on population health, as well as help assess the promise of that population for genetic study.
Genetic data can be divided roughly into two categories. The first category is the historical information obtained at interview, including family pedigrees with their biologic relationships, and the disease experience of those families both within and across generations. Standardized techniques exist for ascertaining and recording pedigree information (Bennett et al., 1995). The second category is the bodily specimens on which the genetic studies can be run. There are several general ways to acquire such specimens:
Venopuncture. The most effective way is to obtain blood at the time of survey, if resources allow. This procedure requires specific training of interviewers and use of equipment to store and transmit blood specimens. An alternative approach is to have a smaller number of trained venopuncturists visit the survey respondents later. This protocol would be particularly helpful when a willing primary respondent can gather available family members. Another approach that has been successful is to supply the study participant with blood vials, a prepaid mailing container, and a voucher to pay a local physician or clinic to obtain, process, and transmit the specimen. A less efficient but ancillary approach is to obtain a blood specimen that was stored for some other reason.
Hair follicles. In this technique participants are asked to supply a hair specimen that includes the follicular roots. We have less experience in obtaining such samples in the survey setting, but it may be worth pursuing.
Cheek swabs. This technique is noninvasive and offers promise where venopuncture is impossible. Processing specimens is more cumbersome and expensive, and a problem exists with contamination of the specimen by oral bacteria, food particles, etc.
Surgical specimens. A common technique in molecular epidemiology, particularly in the study of cancer occurrence and prognosis, is to acquire stored tissue specimens obtained at surgery, on which many genetic markers can be determined. These specimens will, of course, vary in availability, depending in part on the time interval since the operation, but they still may be an important source of markers—one that can be accessed by mail if participant and pathologist consents are obtained. A corollary approach is to obtain stored blood or tissue specimens that were obtained at autopsy. Unfortunately, autopsy rates are low and decreasing; this option will often be unavailable.
Although it is beyond the scope of this report to comprehensively review the ethical considerations of obtaining genetic markers, there are growing concerns and evolving regulations about the acquisition and disposition of these markers.
It is necessary for each investigator to ensure that appropriate consent procedures are followed for specimen acquisition, banking, and future applications. Particularly sensitive issues include: (1) accessibility of personal genetic information by other parties, such as family members or insurance companies, (2) accessibility of the specimens to other scientific laboratories, (3) ownership of potential commercial uses of biologic findings from collected specimens, (4) application of specimens for scientific determinations unplanned at the time of collection. (5) fear of discovering previously unreported paternity; and (6) disclosure of high levels of disease risk discovered after the main study has ended, including investigator obligation to maintain contact with participants after completion of the study. These and related issues have been recently discussed in depth, although not always with full resolution (Clayton et al., 1995; Wagener, 1995).
Given the apparent potential for amalgamating population surveys with genetic study, several possible enhancements to these studies might lead to improved applicability. These suggestions are intended for further discussion and research planning:
Both from substantive, methodologic, and ethical perspectives, study of the genetic causes of disease and dysfunction is advancing rapidly. Clearly, large-scale population surveys can be extremely informative on genetic issues if appropriate forethought accompanies the inception of these surveys. There is a need for multidisciplinary approaches, at the very least among social scientists, geneticists, epidemiologists, and survey researchers, to fully use the population-survey opportunity.
Bennett, R.L., K.A. Steinhaus, S.B. Uhrich, C.K. O'Sullivan. R.G. Resta. D. Lochner–Doyle. D.S. Markel, V. Vincent, and J. Hamanishi 1995 Recommendations for standardized human pedigree research. American Journal of Human Genetics 56:745–752.
Clayton. E.W., K.K. Steinberg, M.J. Khoury, E. Thompson. L. Andrews, M.J. Ellis Kahn, .M. Kopelman. and J.O.Weiss 1995 Informed consent for genetic research on stored tissue samples. Journal of the American Medical Association 274:1786–1792.
Cruickshanks. K.J., C.M.Vadheim, S.E. Moss. M.P. Roth. W.J. Riley, N.K. Macleran. D. Langfield. R.S. Sparkes. R. Klein. and J.I. Rotter. 1992 Genetic markers associated with proliferative retinopathy in persons diagnosed with diabetes before 30 yr of age. Diabetes 41:879–885.
Eisensmith, R.C.. and S.L. Woo 1994 Population genetics of phenylketonuria. Acta Paediairica. Supplement 407:19–26.
Hyman, B.T., T. Gomez-lsla, M. Briggs, H. Chung, S. Nichols, F. Kohout, and R. Wallace In press A population-based longitudinal study of the influence of apolipoprotein E genotype on the risk of cognitive impairment in the elderly. Annals of Neurology.
Gill, P., and I. Evett 1995 Population genetics of short tandem repeat (STR) loci. Genetica 96:69-87.
Joyce, P., and S. Tavare 1995 The distribution of rare alleles. Journal of Mathematical Biology 33:602-618.
Kalnin, V.V., O.V. Kalnina, M.I. Prosniak, I.M. Khidiatova, E.K. Khusnutdinova, K.S. Raphicov, and S.A. Limborska 1995 Use of DNA fingerprinting for human population genetic studies. Molecular and General Genetics 247:488-493.
Lee, E.J. 1994 Population genetics of the angiotensin-converting enzyme in Chinese. British Journal of Clinical Pharmacology 37:212-214.
Rubinsztein, D.C., D.R. van der Westhuyzen, and G.A. Coetzee 1994 Monogenic primary hypercholesterolemia in South Africa. South African Medical Journal 84:339-344.
Schork, N.J., A.B. Weder, M. Trevisan, and M. Laurenzi 1994 The contribution of pleiotropy to blood pressure and body-mass index variation: The Gubbio Study. American Journal of Human Genetics 54:361-373.
Shivdasani, R.A., and K.C. Anderson 1994 HLA homozygosity and shared HLA haplotypes in the development of transfusion-associated graft-versus-host disease. Leukemia and Lymphoma 15:227-234.
Shickle, D., and I. Harvey 1993 "Inside out," back-to-front: A model for clinical genetic population screening. Journal of Medical Genetics 30:580-582.
Skamene, E. 1994 The Bcg gene story. Immunobiology 191:451-460.
van Oorschot, R.A., S.J. Gutowski. and S.L. Robinson 1994 HUMTHOI: amplification, species specificity, population genetics and forensic applications . International Journal of Legal Medicine 107:121-126.
Wagener, D.K. 1995 Ethical considerations in the design and execution of the National and Hispanic Health and Nutrition Examination Survey (HANES). Environmental Health Perspectives 103 (Supplement 3):75-80.