Abstract

In 1995–1996, the authors mailed a food frequency questionnaire to 3.5 million American Association of Retired Persons members who were aged 50–69 years and who resided in one of six states or two metropolitan areas with high-quality cancer registries. In establishing a cohort of 567,169 persons (340,148 men and 227,021 women), the authors were fortunate in that a less-than-anticipated baseline response rate (threatening inadequate numbers of respondents in the intake extremes) was offset by both a shifting and a widening of the intake distributions among those who provided satisfactory data. Reported median intakes for the first and fifth intake quintiles, respectively, were 20.4 and 40.1 (men) and 20.1 and 40.0 (women) percent calories from fat, 10.3 and 32.0 (men) and 8.7 and 28.7 (women) g per day of dietary fiber, 3.1 and 11.6 (men) and 2.8 and 11.3 (women) servings per day of fruits and vegetables, and 20.7 and 156.8 (men) and 10.5 and 97.0 (women) g per day of red meat. After 5 years of follow-up, the cohort is expected to yield nearly 4,000 breast cancers, more than 10,000 prostate cancers, more than 4,000 colorectal cancers, and more than 900 pancreatic cancers. The large size and wide intake range of the cohort will provide ample power for examining a number of important diet and cancer hypotheses.

In this paper, we discuss the rationale for and design of a new prospective cohort study, the National Institutes of Health–American Association of Retired Persons (NIH-AARP) Diet and Health Study. We emphasize difficulties that hampered implementation of our original design and the (partly serendipitous) resolution of those difficulties. Finally, we present the baseline characteristics, especially the dietary intake distributions, for the cohort that ultimately emerged.

MATERIALS AND METHODS

Rationale

We designed this study to address three methodological problems impeding epidemiologic investigations of diet and cancer: 1) recall bias arising from postdiagnostic dietary assessment in case-control studies; 2) attenuation of true relative risks because of dietary measurement error; and 3) relative homogeneity of dietary intake among participants in studies conducted in one area or country.

To circumvent recall bias, we opted for a prospective cohort study, with dietary assessment taking place prior to cancer diagnosis. Although it is plausible that a diagnosis of cancer could systematically alter the retrospective reporting of typical diet prior to diagnosis, it is not proven that recall bias is a substantial problem in case-control (retrospective) studies of diet and malignancy. Within the context of prospective cohort studies, a few studies have compared prospectively and retrospectively reported fat and energy intake for cancer cases and noncases (14). Although the results of these studies are inconsistent and vary according to the nutrient assessed and the cancer being investigated, they have provided some empirical evidence for the existence of systematic recall bias in case-control studies of diet and cancer. However, even if there were a large series of cohort-based studies showing strong correlations between pre- and postdiagnosis food frequency questionnaire (FFQ) results for virtually every nutrient or food of interest in relation to a wide variety of cancers (which would certainly be a costly research program), investigators could still not be certain that such strong correlations would hold in case-control studies conducted in other demographic or geographic contexts. Thus, it is unlikely that even an extensive, rigorous (and, undoubtedly, expensive) research program would definitively exclude the possibility of recall bias in case-control studies of diet and cancer. Given this essentially intractable uncertainty about recall bias in case-control studies of diet and cancer, we opted for the prospective cohort design in which diet is assessed prior to diagnosis.

The FFQ is now commonly used to assess dietary intake in large epidemiologic studies. A large body of evidence suggests that the FFQ measures true diet with considerable error (5). Such error tends to attenuate the relative risks observed in studies of dietary factors and disease (6). Using a simple errors-in-measurement model and data from the 1987 National Health Interview Survey and the Women's Health Trial Vanguard Study, we calculated the cohort sample size required to take into account the attenuation resulting from the misreporting of dietary fat (7). To detect with 90 percent power, based on a 5 percent significance level for a two-sided statistical test of a “modest” relative risk gradient (one reflecting a relative risk of 1.64 for those who consumed more than 47.5 percent calories compared with those who consumed fewer than 25 percent of calories from fat) for dietary fat and colorectal cancer, we estimated that we would need a cohort of approximately 350,000. As with many sample size calculations, this was a crude estimate based on a number of assumptions about cancer incidence, the true distribution of dietary fat in our target population, the correlation between measured and true fat intake, and the accuracy of so-called reference instruments used in determining “true” intake. Moreover, this calculation was based on only one dietary factor when, in fact, we are interested in examining the relation to cancer of multiple dietary factors, with each being measured with error and having various degrees of correlation with the other factors. What was clear, though, was that our cohort would have to be very large—on the order of several hundred thousand—to offset the attenuation produced by measurement error.

Many populations are characterized by relatively narrow intake distributions for a variety of potentially cancer-related nutrients and foods. In countries such as the United States, for example, a relatively small proportion of the population consumes less than 20 percent of calories from fat or more than several servings of vegetables per day. A study conducted in this country or similar geographic regions is constrained in its capacity to examine the cancer risk associated with more “extreme” intake levels. To circumvent this problem of dietary homogeneity, we used a two-stage cohort construction strategy that allowed us explicitly to enrich our study population with persons at the extremes of intake.

Original design

We chose to establish our cohort within the membership of the AARP for several reasons: 1) the AARP is a very large organization, with a mailing list encompassing over 30 million men and women in the United States; 2) the demographics are favorable, since the AARP comprises both men and women age 50 years and over, an age when cancer occurrence is becoming more frequent; 3) the organization focuses on many health issues, which we thought would help in our recruitment; 4) AARP maintains regular communications with members through its magazine and bulletin; and 5) AARP leadership and research staff had participated earlier in a few smaller studies, were interested in collaborating with the NIH on a large-scale health study, and were willing to provide their mailing list and letterhead to help with recruitment.

On the basis of national survey data (8), we anticipated that only relatively small percentages of respondents to our baseline questionnaire would fall into extreme dietary intake categories (i.e., less than 25 percent and more than 47.5 percent of calories from fat). We reasoned that if we could mail the baseline questionnaire to a sufficiently large pool of people, we would be able to capture enough persons in the extreme intake categories to construct a cohort permitting stable estimates of cancer risk associated with the intake extremes. Assuming a response rate of approximately one in three, we calculated that we would need to send the baseline questionnaire to 3.5 million people. We planned to construct our cohort by including all respondents in the intake extremes and a random sample of persons who reported intake between these extremes.

In 1995–1996, we sent a 16-page-questionnaire to 3.5 million AARP members in six states (California, Florida, Pennsylvania, New Jersey, North Carolina, and Louisiana) and two metropolitan areas (Atlanta, Georgia, and Detroit, Michigan). In choosing our states and metropolitan areas, we considered only those areas with cancer registries certified as having at least 90 or 95 percent completeness of case ascertainment. To make the mailing process more efficient, we attempted to minimize the total number of states for our initial mailing and, therefore, picked states with large AARP membership. Because we also wanted to maximize the minority composition of the cohort, we targeted some smaller states and metropolitan regions with large minority populations.

The baseline questionnaire was a grid-based version of the new National Cancer Institute instrument, the Diet History Questionnaire. The Diet History Questionnaire was modeled initially on the Block questionnaire, underwent extensive cognitive testing during its development, and used a nutrient composition database newly derived from national survey data (911). The baseline questionnaire included 124 food items with portion size and 21 questions on intake of low-fat, high-fiber foods and food preparation. This questionnaire does allow for variation in eating patterns in different parts of the country to the extent that this variation is reflected in the line items and questions on cooking practices contained within the instrument. Although the questionnaire was designed for “general population” use, some regional and ethnic group-specific foods (flour and corn tortillas, for example) were included. The baseline instrument also included questions on demographic characteristics and other potential cancer risk factors (smoking, physical activity, family history of cancer, medical conditions, reproductive factors, and exogenous hormone use). We mailed this questionnaire in three waves: 250,000 in October 1995, 1 million in February 1996, and 2.25 million in May 1996.

We planned a calibration substudy among approximately 2,000 respondents to the baseline questionnaire. In addition to the baseline FFQ, each calibration study participant was to complete two 24-hour dietary recalls (administered by telephone an average of 25 days apart) and a second food frequency questionnaire.

We planned to use the cancer registries as our primary means of ascertaining cases. Although some registries have mechanisms for capturing data from persons who go to contiguous states or geographic areas for medical care, we expected that a small proportion of cohort members would move outside the purview of the registries in our target states and metropolitan areas. We therefore planned to mail a follow-up questionnaire to cohort members at the end of a 5-year period of observation. This short questionnaire would ask if cancer had occurred and whether the participant continued to reside in the registry area.

RESULTS

Implementing the original design

A total of 617,119 men and women returned the baseline questionnaire, a response rate of 17.6 percent. Data from the questionnaires were entered via Optical Mark Read (National Computer Systems, Minneapolis, Minnesota) scanners, capable of reading 12,000 pages per hour with 99.99 percent accuracy. The scanners used an “ink read” technique that permitted reading of questionnaires marked in pen rather than black lead pencil. All questionnaires were manually examined for damage prior to scanning. During scanning, we checked at least one record in every thousand by comparing the marks on the original document with the output of the scanner. Less than 0.1 percent of the questionnaires could not be scanned due to form damage; data from these damaged instruments were entered manually.

We excluded respondents for whom gender was unknown and those who skipped substantial portions of the questionnaire, were proxies for the intended respondent (this was indicated by a question on the front cover of the questionnaire), had more than 10 recording errors, reported fewer than 10 foods consumed, or subsequently requested to be removed from the study (table 1). A total of 567,169 persons (340,148 men and 227,021 women) respondents remained after these exclusions.

TABLE 1.

Exclusions from the cohort,* NIH-AARP Diet and Health Study, 1995–1996

Respondents (n = 617,119)
Skipped everything1,723
Skipped facing pages25,829
Deceased or proxy respondent13,442
Unknown gender6
>10 recording errors8,028
<10 foods indicated99
Asked to be dropped from study823
Excluded49,950
Respondents (n = 617,119)
Skipped everything1,723
Skipped facing pages25,829
Deceased or proxy respondent13,442
Unknown gender6
>10 recording errors8,028
<10 foods indicated99
Asked to be dropped from study823
Excluded49,950
*

There were 567,169 persons in the cohort (340,148 males and 227,021 females).

NIH, National Institutes of Health; AARP, American Association of Retired Persons.

TABLE 1.

Exclusions from the cohort,* NIH-AARP Diet and Health Study, 1995–1996

Respondents (n = 617,119)
Skipped everything1,723
Skipped facing pages25,829
Deceased or proxy respondent13,442
Unknown gender6
>10 recording errors8,028
<10 foods indicated99
Asked to be dropped from study823
Excluded49,950
Respondents (n = 617,119)
Skipped everything1,723
Skipped facing pages25,829
Deceased or proxy respondent13,442
Unknown gender6
>10 recording errors8,028
<10 foods indicated99
Asked to be dropped from study823
Excluded49,950
*

There were 567,169 persons in the cohort (340,148 males and 227,021 females).

NIH, National Institutes of Health; AARP, American Association of Retired Persons.

The 17.6 percent response rate to the baseline mailing was lower than we had (overoptimistically) expected from the AARP membership. Because we had calculated the initial mailing size (3.5 million) on the basis of a higher response rate, we faced the prospect of fewer respondents with extreme intakes available for the final cohort. (Due to fiscal and administrative constraints, we were compelled to move directly to the field in 1995 and could not conduct a pilot study to estimate the response rate prior to the first mailing wave.)

It turns out, though, that the intake distributions for respondents differed from those observed in national surveys. Our AARP respondents consumed less fat and red meat and more fiber and fruits and vegetables than comparably aged adults in the general US population. Moreover, the intake distributions for these dietary factors were wider than those in the national surveys. This combination of both a shifting and a widening of the intake distributions among respondents compensated for the less-than-anticipated response rate. (The lower response rate did have one fortuitous, if unintended, consequence: The return mailing costs were nearly a million dollars lower than the initial budget projection.)

In calculating our initial cohort sample size of 350,000, we focused on a single nutrient, dietary fat. After review of the baseline data, it was clear that we had satisfactory dietary information on an additional 200,000 men and women. Other dietary factors of interest were not strongly correlated with fat, so that persons in the extreme quintiles for fat are not necessarily those in the extreme quintiles for fiber, fruits and vegetables, and red meat. By including all 567,169 people in the final cohort, we were able to achieve a reasonably wide intake range for all four dietary factors.

Baseline cohort characteristics are summarized in table 2. The mean age was approximately 62 years. The cohort is predominantly White and more educated than the general population: Forty-five percent of the men and 31 percent of the women are college graduates. Baseline body size was comparable with national averages, with a mean body mass index of 27.2 in men and 26.9 in women. Although over half of the cohort reported being former smokers, only a little more than 10 percent of the men and 14 percent of the women reported current smoking. (These percentages of current smoking are somewhat lower than national numbers (12)). Physical activity (defined as lasting at least 20 minutes and causing either increases in breathing or heart rate or, alternatively, working up a sweat) showed considerable variation: 16 percent of the men and nearly 23 percent of the women never or rarely engaged in such activity, whereas 21 percent of the men and 16 percent of the women were physically active five or more times per week.

TABLE 2.

Baseline cohort characteristics, NIH-AARP* Diet and Health Study, 1995–1996

Men (n = 340,148)Women (n = 227,021)
Age (mean)62.361.9
Mean height in cm (inches)178.3 (70.2)163.3 (64.3)
Mean weight in kg (pounds)86.6 (191.0)71.7 (158.0)
Mean BMI* (weight (kg)/height (m)2)27.226.9
Race/ethnicity (%)
 White, not Hispanic93.790.9
 Black, Hispanic, Asian, other6.49.1
Education (%)
 ≤11 years6.66.6
 12 years/high school16.526.2
 Vocational/technical9.611.0
 Some college22.525.6
 College graduate21.915.2
 Postgraduate23.015.5
Family history of cancer (participant or first-degree relative) (%)31.437.8
Currently smoking (%)10.714.4
Former smoker (%)59.240.7
Physical activity (%)
 Never/rarely15.823.2
 1–3/month13.114.4
 1–2/week21.921.1
 3–4/week27.925.1
 ≥5/week21.316.3
Currently using HRT*42.8
Median kcal/day1,895.11,470.5
Alcohol intake (≥15 g/day)28.011.3
Men (n = 340,148)Women (n = 227,021)
Age (mean)62.361.9
Mean height in cm (inches)178.3 (70.2)163.3 (64.3)
Mean weight in kg (pounds)86.6 (191.0)71.7 (158.0)
Mean BMI* (weight (kg)/height (m)2)27.226.9
Race/ethnicity (%)
 White, not Hispanic93.790.9
 Black, Hispanic, Asian, other6.49.1
Education (%)
 ≤11 years6.66.6
 12 years/high school16.526.2
 Vocational/technical9.611.0
 Some college22.525.6
 College graduate21.915.2
 Postgraduate23.015.5
Family history of cancer (participant or first-degree relative) (%)31.437.8
Currently smoking (%)10.714.4
Former smoker (%)59.240.7
Physical activity (%)
 Never/rarely15.823.2
 1–3/month13.114.4
 1–2/week21.921.1
 3–4/week27.925.1
 ≥5/week21.316.3
Currently using HRT*42.8
Median kcal/day1,895.11,470.5
Alcohol intake (≥15 g/day)28.011.3
*

NIH, National Institutes of Health; AARP, American Association of Retired Persons; BMI, body mass index; HRT, hormone replacement therapy.

1 inch = 2.54 cm; 1 pound = 4.54 kg.

These data derive from the 540,010 persons (323,412 men and 216,598 women) who remained after those whose reported energy intake was too low or too high (men, <800 or ≥4,200 kcal, n = 16,736; women, <600 or ≥3,500 kcal, n = 10,423) were excluded from the cohort.

TABLE 2.

Baseline cohort characteristics, NIH-AARP* Diet and Health Study, 1995–1996

Men (n = 340,148)Women (n = 227,021)
Age (mean)62.361.9
Mean height in cm (inches)178.3 (70.2)163.3 (64.3)
Mean weight in kg (pounds)86.6 (191.0)71.7 (158.0)
Mean BMI* (weight (kg)/height (m)2)27.226.9
Race/ethnicity (%)
 White, not Hispanic93.790.9
 Black, Hispanic, Asian, other6.49.1
Education (%)
 ≤11 years6.66.6
 12 years/high school16.526.2
 Vocational/technical9.611.0
 Some college22.525.6
 College graduate21.915.2
 Postgraduate23.015.5
Family history of cancer (participant or first-degree relative) (%)31.437.8
Currently smoking (%)10.714.4
Former smoker (%)59.240.7
Physical activity (%)
 Never/rarely15.823.2
 1–3/month13.114.4
 1–2/week21.921.1
 3–4/week27.925.1
 ≥5/week21.316.3
Currently using HRT*42.8
Median kcal/day1,895.11,470.5
Alcohol intake (≥15 g/day)28.011.3
Men (n = 340,148)Women (n = 227,021)
Age (mean)62.361.9
Mean height in cm (inches)178.3 (70.2)163.3 (64.3)
Mean weight in kg (pounds)86.6 (191.0)71.7 (158.0)
Mean BMI* (weight (kg)/height (m)2)27.226.9
Race/ethnicity (%)
 White, not Hispanic93.790.9
 Black, Hispanic, Asian, other6.49.1
Education (%)
 ≤11 years6.66.6
 12 years/high school16.526.2
 Vocational/technical9.611.0
 Some college22.525.6
 College graduate21.915.2
 Postgraduate23.015.5
Family history of cancer (participant or first-degree relative) (%)31.437.8
Currently smoking (%)10.714.4
Former smoker (%)59.240.7
Physical activity (%)
 Never/rarely15.823.2
 1–3/month13.114.4
 1–2/week21.921.1
 3–4/week27.925.1
 ≥5/week21.316.3
Currently using HRT*42.8
Median kcal/day1,895.11,470.5
Alcohol intake (≥15 g/day)28.011.3
*

NIH, National Institutes of Health; AARP, American Association of Retired Persons; BMI, body mass index; HRT, hormone replacement therapy.

1 inch = 2.54 cm; 1 pound = 4.54 kg.

These data derive from the 540,010 persons (323,412 men and 216,598 women) who remained after those whose reported energy intake was too low or too high (men, <800 or ≥4,200 kcal, n = 16,736; women, <600 or ≥3,500 kcal, n = 10,423) were excluded from the cohort.

Table 3 shows, for each quintile of intake, the median values for four dietary factors: percent kilocalories from fat, dietary fiber (g/day), fruits and vegetables (servings/day), and red meat (g/day). To derive these data, we first excluded from the baseline cohort all men who reported consuming less than 800 kcal or 4,200 or more kcal and all women who reported an intake of less than 600 kcal or 3,500 or more kcal. (This is a typical approach to excluding calorie outliers, but it is not the only one that can be used.) The range of intake was similar in both men and women. The median values for fat as a percent of total energy intake ranged from approximately 20 percent in the lowest quintile to 40 percent in the highest quintile. Dietary fiber intake in the highest quintile was about three times that in the lowest. Fruit and vegetable intake was nearly four times greater in the fifth as opposed to first quintile. Median intake of red meat among study cohort members in the top quintile was more than seven times (for men) and nine times (for women) that of members in the bottom quintile.

TABLE 3.

Dietary characteristics by quintile,* NIH-AARP Diet and Health Study, 1995–1996

Dietary factorsMedian values per quintile
Quintile 1Quintile 2Quintile 3Quintile 4Quintile 5
Men (n = 323,412)
% kcal from fat20.426.730.834.740.1
Dietary fiber (g/day)10.314.718.623.232.0
Fruits and vegetables (servings/day)3.14.76.28.011.6
Red meat (g/day)20.744.266.896.5156.8
Women (n = 216,598)
% kcal from fat20.125.830.034.140.0
Dietary fiber (g/day)8.712.716.220.528.7
Fruits and vegetables (servings/day)2.84.55.97.711.3
Red meat (g/day)10.524.338.457.697.0
Dietary factorsMedian values per quintile
Quintile 1Quintile 2Quintile 3Quintile 4Quintile 5
Men (n = 323,412)
% kcal from fat20.426.730.834.740.1
Dietary fiber (g/day)10.314.718.623.232.0
Fruits and vegetables (servings/day)3.14.76.28.011.6
Red meat (g/day)20.744.266.896.5156.8
Women (n = 216,598)
% kcal from fat20.125.830.034.140.0
Dietary fiber (g/day)8.712.716.220.528.7
Fruits and vegetables (servings/day)2.84.55.97.711.3
Red meat (g/day)10.524.338.457.697.0
*

These analyses are based on the 540,010 persons (323,412 men and 216,598 women) who remained after those whose reported energy intake was too low or too high (men, <800 or ≥4,200 kcal, n = 16,736; women, <600 or ≥3,500 kcal, n = 10,423) were excluded from the overall cohort.

NIH, National Institutes of Health; AARP, American Association of Retired Persons.

Intakes of fruits and vegetables are expressed in terms of standardized, recommended servings per day based on Health and Human Services US Department of Agriculture dietary guidance as specified in the US Department of Agriculture Food-Guide Pyramid (16). A Pyramid serving of fruit is defined as one medium fresh fruit, 1/2 cup (120 g) of cut-up fruit, or 6 ounces (28.4 g) of juice; a Pyramid serving of vegetables is defined as 1 cup (240 g) of leafy vegetables, 1/2 cup of other vegetables, or 6 ounces of juice.

TABLE 3.

Dietary characteristics by quintile,* NIH-AARP Diet and Health Study, 1995–1996

Dietary factorsMedian values per quintile
Quintile 1Quintile 2Quintile 3Quintile 4Quintile 5
Men (n = 323,412)
% kcal from fat20.426.730.834.740.1
Dietary fiber (g/day)10.314.718.623.232.0
Fruits and vegetables (servings/day)3.14.76.28.011.6
Red meat (g/day)20.744.266.896.5156.8
Women (n = 216,598)
% kcal from fat20.125.830.034.140.0
Dietary fiber (g/day)8.712.716.220.528.7
Fruits and vegetables (servings/day)2.84.55.97.711.3
Red meat (g/day)10.524.338.457.697.0
Dietary factorsMedian values per quintile
Quintile 1Quintile 2Quintile 3Quintile 4Quintile 5
Men (n = 323,412)
% kcal from fat20.426.730.834.740.1
Dietary fiber (g/day)10.314.718.623.232.0
Fruits and vegetables (servings/day)3.14.76.28.011.6
Red meat (g/day)20.744.266.896.5156.8
Women (n = 216,598)
% kcal from fat20.125.830.034.140.0
Dietary fiber (g/day)8.712.716.220.528.7
Fruits and vegetables (servings/day)2.84.55.97.711.3
Red meat (g/day)10.524.338.457.697.0
*

These analyses are based on the 540,010 persons (323,412 men and 216,598 women) who remained after those whose reported energy intake was too low or too high (men, <800 or ≥4,200 kcal, n = 16,736; women, <600 or ≥3,500 kcal, n = 10,423) were excluded from the overall cohort.

NIH, National Institutes of Health; AARP, American Association of Retired Persons.

Intakes of fruits and vegetables are expressed in terms of standardized, recommended servings per day based on Health and Human Services US Department of Agriculture dietary guidance as specified in the US Department of Agriculture Food-Guide Pyramid (16). A Pyramid serving of fruit is defined as one medium fresh fruit, 1/2 cup (120 g) of cut-up fruit, or 6 ounces (28.4 g) of juice; a Pyramid serving of vegetables is defined as 1 cup (240 g) of leafy vegetables, 1/2 cup of other vegetables, or 6 ounces of juice.

To ensure that our calibration study paralleled that of the main study in comprising persons with extreme intakes of four dietary factors (fat as a percent of total calories, fiber, fruits and vegetables, and red meat), we randomly selected baseline questionnaire respondents within various combinations of intake strata for these four factors. Of the 2,795 respondents initially approached, 2,055 (74 percent) participated by completing the first 24-hour dietary recall interview, and 1,415 (51 percent) completed both 24-hour dietary recall interviews and a second food frequency questionnaire. The 24-hour dietary recalls were conducted by experienced interviewers who completed a rigorous training session, which included home study, demonstration interviews, interactive lectures, and role playing. Table 4 shows the correlations, adjusted for random within-person error in the 24-hour dietary recall, between food frequency questionnaire and recall values for the four dietary factors.

TABLE 4.

Correlations between intakes from food frequency questionnaires and 24-hour dietary recalls,* NIH-AARP Diet and Health Study, 1995–1996

MaleFemale
% calories from fat0.690.64
Fiber0.480.42
Fruits and vegetables0.540.47
Red meat0.620.70
MaleFemale
% calories from fat0.690.64
Fiber0.480.42
Fruits and vegetables0.540.47
Red meat0.620.70
*

These correlations were obtained by first determining the correlation between food frequency questionnaire and 24-hour dietary recall intake, then adjusting for random within-person error in the 24-hour recall (17). These numbers are based on 1,415 persons who completed two 24-hour dietary recalls and two food frequency questionnaires.

NIH, National Institutes of Health; AARP, American Association of Retired Persons.

TABLE 4.

Correlations between intakes from food frequency questionnaires and 24-hour dietary recalls,* NIH-AARP Diet and Health Study, 1995–1996

MaleFemale
% calories from fat0.690.64
Fiber0.480.42
Fruits and vegetables0.540.47
Red meat0.620.70
MaleFemale
% calories from fat0.690.64
Fiber0.480.42
Fruits and vegetables0.540.47
Red meat0.620.70
*

These correlations were obtained by first determining the correlation between food frequency questionnaire and 24-hour dietary recall intake, then adjusting for random within-person error in the 24-hour recall (17). These numbers are based on 1,415 persons who completed two 24-hour dietary recalls and two food frequency questionnaires.

NIH, National Institutes of Health; AARP, American Association of Retired Persons.

Table 5 shows the number of incident, site-specific cancers expected in the cohort after 5 and 10 years of follow-up. These estimates were based on 1994–1996 data for all persons covered in the Surveillance, Epidemiology, and End Results program of the National Cancer Institute (13).

TABLE 5.

Expected numbers of selected incident cancers,* NIH-AARP Diet and Health Study, 1995–1996

Site5 years of follow-up
10 years of follow-up
MaleFemaleBothMaleFemaleBoth
Breast3,7733,7737,6357,635
Prostate10,74610,74622,75222,752
Colorectum2,9061,3284,2346,3183,0179,335
Lung4,7841,9216,70510,2464,12814,374
Pancreas6082929001,3296762,005
Bladder1,7223042,0263,7896874,476
Ovary473473974974
Endometrium8908901,8181,818
Non-Hodgkin's lymphoma9094381,3471,9279692,896
Site5 years of follow-up
10 years of follow-up
MaleFemaleBothMaleFemaleBoth
Breast3,7733,7737,6357,635
Prostate10,74610,74622,75222,752
Colorectum2,9061,3284,2346,3183,0179,335
Lung4,7841,9216,70510,2464,12814,374
Pancreas6082929001,3296762,005
Bladder1,7223042,0263,7896874,476
Ovary473473974974
Endometrium8908901,8181,818
Non-Hodgkin's lymphoma9094381,3471,9279692,896
*

Based on 1994–1996 Surveillance, Epidemiology, and End Results data (11). For invasive cancer only.

NIH, National Institutes of Health; AARP, American Association of Retired Persons.

TABLE 5.

Expected numbers of selected incident cancers,* NIH-AARP Diet and Health Study, 1995–1996

Site5 years of follow-up
10 years of follow-up
MaleFemaleBothMaleFemaleBoth
Breast3,7733,7737,6357,635
Prostate10,74610,74622,75222,752
Colorectum2,9061,3284,2346,3183,0179,335
Lung4,7841,9216,70510,2464,12814,374
Pancreas6082929001,3296762,005
Bladder1,7223042,0263,7896874,476
Ovary473473974974
Endometrium8908901,8181,818
Non-Hodgkin's lymphoma9094381,3471,9279692,896
Site5 years of follow-up
10 years of follow-up
MaleFemaleBothMaleFemaleBoth
Breast3,7733,7737,6357,635
Prostate10,74610,74622,75222,752
Colorectum2,9061,3284,2346,3183,0179,335
Lung4,7841,9216,70510,2464,12814,374
Pancreas6082929001,3296762,005
Bladder1,7223042,0263,7896874,476
Ovary473473974974
Endometrium8908901,8181,818
Non-Hodgkin's lymphoma9094381,3471,9279692,896
*

Based on 1994–1996 Surveillance, Epidemiology, and End Results data (11). For invasive cancer only.

NIH, National Institutes of Health; AARP, American Association of Retired Persons.

In table 6, we present results of power calculations based on the distribution of dietary factors (table 3), the expected number of incident cancers (table 5), and the relative risk attenuation resulting from dietary measurement error (14). The calculations were designed to determine the power to detect “moderate” true relative risks of 1.6 and 1.8, reflecting a comparison of the cancer risks associated with the median intakes for the first and fifth quintiles for a given dietary factor. We used the calibration study data to determine λ, the attenuation factor; for estimation of λ, we used the standard model (5), which assumes that the reference instrument (our 24-hour recall) gives an unbiased estimate of true intake and has errors uncorrelated with those from the FFQ. As an example, for percentage of calories from fat in men, we calculated λ to be 0.54. Because the relation between the true and observed relative risk is given generally by [RR>T]λ = RRobs, the RRobs for percentage of calories from fat is 1.60.54 = 1.29 for the true relative risk of 1.6 and 1.80.54 = 1.37 for a true relative risk of 1.8. We applied a newer, more robust model, one that does not make the assumption that the reference instrument provides an unbiased measure of true intake, to an empirical data set that contained information on dietary protein from a food frequency questionnaire and a “reference” instrument (4-day weighed food record) and had an unbiased biomarker (urinary nitrogen) (14, 15). Our analysis indicated that the attenuation was 34 percent greater than that predicted by the standard model. Therefore, the true relative risk of 1.6 would generate an observed relative risk of 1.18 (1.6[0.54 × 0.66]); the true relative risk of 1.8 would yield an observed relative risk of 1.23. We found comparable degrees of relative risk attenuation in similar calculations carried out for women and for fiber, fruits and vegetables, and red meat.

TABLE 6.

Power to detect selected observed relative risks for fruits and vegetables and red meat for several cancer sites, NIH-AARP* Diet and Health Study, 1995–1996

SiteDietary factorObserved RR*Power (5 years of follow-up)
Power (10 years of follow-up)
MaleFemaleBothMaleFemaleBoth
BreastF/V*1.10.660.92
1.21.01.0
Red meat1.10.650.92
1.20.991.0
ProstateF/V1.10.981.0
1.21.01.0
Red meat1.10.981.0
1.21.01.0
ColorectumF/V1.10.550.290.710.870.570.96
1.20.980.771.01.00.981.0
Red meat1.10.540.280.700.860.550.96
1.20.980.761.01.00.981.0
PancreasF/V1.20.450.240.610.770.490.91
1.30.750.450.890.970.791.0
Red meat1.20.440.240.600.760.480.91
1.30.740.430.870.970.781.0
OvaryF/V1.20.360.64
1.30.640.91
Red meat1.20.350.62
1.30.630.91
NHL*F/V1.20.610.340.780.900.640.98
1.30.900.610.981.00.911.0
Red meat1.20.590.330.770.890.620.98
1.30.880.590.971.00.901.0
SiteDietary factorObserved RR*Power (5 years of follow-up)
Power (10 years of follow-up)
MaleFemaleBothMaleFemaleBoth
BreastF/V*1.10.660.92
1.21.01.0
Red meat1.10.650.92
1.20.991.0
ProstateF/V1.10.981.0
1.21.01.0
Red meat1.10.981.0
1.21.01.0
ColorectumF/V1.10.550.290.710.870.570.96
1.20.980.771.01.00.981.0
Red meat1.10.540.280.700.860.550.96
1.20.980.761.01.00.981.0
PancreasF/V1.20.450.240.610.770.490.91
1.30.750.450.890.970.791.0
Red meat1.20.440.240.600.760.480.91
1.30.740.430.870.970.781.0
OvaryF/V1.20.360.64
1.30.640.91
Red meat1.20.350.62
1.30.630.91
NHL*F/V1.20.610.340.780.900.640.98
1.30.900.610.981.00.911.0
Red meat1.20.590.330.770.890.620.98
1.30.880.590.971.00.901.0
*

NIH, National Institutes of Health; AARP, American Association of Retired Persons; RR, relative risk; F/V, fruits/vegetables; NHL, non-Hodgkin's lymphoma.

TABLE 6.

Power to detect selected observed relative risks for fruits and vegetables and red meat for several cancer sites, NIH-AARP* Diet and Health Study, 1995–1996

SiteDietary factorObserved RR*Power (5 years of follow-up)
Power (10 years of follow-up)
MaleFemaleBothMaleFemaleBoth
BreastF/V*1.10.660.92
1.21.01.0
Red meat1.10.650.92
1.20.991.0
ProstateF/V1.10.981.0
1.21.01.0
Red meat1.10.981.0
1.21.01.0
ColorectumF/V1.10.550.290.710.870.570.96
1.20.980.771.01.00.981.0
Red meat1.10.540.280.700.860.550.96
1.20.980.761.01.00.981.0
PancreasF/V1.20.450.240.610.770.490.91
1.30.750.450.890.970.791.0
Red meat1.20.440.240.600.760.480.91
1.30.740.430.870.970.781.0
OvaryF/V1.20.360.64
1.30.640.91
Red meat1.20.350.62
1.30.630.91
NHL*F/V1.20.610.340.780.900.640.98
1.30.900.610.981.00.911.0
Red meat1.20.590.330.770.890.620.98
1.30.880.590.971.00.901.0
SiteDietary factorObserved RR*Power (5 years of follow-up)
Power (10 years of follow-up)
MaleFemaleBothMaleFemaleBoth
BreastF/V*1.10.660.92
1.21.01.0
Red meat1.10.650.92
1.20.991.0
ProstateF/V1.10.981.0
1.21.01.0
Red meat1.10.981.0
1.21.01.0
ColorectumF/V1.10.550.290.710.870.570.96
1.20.980.771.01.00.981.0
Red meat1.10.540.280.700.860.550.96
1.20.980.761.01.00.981.0
PancreasF/V1.20.450.240.610.770.490.91
1.30.750.450.890.970.791.0
Red meat1.20.440.240.600.760.480.91
1.30.740.430.870.970.781.0
OvaryF/V1.20.360.64
1.30.640.91
Red meat1.20.350.62
1.30.630.91
NHL*F/V1.20.610.340.780.900.640.98
1.30.900.610.981.00.911.0
Red meat1.20.590.330.770.890.620.98
1.30.880.590.971.00.901.0
*

NIH, National Institutes of Health; AARP, American Association of Retired Persons; RR, relative risk; F/V, fruits/vegetables; NHL, non-Hodgkin's lymphoma.

We then calculated the power of the study to detect observed relative risks in the range of 1.1–1.3 for several cancers. These power calculations are presented in table 6.

DISCUSSION

Our two-stage stategy (wide screen, then inclusion of all persons at extremes and a subset of those in-between) was compromised by the lower-than-expected response rate. We ultimately did create a large cohort with wide intake distributions (table 3), but only after an unanticipated development that offset the low initial response: the shifting and widening in the intake distributions that became apparent with our first mailing wave. Even though we were not able to implement our two-stage strategy, this design may prove useful to other investigators interested in establishing cohorts with wide exposure distributions.

Our sample size calculations (table 6) show that, even allowing for attenuation due to dietary measurement error, we will have more than 90 percent power to detect moderate relative risk increases after 5 years of follow-up for four major dietary factors (fat, fiber, fruits and vegetables, and red meat) in analyses of colorectal, breast, and even pancreatic cancer outcomes. In the interest of space, we have chosen to present power calculations only for selected outcomes. Clearly, the power to detect moderate relative risk increases within 5 years of follow-up will be very high for prostate and lung cancer analyses. The power to detect moderate relative risk increases for non-Hodgkin's lymphoma after 5 years of follow-up will be even greater than that for pancreatic cancer. For ovarian cancer, we will require 10 years of follow-up to achieve 90 percent power to detect moderate relative risk increases. It should be noted, however, that should our AARP respondents turn out to be “healthier” than the Surveillance, Epidemiology, and End Results population, our expected number of cases (table 5) and power calculations (table 6) may be somewhat overoptimistic.

We are currently conducting within the cohort a pilot study of our endpoint assessment procedures. This study will determine the incidence rates for a few major cancers, the response rate to the follow-up questionnaire, the proportion of cohort members who no longer reside in the registry reporting areas (and, among these, the proportion of cancers reported on the follow-up questionnaire that can be confirmed through written requests to physicians and hospitals), and the proportion of cohort members who are lost to follow-up.

We developed a second questionnaire that was mailed in late 1996 to the baseline questionnaire respondents. This second questionnaire, derived from a series of instruments used in other National Cancer Institute studies, asked more-detailed questions on potential cancer risk than we could include in the baseline instrument because of the size of the FFQ and the constraints imposed by the mass mailing cost structure. This instrument also included questions on typical diet in adolescence and 10 years earlier. Although data on the accuracy of dietary information for adolescence obtained in later adult years are sparse, we concluded that the potential importance of adolescent diet-cancer hypotheses warranted collecting this exposure information. The response rate to this second questionnaire was approximately 63 percent. The administration of this questionnaire was a “one-shot” activity; we did not have resources available at that time for pretesting or for further mailings or telephone contacts to increase the response rate. Nevertheless, a subcohort of approximately 339,000 persons with information on past as well as current diet, in addition to more detailed information on family history, physical activity, body size, and other risk factors, may prove valuable.

Another ongoing pilot study will aid us in determining the optimum method for collecting buccal cells for DNA from the cohort. On the basis of preliminary findings from a recent buccal-cell collection pilot study within the cohort, we anticipate being able to collect DNA from about 200,000 of our cohort members. These specimens will add a molecular dimension to the study and contribute to the worldwide epidemiologic resources available for studying the interplay of genetic, dietary, and other environmental factors.

The NIH-AARP Diet and Health Study has the prospective design, large size, and wide intake range as well as a realistic possibility of acquiring a large number of biologic specimens to make a useful contribution to the nutritional epidemiology of cancer.

Reprint requests to Dr. Arthur Schatzkin, National Cancer Institute, 6120 Executive Blvd., EPS 7032, Bethesda, MD 20892–7232 (e-mail: schatzka@mail.nih.gov).

REFERENCES

1.

Friedenreich CM, Howe GR, Miller AB. An investigation of recall bias in the reporting of past food intake among breast cancer cases and controls.

Ann Epidemiol
1991
;
1
:
439
–53.

2.

Wilkins LR, Hankin JH, Yoshizawa CN, et al. Comparison of long-term dietary recall between cancer cases and noncases.

Am J Epidemiol
1992
;
136
:
825
–35.

3.

Giovannucci E, Stampfer MJ, Colditz GA, et al. Recall and selection bias in reporting past alcohol consumption among breast cancer cases.

Cancer Causes Control
1993
;
4
:
441
–8.

4.

Holmberg L, Ohlander EM, Byers T, et al. A search for recall bias in a case-control study of diet and breast cancer.

Int J Epidemiol
1996
;
25
:
235
–44.

5.

Freedman LS, Carroll RJ, Wax Y. Estimating the relation between dietary intake obtained from a food frequency questionnaire and true average intake.

Am J Epidemiol
1991
;
134
:
510
–20.

6.

Freudenheim JL, Marshall JR. The problem of profound mismeasurement and the power of epidemiologic studies of diet and cancer.

Nutr Cancer
1988
;
11
:
243
–50.

7.

Freedman LS, Schatzkin A, Wax Y. The impact of dietary measurement error on planning a sample size required in a cohort study.

Am J Epidemiol
1990
;
132
:
1185
–95.

8.

US Department of Agriculture. Agricultural Research Service. Food and nutrient intakes by individuals in the United States by sex and age. (Nationwide Food Surveys report no. 96–2). Washington, DC: US Department of Agriculture,1998:1004–96.

9.

Subar AF, Thompson FE, Smith AF, et al. Improving food frequency questionnaires: an example of the role of cognitive research.

J Am Diet Assoc
1995
;
95
:
781
–8.

10.

Subar AF, Midthune D, Kulldorff M, et al. Evaluation of alternative approaches to assign nutrient values to food groups in food frequency questionnaires.

Am J Epidemiol
2000
;
152
:
279
–86.

11.

Subar AF, Ziegler RG, Thompson FE, et al. Is shorter always better? Relative importance of questionnaire length and cognitive ease on response rates and data quality.

Am J Epidemiol
2001
;
153
:
404
–9.

12.

Centers for Disease Control and Prevention. National Center for Chronic Disease Prevention and Health Promotion, Behavioral Risk Factor Surveillance System. (http://apps.nccd.cdc.gov/brfss/sex.asp?cat = TU&yr = 1995&qkey = 631&state = US).

13.

Feuer EJ, Wun LM. DEVCAN: Probability of developing or dying of cancer software. Bethesda, MD: National Cancer Institute, 1999.

14.

Kipnis V, Midthune D, Freedman LS, et al. Empirical evidence of correlated biases in dietary assessment instruments and its implications.

Am J Epidemiol
2001
;
153
:
394
–403.

15.

Bingham SA, Gill C, Welch A, et al. Comparison of dietary assessment methods in nutritional epidemiology: weighed records v. 24 h. recalls, food frequency questionnaires and estimated-diet records.

Br J Nutr
1994
;
72
:
619
–43.

16.

US Department of Agriculture. The food guide pyramid. 2000. Hyattsville, MD: Human Nutrition Information Service, 1992.

17.

Rosner B, Willett WC. Interval estimates for correlation coefficients corrected for within-person variation: implications for study design and hypothesis testing.

Am J Epidemiol
1988
;
127
:
377
–86.