An Invaluable Descriptive Epidemiology Resource
The Surveillance, Epidemiology, and End Results (SEER) Program data are critical to understanding cancer in the U.S.SEERis a collection of cancer registries, which collect, store, and manage data on people with cancer. Before 1973, only a few registries existed in the U.S. Supported by the 1971 National Cancer Act, SEER launched with registries in five states and two metropolitan areas that encompassed less than ten percent of the U.S. population. Fifty years later, the program has expanded to 18 registries,representing nearly 50 percent of the U.S. population.
In the absence of a nation-wide registry, the SEER Program provides a population-based approach to explore patterns and trends in cancer incidence and outcomes through comparisons across groups, places, and/or time. These data present a big picture view and allow investigators to generate new hypotheses and research questions.
“I think ofdescriptive epidemiologyas the beginning and the end of our work in DCEG,” saidMeredith Shiels, Ph.D., M.H.S., senior investigator in the Infections and Immunoepidemiology Branch. “At the beginning, we’re looking for clues to better understand cancer etiology, and at the end, we want to know if interventions are having an impact at the population level.” SEER enables these studies.
Investigators in the Radiation Epidemiology Branch (REB) and theCancer Survivorship Research Unit (CSRU)use SEER data to quantify outcomes in cancer survivors as a standard part of the research process. “Leveraging the strengths of the SEER Program, particularly the large sample size, enables us to discover meaningful patterns in cancer survivors and helps identify priorities for future research,” saidLindsay M. Morton, Ph.D.,Branch Director, Head of CSRU, and senior investigator in REB. “Descriptive studies can help us understand when we should launch a more detailed investigation of risk factors.” With over 18 million cancer survivors in the U.S. today, SEER data are especially critical to investigations of second cancers and survivorship.
Similar opportunities arise for rare cancers. “We’re studying a rare type of non-Hodgkin lymphoma called primary effusion lymphoma (PEL). It’s hard to get data for PEL, but using SEER, we can begin to look at patterns, and that’s been very informative,” saidEric A. Engels, M.D., M.P.H., Branch Director and senior investigator, IIB.
SEER also includes demographic data. “You can look at cancer rates by racial or ethnic group or link to county levelmetrics of socioeconomic status or degree of rurality,” said Dr. Shiels. “These data can highlight differences that show a need for more detailed studies to identify the underlying causes of these health disparities.” A recent example of a DCEG study using this approach is a study published by REB Assistant Clinical InvestigatorJacqueline Vo, Ph.D., R.N., M.P.H.,ongeographic disparities in mortality from cardiovascular disease among breast cancer survivors.
Racial and ethnic demographic data available in SEER has evolved with theU.S. census to allow individuals to self-identify with greater detail. This advance has enabled researchers to disaggregate racial and ethnic groups further, such as a recent study fromMaria Constanza Camargo, Ph.D., Earl Stadtman investigator in the Metabolic Epidemiology Branch, which studied esophageal and gastric cancer mortality in the more specific categories of Asian and Pacific Islander ancestry.
Evolving with Scientific Advances
Analyzing SEER Data
SEER has also evolved with technology. When the program began, computers were a relatively new tool. Today, the data are easily accessible online and can be downloaded with a simple user agreement. The software packageSEER*Statmakes initial analysis of the immense data relatively straightforward; since its inception, DCEG scientists have been involved in its ongoing development and improvement.
One example of this collaboration is theNCI Second Cancers Monograph, led byRochelle E. Curtis, M.A., staff scientist in REB. The Monograph was a critical contribution to our understanding of second cancers and the first to provide a comprehensive analysis of the risk of developing subsequent malignancies in the U.S. population. Following this analysis, she worked closely with SEER to develop a module of SEER*Stat for analysis of second cancers, calledmultiple primary standardized incidence ratio (MP-SIR). In this module, a cohort of cancer survivors are followed through time in order to compare their cancer incidence rate with the incidence rate for the general population.
However, outside of SEER*Stat the analyses can quickly become more complicated. “Even for a single cancer, we necessarily need to carry out a multivariate analysis,” said Dr. Rosenberg. “This has spurred my interest in software tools and methodological research on foundational methods for cancer surveillance research.” One example is theAge Period Cohort (APC) Analysis Web Tool, which was created specifically to make this important type of analysis more accessible to researchers. “APC allows researchers to disentangle how changes in incidence by age vary according to factors associated with birth cohort versus calendar period,” said Dr. Rosenberg. Age effects may indicate that the etiology of the cancer is related to aging, while birth cohort effects are generational, potentially related to collective lifestyle changes. Period effects are temporal trends, likely relating to a specific event, such as the introduction of a new carcinogenic exposure.
Within the APC approach, Dr. Rosenberg invented “Local Drifts,” an analysis that quantifies how yearly changes in cancer rates vary by age. “Local Drifts helped us establish that early-onset colorectal cancer incidence in the U.S. has been increasing over time, one of our most important discoveries,” said Dr. Rosenberg.
This work benefits the entire scientific community but is especially helpful to the epidemiologists who work just down the hall. “I think one big advantage of using SEER data in DCEG is having such a wonderful Biostatistics Branch with whom to collaborate,” said Dr. McGlynn, whose research has relied on APC analysis for critical findings.