Epidemiology of gastric adenocarcinoma suggests that intestinal-type and diffuse-type cancers develop through distinct causal pathways. To examine the differences in risk factors and molecular changes between the histological types, reliable data on histological typing are essential. We evaluated the concordance between two pathologists in assessment of 95 gastric adenocarcinomas for Laurén classification and tumor grade. Two pathologists, each blinded to the other’s assessment, reviewed H&E-stained slides of gastric tumor. The responses of the two pathologists for histological type were considered as concordant if they fell on one of the three categories(intestinal type, diffuse type, or other). Tumor grade was classified into three categories (well, moderately, or poorly differentiated). The pathologists agreed on the classification of histological type for 71 of 92 (77%) tumors. κ coefficient was 0.59 (95% confidence interval, 0.44–0.73). Concordance for tumor grade was 87%, with a κcoefficient of 0.72 (95% confidence interval, 0.57–0.87). Both observed concordance and κ coefficient for histological type and tumor grade were similar across three calendar periods of study. Interobserver agreement was virtually identical between tumors with biopsy specimens only and those with surgical specimens. Although the level of disagreement for histological type observed in this study is comparable with that in other studies, the resulting misclassification would lead to the reduction in observed differences in prevalence and odds ratio estimates between two histological types.

The most common form of stomach cancer is adenocarcinoma (1). Laurén (2) classified gastric adenocarcinoma into two histological types, intestinal and diffuse,according to morphological features of tumor. The decrease in intestinal-type tumors largely accounts for the declining trends of stomach cancer incidence (3), suggesting larger influences of environmental risk factors on intestinal-type cancer than diffuse-type cancer. Preneoplastic lesions, such as intestinal metaplasia, are often associated with intestinal-type cancer but are infrequently associated with diffuse-type cancer. Both observations suggest that the two histological types develop through distinct pathways (4).

Investigators have examined risk factors and molecular changes that might be differentially associated with various histological types. In such investigations, consistent and reliable classification of tumors is essential. As a part of our molecular epidemiological study of stomach cancer, we examined the concordance between two pathologists in assessment of Laurén classification and tumor grade. We also assessed the effects of misclassification in histological typing on the observed differences in prevalence and OR3estimates between intestinal-type and diffuse-type tumors.

Study Subjects.

Study subjects were identified for a study of Helicobacter pylori serology and molecular changes in gastric cancer from the members of the KPMCP of Northern California who had participated in the MPHC program between 1964 and 1969. Diagnoses of gastric cancer between the date of participation in the MPHC program and June 1996 were ascertained by linkage of the MPHC database with the KPMCP’s local tumor registry and computer-stored hospitalization records. The study protocol has been approved by the human subjects panels of Stanford University and the Kaiser Foundation Research Institute.

We first screened pathology reports and H&E-stained slides to confirm the eligibility of subjects. The following tumors were excluded from this study: adenocarcinoma of the gastroesophageal junction; gastric neoplasia other than adenocarcinoma; metastatic tumor to the stomach from another organ; and metastatic cancer of gastric origin without concomitant histological material from primary tumor in the stomach. The present analysis includes 95 cases of gastric adenocarcinoma, and tumor specimens of these cases were reviewed by two study pathologists before August 1999.

Histological Review of Tumor Specimens.

H&E-stained slides from endoscopic biopsies and gastrectomies were obtained from pathology departments and a storage facility of the KPMCP. Biopsy specimens alone were available for 42 of 95 (44%)subjects. For each subject, one pathologist reviewed all of the available slides (mean = 4.7 slides/subject, including slides that did not contain tumor) and selected slides containing most representative tumor sections. The selected slides (mean = 1.8 slides/subject) were forwarded to the second pathologist. The two pathologists, each of whom was blinded to the other’s assessment,recorded their assessment for histological type and grade of tumor on an evaluation form. For each tumor, pathologists were instructed to identify a type [intestinal type (tubular, papillary, or mucinous carcinoma or not otherwise specified), diffuse type (signet cell carcinoma or not otherwise specified), or other (undifferentiated or borderline)] and indicate the degree of differentiation (well,moderately, or poorly differentiated; Ref. 5).

Initial review of the 95 tumors was conducted between November 1997 and July 1999. The pathologists held joint review sessions in July 1998 and April 1999 to resolve disagreements between their assessments. Data analyzed in this study came from the initial assessments made by each pathologist before these joint reviews.

Assessment of Interobserver Agreement.

Responses of the two pathologists regarding histological type were considered as concordant if their responses both fell on one of the three categories (i.e., intestinal type, diffuse type, or other). Observed concordance was calculated as a percentage of tumors with concordant responses (i.e., tumors that fell on the diagonal cells in a 3 × 3 table, such as Table 1). κ coefficient, a measure of interobserver agreement, and its 95%CI were calculated (6). Agreement of tumor grade assessment was examined similarly, using three categories (well,moderately, and poorly differentiated).

In addition to an overall estimate of agreement among all of the cases reviewed, we examined concordance by calendar period of review (three periods separated by the two joint review meetings) and by the availability of surgical specimens. Because of the time lag between initial assessments by two pathologists, as well as a delay between the initial and joint reviews of discordant typing, there were some unavoidable overlaps in the timing of each pathologist’s review between adjacent calendar periods.

Assessment of the Effects of Misclassification.

We assessed how misclassification of tumors with respect to histological type would affect the observed differences between intestinal-type and diffuse-type tumors, using two examples:(a) prevalence of tumors stained with a p53 antibody by IHC;and (b) OR for cancer risk associated with H. pylori infection. We assumed that a certain percentage (10% or 15%) of intestinal-type tumors was misclassified as diffuse type and that the same percentage of diffuse-type tumors was misclassified as intestinal type, independent of p53 IHC and H. pyloristatus (i.e., nondifferential misclassification). The two percentages used in the examples were chosen to represent the level of disagreement between pathologists observed in this study. For the purpose of these illustrations, we assumed, although it is unrealistic,that there was no measurement error in p53 IHC and H. pyloristatus.

For the example of p53 IHC, we assumed that the true prevalence of p53-positive tumors was 60% in intestinal-type tumors and 30% in diffuse-type tumors (reasonable estimates based on work by A. S.4). We estimated the difference between the two percentages observed in a study including 200 gastric tumors (100 intestinal-type and 100 diffuse-type tumors) when a specified level of misclassification occurred.

For the example of H. pylori and gastric cancer risk, we assumed that the true ORs were 5.0 for intestinal-type tumors and 10.0 for diffuse-type tumors. Assuming that 60% of control subjects were H. pylori positive, these ORs are translated into the H. pylori prevalence of 88% and 94% in subjects with intestinal-type and diffuse-type tumors, respectively. We estimated observed ORs for intestinal-type and diffuse-type tumors if the tumors were misclassified at a specified level, in a hypothetical case-control study with 200 cases (100 intestinal-type and 100 diffuse-type tumors)and 200 controls.

Concordance of Histological Type

Initial responses on histological type were available for 92 tumors. Intestinal-type tumors were the most common, accounting for over 50% of the tumors according to either pathologist. Overall, the pathologists agreed on the major classification for 71 of 92 (77%)cases (Table 1). The κ coefficient was 0.59 (95% CI, 0.44–0.73). The observed concordance for the three calendar periods was 79%, 79%,and 69%, with κ coefficients of 0.62 (95% CI, 0.37–0.87), 0.62(95% CI, 0.43–0.82), and 0.40 (95% CI, −0.01 to 0.81). The level of agreement was almost identical, regardless of whether only biopsy specimens were available (concordance, 77%; κ, 0.58) or surgical specimens were available (concordance, 79%; κ, 0.61).

Concordance of Tumor Grade

Responses on tumor grade were available for 91 tumors. Poorly differentiated tumors were the most common (67% and 66% by two pathologists, respectively). Overall agreement for tumor grade was 87%, with a κ coefficient of 0.72 (95% CI, 0.57–0.87; Table 2). The observed concordance for the three calendar periods was 93%,83%, and 88%, with κ coefficients of 0.85 (95% CI, 0.64–1.00),0.64 (95% CI, 0.42–0.86), and 0.73 (95% CI, 0.37–1.00). Interobserver agreement was virtually identical between cases with biopsy specimens only (concordance, 88%; κ, 0.70) and those with surgical specimens (concordance, 86%; κ, 0.72).

Effects of Misclassification on Prevalence and OR Estimates

Example 1: Prevalence of p53 IHC-positive Tumors.

If 10% of 100 intestinal-type tumors are misclassified as diffuse type, and 10% of 100 diffuse-type tumors are misclassified as intestinal type, 100 tumors classified as intestinal type include 90“true” intestinal-type and 10 “true” diffuse-type tumors. If 60% of “true” intestinal-type and 30% of “true” diffuse-type tumors are p53 IHC positive (Table 3 A), the number of p53 IHC-positive tumors in 100 “nominal”intestinal-type tumors is 90 × 0.6 + 10 × 0.3 = 57. Similarly, 33 of 100 tumors classified as diffuse type are p53 IHC positive. Consequently, the difference in prevalence of p53 IHC tumors between the two types is 57% − 33% = 24%, which is smaller than the true difference of 30%. Similarly, if 15% of tumors are misclassified, the difference in p53 IHC positivity between intestinal-type and diffuse-type tumors is 21%.

Example 2: OR for Gastric Cancer Associated with H. pylori Infection.

When 10% of case subjects are misclassified, the observed OR for intestinal-type tumors increases from 5.0 to 5.4 (Table 3B), whereas the observed OR for diffuse-type tumors decreases from 10.0 to 8.9,resulting in a smaller ratio of the two ORs (1.6 rather than 2). If the misclassification occurs in 15% of tumors, the ORs for intestinal-type and diffuse-type tumors are even closer to each other.

Frequently used histological classifications for gastric adenocarcinoma (intestinal-type and diffuse-type carcinoma) are attributed to Laurén (2). Laurén himself observed a stronger male preponderance and older age at diagnosis for intestinal-type cancer than for diffuse-type cancer (2),which was corroborated by Hanai et al.(7).

Histological classification of cancers, as is the case with all measurements, is prone to error. Histological typing and grading are inherently subjective because they depend on a pathologist’s judgement in the absence of objective “gold standard” measures. Pathologists would be more likely to disagree for tumors with overlapping histological patterns, which consist of as high as 15% of all gastric cancers (1, 8). Given the absence of a “gold standard,” examining the level of agreement between two or more pathologists is a feasible alternative to estimate the reliability of histological data.

Our study showed an overall agreement for Laurén classification between two pathologists in 77% of the 92 gastric adenocarcinomas; aκ coefficient of 0.59 is considered as “fair to good” agreement (6). Both observed concordance and κ coefficient for histological type were consistent across the three calendar periods and regardless of whether surgical specimens of tumor were available in addition to biopsy specimens. The level of agreement was higher for tumor grade, which was somewhat surprising to us because we had expected that determination of tumor grade would be more subjective and thus more variable between pathologists. For example, in a study of 17 gastric cancers reviewed by 14 pathologists (8), agreement with accepted reference pathology was higher for Laurén classification (89%) than for tumor grade (63%).

Other studies also examined the reliability of histological typing and grading for gastric cancers. Palli et al.(9)observed agreement among six pathologists in 70–80% of the 100 gastric cancers reviewed for histological classification. Agreement between repeat readings with an up to 3-year interval by the same pathologist was also high (95%) (Ref. 9). In a study by Hansson et al.(10), one pathologist repeated a review of gastric cancer specimens after a 2-year interval. Overall agreement and κ coefficient, respectively, for Laurén classification was 79% and 0.63 for 67 resection materials and 83%and 0.70 for 88 biopsy materials. These results are comparable with our findings.

Distinct pathways of carcinogenesis for the two histological types of Laurén were suggested by more frequent observations of precancerous lesions in gastric mucosa adjacent to intestinal-type tumors (2, 3). Subsequently, many studies examined risk factors (e.g.,H. pylori infection) (Ref. 11) and molecular markers (e.g.,p53abnormalities4) for intestinal-type and diffuse-type tumors separately. Lack of reliability in histological assessment in these investigations would lead to misclassification of tumors, potentially resulting in inconclusive or erroneous findings. In our study, two pathologists disagreed on histological type for 24% of tumors; i.e., an average of 12% of tumors classified as intestinal type by one pathologist was classified as diffuse type by the other pathologist, and vice versa. As our hypothetical examples show, this level of misclassification would result in reduced differences in prevalence and OR estimates between histological types,such that those differences might be discounted. It would be difficult to assess the degree and direction of bias for actual studies, in which additional measurement error occurs in other variables, and histological misclassification might not be independent of other marker measurements.

Extrapolation of our findings to other settings may be limited due to some aspects of study design. First, a second pathologist reviewed only a subset of slides that a first pathologist considered as most representative, which may have resulted in overestimated κcoefficients. Second, the joint review by two pathologists may have influenced subsequent typing of tumors, although we observed no increase in concordance over the calendar periods.

In conclusion, we observed reasonably good agreement between pathologists for Laurén classification and grade of gastric cancers. A central review of pathology material and resolution of disagreements between pathologists will be practical alternatives to improve the reliability of histological data in epidemiological studies.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

        
1

Supported in part by American Cancer Society Institutional Research Grant 32-36 and NIH Grants R01CA73011 and R35CA49761.

                
3

The abbreviations used are: CI, confidence interval; IHC, immunohistochemistry; KPMCP, Kaiser Permanente Medical Care Program; MPHC, multiphasic health checkup; OR, odds ratio.

        
4

A. Shibata. Immunohistochemical analysis of p53 tumor suppressor in stomach cancer, manuscript in preparation.

Table 1

Overall concordance in histological classification between pathologistsa

Pathologist 2Total
Intestinal typeDiffuse typeOther
Pathologist 1     
Intestinal type 42 50 
Diffuse type 27 38 
Other 
Total 50 32 10 92 
Pathologist 2Total
Intestinal typeDiffuse typeOther
Pathologist 1     
Intestinal type 42 50 
Diffuse type 27 38 
Other 
Total 50 32 10 92 
a

Observed concordance =(42 + 27 + 2)/92 = 77%. κ coefficient = 0.59(95% CI, 0.44–0.73).

Table 2

Overall concordance in tumor grade assessment between pathologistsa

Pathologist 2Total
Well differentiatedModerately differentiatedPoorly differentiated
Pathologist 1     
Well differentiated 
Moderately differentiated 21 26 
Poorly differentiated 55 61 
Total 28 60 91 
Pathologist 2Total
Well differentiatedModerately differentiatedPoorly differentiated
Pathologist 1     
Well differentiated 
Moderately differentiated 21 26 
Poorly differentiated 55 61 
Total 28 60 91 
a

Observed concordance =(3 + 21 + 55)/91 = 87%. κ coefficient = 0.72(95% CI, 0.57–0.87).

Table 3

Effects of misclassification on prevalence and OR estimates in two hypothetical examples

Level of histological misclassification
0% (no error)10%15%
A. Observed prevalences of p53 immunohistochemistry-positive tumors by histological type    
Intestinal-type cancer 60% 57% 56% 
Diffuse-type cancer 30% 33% 35% 
Difference 30% 24% 21% 
B. Observed ORs for H. pylori infection and gastric cancer risk by histological type    
Intestinal-type cancer 5.0 5.4 6.0 
Diffuse-type cancer 10.0 8.9 7.7 
Ratio of the two ORs 2.0 1.6 1.3 
Level of histological misclassification
0% (no error)10%15%
A. Observed prevalences of p53 immunohistochemistry-positive tumors by histological type    
Intestinal-type cancer 60% 57% 56% 
Diffuse-type cancer 30% 33% 35% 
Difference 30% 24% 21% 
B. Observed ORs for H. pylori infection and gastric cancer risk by histological type    
Intestinal-type cancer 5.0 5.4 6.0 
Diffuse-type cancer 10.0 8.9 7.7 
Ratio of the two ORs 2.0 1.6 1.3 

We thank Maria Isabel Garcia, Lucy Bonds, Bonnie Bell, and Bill Frank for technical assistance; the pathology departments of Kaiser Permanente Medical Centers in Northern California for tumor specimens;and Dr. Gary Friedman for access to the KPMCP databases.

1
Lewin, K. J., and Appelman, H. D. Atlas of Tumor Pathology. Washington DC: Armed Forces Institute of Pathology, 1995.
2
Laurén P. The two histological main types of gastric carcinoma: diffuse and so-called intestinal-type carcinoma.
Acta Pathol. Microbiol. Scand.
,
64
:
31
-49,  
1965
.
3
Correa, P., and Chen, V. W. Gastric cancer. In: R. Doll, J. F. Fraumeni, Jr., and C. S. Muir (eds.), Trends in Cancer Incidence and Mortality, pp. 55–75. Plainview, NY: Cold Spring Harbor Laboratory Press, 1994.
4
Correa P. Human gastric carcinogenesis: a multistep and multifactorial process–First American Cancer Society Award Lecture on Cancer Epidemiology and Prevention.
Cancer Res.
,
52
:
6735
-6740,  
1992
.
5
Watanabe, H., Jass, J. R., and Sobin, L. H. Histological Typing of Oesophageal and Gastric Tumours. Berlin: Springer-Verlag, 1990.
6
Fleiss, J. L. The measurement of interrater agreement. In: Statistical Methods for Rates and Proportions, 2nd ed., pp. 212–236. New York: John Wiley & Sons, Inc., 1981.
7
Hanai, A., Fujimoto, I., and Taniguchi, H. Trends of stomach cancer incidence and histologic types in Osaka. In: K. Magnus (ed.), Trends in Cancer Incidence. Causes and Practical Implications, pp. 143–154. Washington DC: Hemisphere Publishing Corp., 1982.
8
Hermanek P., Wittekind C. Histological typing and grading of gastric carcinomas Nishi M. Ichikawa H. Nakajima T. Marayama K. eds. .
Gastric Cancer
,
:
40
-52, Springer-Verlag. Tokyo  
1993
.
9
Palli D., Bianchi S., Cipriani F., Duca P., Amorosi A., Avellini C., Russo A., Saragoni A., Todde P., Valdes E. Reproducibility of histologic classification of gastric cancer.
Br. J. Cancer
,
63
:
765
-768,  
1991
.
10
Hansson L. E., Lindgren A., Nyren O. Can endoscopic biopsy specimens be used for reliable Lauren classification of gastric cancer?.
Scand. J. Gastroenterol.
,
31
:
711
-715,  
1996
.
11
Parsonnet J., Friedman G. D., Vandersteen D. P., Chang Y., Vogelman J. H., Orentreich N., Sibley R. K. Helicobacter pylori infection and the risk of gastric carcinoma.
N. Engl. J. Med.
,
325
:
1127
-1131,  
1991
.