Introduction

The number of patients undergoing laparoscopy-assisted distal gastrectomy (LADG) has been increasing worldwide, even in countries in which it is used relatively rarely. However, its oncological efficiency can be questioned considering the biological behavior of this cancer (including frequent lymph node metastases, extranodal spread, and peritoneal dissemination), which is rather different from the behavior of colorectal cancer. Although some small-scale randomized controlled trials (RCTs) have suggested that LADG shows high efficacy [14], there has been no pivotal trial so far. In order to justify the use of a laparoscopic approach to gastric cancer, we started clinical trials targeting early-stage disease with low probabilities of nodal, extranodal, and peritoneal metastasis on the assumption that we should not apply this technique to more advanced disease if we could not show that LADG is not inferior to ODG.

We first conducted a multi-institutional phase II trial of LADG (JCOG0703) with 173 patients who had gastric cancer of clinical stage IA (T1N0) or IB (T1N1, T2N0) [5]. The primary endpoint was the incidence of postoperative anastomotic leak or pancreatic fistula, which was far lower (1.7 %) than the prespecified threshold (8 %). The overall proportion with in-hospital grade 3 or 4 adverse events was as low as 5.1 %. We concluded that LADG could be safely performed by credentialed surgeons.

We then launched this phase III trial (JCOG0912) to check for the noninferiority of LADG to ODG in terms of relapse-free survival (RFS) in all randomized patients with clinical IA/IB gastric cancer. The patient enrollment was completed, so we present the short-term surgical outcomes in this paper. We also compared them with recently published early results of a large-scale RCT with a similar design that was conducted in Korea (KLASS-1) [6].

Methods

Study design

This open-label, multi-institutional, randomized, two-arm (ODG and LADG) phase III trial is being conducted within the framework of the JCOG Stomach Cancer Study Group. The trial was designed to demonstrate that LADG is noninferior to ODG in terms of RFS. Only surgeons specializing in both procedures at 33 Japanese institutions are participating in the study. The study protocol was approved by the Protocol Review Committee of the JCOG and was approved and overseen by the institutional review board of each participating hospital. The design of the trial was reported previously [7].

Participants

Patients with a distal gastrectomy treatable, histologically proven gastric adenocarcinoma in the middle or lower third of the stomach, a c-stage IA (T1N0) or IB (T1N1, T2[MP]N0) tumor according to the 13th Japanese Classification of Gastric Carcinoma (corresponding to the 2nd English edition [8]) and no indications for endoscopic mucosal resection (EMR) or endoscopic submucosal dissection (ESD) were initially eligible for this study.

T category was evaluated by upper gastrointestinal endoscopy. Multidetector CT was used to diagnose T category and N category.

The detailed eligibility criteria for the study are shown in Table 1. All patients gave their written informed consent.

Table 1 Eligibility criteria

Randomization and masking

Randomization and data handling were performed by the JCOG Data Center. After confirming the eligibility criteria and receiving written informed consent, staff members registered patients by telephone, fax, or a web-based system with the JCOG Data Center. Patients were randomized to either the ODG arm or the LADG arm; the minimization method with a random component was used to balance the arms with respect to institution and clinical stage (IA/IB). The allocated procedure was not masked from investigators or patients.

Procedures

ODG and LADG were performed in their respective arms. All procedures were identical except for the surgical approach. The extent of nodal dissection was selected according to the surgical T and N stages, which were based on the third version of the Gastric Cancer Treatment Guidelines in Japan [9]. D1, D1+, or D2 dissection was applied for clinical stage IA tumors, while D2 dissection was performed for clinical stage IB tumors. For clinical T1 gastric cancer with margins that were 4 cm or further from the pylorus, pylorus-preserving distal gastrectomy was allowed. Bursectomy was not allowed, but preservation of the omentum and/or vagus nerve was discretionary. The reconstruction method was not specified in this study.

In the LADG arm, the mini-laparotomy incision was required to be shorter than 6 cm. If the intraoperative findings revealed a tumor stage of II or greater, the LADG was converted to open surgery. We did not make any other rules for conversion aside from this rule because the proportion of LADGs that were converted to open surgery was one of the secondary endpoints. When intraoperative difficulties were determined by credentialed surgeons as described below, the LADG was also converted to open surgery.

Postoperative analgesia, such as epidural anesthesia, was not specified. Usage of analgesics on postoperative days 5–10 was recorded. Adjuvant chemotherapy with S-1 for 1 year was recommended for patients with curative resection and a pathological stage II, IIIA, or IIIB tumor, excluding T1 disease.

Surgical quality control was facilitated by permitting only surgeons credentialed by the study chair to perform both LADG and ODG. In the ODG arm, only surgeons with experience of 60 or more open gastrectomies were credentialed. In the LADG arm, experience of 30 or more of both ODG and LADG procedures as well as certification (or its equivalent) by the Japan Society for Endoscopic Surgery were necessary. In arbitrarily selected cases, LADG procedures were centrally reviewed by photographs and by video. These photographs were evaluated by the committee for quality control and assessment of surgery, and problems regarding the surgical procedure were discussed at meetings held three times a year. To assess compliance with lymphadenectomy guidelines, the number of dissected nodes at all stations was recorded on case report forms and the results were monitored.

All enrolled patients were followed up at least every 6 months for the first 2 years and then every year for another 3 years until 5 years postoperatively. Tumor markers, chest X-ray, upper gastrointestinal endoscopy, and enhanced abdominal computed tomography were evaluated at least every year for the duration of follow-up. Operative methods and pathology results were recorded according to the 13th and 14th Japanese Classifications of Gastric Carcinoma, which correspond to the second and third English edition [8, 9]. Operating time, blood loss, and all postoperative morbidities during hospitalization were included on the case report forms for prospective data collection. Postoperative morbidity was described according to the Common Terminology Criteria for Adverse Events (CTCAE) ver. 4.0 and the JCOG postoperative complication criteria based on the Clavien–Dindo classification [10].

Outcomes

The primary endpoint of this study was RFS in all randomized patients. RFS was defined as days from randomization to relapse or death from any cause, and was censored at the most recent day when the patient was alive without any evidence of relapse. The secondary endpoints were overall survival (OS) in all randomized patients, proportion of LADGs that were completed, proportion of LADGs that were converted to open surgery, adverse events, short-term clinical outcomes, and postoperative quality of life (QOL). OS was defined as days from randomization to death from any cause, and was censored at the last day when the patient is alive. The proportion of LADGs that were completed was defined as the proportion of all operated patients in the LADG arm in whom LADG was completed without conversion to open surgery. The proportion of LADGs that were converted to open surgery was defined as the proportion of patients who underwent conversion from LADG to open surgery among those diagnosed before gastrectomy as clinical stage IA or IB. The short-term clinical outcomes consisted of: (1) the time from the end of surgery until the first episode of flatus, (2) the proportion of patients requiring an analgesic on postoperative days 5–10, (3) the highest body temperatures during the first 3 days after surgery, and (4) the highest body temperatures during hospitalization. Postoperative QOL was evaluated using EORTC QLQ-C30 and STO22. This QOL evaluation was performed only in the four principal institutions due to a lack of resources in the other institutions. The primary analysis of QOL was performed using the global health status from EORTC QLQ-C30 at the 90th postoperative day.

Statistical analysis

This randomized trial was designed to demonstrate that LADG is noninferior to ODG in terms of RFS. Some endpoints were adopted to evaluate whether LADG is less invasive than ODG, but these endpoints were all considered to be exploratory. Thus, as long as the noninferiority of LADG is confirmed, LADG may be considered an option for the standard treatment of clinical stage IA/IB gastric cancer.

The initial primary endpoint was OS. According to Schoenfeld and Richter’s method [11], the planned sample size was 920 patients, with 460 patients per arm. We anticipated 5 years of follow-up after 5 years of accrual, ensuring at least 80 % power with a one-sided alpha of 5 % and a noninferiority margin of 5 % in terms of 5-year OS (a noninferiority margin for a hazard ratio of 1.54). This assumed an expected 5-year OS of 90 % in each arm.

In April 2015, which was before the first interim analysis, the primary endpoint was amended from OS to RFS because the surrogacy of RFS was demonstrated in a meta-analysis [12] and the predicted number of events for OS was smaller than expected. After amending the primary endpoint, 68 % power was maintained with a one-sided alpha of 5 % and a noninferiority margin for a hazard ratio of 1.54, which corresponds to 3.6 % in terms of 5-year RFS.

Differences in proportions between arms were evaluated using Fisher’s exact test. Comparisons of continuous data were performed using the Wilcoxon rank sum test. All p values were two-sided at the 5 % significance level, and because of the exploratory nature of the analysis, no adjustment for multiplicity was carried out.

To investigate the association between the proportion with surgical complications and BMI, the Cochran–Armitage trend test was used in each arm.

All statistical analysis was performed using SAS 9.2 (SAS Institute, Cary, NC, USA). The data presented in this article were as of December 2015. Analysis was performed per protocol set, including patients who received protocol treatments as assigned. This trial was registered at the UMIN Clinical Trials Registry as UMIN000003319 (http://www.umin.ac.jp/ctr/index.htm).

We plan to conduct two interim analyses, taking multiplicity into account using the Lan–DeMets method with the O’Brien and Fleming-type alpha spending function [13]. The Data and Safety Monitoring Committee (DSMC) of the JCOG will independently review the interim analysis reports and will recommend early trial termination if necessary. In-house monitoring is performed every 6 months by the JCOG Data Center to evaluate and improve study progress, data integrity, and patient safety.

Results

Between March 2010 and November 2013, 921 patients (ODG 459, LADG 462) were enrolled from 33 institutions in Japan (Fig. 1). Among all of the randomized patients, 455 patients who underwent ODG and 457 patients who underwent LADG were analyzed for surgical outcome. In the ODG arm, two patients who withdrew their consent and two patients who refused the assigned surgery were excluded. In the LADG arm, five patients were excluded: one patient whose diagnosis of gastric cancer at a previous hospital was not confirmed at the participating hospital; one patient who was diagnosed preoperatively as having widespread cancer which required total gastrectomy; one patient whose surgery was canceled due to ischemic heart disease just after the operation started; and two patients who refused the assigned surgery.

Fig. 1
figure 1

Trial profile

Patient demographics are summarized in Table 2. Baseline factors were well balanced between the two arms. Surgical procedures and outcomes are summarized in Table 3. Distal gastrectomy was performed in 333 (73.2 %) patients in the ODG arm and 342 (74.8 %) patients in the LADG arm. Pylorus-preserving gastrectomy was performed in 122 (26.8 %) patients in the ODG arm and 114 (24.9 %) patients in the LADG arm. There was no difference in the distribution of operative methods between the two arms. D1+ was performed in 319 (70.1 %) patients in the ODG arm and 347 (75.9 %) patients in the LADG arm. D2 was performed in 126 (27.7 %) patients in the ODG arm and 101 (22.1 %) patients in the LADG arm. There was no difference in the distribution of the extent of nodal dissection between the two arms. Billroth I was performed in 211 (46.4 %) patients in the ODG arm and 221 (48.4 %) patients in the LADG arm. Roux-en-Y was performed in 119 (26.2 %) patients in the ODG arm and 121 (26.5 %) patients in the LADG arm. There was no difference in the distribution of reconstruction methods between the two arms.

Table 2 Baseline characteristics
Table 3 Operative details

Blood loss was smaller in LADG than in ODG [median (range), 38 (0–1920) mL vs. 115 (0–890) mL, p < 0.001]. Operative time was longer in LADG than ODG [median (range), 278 (120–577) min vs. 194 (48–445) min, p < 0.001]. Conversion to open surgery was necessary in 16 (3.5 %) patients, and these patients were included in the LADG arm.

The most common reasons for conversion were technical: bleeding [four patients (25.0 %)], difficulty with gastric transection and reconstruction [three patients (18.8 %)], torsion of the anastomosis [one patient (6.3 %)], inability to visualize the operative field [one patient (6.3 %)], and inadequate stapling of the duodenal stump [one patient (6.3 %)]. Other reasons for conversion were nontechnical: intraoperative diagnosis of stage II or III tumors [four patients (25 %)], inability to perform additional resection of the positive cut end [one patient (6.3 %)], and anomalous common hepatic artery and left gastric artery [one patient (6.3 %)].

There were no grade 3 or 4 intraoperative adverse events in either arm.

Regarding postoperative recovery, laparoscopic surgery was associated with a shorter time to first flatus [median (range), 2 (0–7) days vs. 3 (0–13) days, p < 0.001] (Table 4). There was no difference between the arms in either the highest body temperatures during the first 3 days postoperatively or the highest body temperatures during hospitalization. The use of analgesics after 5 postoperative days was more frequent in the open surgery arm than in the laparoscopic surgery arm [270 (59.3 %) patients with open surgery vs. 230 (50.3 %) patients with laparoscopic surgery, p = 0.006].

Table 4 Short-term clinical outcomes

There was no marked difference between the two approaches regarding the incidence of reoperation. In the ODG arm, reoperation was performed in two patients because of one case of delayed gastric emptying and one case of intestinal strangulation. In the LADG arm, reoperation was performed in two patients: one because of bowel obstruction and the other because of postoperative bleeding. Red blood cell transfusion was required in one patient in the ODG arm and three patients in the LADG arm. There was no obvious difference regarding the proportion of the patients who required blood transfusion. There were no operation-related deaths in either arm.

Pathological data are summarized in Table 5. There were no marked differences in pathological data between the two arms. The proximal resection margin was positive in one patient in the ODG arm and the distal resection margin was positive in one patient in the LADG arm; both of these patients received additional resection as post-protocol treatment. The number of harvested lymph nodes did not differ between the two arms.

Table 5 Histological findings

Table 6 summarizes in-hospital grade 3–4 adverse events. There was no difference in the overall proportion with in-hospital grade 3–4 surgical complications [17 (3.7 %): ODG vs. 15 (3.3 %): LADG] or the mortality rate (no patient in either arm).

Table 6 In-hospital grade 3–4 nonhematological and hematological adverse events

The proportion with grade 3–4 surgical complications was 0 % (0/84) (BMI <20), 4.6 % (123/281) (BMI≥20, <25), and 4.4 % (4/90) (BMI ≥25) in the ODG arm. The proportion with grade 3–4 surgical complications was 2.1 % (2/94) (BMI <20), 2.1 % (6/280) (BMI ≥20, <25), and 8.4 % (7/83) (BMI ≥25) in the LADG arm. In the ODG arm, the proportion with surgical complications did not increase with increasing BMI (p = 0.066). However, in the LADG arm, the proportion with surgical complications increased with increasing BMI (p = 0.012).

The proportion with grade 3 or 4 serum AST/ALT elevation was significantly higher in LADG than in ODG (16.4 vs. 5.3 %, p < 0.001).

Discussion

This study demonstrated good short-term clinical outcomes in both arms, such as no intraoperative complications, a low proportion of postoperative complications, and favorable postoperative recovery. The same surgical procedures were used in the LADG and ODG arms, such as extent of lymph node dissection and reconstruction methods, but patients undergoing LADG experienced longer operative times and less blood loss. The overall proportion with in-hospital postoperative complications was quite low in both arms, but the proportion with elevated AST/ALT levels was higher in LADG than in ODG. The most plausible reason for this difference was the prolonged liver retraction during LADG. In laparoscopic surgery, the liver was generally retracted for the full duration of the operation, whereas in open surgery, retraction was performed only when needed. Other reasons could be longer operative times, pneumoperitoneum (which may reduce portal vein circulation), and the method of liver retraction.

Surgeons should try to reduce the duration of liver retraction. Moving the position of the retractor or releasing it intermittently may be an effective method of preventing postoperative liver damage [14]. Instead of employing mechanical surgical liver retractors, applying elastic retraction (e.g., utilizing a Penrose drain) might be useful [15].

Recent studies focusing on LADG have shown that a high BMI does not increase the likelihood of surgical complications [1618]. In this study, in the LADG arm, the proportion with surgical complications increased with increasing BMI, even though only credentialed surgeons performed the surgery. However, the number of patients with complications was too small to allow us to draw any conclusions.

Regarding postoperative recovery, LADG was associated with a shorter time to pass first flatus and less use of analgesics after 5 postoperative days. This study demonstrated favorable postoperative recovery from laparoscopic surgery.

As for conversion from LADG to ODG, the conversion rate of 3.5 % was similar to that seen in JCOG0703 [2], but higher than that noted in previous reports by Kim et al. (0.2 %) [19], Lee (1.7 %) [20], and Kim (0.9 %) [6]. A potential reason for the higher conversion rate observed in this study is that our protocol required intraoperative conversion to open surgery when the tumor was intraoperatively diagnosed as stage IB (T2[SS]N0), II, IIIA, or IIIB. In addition, conversion to open surgery was carried out for other nontechnical reasons such as a secure surgical margin and an anatomic anomaly, because the surgeons wanted to achieve the same quality of surgery as obtained with ODG, as the guidelines [1] that we used clearly state that LADG is still an investigational treatment. If these 6 cases are excluded, the proportion for whom technical conversion was performed was 2.2 %, close to the proportions seen in previous studies. Among these technical conversions, conversion due to difficulty with reconstruction might have been avoided by introducing intracorporeal anastomosis, but it was still a demanding procedure during this study period. We consider that the conversion rate of our trial was acceptable because this was a multicenter trial.

In South Korea, an RCT named KLASS-01 [6] was started prior to this study (JCOG0912) to compare LADG with ODG in a noninferiority design. The Korean trial is also at the follow-up stage as of 2015. The morbidity and mortality in KLASS-01 have already been reported. There are some differences between JCOG0912 and KLASS-01. First, the number of retrieved lymph nodes was comparable for the LADG and ODG groups in JCOG0912 butwas smaller in the LADG arm than in the ODG arm in KLASS-01. Second, in JCOG0912, the proportion of postoperative complications was similarly low in the LADG and ODG arms, whereas postoperative morbidity was higher in the ODG arm than in the LADG arm in KLASS-01. Third, the number of ineligible patients and those who did not receive the assigned surgery was much smaller in JCOG0912 than in KLASS-01.

Two patients (0.2 %) withdrew their consent in JCOG0912, while 18 patients (1.3 %) withdrew their consent in KLASS-01. Among all of the eligible patients, three patients (0.3 %) switched from ODG to LADG or from LADG to ODG in JCOG0912, while 85 (6.1 %) patients switched from ODG to LADG or from LADG to ODG in KLASS-01. One (0.1 %) patient switched to total gastrectomy in JCOG0912, while 23 patients (1.7 %) switched to total gastrectomy in KLASS-01.

As a result, safety was analyzed in a higher proportion of patients in JCOG0912 (99.0 %) than in KLASS-01 (88.7 %). In addition, the higher proportion who underwent a procedural switch in KLASS-01 could have a significant influence, especially in noninferiority trials. Such contaminations generally attenuate the survival difference; the results of noninferiority trials could be positive even when LADG is actually inferior to ODG. From this statistical point of view, it should be noted that the efficacy results for KLASS-01 have some limitations, and any definitive conclusion should be drawn based on the results of both JCOG0912 and KLASS-01. There are four potential reasons for this discrepancy. First, our JCOG Stomach Cancer Group has recognized the importance of maintaining clinical trial quality and has become proficient at clinical trials since this group started in 1995. Second, to ensure the quality of our trial, in-house monitoring was performed every 6 months by the JCOG Data Center. Third, regular group meetings were held to share information on ineligible patients. Fourth, the Japanese treatment guidelines specify that ODG is the standard procedure and LADG is an investigational treatment, and the documents given to patients prior to them providing informed consent conformed strictly with these guidelines. Therefore, the patients generally understood these circumstances.

In conclusion, although the elevation of serum AST/ALT should be addressed, this trial confirmed that LADG performed by credentialed surgeons is as safe as ODG in terms of adverse events and short-term clinical outcomes. LADG may be an alternate procedure for clinical IA/IB gastric cancer if the noninferiority of LADG in RFS is confirmed by the primary analysis planned in 2018.