Official Solution:
To improve learning outcomes in high school science classes, one district replaced traditional lectures with hands-on lab activities. After one academic year, standardized science test scores in these schools rose by an average of 12 percent. Based on this outcome, the district concluded that the lab-based format is a more effective method of teaching science and has decided to implement it across all schools in the district.
Which of the following would be most useful to know in evaluating the conclusion drawn by the district?
A. Whether the teachers who implemented the lab-based format had prior experience conducting hands-on science instruction.
B. Whether students in schools that adopted the new format also showed improvement in subjects that were still taught through lectures.
C. Whether any other changes to curriculum, scheduling, or student population occurred in the schools that adopted the new teaching format.
D. Whether students who performed better under the new format reported higher levels of interest in pursuing science careers.
E. Whether schools that maintained the traditional lecture-based format showed similar improvements in test scores over the same time period.
We’re being asked to evaluate the district’s conclusion that the lab-based format caused the improvement in science test scores, and that this method is therefore better and should be expanded to all schools.
On the surface, the score improvement sounds convincing. But this is classic correlation vs causation. The real question is whether that improvement was because of the lab format or due to something else entirely. So we’re looking for the option that would help us figure that out and not just describe what happened, but test whether the core logic actually holds.
A. Whether the teachers who implemented the lab-based format had prior experience conducting hands-on science instruction.
This sounds mildly relevant, but it doesn’t test the conclusion. Even if the teachers had prior lab experience, that doesn’t prove the format itself caused the improvement. At best, this tells us about implementation quality, but we still wouldn’t know if the method was more effective or if these were just better teachers overall. It doesn’t challenge the causal link.
B. Whether students in schools that adopted the new format also showed improvement in subjects that were still taught through lectures.
Feels tempting, but this pulls us into a side topic. The argument is only about science scores. Even if students improved in other lecture-based subjects, that doesn’t prove or disprove whether the lab-based science method caused the science score bump. This is more of a distraction than a useful evaluation.
C. Whether any other changes to curriculum, scheduling, or student population occurred in the schools that adopted the new teaching format.
This is close. It touches on alternate explanations, which is good. If there were big changes beyond just the teaching format, that could challenge the causality. But the issue is that it's still too open-ended. It just invites speculation. It doesn't let us evaluate the claim directly. GMAT tends to prefer cleaner tests of the logic over vague "what if lots of other stuff happened" setups.
D. Whether students who performed better under the new format reported higher levels of interest in pursuing science careers.
This one is irrelevant. The argument is about performance, not long-term interest or career plans. Even if students found it more engaging, that doesn’t help us evaluate whether the method was more effective for learning. This is a classic “nice to know” but doesn’t test the logic.
E. Whether schools that maintained the traditional lecture-based format showed similar improvements in test scores over the same time period.
This is the correct one. It gets right to the core of the logic. If the lecture-based schools also saw similar gains, then the improvement can’t be credited to the lab format. But if only the lab-format schools improved, that supports the conclusion. This is the classic “control group” test and it’s how you’d actually evaluate whether a change in method caused the observed outcome.
Answer: E