Educational Research Thoughts     Bilingual Education   J. Cummins Web

   
  Educational Research in Bilingual Education   

       

  

Home     resources   

 Educational Research in Bilingual Education

 Jim Cummins

University of Toronto

  

 Interpretation of the voluminous research on bilingual education has been highly controversial among both academics and policy-makers for more than 25 years.  Clearly the political sensitivity of the issue has contributed to confusion about what the research is actually saying.  A more fundamental cause of this confusion, however, is the extremely limited way in which educational researchers have examined the research, and in particular the quantitative research on this issue. The dominant assumption among academic opponents and advocates of bilingual education has been that we can draw policy-relevant conclusions regarding the effectiveness of bilingual education only from “methodologically acceptable studies.” Typically, these studies are program evaluations that involve treatment and control groups compared in such a way that outcome differences can be attributed to the treatment rather than to extraneous factors.

 I argue that this approach represents an appropriate, but extremely limited, orientation to research in bilingual education. It is limited on two counts:

·         It has proved virtually impossible to apply rigorous controls to comparisons between programs due to the myriad human, administrative, and political influences that impact the implementation of programs over time;

·         Underlying this approach is the assumption that there is a direct connection between research and policy: researchers will discover which program alternatives work best and policy-makers will develop funding and implementation policies based on these findings. While this straightforward linkage between research and policy may work well in some spheres (e.g. opinion surveys), it seldom yields clearcut results in an area as multifaceted as educational research where the treatment variable of interest (e.g. proportion of first language [L1] instruction) is intertwined and interacting with hundreds of other variables that will affect program outcomes.

 

As I document below, it is not surprising that this dominant research paradigm has yielded such paltry pickings for policy. The only thing that academic opponents and advocates of bilingual education seem to agree on with respect to the policy-related research is that it is of almost universally poor quality (August & Hakuta, 1997; Greene, 1998; Rossell & Baker, 1996).

 I argue in this paper that “poor quality” is in the eye of the beholder. Viewed through the lens of “methodologically acceptable studies,” it is possible to find fault with virtually all of the research studies, including many of those that survived the “rigorous criteria” established by  Greene (1978) and Rossell and Baker (1996). However, there is an enormous amount of relevant and interpretable research, both internationally and within the United States, that speaks directly to the bilingual education policy issues. I suggest that the policy issues have remained confused and contested at least partly because the bulk of the relevant research has been virtually ignored, both by advocates and opponents of bilingual education. The relevance of this research is not apparent within the dominant paradigm because the studies do not conform to the criteria of acceptability within this paradigm.  However, when we examine this voluminous research from the perspective of an alternative paradigm, its relevance is immediately apparent.

 The alternative paradigm claims that the relevance of research for policy is mediated through theory. In complex educational and other human organizational contexts, data or “facts” become relevant for policy purposes only in the context of a coherent theory.  It is the theory rather than the individual research findings that permits the generation of predictions about program outcomes under different conditions. Research findings themselves cannot be directly applied across contexts.  For example, the fact that students in a Spanish-English bilingual program in New York City performed well academically in relation to grade norms (Beykont, 1994) tells us very little about whether a similar program might work with Mexican-American students in San Diego. Yet when certain patterns are replicated across a wide range of situations, the accumulation of consistent findings suggests that some stable underlying principle is at work.  This principle can then be stated as a theoretical proposition or hypothesis from which predictions can be derived and tested.

 In contrast to research findings, theories are almost by definition applicable across contexts.  The validity of any theoretical principle is assessed precisely by how well it can account for the research findings in a variety of contexts.  If a theory cannot account for a particular set of research findings, then it is an inadequate or incomplete theory.  Thus, while no individual research finding can “prove” a theory or confirm a hypothesis, any research finding can disconfirm or refute a theory or hypothesis. Thus, the criterion of validity for any hypothesis is extremely stringent: it must be consistent with all of the research data or at least be able to account for inconsistencies (e.g. poor implementation of a program).

 In this paper, I label the former paradigm the ”Research-Policy” paradigm and the latter the “Research-Theory-Policy” paradigm.  In order to show the strengths and limitations of each, I examine how they have been employed in the bilingual education debate and the policy-relevant findings that each yields.  I conclude that the former paradigm yields largely trivial information for policy purposes and perpetuates misconceptions regarding bilingual education that have persisted for almost 30 years.  By contrast, the latter paradigm yields considerable information that has direct relevance to policy and addresses many of the most contentious issues in the bilingual education debate.

 The Research-Policy Paradigm: Debunking the Mythology of “Methodologically Acceptable Studies”

 Both advocates and opponents of bilingual education have largely concurred on the conditions under which research in general and evaluation studies in particular can be considered “methodologically acceptable.” For example, Greene (1998), whose interpretation of research is favorable to bilingual education, and Rossell and Baker (1996) whose conclusions are highly unfavorable to bilingual education, agreed on the appropriateness of the following criteria for designation of studies as “methodologically acceptable:”

1.        Studies had to compare students in bilingual programs to a control group of similar students;

2.        The design had to ensure that initial differences between treatment and control groups were controlled statistically or through random assignment;

3.        Results were to be based on standardized test scores in English;

4.        Differences between the scores of treatment and control groups were to be determined by means of appropriate statistical tests.

 

The approaches diverged on several additional points. Greene, for example, focused only on studies that had been carried out in the United States and which measured the effects of bilingual education after at least one year of the treatment whereas Rossell and Baker included international research data in their review.  Rossell and Baker also categorized programs according to pre-defined labels (structured immersion, transitional bilingual education, submersion, English-as-a-second-language [ESL]) whereas Greene adopted a more straightforward categorization of bilingual education as a program in which all students are taught using at least some of their native language. An additional difference was that Greene (like Willig, 1985) used meta-analysis to take into account effect sizes in individual studies whereas Rossell and Baker (like Baker and de Kanter, 1983) simply counted studies that were favorable or unfavorable to different treatments.

 Using their respective criteria of “methodologically acceptable” studies, Rossell and Baker rejected 228 studies and accepted 72 while Greene could find only 11 studies whose design permitted conclusions to be drawn. 

 August and Hakuta (1997) and their colleagues reviewed both the basic and policy-oriented research on schooling for language-minority children.  Their review is comprehensive, balanced, and useful for researchers and policy-makers alike.  It also offers very appropriate suggestions for improving the quality of program evaluations. However, it suffers from the same problematic orientation to research and policy as do the reviews by Greene (1998) and Rossell and Baker (1996). In interpreting both the basic research and program evaluations relevant to language learning and bilingual education, the authors pay only lip-service to the role of theory in mediating the relevance of research for policy (Fitzgerald & Cummins, in press).  As a result, their conclusions are considerably weaker than they might have been and also considerably less useful for policy purposes.

 I critique these three literature reviews below. Space does not permit a comprehensive review of these studies and thus my focus is directed towards identifying the limitations of the research paradigm these authors have adopted.

 

Greene’s Meta-Analysis

Greene reported that participation in a bilingual program over a period of two years made a difference of about 1/5 of a standard deviation in achievement. Thus, if a student in an  English-only program  performed at the 26th percentile at the end of those two years, the bilingual student would be at the 34th percentile.  The problematic nature of this type of meta-analysis can be seen from examining some of the 11 studies included.  One of the large-scale “methodologically acceptable” studies included in the analysis (and in Rossell and Baker’s review) was the American Institutes of Research (AIR) study (Danoff, 1978).  This study aggregated a variety of programs labeled “bilingual education” and compared them to non-bilingual programs, ignoring the fact that many bilingual programs are bilingual in name only or involve minimal amounts of L1 instruction by a non-trained paraprofessional (see, for example, Gandara, 1997). In addition, as pointed out by August and Hakuta (1997) there was no clear demarcation between treatment and control groups:

Nearly three-quarters of the experimental group had been in bilingual programs for 2 or more years, and the study measured their gains in the last few months. Additionally, about two-thirds of the children in the control group had previously been in a bilingual program; these children did not represent a control group in the usual sense of the term. Thus the AIR study did not compare bilingual education with no bilingual education. (p. 140)

Krashen (1999a) has also pointed out that Greene’s analysis included several “methodologically acceptable” studies in which “bilingual education” was either not described in any detail or involved minimal use of the L1 (e.g. use of bilingual paraprofessionals)

 In short, Greene’s meta-analysis makes no attempt to test the theoretical propositions underlying bilingual education or alternative English-only programs. The apparent rigor involved in reducing the extensive corpus of bilingual education data to 11 “methodologically acceptable” studies seems destined to end up in rigor mortis for this approach as the credibility of even these 11 studies is whittled away. However, this is not because most of the research  is inadequate; the inadequacy is rather in the lens through which we are examining the research.

 

August and Hakuta’s National Research Council Report 

This report was intended to provide researchers and policy-makers with a “state-of-the-art” review of what basic and applied research could tell us about improving schooling for language-minority students.  One might have expected to find clear and coherent answers to volatile policy-relevant questions such as:

·         How long does it take English language learners (ELL) to catch up academically to their native English-speaking peers?

·         Should reading instruction be provided initially through bilingual students’ first language?

·         What outcomes can be expected from different kinds of bilingual programs?

·         How important is time-on-task, understood as the amount of instructional time spent through the target language, on second language academic development?” etc.

 

Although most of these questions are addressed to some extent in the report, the answers are at best tentative and not particularly helpful for policy-purposes. This lack of incisiveness can be attributed to the focus on research in isolation from theory. Crucial theoretical issues such as the nature of language proficiency and its relationship to academic development are barely considered in the report. Similarly, the report fails to address the validity of  the competing hypotheses that have been advanced to support either English-only programs (the “time-on-task” or maximum exposure to English hypothesis [e.g. Porter, 1990]) or certain forms of bilingual program (the “linguistic interdependence” hypothesis [Cummins, 1981]). [1]

Consequently, the first question listed above is barely considered in the report despite the fact that this has been a central issue to emerge in the context of Proposition 227 in California and the many policy debates leading up to this initiative.

 Answers are given to some of the other questions; for example, the report states that “[t]he degree of children’s native-language proficiency is a strong predictor of their English-language development” (p. 28). However, the authors fail to address what this finding means for policy.  As I illustrate below, when this finding is integrated with theory in the form of the linguistic interdependence hypothesis, it provides an empirically-testable basis for interpreting the outcomes of bilingual education programs in widely different sociolinguistic and sociopolitical contexts.

 Similarly, the authors’ treatment of  the question of learning to read through a second language correctly identifies the major pattern of  research findings but then lets the issue drop as though the findings were contradictory and few policy implications could be drawn:

With respect to reading instruction in a second language, there is remarkably little directly relevant research. Clearly, one of the major intellectual stimuli to bilingual education programs has been the belief that initial reading instruction in a language not yet mastered orally to some reasonable level is too great a cognitive challenge for most learners. The evidence that better academic outcomes characterize immigrant children who have had 2 to 3 years of initial schooling (and presumably literacy instruction) in their native countries (Collier and Thomas, 1989; Skutnabb-Kangas, 1979) is consistent with the claim that children should first learn to read in a language they already speak. However, it is clear that many children first learn to read in a second language without serious negative consequences. (pp. 59-60)

By contrast, I would argue that there is a significant amount of directly relevant research. Reviews of this research from 25 years ago were able to conclude that the language of initial literacy instruction is not, in itself, a significant determinant of academic outcomes (Cummins, 1979; Engel, 1975; Wagner, 1998), although it may play a role as part of a much broader constellation of variables related to power relations and identity negotiation (Cummins, 1996).

 Finally, overwhelming amounts of data on bilingual education programs internationally (e.g. Cummins & Corson, 1997) should have permitted the authors to state definitively that the major theoretical argument underlying the push for all-English programs is without merit.  The time-on-task (maximum exposure) hypothesis predicts that any form of bilingual education that reduces the amount of instructional time through the medium of English will result in academic difficulties in English. Or, as Rosalie Pedalino Porter (1990) has expressed it: "The evidence of direct correlation between early, intensive second-language learning and high level of competence in the second language is inescapable, as is the on-task principle - that is, the more time spent learning a language, the better you do in it, all other factors being equal" (p. 119). As a theoretical proposition, this hypothesis is refuted by the outcomes of  countless bilingual programs evaluated in countries around the globe which demonstrate that students suffer no adverse effects in their mastery of the majority language (English in North America) as a result of spending significant instructional time through the minority language (see Corson, 1993; Cummins, 1996, for reviews).

 The policy implication is not that bilingual programs are necessarily “effective,” or will necessarily succeed better than alternative programs. Outcomes of any program will depend on a variety of implementation factors; rather the data and associated theory show clearly that linguistic minority and linguistic majority students in well-implemented bilingual programs (of various types) will suffer no adverse consequences as a result of spending instructional time through both languages. 

 This type of clear policy-relevant conclusion is not forthcoming from the report, not because relevant data are lacking, but because the data are viewed through a very fuzzy lens.  The authors’ conclusion seems designed to provide policy-makers on both sides of the bilingual education issue with what they want to hear:

The beneficial effects of native-language instruction are clearly evident in programs that have been labeled “bilingual education,” but they also appear in some programs that are labeled “immersion” (Gersten and Woodward, 1995). There also appear to be benefits of programs that are labeled “structured immersion” (Baker and de Kanter, 1981; Rossell and Ross, 1986); (p. 147)

 

Theory is required to bring the data into focus.  As I illustrate below, definitive answers could have been provided to many of the most contentious policy-related questions in the area of bilingual education had the authors articulated the major theoretical positions in the literature on bilingual education and examined the consistency of these positions with the available data.

 

Rossell and Baker (1996) 

The outcomes of Rossell and Baker’s review of the literature on the educational effectiveness of bilingual education are clearly stated in the Abstract:

The research evidence indicates that, on standardized achievement tests, transitional bilingual education (TBE) is better than regular classroom instruction in only 22% of the methodologically acceptable studies when the outcome is reading, 7% of the studies when the outcome is language, and 9% of the studies when the outcome is math. TBE is never better than structured immersion, a special program for limited English proficient children  where the children are in a self-contained classroom composed solely of English learners, but the instruction is in English at a pace they can understand. (p. 1)

Furthermore, the comparisons of reading scores between TBE and structured immersion showed that  structured immersion was superior in 83% of cases and no differences were observed in 17%. 

 These conclusions, published in a reputable refereed journal, and apparently based on rigorous methodological criteria, would cause any policy-maker to question the merits of transitional bilingual education.

 Cracks appear very quickly, however, in the facade of objective rationality that this review of the literature projects. One problem is immediately obvious: When we look more closely at the research studies that supposedly demonstrated the superiority of “structured immersion” over “transitional bilingual education” it turns out that 90% of these studies are interpreted by their authors as supporting the effectiveness of bilingual and even trilingual education.

 Seven of the ten studies that Rossell and Baker claim support structured immersion over TBE were studies of French immersion programs in Canada.  Typically, in these programs English-speaking students are "immersed" in French (their second language [L2]) in kindergarten and grade 1 and English (L1) language arts are introduced in grade 2.  The proportion of English instruction increases to about 50% by grades 4 or 5.  The closest equivalent to the program in the United States is dual language immersion which has repeatedly demonstrated its effectiveness for both majority and minority language students (e.g. Christian et al., 1997; Dolson & Lindholm, 1995; Thomas & Collier, 1997).  Note that, as in the U.S. dual language programs, Canadian French immersion programs are bilingual programs, taught by bilingual teachers, and their goal is the development of bilingualism and biliteracy.

 Even at the level of face validity, it seems incongruous that Rossell and Baker use the success of the Canadian French-English bilingual programs to argue for monolingual immersion programs taught largely by monolingual teachers with the goal of developing monolingualism.  This is particularly the case since two of the seven programs they cite as evidence for monolingual structured immersion were actually trilingual programs involving instruction in French, English, and Hebrew (Genesee & Lambert, 1983; Genesee, Lambert, & Tucker, 1977).

 In addition to these seven French immersion program evaluations, one of the ten studies (Malherbe, 1946) was an extremely large-scale study of Afrikaans-English bilingual education in South Africa involving 19,000 students.  The other two were carried out in the United States (Gersten, 1985; Peña-Hughes & Solis, 1980).

 The Peña-Hughes and Solis program (labelled "structured immersion" by Rossell and Baker) involved an hour of Spanish language arts per day and was viewed as a form of bilingual education by the director of the program (Willig, 1981/82). I would see the genuine promotion of L1 literacy in this program as indicating a much more adequate model of bilingual education than the quick-exit transitional bilingual program to which it was being compared. Gersten's study involved an extremely small number of Asian-origin students (12 immersion students in the first cohort and nine bilingual program students, and 16 and seven in the second cohort) and hardly constitutes an adequate sample upon which to base national policy.

 Malherbe's study concluded that students instructed bilingually did at least as well in each language as students instructed monolingually despite much less time through each language. He argues strongly for the benefits of bilingual education (however, see Krashen, 1999b, for a critique of the design of this study).

 In short, Rossell and Baker’s conclusions are immediately suspect as a result of the fact that they use the documented success of bilingual and trilingual programs to argue against bilingual education.  There are many other problems with their literature review which bring the entire enterprise of basing policy decisions primarily on “methodologically acceptable” treatment-control group studies into question. Some of the problems are briefly outlined below (see also Cummins, 1999; Dicker, 1996;  Escamilla, 1996; Krashen, 1996)

 The criteria for deciding which studies are “methodologically acceptable” are unclear and are applied in an arbitrary manner. Krashen (1999b), for example, points out significant design problems with Malherbe’s study and, as noted above, the AIR study confounds the experimental and control treatments since both groups experienced some (unknown) forms of bilingual education.

 The labels assigned to different programs are arbitrary and applied in an inconsistent manner.  For example, Rossell and Baker claim to compare French immersion (structured immersion) programs in Canada with “transitional bilingual education.” There are no transitional bilingual education programs in Canada. The El Paso Independent School District (1987) program was labeled a “Spanish-English dual immersion” program by Baker in 1992 and a “structured English immersion” program by Baker in 1998, and a “submersion” program by Rossell and Baker (1996—Appendix C, p. 72).  This program involved a “native language cognitive development” component of 90 minutes a day at grade 1,  gradually reducing to 60 minutes a day by grade 3 and 30 minutes a day by grade 4 (Gersten & Woodward, 1995; Krashen, 1996).

 Limiting the framework of discourse to exclude bilingual programs designed to promote bilingualism and biliteracy.  An additional example of arbitrary labeling  is their treatment of Legaretta’s (1979) kindergarten study.  Originally labeled a “structured immersion” program by Baker and de Kanter (1983), this study demonstrated the superiority of a 50% Spanish, 50% English kindergarten program over both English-only and other bilingual program options with respect to students’ learning of English. Rossell and Baker (1996) list this study as showing “no difference” between TBE and submersion (English-only) treatments.  Yet the program option that was significantly better than all others was neither TBE nor submersion! The consistently positive outcomes of this kind of “Enriched Education” program (Beykont, 1994; Cloud, Genesee, & Hamayan, in press) are nowhere represented in Rossell and Baker’s review.  By limiting the framework of discourse to “transitional bilingual education” versus varieties of English-only programs, they have excluded the type of dual-language program option endorsed by virtually all applied linguists and also by some academics who have been highly critical of  bilingual education (Glenn & LaLyre, 1991; Porter, 1990). There appears, in fact, to be an emerging consensus among advocates and opponents of “bilingual education” that dual language and other programs that aspire to bilingualism and biliteracy are effective in developing English academic skills among both linguistic minority and majority students (Cummins, in press).

 Rossell and Baker’s reporting of French immersion data is blatantly inaccurate. In response to critiques from Kathy Escamilla (1996) and Susan Dicker (1996) regarding the fact that French immersion programs are fully bilingual in both goals and implementation, Rossell (1996) pointed out:

In the first two years, the program is one of total immersion, and evaluations conducted at that point are considered to be evaluations of “structured immersion.”  It is really not important that, in later years, the program becomes bilingual if the evaluation is being conducted while it is still and always has been a structured immersion program" (1996, p. 383)

 

The significance of this point is that the major empirical basis of Rossell and Baker's entire argument for structured English immersion rests, according to their own admission, on the performance in French of English-background students in the first two years of  Canadian French immersion programs.  They interpret this research as follows:

Both the middle class and working class English-speaking students who were immersed in French in kindergarten and grade one were almost the equal of native French-speaking students until the curriculum became bilingual in grade two, at which point their French ability declined and continued to decline as English was increased. (p. 22)

 

Rossell and Baker seem oblivious to the fact that at the end of grade one French immersion students are still at very early stages in their acquisition of French. Despite good progress in learning French (particularly receptive skills) during the initial two years of the program, they are still far from native-like in virtually all aspects of proficiency - speaking, listening, reading, and  writing.  Most grade 1 and 2 French immersion students are still incapable of carrying on even an elementary conversation in French without major errors and insertions of English. Similarly, it is bizarre to claim, as Baker and Rossell do without even a citation to back it up, that the French proficiency of grade 6 immersion students is more poorly developed than that of grade 1 students, and to attribute this to the fact that L1 instruction has been incorporated in the program.

 The research data show exactly the opposite pattern to that claimed by Rossell and Baker. Lambert and Tucker (1972), for example, report highly significant differences between grade 1 immersion and native French-speaking students on a variety of vocabulary, grammatical and expressive skills in French, despite the fact that no differences were found in some of the sub-skills of reading such as word discrimination. By the end of grade four, however, (after 3 years of English [L1] language arts instruction), the immersion students had caught up with the French controls in vocabulary knowledge and listening comprehension, although major differences still remained in speaking ability.  Similarly, in the United States, the one large-scale "methodologically acceptable" study that investigated this issue (Ramirez, 1992) found that early-grade students in "structured immersion" were very far from grade norms in English even after four years of English-only immersion.

 In summary, to claim that two years of immersion in French in kindergarten and grade 1 results in almost native-like proficiency in French in a context where there is virtually no French exposure in the environment or in school outside the classroom flies in the face of a massive amount of  research data. This can be verified by anyone who cares to step into any of the thousands of grade 1 French immersion classrooms across Canada.

 In conclusion, Rossell and Baker’s literature review is characterized by inaccurate and  arbitrary labeling of programs, inconsistent application of criteria for “methodological acceptability,” and highly inaccurate interpretation of the results of early French immersion programs. Ironically, the data they review (both methodologically “acceptable” and “unacceptable”) does have considerable relevance for policy purposes, if the interpretive paradigm is changed.

 The Research-Theory-Policy Paradigm: Progressive Refinement of Theory to Explain and Predict Phenomena

 In most scientific disciplines, knowledge is generated not by evaluating the effects of particular treatments under strictly controlled conditions but by observing phenomena, forming hypotheses to account for the observed phenomena, testing these hypotheses against additional data, and gradually refining the hypotheses into more comprehensive theories that have broader explanatory and predictive power.  Take just one example: meteorology or climatology–the understanding and prediction of weather patterns.  What scientists do to generate knowledge in this discipline (and many others) is to observe phenomena (e.g. the conditions under which hurricanes  appear) and build up theoretical models that attempt to predict these phenomena. With further observations they test and refine their predictive models.  There is no control group, for obvious reasons, yet theory-based predictions are constantly being tested and refined.

 In the same way, I would suggest that a much wider body of research data is both theoretically- and policy-relevant than typical reviews in the area of bilingual education have suggested.  For example, case studies of particular programs or evaluations that assess student progress in relation to grade norms are potentially theoretically-relevant. They become relevant for theory and policy when their outcomes are assessed in relation to the predictions derived from particular hypotheses or theoretical frameworks.  The process is as follows:

1.                    Establish that the phenomenon is genuine and not an artifact of measurement or observational procedures (e.g. data collection errors).

2.                    Once the phenomenon has been established as genuine, ask what theoretical constructs can potentially account for the data.

3.                    The third stage involves examining hypotheses in relation to additional data (e.g. designing research to test the hypotheses explicitly). As noted above, it takes only one contrary finding to refute a theoretical proposition or cause it to be modified.

4.                    The final stage involves continual refinement of hypotheses and increasing integration into broader theoretical frameworks capable of more comprehensive explanations and accurate predictions.

 

There is nothing new in any of this.  It reflects, for example, the process whereby we came to understand the movement of the planets and countless other scientific phenomena. Why then has this process not been applied in the recent policy-oriented literature reviews relating to bilingual education?  Had this process been applied, a much clearer picture of the research findings and their implications would have emerged. The three literature reviews outlined above would have been able to communicate to policy-makers and the general public the following answers to at least some of their central questions:

·         In response to the relatively unsophisticated question, “Does bilingual education work,” the research shows clearly that successful bilingual education programs have been implemented in countries around the world for both linguistic minority and majority students and exactly the same patterns are observed in well-implemented programs: students do not lose out in their development of  academic skills in the majority language despite spending a considerable amount of instructional time through the minority language. This pattern is demonstrated in the vast majority of the 300 studies listed by Rossell and Baker (1996) as well as in the broader reviews of literature undertaken by August and Hakuta (1997) and Cummins and Corson (1997). These data are consistent with predictions derived from the interdependence hypothesis which suggests that this theoretical construct can be used as a predictive tool by policy-makers.

·         In response to the question, “Does bilingual education work better than English-only instructional programs?” no definitive answers can be given until the term “bilingual education” is defined more precisely.  The trend in much of data is that programs that aspire to develop bilingualism and biliteracy (Enriched Education programs) show much better outcomes than English-only or quick-exit transitional bilingual programs that do not aspire to develop bilingualism and biliteracy. Specific hypotheses (e.g. regarding the positive effects of bilingualism on cognitive and linguistic functioning) and more comprehensive theoretical frameworks (e.g. Ovando & Collier,1998; Cummins, 1996; Lucas, Henze, & Donato, 1990) have been advanced that are consistent with this trend for Enriched Education programs to show highly positive outcomes. However, considerably more research is required to refine these frameworks to take account of the multiple interactions that occur among variables that contribute to bilingual students’ academic success.

·         In response to the question, “Will students suffer academically if they are introduced to reading in their second language?” the research indicates that the language of initial introduction of reading is not, in itself, a determinant of academic outcomes. The linguistic mismatch hypothesis therefore has no credibility, as was evident in the 1970s (Engle, 1975; Cummins, 1979).

·         In response to the question, “Will greater amounts of English instruction (time-on-task) result in greater English achievement?” the answer is simply, no. The research data overwhelmingly fails to show any positive relationship between the amount of English instruction in a program and student outcomes.

 

In arguing for the theoretical and policy relevance of research findings that report student outcomes in relation to grade norms without direct control group comparisons, I am not suggesting that individual studies in isolation provide any definitive information.  Rather the findings of individual evaluations or research studies represent phenomena that require explanation. Specific conditions in any particular context may have contributed to program outcomes such that similar findings are not observed in contexts where these conditions are absent. What we can state is that a particular set of findings is consistent or inconsistent with hypothesis X. However, before much credibility can be placed in the general relevance of hypothesis X, it is necessary to assess its consistency with a much wider set of findings in contexts where a variety of other unique conditions may be present.  If the predictions that derive from hypothesis X are confirmed across these diverse contexts, then the credibility of hypothesis X increases significantly despite the fact that control group comparisons may not have been carried out.

 Let us take a hypothetical example. Suppose that dual language or two-way bilingual immersion programs (which usually have between 50% and 90% minority language instruction in the early grades) were to show consistently the pattern that most of those that have been evaluated to this date apparently do show: by the end of elementary school, students from majority language backgrounds develop high levels of biliteracy skills at no cost to their English (L1) academic development; students from minority language backgrounds by grade 6 show above average L1 literacy development and come close to grade norms in English (L2) academic skills. Let us suppose, hypothetically, that we have 100 such programs demonstrating this pattern from around the United States and the few programs that do not demonstrate this pattern can be shown to have been poorly implemented or not to have followed the prescribed model in some important respects.

Do these 100 programs demonstrating a consistent pattern of achievement in relation to grade norms tell us anything that is policy relevant? Rossell and Baker would say no—these studies are not “methodologically acceptable” because control groups were not used and results are reported only in relation to grade norms.

 I have argued, by contrast, that such a pattern is directly relevant to policy because it permits us to test certain theoretical predictions against the research data. Thus, the hypothetical pattern described for both minority and majority students is clearly inconsistent with the "time-on-task" hypothesis because students instructed through the minority language for significant parts of the school day suffered no adverse effects in English language academic development.  These data would also refute the linguistic mismatch hypothesis since majority language students whose initial literacy instruction was through Spanish experienced no long-term difficulty in either English or Spanish literacy skills. This pattern of data, however, would be consistent with the interdependence hypothesis which predicts that instruction through a minority language will result in no adverse consequences for academic development in the majority language.

 In conclusion, the alternative Research-Theory-Policy paradigm conforms more closely to typical scientific procedures than the virtual elimination of theoretical considerations in the Research-Policy paradigm that has dominated the recent bilingual education policy debate.  Not surprisingly, it also yields information for policy that is much more interpretable and useful. Experimental and quasi-experimental research is an appropriate approach to inquiry  but by itself is limited in its ability to answer the major policy-related questions in the education of linguistic minority students, as the inadequacies of the three literature reviews analyzed above illustrate.

 

Footnotes

1. The interdependence principle has been stated as follows (Cummins, 1981, p. 29):

 

To the extent that instruction in Lx is effective in promoting proficiency in Lx, transfer of this proficiency to Ly will occur provided there is adequate exposure to Ly (either in school or environment) and adequate motivation to learn Ly.

 

2.        Rossell and Baker (1996) do acknowledge the existence of the interdependence hypothesis but they distort it beyond recognition, attributing to me a “facilitation theory” (a label which I have never used) which predicts that minority students taught through their L1 should always perform better in English than students taught exclusively through English regardless of the conditions or sociocultural context. This is a very different prediction than that which derives from the interdependence hypothesis which is that the transfer of conceptual and linguistic knowledge across languages can compensate for the significantly reduced instructional time through the majority language (see Cummins, 1999).

References

August, D. & Hakuta, K. (1997). Improving schooling for language minority children: A research agenda. Washington, DC: National Academy Press.

Baker, K. A.& de Kanter, A. A. (1981). Effectiveness of bilingual education: A review of the literature. Washington, DC: U.S. Department of Education.

Baker, K. (1992, Winter/Spring). Review of Forked Tongue. Bilingual Basics, pp. 6-7.

Baker, K. (1998). Structured English immersion: Breakthrough in teaching limited-English-proficient students. Phi Delta Kappan (November), 199-204.

Beykont, Z. F. (1994). Academic progress of a nondominant group:  A longitudinal study of Puerto Ricans in New York City's late-exit bilingual programs. Doctoral dissertation presented to the Graduate School of Education of Harvard University.

Christian, D., Montone, C. L., Lindholm, K. J., & Carranza, I. (1997). Profiles in two-way immersion education. Washington, DC: Center for Applied Linguistics and Delta Systems.

Collier, V. P. & Thomas, W. P. (1989). How quickly can immigrants become proficient in school English? Journal of Educational Issues of Language Minority Students, 5, 26-38.

Corson, D. (1993). Language, minority education and gender:  Linking social justice and power. Clevedon, England: Multilingual Matters.

Cummins, J. (1979). Linguistic interdependence and the educational development of bilingual children.  Review of Educational Research, 49, 222-25l.

Cummins, J.  (1981).  The role of primary language development in promoting educational success for language minority students.  In California State Department of Education (Ed.), Schooling and language minority students: A theoretical framework. (pp. 3-49). Los Angeles: National Dissemination and Assessment Center.

Cummins, J. (1996). Negotiating identities: Education for empowerment in a diverse society. Los Angeles: California Association for Bilingual Education.

Cummins, J. (1999). Beyond adversarial discourse: Searching for common ground in the education of bilingual students. In I. A. Heath & C. J. Serrano (Eds.), Annual editions: Teaching English as a second language (pp. 204-224). Guildford, CT: Dushkin/McGraw-Hill.

Cummins, J. (in press). The ethics of doublethink: Language rights and the bilingual education debate. TESOL Journal.

Cummins, J. & Corson, D. (Eds.). (1998). Bilingual education. Dordrecht, The Netherlands: Kluwer Academic Publishers.

Danoff, M. N. (1978). Evaluation of the impact of ESEA Title VII Spanish-English bilingual education programs. Technical Report. Washington, DC: American Institutes for Research.

Dicker, S. (1996). RTE Forum: Letters from readers. Research in the Teaching of English, 30(1), 373-376.

Dolson, D. & Lindholm, K. (1995). World class education for children in California:

           A comparison of the two-way bilingual immersion and European Schools model. In T. Skutnabb-Kangas (Ed.) Multilingualism for all. (pp. 69-102). Lisse: Swets & Zeitlinger.

El Paso Independent School District (1987). Interim report of the five-year bilingual education pilot 1986-87 school year. El Paso, Tx: Office for Research and Evaluation.

Engle, P. (1975). The use of the vernacular language in education. Bilingual education. (Series No. 2). Washington, DC: Center for Applied Linguistics.

Escamilla, K. (1996). RTE Forum: Letters from readers. Research in the Teaching of English, 30(1), 371-373.

Fitzgerald, J. & Cummins, J. (in press). Bridging disciplines to critique a national research agenda for language-minority children’s schooling. Reading Research Quarterly, 34(3).

Gandara, P. (1997). Review of research on instruction of limited English proficient students. Davis: University of California Linguistic Minority Research Institute Education Policy Center, Davis

Genesee, F., & Lambert, W. (1983). Trilingual education for majority-language children. Child Development, 54, 105-114.

Genesee, F.,  Lambert, W., & Tucker, G. (1977). An experiment in trilingual education.  Montreal: McGill University

Gersten, R. (1985).  Structured immersion for language minority students: Results of a longitudinal evaluation.  Educational Evaluation and Policy Analysis, 7, 187-196.

Gersten, R., & Woodward, J. (1995). A longitudinal study of transitional and immersion bingual education programs in one district. The Elementary School Journal, 95(3), 223-239.

Glenn, C. L. & LaLyre, I. (1991). Integrated bilingual education in the USA. In K. Jaspaert & S. Kroon (Eds.), Ethnic minority languages and education (pp. 37-55). Amsterdam: Swets & Zeitlinger.

Greene, J. (1998). A meta-analysis of the effectiveness of bilingual education. Claremont, CA: Tomas Rivera Policy Institute.

Krashen, S.D. (1996). Under attack: The case against bilingual education. Culver City, CA: Language Education Associates.

Krashen, S. D. (1999a). Condemned without a trial: Bogus arguments against bilingual education. Portsmouth, NH: Heinemann.

Krashen, S. D. (1999b).  Why Malherbe (1946) is NOT evidence against bilingual education. Manuscript submitted for publication.

Lambert, W.E. & Tucker, G.R. (1972). Bilingual education of children: The St. Lambert Experiment. Rowley, Mass.: Newbury House.

Legaretta, D. (1979). The effects of program models on language acquisition by Spanish speaking children. TESOL Quarterly, 13,  521-534.

Lucas, T., Henze, R., & Donato, R. (1990). Promoting the success of Latino language-minority students: an exploratory study of six high schools. Harvard Educational Review, 60, 315-340.

Malherbe, E. G. (1946). The bilingual school. Johannesburg: Bilingual School Association.

Ovando, C. J. & Collier, V. P. (1998). Bilingual and ESL classrooms:  Teaching in multicultural contexts. Boston: McGraw-Hill.

Peña-Hughes, E. & Solis, J. (1980). ABCs (unpublished report). McAllen, TX: McAllen Independent School District.

Porter, R. P. (1990). Forked tongue: The politics of bilingual education. New York: Basic Books.

Ramirez, J.D. (1992). Executive summary. Bilingual Research Journal, 16, 1-62.

           Rossell, C. H. (1996). RTE Forum: Letters from readers. Research in the Teaching of English, 30,  376-385.

Rossell, C. H. & Baker, K. (1996). The effectiveness of bilingual education. Research in the Teaching of English, 30, 7-74.

.           Rossell, C. H. & Ross, J. M. (1986). The social science evidence on bilingual education. Journal of Law and Education, 15(4), 385-419.

Skutnabb-Kangas, T. Language in the process of cultural assimilation and structural incorporation of linguistic minorities. Arlington, VA: National Clearinghouse for Bilingual Education.

Thomas, W. P. & Collier, V. P. (1997). School effectiveness for language-minority students. Washington, DC: National Clearinghouse for Bilingual Education.

Wagner, D. A. (1998). Putting second language first: Language and literacy learning in Morrocco.  In L. Verhoeven & A. Y. Durgunoglu (Eds.), Literacy development in a multilingual context: Cross-cultural perspectives. (pp. 169-184). Mahwah, NJ: Lawrence Erlbaum Associates.

Willig, A. C. (1981/82). The effectiveness of bilingual education: Review of a report. NABE Journal, 6, 1-19.

Willig, A. C. (1985). A meta-analysis of selected studies on the effectiveness of bilingual education. Review of Educational Research, 55(3), 269-317.

 

 



The Case for Bilingual Education
Why Bilingual Education? by Stephen Krashen
ERIC® Clearinghouse on Assessment and Evaluation ... read more.

How effective is bilingual education? 
Elizabeth Howard, Center for Applied Linguistics... read more.

Visit James Crawford's Language Policy Web 
Possibly the most in-depth bilingual education site on the web.


Highlighted ESL Sites and Tools
Instant Multi-Language Translator
Great tool for the ESL writing workshop classroom. 

SETTING EXPECTED GAINS
for Non and Limited English Proficient Students

Edward De Avila, Ph.D.

Mathematics For Students with Learning Disabilities from Language-Minority Backgrounds: Recommendations for Teaching Diane Torres Raborn

Rethinking Schools:  Online Urban Educational Journal.


Dr. James Cummins is a leader in second language learning and literacy development research. This website is an expanding resource for educators the world over.


Top of Page
i teach i learn.com © 1999-2003
Educators. Technology. Connected.