Blog

Segregating Suburbia: A Princeton Story

Others around me have for some time been raising concerns about the emergence of boutique, suburban charter schools. Until now, I’ve largely blown off those concerns in part as I’ve questioned just how much sorting a charter school can achieve in a relatively homogeneous suburban area.

Suburbs have their own unique portfolio of schools.  One might find in any leafy suburb near a major metropolitan area a very fine local public school district, perhaps a private catholic school in certain regions of the country and in many areas, an elite private independent day school or two – oft named “day school” or “country day school.” These portfolios have been in existence, in some cases, for centuries.  At some future point, I may discuss more extensively the public private balance issue and the role that elite, and less so, private schools play when embedded in otherwise elite communities that also have relatively elite public school systems.

Rarely would one expect to find the charter school movement trying to infiltrate this environment, adding that other element to the portfolio. And if and when this does happen, what niche do they try to fill? On the one hand, one might try to establish a charter that handles the “difficult” cases from the local school system – those that might not fit particularly well in either the public system or have access to appropriate private schooling.

But, I’m reminded… though I can’t find the link right now… of a Palo Alto, CA charter school that had basically established itself as the equivalent of a publicly subsidized elite private school. [Found! By a commenter below] It’s a rather clever financial model. If elite private school tuition is running at about $30k per year per child…and per pupil cost of a quality private education program about $32 to $35k… one could either pay that price, or gather a group of close friends, and apply for a charter, where each child might receive an allotment of $10 to $15k from the local district and then parents could quietly agree to chip in the other $15k to achieve similar quality schooling to  the private option – at half the price.

Of course, there are many additional costs of getting that ball rolling, including finding and leasing space for start up years, and running capital fundraising campaigns for future years. By establishing a charter school in this way, these parents really couldn’t officially exclude others from their school or obligate private contributions within their “club”… but they sure could make any free-rider, or other resource drain on their schooling model feel uncomfortable enough to leave.

On the one hand, it might not be considered that problematic for a group of parents with “average children” in the local district to require (via establishing a charter school) that district to subsidize their quasi-private endeavor.  I would argue that it becomes more problematic when an above average income group in the community, with relatively low need children (by usual classifications), obligates the local public school district to subsidize their segregationist preferences. That is, asking those less well off than you to subsidize your quasi-private school alternative.

But, just how much sorting can a suburban charter school achieve anyway? And can a suburban charter school establish itself as a quasi-elite-private school in a market  where there are already several private schooling options. That is, would parents of advantaged children actually seek to establish a school that taxes those less well off than them, to subsidize their charter school, instead of paying the full price of tuition at local private schools?  Evidence from Princeton, New Jersey suggests that the answer to this question may in fact be yes!

Let’s take a look.

Here’s the lay of the land… from the broad viewpoint… with district housing values in the brown shading.

Princeton GlobalPrinceton is the dark area in the middle of the picture, with very high average housing values. Princeton also is home to numerous… and I mean numerous private independent day and boarding schools, many of which (along a single road) serve a large portion of school aged children from Princeton and surrounding communities and many of which have been around for a very long time. Princeton is also known for having and exceptionally strong local public district. To the south and west is Trenton, with high poverty schools including high poverty charter schools (yellow stars). Notably, in Princeton the lowest poverty “public” school is Princeton Charter School. Princeton Public Schools each have much higher rates of children qualified for free or reduced priced lunch.

Here’s a zoom in on Princeton:

Princeton ZoomWhile many of the triangles (private schools) in other parts of the state are preschools, etc., many in  Princeton are actually relatively large elite private day and boarding schools.  Rather amazingly, Princeton Charter School appears – at least by exclusion of low income children – to be positioning itself as a publicly subsidized alternative to the elite private schools and not as a more broadly accessible charter alternative.

Here’s a breakout of the details on the Princeton Charter population compared to the district:

Slide1

And here’s the composition of the special education populations:

Slide2

That is, PCS has only the mildest, lowest cost children with disabilities.

Put bluntly, these figures show that the parent population of Princeton Charter is obligating the parents of much less advantaged children, including parents of children with special education needs, subsidize their preference to have a school more like the private day schools along Great Road.

While I’m still not entirely sure what to make of this… it does concern me.

It also ought to raise questions for leaders of private school alternatives in these communities. On balance, I’ve never seen the charter school movement as a particular competitive threat to private independent day schools, as charters have often been primarily urban, serving minority populations and employing “no excuses” strategies that most parents in leafy suburbs would not find palatable for their own children.

Urban charter schools have arguably taken their toll on urban catholic school enrollments, but that’s another story.  But, to the extent that state charter policies permit the type of school establishment and segregation going on in Princeton, more an more parents may find ways to organize quasi-private-elite schools to serve their needs – effectively seeking taxpayer charity to support their country club preferences. This indeed may pose a threat to financially less well endowed private schools.

In a twisted sort of way, it’s rather like asking your local public parks department to pay for your membership to the local private country club – thus reducing the quality of services to others who really don’t have access to the country club (even if it proclaims it’s open to all comers).

Much more to ponder here… but the numbers on Princeton Charter School certainly raise some serious red flags.

Note: In New Jersey and elsewhere, there are numerous other taxpayer subsidies that support private schooling, ranging from property tax exemptions and exemptions on charitable gifts, to textbook subsidies (loans from local districts) and transportation reimbursements. So, to an extent, all private schools and privately schooled children are receiving some level of subsidy at taxpayer expense. But, that level increases dramatically if/when the local public district is also required to hand over the full annual operating expense per child.

Friday Story Time: Deconstructing the Cycle of Reformy Awesomeness

Once upon a time, there was this totally awesome charter school in Newark, NJ. It was a charter school so awesome that its leaders and founders and all of their close friends decided they must share their miracle with the world in books on the reasons for their awesomeness, including being driven by data and teaching like a champion!

The school’s break-the-mold – beating the odds – disruptively innovative awesomeness was particularly important during this critical time of utter collapse of the American education system which had undoubtedly been caused by corrupt self-interested public school teachers (& their unions) who had been uniformly ill-trained by antiquated colleges and universities that themselves were corrupt and self-interested and generally in the business of selling worthless graduate degrees.

In fact, the undisputed awesomeness of this North Star Academy could, in theory, provide the foundation for a whole new approach to turning around the dreadful state of American education.

And thus came the Cycle of Reformy Awesomeness, which looks something like this:

Slide1

Built on the foundation of awesomeness established by THE North Star Academy, since teachers are the undisputed most important in school factor determining student outcomes, the awesomeness of North Star could be attributed primarily to the quality of the teachers and innovative practices they used in their data driven classrooms!

Thus, by extension, we must establish new institutions of teacher preparation whereby these truly exceptional teachers (of 3 to 5 years experience) not only are provided the opportunity to share their expertise on a personal collaborative level with their own colleagues, but rather, we should let these teachers be the instructors in a new graduate school of education (regardless of academic qualifications) and we should actually let them grant graduate degrees in education to their own colleagues.

This new approach of letting teachers in a school grant graduate degrees to their own work colleagues (and those in other network schools) could lead to rapid diffusion of excellence and would most certainly negate the corrupt perverse incentives pervasive throughout the current, adult oriented self-interested American higher education system! Disruptive innovation indeed!

And so their founders and disciples took their show on the road. They took their show to state departments of education to urge fast-tracked uncritical promotion of their cycle of awesomeness. They gained leverage on local boards of education in nearby school districts to promote diffusion of their awesomeness. And they set out to other state departments of education to share their insights on how to achieve awesomeness with drive by data… excuse me… being driven by data!

And driven by data they were… for example… absolutely all of the kids in their school passed that test in high school.

And there was much rejoicing.

Slide2And that one too:

Slide3

 

And there was much rejoicing.

And they were only getting better, and better and better:

Slide4

And there was much rejoicing.

And better:

Slide5The more they looked at their own data – well, really only one measure of their data – the more they patted themselves on the back, congratulated their own reformy awesomeness and shared it with the world. And the state!

Slide6

And there was much rejoicing.

Yup… 100% graduation rate… which is totally unheard of for a high poverty, urban high school in dreadful Newark, NJ! [or at least for a school that happens to be located in the high poverty city of Newark].

A true miracle it was… is… and shall be. One that must be proliferated and shared widely.

But alas, the more they shared, the more they touted their awesomeness, the more it started to become apparent that all might not be quite so rosy in North Star land.

As it turned out, those kids in North Star really didn’t look so much like those others they were apparently so handily blowing out on state tests….

Slide11

And there was complete freakin’ silence!

Somehow, this rapidly growing miracle school was managing to serve far fewer poor children than others (except a few other charter schools also claiming miracle status) around them.

And, they were serving hardly any children with disabilities and few or none with more severe disabilities.

Slide12

And again there was complete freakin’ silence!

And if that was the case, was it really reasonable to attribute their awesomeness to the awesomeness of their own teachers – their innovative strategies… and the nuanced, deep understanding of being driven by data?

Actually, it is perhaps most befuddling if not outright damning that such non-trivial data could be so persistently ignored in a school that is so driven by data?

And there was complete freakin’ silence!

But alas, these were mere minor signals that all might not be as awesome as originally assumed.

It also turned out that of all the 5th graders who entered the halls of awesomeness, only about half ever made it to senior year – year after year after year after year… after year.

Slide14

And for black boys in the school, far fewer than that:

Slide15

And there was complete freakin’ silence!

And in any given year, children were being suspended from the school at an alarming rate.

Slide13

Again… raising the question of how a school driven by data could rely so heavily on a single metric – test scores and pass rates derived from them – to proclaim their awesomeness, when in fact, things were looking somewhat less than awesome.

Could a school really be awesome  if only the fewer than half who remain (or 20% of black boys who remain) pass the test? Might it matter at least equally as much what happened to the the other half who left?

Was it perhaps possible that the “no excuses” strategies endorsed as best practices both in their school and in their training of each other really weren’t working so well…and weren’t the strategies of true teaching champions… but rather created a hostile and oppressive environment causing their high attrition rate? Well… one really can say this one way or the other…

Regardless of the cause, what possibly could such a school share with those traditional supposedly failing public schools who lacked similar ability to send the majority of their children packing? Further, what possibly could the rather novice teachers in this school charged with granting their own co-workers graduate credentials share with experienced researchers and university faculty training the larger public school teacher workforce?

Alas the miracle was (is) crumbling.

But that miracle wasn’t just any ol’ miracle. Rather, it was the entire foundation for the reformy cycle of awesomeness! And without that foundation, the entire cycle comes crumbling down.

Slide16

No miraculously awesome charter school [in fact, one might argue that any school with such attrition is an unqualified failure].

Thus no valid claim of miraculous teachers and teaching.

Thus no new secret sauce for teacher preparation.

All perpetrated with deceptive and in some cases downright fraudulent (100% graduation rate?) presentation of data.

And thus the search continues… for the next miracle… and the next great disruptive innovation to base on that miracle… whatever… wherever it may be.

 

 

 

 

 

 

 

The “Ed Schools are the Problem” Fallacy

I had the displeasure of waking up to this drivel in my in-box this morning:

“Those who can, do. Those who can’t, teach. And those who can’t teach, teach teaching.”

http://www.nytimes.com/2013/10/21/opinion/keller-an-industry-of-mediocrity.html?_r=0

yeah… and those completely lacking in critical thinking, basic research and data interpretation skills write op-eds for the Times.

I don’t really teach teachers myself, so I guess I shouldn’t take offense. But I do mainly because the core argument advanced here is so ill-informed and poorly conceived.

Allow me to start by pointing out that I have actually written detailed, quantitative research in peer reviewed journals on the very topic of who’s teaching the teachers. In fact, the article we wrote was done partly in response to the Arthur Levine report cited in the Times op-ed piece. And it’s not as if the article title really conceals its contents:

  • Wolf‐Wendel, L., Baker, B. D., Twombly, S., Tollefson, N., & Mahlios, M. (2006). Who’s teaching the teachers? Evidence from the National Survey of Postsecondary Faculty and the Survey of Earned Doctorates. American Journal of Education, 112(2), 273-300.

My apologies for the fact that this article is fire-walled. I really don’t expect all of my blog readers to go through the trouble of paying for it or finding an academic library that carries it. But any responsible journalist, pundit or author proclaiming a strong policy position on this issue ought to at least do some reading on the topic first. The above article is certainly not uncritical of teacher preparation. [UPDATE: Full version here, courtesy of the kind folks at AJE Baker2006]

And the issues of complexity and variation in teacher preparation I explore in the above research article are not the only massive omission or conflation put forth in the New York Times piece, which operates on the overly crude assumption of a uniform system of content-free instruction across any and all ed schools.

Let’s tackle the bigger and much simpler issue here – the broad notion advanced in this op-ed that Ed Schools are the problem!  Ed Schools are the primary threat to the quality of our public schooling system as a whole and by extension Ed Schools are a threat to our national security. [yeah… he didn’t really say that… but somehow it often goes there] And further, that if we can just replace ed schools – with some other unknown thing – we’ll all be better off.

A kinder, gentler variant on this argument is that it’s just the bad ed schools that are a threat and that we can weed out those bad ed schools by looking at how the students of their graduates perform. I’ve addressed this issue in a few previous blog posts. First, I’ve addressed the question of whether “ed school” is really some static, monolithic entity. Second, I’ve addressed the feasibility of rating ed schools by twice removed outcome measures.

But there’s actually a simpler logical fallacy at play here which lies at the root of many reformy arguments regarding causes and consequences – failure to acknowledge that  the U.S. has a wide range of elementary and secondary of schools that are both high performing and low performing and that the defining features differentiating higher and lower performing schools are not found primarily in their teachers or the preparation programs they attended – or whether they attended any at all – but rather in the communities they serve, the resources available to them and the backgrounds, health and economic well-being of the children and families they serve.

This is not about the poverty as excuse argument. This is about the simple point that our highest performing public schools also employ teachers from traditional public college and university preparation programs and in many cases, teachers from the same – or substantively overlapping – college and university preparation programs as teachers in our lowest performing schools in the same region.

If that’s the case, then how is it possible that teacher preparation programs are the problem?

I know… the good reformer at this point is thinking – but there are no good U.S. public schools or districts. They all suck and that’s precisely why teacher preparation is the problem. Of course, if that was the case – that all K-12 public schools suck – it would hard to, by research design – with a dependent variable that doesn’t vary – attribute that sucky-ness to a single cause.  But the dependent variable does vary… even when we rely on reformy resources like the Global Report Card I wrote about here.

First, here’s a location where you, yourself can actually download the reformy report card, which in large part was designed to shake the confidence of America’s suburban parents by taking a few statistical leaps to show them their leafy suburban schools wouldn’t stack up so well if we transported them to Finland or Singapore.

http://globalreportcard.org/docs/Global-Report-Card-Data-11.14.12.xlsx

I’ll save that argument for another day, and just select two sets of districts from this report card, from Illinois and Kansas, because I have the data readily available. Let’s look at local public school districts that are

1) Better than the Average Fin and those that are…

2) Worse than 80% of beer-swillin’ Hockey Lovin’ Canadians.

That’s quite a contrast (even though both are high performing countries – on average – setting aside demographics, etc.).

Here are the lists:

Slide1

Slide2

So, we’ve got some school districts in each state that are better than the average Finnish school and some that get trampled by the those syrup swillin’ hosers from the Great White North.

The only plausible explanation is that the teachers in the Better than Finland category are either from completely non-traditional ed schools or not ed schools at all while  the teachers in the not-so-great schools all come from your typical state ed school.

Certainly, we know from large bodies of teacher labor market research that graduates of various preparation programs, colleges and universities and alternative route programs more broadly,  sort themselves on the labor market, with those who possess stronger academic credentials often sorting into the “more desirable” jobs.

But that’s somewhat of an aside here. For the basic reformy premise of massive uniform ed school failure to be true – we would have to see little or no commonality in the ed school preparation of teachers across these settings – across totally awesome U.S. schools and totally sucky ones.

So, here’s the recent distribution of graduates of Kansas teacher preparation programs in the Kansas City metropolitan area which includes the Blue Valley School district – better than the average  Fin and Kansas City Kansas, which, well, gets its butt kicked by Canada!

Slide3

Hmmm… you can’t possibly be telling me that both KCK and BVSD have teachers who graduated from the major state teacher preparation colleges can you? If that’s the case, then their relative international rankings might not be determined by teacher preparation?

[ignore the poverty shading in the background…’cuz payin attention to poverty… well… just isn’t cool with the reformy crowd!]

There are some notable features to this map. One is that BVSD and and Olathe to its west were still significantly growing districts during this period. So it makes sense that they hired a lot of new teachers during that time. It makes less sense that KCK, more stagnant (and declining) in population hired so many new teachers – but for the relatively high turnover rate more common in such high poverty settings! There are also some distributional differences in the dots – which universities produce more teachers for which districts (or provide more credentials). Pittsburg state (blue dots) more prevalent in KCK provides a local program that feeds to KCK. I’d be hard pressed, however, to lay blame on Pitt state for KCK’s Canadian butt-whoopin’ and I’d be equally hard pressed to credit K-state in producing more teachers for Blue Valley as the cause of Blue Valley’s competitive match up with Finland! The fact is that all of these Kansas districts draw heavily on teachers produced by the public teachers colleges of that state – and some do as well as Finland while others struggle.

As such, it’s pretty darn hard to lay blame on traditional teacher preparation in Kansas for these differences in outcomes.

Now, let’s take a look at a few high performing and lower performing districts in Illinois.

First, here are the top 15 undergraduate degree producers for Chicago and Aurora East and for Naperville and Lake Forest. Rather than from the degree producers perspective, these data simply include all instructional staff in these districts, downloadable here: http://www.isbe.state.il.us/research/xls/2012-tsr-public-dataset-instr.xlsx

The data include where teachers got their undergraduate and advanced degrees.

Slide5

Wow… there’s actually quite a bit of overlap in the institutions. Sure, there are differences. Where a state name is listed, the teacher received his/her undergraduate degree from an un-named institution in that state (such are the shortcomings of state administrative data). The City of Chicago does have larger shares from some Chicago based programs. But there’s also overlap and there’s significant overlap for the state’s major public teacher preparation institutions, like Illinois State University, Northern Illinois University and the University of Illinois main campus (Champaign). How can that be? How can there possibly be school districts that compete favorably with Finland while employing graduates of traditional teachers colleges?

While the percentages of teachers in these districts who attended any one preparation institution tend to be small, the shares who attended major public preparation institutions for their bachelors degrees appears marginally larger in the high performers (Over 10% for both IL State and Northern).

That’s impossible! But… But… But… graduates of those same colleges are teaching in districts that got whooped by the Canadians? So how can we possibly place blame for systemic failure of American schools on teacher prep programs? I’m struggling with the logic here.

One more look… here are the advanced degree granting institutions for teachers in higher versus lower performing Chicago area districts. Note that “NULL” refers to those not holding (or reporting) advanced degrees and that the share holding only a bachelors degree is higher in the lower performing districts (poorer, minority districts).

Slide6

Again, these degrees – which in include both initial and additional certifications – are dominated by traditional credential granting institutions with substantial overlap across teachers between higher and lower performing schools.

This is a separable but related issue to the evaluation of ed schools by student outcome measures. I’ll continue digging more into that issue in future posts.

It is certainly hard to make a compelling case that traditional teacher preparation institutions are the primary cause of our supposed lagging national education system when our highest performing schools – those that compete favorably with Finland – also employ in large number, graduates of those preparation programs and in many cases employ significant numbers of graduates of the same programs that provide teachers for our supposed failing schools.

$500 million? No! $3 BILLION! That’s $3BILLION! Comments New York State’s Underfunding of NYC Schools

New York’s Governor Cuomo has been big on words promising NOT TO FUND New York State schools and squeeze them to the maximum extent possible with layers of cuts and caps. After all – NOT FUNDING SCHOOLS is the most noble of endeavors – that along with declaring death penalties for those underfunded, high need schools that post low average test scores.

The Governor’s most recent anti-school-funding attack comes in response to NYC Mayoral Candidate Bill de Blasio’s campaign promise to push for universal preschool for city school children. At least as characterized in a New York Daily News editorial, the noble Governor is again set to dig in his heals against any additional spending on schools:

Laying claim to big ideas, Bill de Blasio has promised to deliver universal all-day pre-kindergarten, paid for by raising taxes on wealthy New Yorkers.

The attractive concept helped boost de Blasio to a commanding lead over Republican Joe Lhota, and is key to his education program. He calls universal pre-k “how we will start to close the achievement gap” between minority and white children.

Now is the time for voters to consider whether de Blasio has a prayer of fulfilling his pledge to provide services to 70,000 4-year-olds, along with after-school programs for middle-schoolers.

He’d need a half-billion dollars a year and has staked all on convincing Albany to okay a city income tax hike on high earners.

Bill, meet Andrew:

Gov. Cuomo says no.

In an interview with the Daily News Editorial Board, Cuomo made clear that he has no intention of pressing the Legislature to give de Blasio the $500 million in tax money he’s counting on.

Read more: http://www.nydailynews.com/opinion/bill-rude-awakening-article-1.1487839#ixzz2i5NJcOJT

So, as the Daily News would characterize, the Governor is incensed that de Blasio would even consider the absurd possibility of needing an additional half a billion – that’s HALF A BILLION DOLLARS! to finance the frivolity of universal preschool.

Yeah… HALF A BILLION DOLLARS sure sounds like an obscene number. But let’s reflect for a moment on just how much money the state of New York, under the leadership of Governor Cuomo continues to come up short in financing the state school finance formula that was adopted back in 2007 in order to comply with a NY State High Court ruling that funding levels at that time for New York City were inadequate.

In 2012-13 and again in 2013-14 – New York State continues to short NYC on general state aid for schools to the tune of around 3 BILLION DOLLARS! Yeah… that’s 6 times the seemingly obscenely huge funding request for de Blasio’s pre-k proposal.

Let’s take a look. First, here’s a graph of the relationship between state aid shortfalls per pupil and the state’s own pupil need index. Larger circles represent larger (enrollment) districts. As can be seen here, in 2013-14, based on March/April 2013 adopted budget figures (state aid run worksheets), New York City – the bowling ball in the picture – is shorted by just under $3,000 per pupil in State Aid! That’s $3,000 PER PUPIL IN STATE AID!

Slide1Now, here’s how the funding formula that got the state out from under litigation is supposed to work.

A district’s target funding level per pupil is supposed to be a function of a) the foundation level of funding per pupil [appropriately inflated to represent current year costs], times b) the pupil need index for each district times c) the regional cost index for each district times d) the number of “aidable foundation pupil units” (which is an enrollment count including some adjustments for special education and other factors).

The adjusted foundation amount per pupil in NYC is $16,562 in 2013-14.

Bear in mind that even this figure is based on rigged analyses that severely underestimate actual needs and costs.

Then, the state determines the share of that figure to be covered by the district and balance to be covered by the state. The state share for NYC is supposed to be $7,006.

Take that figure times the total aidable foundation pupil units (TAFPU) and you’ve got….. $8.8 BILLION DOLLARS!

THAT’S $8.8 BILLION DOLLARS!!!!!

Slide2But alas, that’s far more than the state actually allocates to the City of New York through the foundation aid program it adopted to get out from under litigation brought by the city of New York – because funding was (and still is) inadequate!

I’m not even going to quibble (here and now) over the broader conception of “adequacy” involved, or the fact that the state concocted ways to reduce their estimated targets. The fact is that even though they set a low bar and further lowballed their funding targets, they’ve (meaning the Governor and Legislature) chosen to not even come close to funding those targets!

Instead, the state begins by freezing the underlying foundation aid level to past levels, setting NYC’s foundation aid level to just under $6.4 billion.

Yeah… sounds like a lot… but that’s already well short of what the city is supposed to get. And that’s just the first CUT.

Next, the state applies what it calls the Gap Elimination Adjustment (and then partially restores that cut, to make it seem like a gift), further reducing state aid to NYC down to just under $5.9 billion.

That’s a total state aid shortfall of nearly $3 billion! THAT’S $3 BILLION!!!!!!!!!!!  [with the figure fluctuating around $3 billion from year to year – the figure was higher for 2012-13]

So before the good Governor Cuomo decries the obscene half a billion dollar request presented by future mayor(?) de Blasio, it might be wise for him to reflect on the state’s own past and still relevant promises to New York City… promises that would rightfully (constitutionally… as per the high court decision in Campaign for Fiscal Equity v. State) drive an additional $3 Billion to NYC.

THAT’S $3 BILLION!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

 

More Thoughts on Interpreting Educational/Economic Research: DC Impact Study

Today brings us yet another opportunity to apply common sense interpretation to an otherwise seemingly complex research study – this time on the “effectiveness” of the DC Impact teacher evaluation system on improving teaching quality in the district. The study, by some of my favorite researchers (no sarcasm here, these are good, thoughtful individuals who do high quality work) is nicely described in the New York Times Economix Blog section:

To study the program’s effect, the researchers compared teachers whose evaluation scores were very close to the threshold for being considered a high performer or a low performer. This general method is common in social science. It assumes that little actual difference exists between a teacher at, say, the 16th percentile and the 15.9th percentile, even if they fall on either side of the threshold. Holding all else equal, the researchers can then assume that differences between teachers on either side of the threshold stem from the threshold itself.

The results suggest that the program had perhaps its largest effect on the rate at which low-performing teachers left the school system. About 20 percent of teachers just above the threshold for low performance left the school system at the end of a year; the probability that a teacher just below the threshold would quit was instead above 30 percent.

In addition, low-performing teachers who remained lifted their performance, according to the system’s criteria. To give a sense of scale, the researchers noted that the effect was about half as large as the substantial gains that teachers typically make in their first years of teaching combined.

http://economix.blogs.nytimes.com/2013/10/17/a-new-look-at-teacher-evaluations-and-learning/?_r=1&

The study: http://cepa.stanford.edu/sites/default/files/w19529.pdf

So, for research and stats geeks this description speaks to a design referred to as regression discontinuity analysis. It sounds complicated but it’s really not.  The idea is that whenever we stick cut-points through some distribution of ratings or scores – through messy/noisy data – some people fall just below those cut-points and others just above. But the cut-points are really arbitrary and those who fall just above the line really aren’t substantively, or even statistically significantly different from those who fall just below the line. It’s almost equivalent (assumed equivalent for research purposes) to taking a group of otherwise similar individuals and randomly assigning to some, one score (below the line) and others another score (above the line).

In one application of this approach, researchers from Harvard studied the effect of high stakes high school exit exams on student in Massachusetts. Students who barely passed the test were compared with those who barely failed the test on the first try. In reality, missing or making one or two additional questions does not validly indicate that one child knows their math better than the other. The kids were otherwise comparable, but some were labeled failures and others successes. Those labeled failures were more likely to drop out of high school and less likely to attend college.

The conclusion – that these arbitrary, non-meaningful distinctions adopted in policy are harmful.

This brings us to the present study on the DC Impact teacher evaluation system. Here, the researchers identified teachers who were really no different from one another statistically on their DC Impact ratings, but some were just a few fractions of a point low enough to be labeled as Ineffective and face threat of dismissal, and others just high enough to be out of the woods for now. That is, there really aren’t any substantive observed quality differences between these two groups. Note that the researchers studied this at the high end of the ratings distribution as well, but didn’t really find as much going on there.

Put simply, what this study says is that if we take a group of otherwise similar teachers, and randomly label some as “ok” and tell others they suck and their jobs are on the line, the latter group is more likely to seek employment elsewhere. No big revelation there and certainly no evidence that DC Impact “works.”

Rather, arbitrary, non-meaningful distinctions are still consequential. This is largely what was found in the Massachusetts high stakes testing studies.

Actually, one implication for supervisors is that if you want to get a teacher you don’t like to leave your school, find a way to give them a bad rating. But I think most supervisors and principals could already figure that one out.

Here’s an alternative experiment to try – take a group of otherwise similar teachers and randomly assign them to group 1 and group 2. We’ll treat Group 1 okay… just okay… no real pats on the back or accolades. Group 2 on the other hand will be berated and treated like crap by the principal on a daily basis and each day in passing, the principal will scowl at them and say… “your job’s on the line!.”

My thinking is that group 2 teachers will be more likely to seek employment elsewhere. Not hugely different from the DC Impact research framework and nor are the policy implications. Does this mean that teacher evaluation works – has appropriate labor market consequences. No… not at all. It means that arbitrary differential treatment matters.

Of course, this would be an unethical experiment unlikely to make through IRB approval. But heck, screwing with people’s lives via actual arbitrary and capricious employee rating schemes, adopted as policy is totally okay.

As for the second conclusion… that those who do stay appear to improve their game…it certainly makes sense that individuals would try not to continue being the whipping boy… even if they perceive their prior selection as whipping boy to be arbitrary and capricious. Notably, the bulk of the evaluations in this study were based on observed behaviors not test-based metrics, and with observations, teachers have more direct control over what their supervisors observe and can therefore respond accordingly. Whether these behavior changes have anything to do with better actual on-the-job performance – “good teaching” – is at least questionable.

 

 

The Value Added & Growth Score Train Wreck is Here

In case you hadn’t noticed evidence is mounting of a massive value-added and growth score train wreck. I’ve pointed out previously on this blog that there exist some pretty substantial differences in the models and estimates of teacher and school effectiveness being developed in practice across states for actual use in rating, ranking, tenuring and firing teachers – and rating teacher prep programs – versus the models and data that have been used in high profile research studies. This is not to suggest that the models and data used in high profile research studies are ready for prime time in high stakes personnel decisions.  They are not. They reveal numerous problems of their own. But many if not most well-estimated, carefully vetted value-added models used in research studies a) test alternative specifications including use of additional covariates at the classroom and school level, or include various “fixed effects” to better wash away potential bias and b) through this process, end up using substantially reduced samples of teachers for whom data on substantial samples of students across multiple sections/classes within year and across years are available (see, for example: http://nepc.colorado.edu/files/NEPC-RB-LAT-VAM_0.pdf ). Constraints imposed in research to achieve higher quality analyses often result in loss of large numbers of cases, and result potentially in clearer findings, which makes similar approaches infeasible where the goal is not to produce the most valid research but instead to evaluate the largest possible number of teachers or principals (where seemingly, validity should be an even greater concern).

Notably, even where these far cleaner data and  far richer models are applied, critical evaluators of the research on the usefulness of these value-added models suggest that… well… there’s just not much there.

Haertel:

My first conclusion should come as no surprise: Teacher VAM scores should emphatically not be includ­ed as a substantial factor with a fixed weight in conse­quential teacher personnel decisions. The information they provide is simply not good enough to use in that way. It is not just that the information is noisy. Much more serious is the fact that the scores may be system­atically biased for some teachers and against others… (p. 23)

https://www.ets.org/Media/Research/pdf/PICANG14.pdf

Rothstein on Gates MET:

Hence, while the report’s conclusion that teachers who perform well on one measure “tend to” do well on the other is technically correct, the tendency is shockingly weak. As discussed below (and in contrast to many media summaries of the MET study), this important result casts substantial doubt on the utility of student test score gains as a measure of teacher effectiveness.

http://nepc.colorado.edu/files/TTR-MET-Rothstein.pdf

A really, really, important point to realize is that the models that are actually being developed, estimated and potentially used by states and local public school districts for such purposes as determining which teachers get tenure, or determining teacher bonuses or salaries, who gets fired… or even which teacher preparation institutions get to keep their accreditation?…. those models increasingly appear to be complete junk!  

Let’s review what we now know about a handful of them:

New York City

I looked at New York City value-added findings when the teacher data were released a few years back.  I would argue that the New York City model is probably better than most I’ve seen thus far and its technical documentation reveals more thorough attempts to resolve common concerns about bias. Yet, the model, by my cursory analysis still fails to produce sufficiently high quality information for confidently judging teacher effectiveness.

Among other things, I found that only in the most recent year, were the year over year correlations even modest, and the numbers of teachers in the top 20% for multiple years running astoundingly low. Here’s a quick summary of a few previous posts:

Math – Likelihood of being labeled “good”

  • 15% less likely to be good in school with higher attendance rate
  • 7.3% less likely to be good for each 1 student increase in school average class size
  • 6.5% more likely to be good for each additional 1% proficient in Math

Math – Likelihood of being repeatedly labeled “good”

  • 19% less likely to be sequentially good in school with higher attendance rate (gr 4 to 8)
  • 6% less likely to be sequentially good in school with 1 additional student per class (gr 4 to 8)
  • 7.9% more likely to be sequentially good in school with 1% higher math proficiency rate.

Math [flipping the outcome measure] – Likelihood of being labeled “bad”

  • 14% more likely to be bad in school with higher attendance rate.
  • 7.9% more likely to be sequentially bad for each additional student in average class size (gr 4 to 8)

https://schoolfinance101.wordpress.com/2012/02/28/youve-been-vam-ified-thoughts-graphs-on-the-nyc-teacher-data/

New York State

Then there are the New York State conditional Growth Percentile Scores.  First, here’s what the state’s own technical report found:

 Despite the model conditioning on prior year test scores, schools and teachers with students who had higher prior year test scores, on average, had higher MGPs. Teachers of classes with higher percentages of economically disadvantaged students had lower MGPs. (p. 1) http://schoolfinance101.files.wordpress.com/2012/11/growth-model-11-12-air-technical-report.pdf

And in an astounding ethical lapse, only a few paragraphs later, the authors concluded:

The model selected to estimate growth scores for New York State provides a fair and accurate method for estimating individual teacher and principal effectiveness based on specific regulatory requirements for a “growth model” in the 2011-2012 school year. p. 40 http://engageny.org/wp-content/uploads/2012/06/growth-model-11-12-air-technical-report.pdf

Concerned about what they were seeing, Lower Hudson Valley superintendents commissioned an outside analysis of data on their teachers and schools provided by the state.  Here is a recent Lower Hudson Valley news summary of the findings of that report:

But the study found that New York did not adequately weigh factors like poverty when measuring students’ progress.

“We find it more common for teachers of higher-achieving students to be classified as ‘Effective’ than other teachers,” the study said. “Similarly, teachers with a greater number of students in poverty tend to be classified as ‘Ineffective’ or ‘Developing’ more frequently than other teachers.”

Andrew Rice, a researcher who worked on the study, said New York was dealing with common challenges that arise when trying to measure teacher impact amid political pressures.

“We have seen other states do lower-quality work,” he said.

http://www.lohud.com/article/20131015/NEWS/310150042/Study-faults-NY-s-teacher-evaluations

That’s one heck of an endorsement, eh? We’ve seen others do worse?

Perhaps most offensive is that New York State a) requires that if the teacher receives a bad growth measure rating, the teacher cannot be given a good overall rating and b) the New York State Commissioner has warned local school officials that the state will intervene “if there are unacceptably low correlation results between the student growth sub-component and any other measure of teacher and principal effectiveness.”  In other words, districts must ensure that all other measures are sufficiently correlated with the state’s own junk measure.

Ohio (school level)

In brief, in my post on Ohio Value Added scores, at the school level, I found that year over year correlations were nearly 0 – the year to year ratings of schools were barely correlated with themselves and on top of that, were actually correlated with things with which they should not be correlated.  https://schoolfinance101.wordpress.com/2011/11/06/when-vams-fail-evaluating-ohios-school-performance-measures/

New Jersey (school level)

And then there’s New Jersey, which, while taking a somewhat more measured approach to adoption and use of their measures than in New York, has adopted measures which appear to be among the most problematic I’ve seen.

Here are a few figures:

Slide9Slide10Slide11Slide12Slide13And here is a link to a comprehensive analysis of these measures and the political rhetoric around them. http://njedpolicy.files.wordpress.com/2013/05/sgp_disinformation_bakeroluwole1.pdf

Conclusions & Implications?

At this point, I’m increasingly of the opinion that even if there was a possible reasonable use of value-added and growth data for better understanding variations in schooling and classroom effects on measured learning, I no longer have any confidence that these reasonable uses can occur in the  current policy environment.

What are some of those reasonable uses and strategies?

First, understanding the fallibility of any one model of school or teacher effects is critically important, and we should NEVER, NEVER, NEVER be relying on a single set of estimates from one model specification to make determinations about teacher, or school… or teacher preparation program effectiveness. Numerous analysis using better data and richer models than those adopted by states have shown that teacher, school or other rankings and ratings vary sometimes wildly under different model specifications. It is by estimating multiple different models and seeing how the rank orders and estimates change that we can get some better feel for what’s going on (knowing what we’ve changed in our models), and whether or the extent to which our models are telling us anything useful. The political requirement of adopting a single model forces bad decision making and bad statistical interpretation.

Second, at best the data revealed by multiple alternative models might be used as exploratory tools in large systems to see where things appear to be working better or worse, with respect to producing incremental changes in test scores, where test scores exist and are perceived meaningful. That’s a pretty limited scope to begin with. But informed statistical analysis may provide guidance on where to look more closely – which classrooms or schools to observe more frequently. But, these data will never provide us definitive information that can or should be used as a determinative factor in high stakes personnel decisions.

But that’s precisely the opposite of current policy prescriptions.

Unlike a few years back, when I was speculating that such problems might lead to a flood of litigation regarding the fairness of using these measures for rating, ranking and dismissing teachers, we now have substantial information that these problems are real.

Even more so from a litigation perspective, we have substantial information that policy makers have been made aware of these problems – especially problems of bias in rating systems – and that some policymakers, most notably New York’s John King have responded with complete disregard.

Can we just make it all stop! ???

Notes on the Seniority Smokescreen

Seniority, in the modern reformy lexicon, is among the dirtiest words. Senior teachers are not only ineffective and greedy and never put interests of the children over their own, but they are in fact downright evil, a persistent drain on state and local economies and a threat to our national security! By contrast, “effectiveness” is good and since seniority and effectiveness are presumed entirely unassociated, the simple solution is to replace any reference to seniority in current education policies with measures of “effectiveness.”

If only it was so simple. This modern reformy mantra grossly misinterprets the relationship between seniority and effectiveness, presumes currently available measures of effectiveness to be more useful than they really are at sorting “good” from “bad” teachers, ignores that the proposed solutions have in many cases been found NOT to solve the supposed problem, and is oblivious to the broader literature on teacher labor markets, compensation and the quality of the teaching workforce.

Seniority and Effectiveness

Numerous studies over time have shown that as teachers reach somewhere around their 5th year, student achievement gains under those teachers begin to grow more slowly and to an extent level off.[1]  These findings, to the extent we believe that these metrics of test score gain adequately represent teaching effectiveness, do not by any stretch of the imagination mean that more experienced teachers are less effective. Rather, their effectiveness increases from year to year level off. If they have indeed reached their optimal performance then it makes sense to continue to compensate senior teachers in order to retain them. A constant cycle of replacement costs money and costs in terms of lost effectiveness during the start-up years.[2]

Seniority and Fairness to Children

One argument is that seniority preferences in teaching assignments permit senior teachers to hold on to jobs in schools against their principals preferences and that these seniority privileges often lead to the neediest children having the least experienced teachers – as the more experienced teachers get the cushiest jobs in the district. On the one hand, this assertion acknowledges that new teachers, not senior ones, may in fact be the least effective. On the other hand, it’s simply not supported by research. Two separate studies, one on Seattle schools after implementing “mutual consent” hiring, and the other exploring Florida seniority contractual provisions have found that a) implementation of mutual consent initially exacerbated inequities across schools and ultimately led to no change[3] and b) that seniority provisions in contracts had no statistical relationship to inequitable distributions of teachers across schools and children.[4] Further, mutual consent hiring policies a) assume that central office decisions are necessarily bad and principals’ decisions necessarily good, b) they ignore that principal quality itself may be inequitably distributed and c) they ignore that central office is responsible for assigning principals.

Seniority and Layoffs

Another argument is that removing seniority preferences in cases of reduction in force (RIF) will necessarily lead to an improved teaching workforce as measured by student achievement outcomes. The argument is that seniority preferences must be replaced by “effectiveness” metrics that are predictably related to future effectiveness. First, such “effectiveness” measures are typically only available for core content classroom teachers between grades 3 and 8. In those relatively rare cases where Reduction in Force is actually implemented, the 20% of teachers for whom such metrics are available are least likely to be reduced and there typically exists significant latitude in deciding which programs and positions might be reduced first. Most reductions in force, which happen infrequently to begin with, chip away at other programs and services before ever approaching core subject area teachers.  Second, these effectiveness measures are wildly erratic and often substantially biased by who the teacher is teaching.[5]  In reality, these policies propose to replace seniority with a roll of the dice, or even a roll of rigged dice. Third, when implementing a 5% across the board cut, for example, effectively eliminating teachers at random by experience rather than eliminating only the newest teachers results only in nickels and dimes savings – certainly not enough to make even the smallest dent in the degree of persistent underfunding of Philadelphia schools, where debate on these policies is ongoing.[6]

While seniority is a seemingly arbitrary and imperfect measure for retaining teachers, replacing it with a roll of the dice is likely to have serious negative consequences for retaining high quality senior teachers and recruiting teachers into high need districts.

Bigger Picture Wage Issues

Most important to recruiting and retaining a quality teacher workforce in any given school district are a) the relative compensation and working conditions a teacher can expect throughout their career when compared with other career options in the same labor market and b) the relative compensation and working conditions a teacher can expect in one school or district versus another in the same labor market.[7]

Paying teachers competitively, offering good working conditions, including smaller class sizes and other resources, and providing job security are far more likely to produce the teaching workforce our children need.


[1] Rice, J. K. (2010). The Impact of Teacher Experience: Examining the Evidence and Policy Implications. Brief No. 11. National center for analysis of longitudinal data in education research.

[2] Ronfeldt, M., Loeb, S., & Wyckoff, J. (2013). How teacher turnover harms student achievement. American Educational Research Journal, 50(1), 4-36.

[4] Cohen-Vogel, L., & Feng, L. (2013). Seniority Provisions in Collective Bargaining Agreements and the “Teacher Quality Gap”. Educational Evaluation and Policy Analysis.

[5] see for example, Haertel, E. H. (2013). Reliability and Validity of Inferences about Teachers Based on Student Test Scores.  http://atlanticresearchpartners.org/wp-content/uploads/2013/07/PICANG14.pdf

[7] Richard J. Murnane and Randall Olsen (1989) The effects of salaries and opportunity costs on length of state in teaching. Evidence from Michigan. Review of Economics and Statistics 71 (2) 347-352

David N. Figlio (1997) Teacher Salaries and Teacher Quality. Economics Letters 55 267-271. David N. Figlio (2002) Can Public Schools Buy Better-Qualified Teachers?” Industrial and Labor Relations Review 55, 686-699.

Ronald Ferguson (1991) Paying for Public Education: New Evidence on How and Why Money Matters. Harvard Journal on Legislation.  28 (2) 465-498.

Susanna Loeb and Marianne Page (2000) Examining the link between teacher wages and student outcomes: the importance of alternative labor market opportunities and non-pecuniary variation. Review of Economics and Statistics  82, 393-408.

Susanna Loeb and Marianne Page (19980 Examining the link between wages and quality in the teacher workforce. Department of Economics, University of California, Davis.

Figlio, D.N., Rueben, K. (2001) Tax Limits and the Qualifications of New Teachers. Journal of Public Economics. April, 49-71

Ondrich, J., Pas, E., Yinger, J. (2008) The Determinants of Teacher Attrition in Upstate New York. Public Finance Review 36 (1) 112-144

Pauvre, Pauvre NYC Charter Schools?

There’s nothing really new in this post. I’m just revisiting data and figures that I’ve addressed over and over in this blog – drawn from this report and this conference paper. I’m reposting this information because many seem to quickly forget or totally ignore what we already know and the current debate over whether the city of New York should charge charter schools rent is clouded by the usual mix of non-information, lack of information, disinformation and catchy (though false) statements on t-shirts.

These data are from 2008-2010 and at some point I will update these analyses. But, while downloading, parsing and analyzing NYC district school data is relatively straightforward, it  remains a more burdensome task to get a complete picture of charter school financing in NYC and most other locations for that matter (searching through non-profit filings, etc.). That, in and of itself, raises serious accountability concerns [see the extent of footnotes needed in the above report to clarify our various concerns over clarity, completeness, accuracy and precision of charter school financial reporting].

Another important note is that conditions in district schools in New York City have continued to decline… with larger and larger class sizes each year… and persistent underfunding of the state school finance system. Thus, it is quite possible that the class size and other advantages charters held over district schools between 2008 and 2010 are much greater now.

So, what do we know about NYC charter schools? [and to be clear, this is an NYC specific issue… which, if you read the above report and paper… plays out differently, for example, in Ohio and Texas]

First, NYC charter schools have historically served much less needy student populations than their same grade level district school counterparts in the same borough of the city:

Slide1

Second, New York City charter schools in many cases spend far more on a per pupil basis than do district schools serving similar student populations, at the same grade level in the same borough.

Slide2

Third, Class Sizes at the elementary and middle school level tend, on average to be smaller, and in many cases much smaller (5 to 10 students per class smaller) in charter schools than in district schools.

Slide3

Fourth, even with these resource advantages, New York City charter schools show very mixed performance outcomes compared to same grade level district schools serving similar student populations in the same borough.

Slide4

My intent here is not to argue whether the city should or should not charge $2,700 per pupil rent to charters. Clearly, the effect of such a policy would fall disparately across charter school operators – where some are far more advantaged than others.  It is important that we not simply accept the rhetoric of the pauvre, pauvre charter school that faces such awful mistreatment under possible city policy changes.

The big issue here – the overarching issue – regards the extent of inequities in access to resources that persist across the city system. Inequities exist across district schools by neighborhood and students served.

An important finding in the figures above is that huge inequities persist within the charter sector – a sector that has been selectively advantaged by the current administration’s policies over the past decade.

Whoever becomes the next Mayor of NYC must consider how the whole system fits together and how that system can generate the best distribution of opportunities for all children.

Equity is a necessary concern and one that is not resolved by providing, endorsing or expanding choices among inequitable alternatives.

Maps of NYC Charter and District School % Free Lunch 2010-11 (NCES Common Core of Data)

Mapping NYC Charter Schools

harlem charters

Bronx ChartersBkln charters

Paying Economists by Hair Color? Thoughts on Masters Degrees & Teacher Compensation

In previous posts, I’ve conveyed my distaste for the oft obsessively narrow thinking of the traditional labor economist when engaged in education policy research. I’ve picked on the assumption that greed and personal interest are necessarily the sole driving force of all human rational decision making. And I’ve picked on the obsession with narrow and circular validity tests. Yet still, sometimes, I see quotes from researchers I otherwise generally respect, that completely blow my mind.

I gotta say, this quote from Tom Kane of Harvard regarding compensation for teachers holding masters degrees is right up there with the worst of them – most notably because it conveys such an obscenely narrow perspective of compensation policies (public or private sector) and broader complexities of labor market dynamics.

The quote comes to us from the Wall Street Journal the other day:

“Paying teachers on the basis of master’s degrees is equivalent to paying them based on hair color,” said Thomas J. Kane, an economist at the Harvard Graduate School of Education and director for the Center for Education Policy Research. Mr. Kane said decades of research has shown that teachers holding master’s degrees are no more effective at raising student achievement than those with only bachelor’s, except in math. Researchers have also shown that teachers with advanced degrees in science benefit students. (Wall St. Journal)

http://online.wsj.com/article/SB10001424052702304795804579101723505111670.html

The broad implication of the quote by Kane is that immediate, measurable differentials  in teacher compensation should only ever be directly associated with characteristics, indicators, behaviors of teachers that can be directly associated with differences in measured student test score differences from time 1 to time 2. Here… and NOW! That’s it. Any and all pay differentials MUST be associated with estimated test score gains on reading and math! The extension of this logic is that if there exists no statistically estimated relationship between a teacher having a masters degree in X, Y or Z and their students test score gains in that or the next year (perhaps), then no compensation should exist for this characteristic. Apparently, the same would apply for the teacher with the additional year of experience if they could not show a marginal gain to their students test scores over what they had previously achieved with less experience.

There are two gaping holes in this logic (setting aside the huge questions of the validity of the test score outcome metric as most important in defining student success and attributing that success to the teacher).

First, the research on “masters” degrees in education has pretty much addressed questions related to whether holding something ambiguously classified as a masters degree is positively associated with test score gains. In fact, these studies have found that holding content area masters degrees in math is associated with gains in math achievement.[1]

BUT, that research has not to my knowledge asked the broader labor market question regarding whether school districts offering additional pay for holding or pursuing masters degrees achieve a recruitment advantage on the labor market for teachers, or any other benefits to their workforce and children they serve. That’s a totally different question and one that requires being able to think beyond the laughably narrow mindset that the ONLY benefit that can ever matter and should ever warrant additional public expenditure is that which contributes directly and immediately to test score gains (of the students with the specific teacher being compensated for their masters degree).

Quite honestly, it’s this same utterly ridiculous thinking that plagues Kane’s Measures of Effective Teaching studies for the Gates Foundation – the assumption pervasive at every step of the project that the one and only valid indicator of teacher quality is value-added itself and all other measures should be evaluated by their ability to predict value added. Because a teacher’s prior year value added is the best predictor of current year value added, and a better predictor than observations, student surveys, etc., therefore value-added is the best measure of teacher effectiveness!

Second, both from a practical perspective and with potential broader labor market implications, there are many, many reasons why a local public school district or private school… or other business entity for that matter might wish to provide additional compensation for their employees who chose to advance their education, either related specifically to their current job title and responsibilities, or not!

It may be entirely reasonable for local public school districts to provide additional compensation for teachers seeking graduate credentials that expand their possible involvement in district or school activities, such as achieving additional training to work with special needs populations, or additional content certifications, or for that matter additional training to engage in all of the new teacher observations Kane and others now seem to think are necessary for getting rid of bad teachers (even though his own work on MET did not support his own conclusion in this regard). That is, you might want to have the salary differential available for the utility player.

It may also be an entirely reasonable approach for school districts to view providing additional compensation for furthering one’s education as a useful tool for retaining teachers – especially those who themselves show interest in expanding their own knowledge/learning.

In both this, and the previous case, the additional degrees or credentials obtained may actually have no direct relationship to the current primary responsibilities of the teacher. Does that mean they are entirely useless? That they should not be in any way associated with differentiated compensation not only until they are used, but until they are used in such a way that we can estimate that the additional credential has led to test score gains?

That’s just freakin’ asinine.

And this reductionist thinking really needs to stop.

A few other points are in order here. As I’ve shown in previous posts, the pursuit of the education masters degree takes many forms and has drifted over years. See the following figures.

Slide4

Slide5

Indeed, more creative thinking about how and when we choose to compensate graduate degrees through salary differentials is important to consider. But it would be utterly foolish to consider only immediate contribution to student test score gain as the single valid metric for making this decision.

Also, there already exists some variation in the ways in which masters degrees tend to be compensated across local public school districts. On average, I have found in studying teacher wage data in large diverse metropolitan areas that it is in fact the more affluent suburban districts that a) tend to have larger shares of teachers holding masters degrees and b) tend to provide a bigger bump in salary associated with masters degrees. Here are the New Jersey figures from a few years back.

Slide2

Slide1

And in the Chicago metro area, based on prior work, a teacher in a majority minority (student population) district is only 69% as likely as a teacher in a non-majority minority district (in the same labor market and holding other teacher characteristics constant) to hold a master’s degree and a teacher in a district that is 100% minority is only 60% as likely to hold a master’s degree as a teacher in a district that is 0% minority.

This figure displays the salary differentials by degree level and experience:

Slide3Teachers in majority minority districts are much less likely to hold a master’s degree than teachers in other districts in the same labor market. On average, a teacher in a majority minority district – at the same degree level and experience as a teacher in a non-majority minority district – is making about $2,000 less in annual salary. Because a master’s degree “bump” in salary is worth an average of about $8,500 ($8,481) in annual salary, large shares of teachers in majority minority schools are earning over $10,000 less than teachers at comparable experience levels in non-majority minority districts in the same labor market.

So, in other words, if masters degrees are such an obscene inefficiency in our public education system, they are an inefficiency most among the affluent suburban districts and less so in major urban districts (it is actually common to find that financially less constrained districts spend less efficiently – at least in terms of direct relationship to measured outcomes).

Meanwhile, the policies endorsed by Kane (via the MET project) are at least implicitly far more focused on fixing the supposed inefficiencies of our major urban education systems.

Perhaps, just perhaps, if those suburban districts next door did not pay such large differentials for and recruit so aggressively the teachers with masters degrees, their urban neighbors might have a chance to recruit some of those same teachers. But in the current environment, pushing urban districts to remove any and all compensating differentials related to anything not tied directly and immediately to test score gains will undoubtedly do far more harm than good.


[1] Goldhaber, D. D., & Brewer, D. J. (1998). When should we reward degrees for teachers?. The Phi Delta Kappan, 80(2), 134-138.

 

Other articles related to teacher wages and quality

Richard J. Murnane and Randall Olsen (1989) The effects of salaries and opportunity costs on length of state in teaching. Evidence from Michigan. Review of Economics and Statistics 71 (2) 347-352

David N. Figlio (1997) Teacher Salaries and Teacher Quality. Economics Letters 55 267-271. David N. Figlio (2002) Can Public Schools Buy Better-Qualified Teachers?” Industrial and Labor Relations Review 55, 686-699.  Ronald Ferguson (1991) Paying for Public Education: New Evidence on How and Why Money Matters. Harvard Journal on Legislation.  28 (2) 465-498.

Susanna Loeb and Marianne Page (2000) Examining the link between teacher wages and student outcomes: the importance of alternative labor market opportunities and non-pecuniary variation. Review of Economics and Statistics  82, 393-408. Susanna Loeb and Marianne Page (19980 Examining the link between wages and quality in the teacher workforce. Department of Economics, University of California, Davis.

Figlio, D.N., Rueben, K. (2001) Tax Limits and the Qualifications of New Teachers. Journal of Public Economics. April, 49-71

Ondrich, J., Pas, E., Yinger, J. (2008) The Determinants of Teacher Attrition in Upstate New York. Public Finance Review 36 (1) 112-144