Software Won’t Fix Boeing’s ‘Faulty’ Airframe

mhandley · on March 28, 2019

Boeing's software fix, announced today, is to compare readings from both angle-of-attack sensors and disable MCAS if they disagree significantly. The obvious question is why they didn't do this in the first place?

One possibility is incompetence. But Boeing engineers are smart people, so I'm not convinced by this. The elephant in the room is the requirement to maintain a common type rating with older 737 models.

Suppose they did originally do what the fixed software does now, and disable MCAS if the AoA sensors disagree. The problem Boeing face is that with MCAS disabled when this occurs, the plane no longer flies like an older 737. They'd need to announce to the pilots an AoA disagree, and announce that MCAS was disabled. Now what? A pilot certified and trained on the older 737 would not know how the Max now differs from what they trained on. If they'd done this, they'd have needed to provide additional training, and this must have concerned Boeing management that it might jeopardize the common type rating. Hence it seems likely they didn't add the AoA sensor comparison for this reason, reasoning that it was unlikely to be a problem anyway. We now know that reasoning was flawed.

What does this mean going forwards? Will EASA and other CAAs refuse to certify the modified 737 Max under the same type rating as the older 737? This certainly seems possible. If they did require a separate type rating, this would likely kill 737 sales, regardless of whether the plane is now safe.

GuB-42 · on March 28, 2019

> One possibility is incompetence. But Boeing engineers are smart people, so I'm not convinced by this.

That's still a possibility. Stupid decisions can emerge out of smart people.

Boeing is huge, and what they develop is incredibly complex. There are a lot of people with differing level of competence, ethics, and goals.

For example (I am not saying that happened), the engineers designing MCAS didn't expect incorrect AoA data, thinking the checks were done elsewhere. At the same time, the "sensors" team thought that raw, unchecked data was expected. The integration guy didn't read the specs correctly (sometimes, it comes down to a single word), didn't catch that, and checked the OK box. His manager, focused on a more pressing issue took that as granted and it went to production.

It is possible that the engineers did an excellent work, but didn't question the specs they had. The integration guy is normally super reliable but he just had a bad day. And his manager handled the other problem beautifully and overlooked the MCAS/AoA because, normally, the integration guy is reliable. A series of small mistakes that ended up in a catastrophe.

There are a lot of safeguards but the complexity is so high that sometimes, something goes through. Especially if the company is under pressure.

vonmoltke · on March 28, 2019

> For example (I am not saying that happened), the engineers designing MCAS didn't expect incorrect AoA data, thinking the checks were done elsewhere. At the same time, the "sensors" team thought that raw, unchecked data was expected. The integration guy didn't read the specs correctly (sometimes, it comes down to a single word), didn't catch that, and checked the OK box. His manager, focused on a more pressing issue took that as granted and it went to production.

> It is possible that the engineers did an excellent work, but didn't question the specs they had. The integration guy is normally super reliable but he just had a bad day. And his manager handled the other problem beautifully and overlooked the MCAS/AoA because, normally, the integration guy is reliable. A series of small mistakes that ended up in a catastrophe.

What you describe here would be a major failure of systems engineering on the project.

The systems engineers are responsible for flowing top level requirements down to the individual systems. They are responsible for ensuring the specs the engineering teams receive for their systems are correct, and for handling requests to change said specs. If the spec of the output of the AoA sensors does not match the spec flowed down to other teams on the input from those sensors the systems engineers responsible did not do their jobs.

Systems engineers exist to manage complexity like this, to ensure that the various engineering teams across the various disciplines are technically coordinated by providing clear, consistent specs and interfaces for them to work to. If that didn't happen, then I would not say those engineers are competent, let alone smart. It would be especially disappointing to me (as a former systems engineer) if this were the case, as systems engineering is the last place I would expect such incompetence from a company like Boeing. It's at the core of everything they do.

I want to address one specific point in a different context.

> the engineers designing MCAS didn't expect incorrect AoA data, thinking the checks were done elsewhere

You always expect out-of-spec conditions to be a possibility and have something in place to handle those conditions appropriately. To not do so is incompetence bordering on negligence.

ken · on March 28, 2019

With their last new aircraft, this happened many times. For example, they discovered that the pieces didn't fit together: https://www.seattletimes.com/business/boeing-finds-787-piece...

I agree that someone with a title like Systems Engineer should be responsible for such issues, but (having worked at Boeing) not every group has systems engineers overseeing their work. Or they're working in a different city and you get a phone conference with them once a month.

This should be no surprise. Conway's Law in action. You can look at the issues that Boeing has and guess to a reasonable degree of accuracy what their organizational structures look like.

> You always expect out-of-spec conditions to be a possibility and have something in place to handle those conditions appropriately. To not do so is incompetence bordering on negligence.

I don't think it's possible to build an airplane if you have to expect this from every system. Any flying machine has many single points of failure. An F-15 once famously flew and landed after losing a wing. This is clearly out-of-spec but no aircraft considers this condition a realistic possibility or has contingency procedures for it.

Exercise for the reader (and maybe a good interview question for a manager): draw the organizational structure for a team to design and manufacture a new airliner. List the single points of failure on the aircraft, and point to who is responsible for them. Would you fly on this? What did you miss?

mtw · on March 28, 2019

2 planes crashes in less than 5 months. So yes, there was either a system engineering failure, or a completely inept decision in choosing an oversized engine for an antic airframe. Your choice.

ethbro · on March 28, 2019

Or a training requirement failure. Or a UX failure. Or a documentation failure. Or an unrelated failure, given that the RCAs haven't been completed on the crashes.

It's amazing the hubris software engineers have in assuming everyone else in an idiot.

I am not an expert on aircraft-scale hardware/software codesign projects, aerospace engineering, high-reliability engineering (to aerospace standards), or complex systems analysis. I strongly suspect you, and most of HN, aren't either.

Boeing has people with all those skillsets. As does Airbus. As do NTSB and the variety of national certification agencies.

So before we toss rocks of certainty around in a comment thread, maybe wait and listen to the experts?

bobthepanda · on March 28, 2019

Given that at least some of the expert companies involved have attempted coverups of previous failures before, a little bit of skepticism is healthy. https://en.m.wikipedia.org/wiki/Boeing_737_rudder_issues

Authority should not be free from skepticism, even by people who are not authorities.

ethbro · on March 28, 2019

We can agree that not all skepticism is valuable skepticism though?

I came across this Rand report on the NTSB [1, 1999?]. The conclusions were not good about the funding & staffing levels vs modern accident load. Partly due to more incidents, but moreso due to increased systems complexity per incident.

So the NTSB, with a budget of around USD$100M, takes multiple years to deliver a report. And they're a professional worldwide standard on accident investigation. Suffice to say, we're not going to crack this case open on HN.

Which isn't to say it isn't productive to debate the relative merits of different regulatory approaches, takeaways for other disciplines, lessons learned, and all manner of things. But let's just have some humility in pretending we're all experts on everything [2].

[1] https://www.rand.org/content/dam/rand/pubs/monograph_reports...

[2] Though in fairness, I wouldn't be surprised if there were at least one expert on any given topic here. You folks are awesome.

mtw · on March 28, 2019

How are those "rocks of certainty"? 2 planes crashed, resulting in deaths of hundreds of people. There is validity in questioning "rocks of certainty". These people were trusted in certifying the plan. These people were trusted in doing the correct choices after the first crash. Obviously, the "experts" did not do what they had to do.

salawat · on March 28, 2019

There is nothing about a "Software Engineer" which precludes them from achieving a grasp of the finer points of physical engineering disciplines.

Finite element analysis, error propagation, systems analysis, statistical process control, mechanics of materials, and all the other equations we've come to rely upon are equally applicable by anyone with the wherewithal to learn to learn them, and are in fact, more likely to be picked up by someone who spends 80% of their time learning "the next tool".

In short, while I agree many software engineers may be in possession of an inflated sense of competence, there are many who are legitimate polymaths. This makes appeals of the sort you assert less than useful in the pursuit of substantive conversation.

ethbro · on March 28, 2019

Time.

salawat · on March 29, 2019

Time is the fire in which we burn, understood. That's why we learn things. So we can predict things, and save time. There's 75 + years in your lifetime. Even auto-didactically you should be able to sift through everything in

https://www.engineeringtoolbox.com

and use that as the starting point to build up some deep dives into authoritative literature.

When you can start to accurately predict outcomes, you're on the right track.

You'll know you're in the right neighborhood when you have an increasingly hard time tolerating poor models of reality, and start drifting off thinking about how easy it would be to make that thing if you only had the tools.

Happy adventures in the wonderful world of Math!

(Don't forget to read the Ethics handbook, and if you want to do it professionally, take the FE, PE, and never forget the iron ring!)

onemoresoop · on March 28, 2019

Or test failure?

GuB-42 · on March 28, 2019

If you look at plane crash reports, the disaster is almost always the result of a chain of events. Taken individually, each event is almost insignificant. It's the combination that makes the disaster.

In the vase of the Boeing 737 Max the sequence would be something like:

- Boeing decided to fit an oversized engine on an older airframe, which can cause a dangerous stall in some unusual configurations. It is not that much of a problem, they just reduced the flight envelope to exclude these configurations. It is not an unusual thing at all.

- In order to make sure that dangerous situation is never encountered and to limit training expenses for the pilots, they used software. Then again, not unusual.

- In order to avoid writing entirely new software, they extended the functionality of existing software (MCAS). Again, fine.

- First problem: the MCAS, now a critical piece of software wasn't properly requalified.

- Second problem: The AoA sensor, now critical, isn't properly managed.

- Third problem: The pilots weren't properly trained on what to do when MCAS starts crapping out

- Fourth problem: The pilots didn't have enough skill to deal with the unexpected situation.

- Fifth problem: Maybe some pilots dealt with the problem correctly but the report wasn't properly made and accounted for.

Anyone of these steps done right would have averted a disaster. We can't really blame a single one.

village-idiot · on March 28, 2019

Some investigative journalism indicates that this is not what happened.

The FAA has different levels of redundancy and uptime requirements based on the outcome of system failure. A system categorized as “catastrophic” failing would lose the plane and all passengers, while “hazardous” might hurt some and kill a few, and so on. The FAA requires that catastrophic systems must have a backup, while hazardous can go without one if it’s reliable enough.

Boeing classified the MCAS as “hazardous” only under certain flight characteristics, and they categorized it as a lower level than hazardous during level flight. This means that the crews designing the specification made the decision to depend upon only a single input, and it appears to have been built to spec[0].

It also appears that the FAA was experiencing internal pressure to delegate more certification authority to Boeing, with disastrous results.

0: https://www.seattletimes.com/business/boeing-aerospace/faile...

tomxor · on March 28, 2019

> Boeing is huge, and what they develop is incredibly complex. There are a lot of people with differing level of competence, ethics, and goals.

That is true... however it's also true both of the big aerospace companies have an abundance of very smart engineers who by their very nature would have been responsible enough and inquisitive enough to notice this defect even if it was not their direct responsibility (good engineers will step out of their box).

Knowing how vital that sensor is (regardless of the stupid reason for it being vital) and not building in redundancy is such an obvious flaw to any engineer - The reason with almost certainty will have been because that multiple observations of this safety flaw would have been squashed by some big execs due to the business requirements and economics and a severe lack of morals.

whatshisface · on March 28, 2019

>good engineers will step out of their box

Malignant cultures can do a lot to reduce this. The platonic ideal engineer will cross any number of organizational boundaries to deliver a needed piece of information, but there can be a lot of pressure to leave those boundaries un-crossed. Depending on how political the organization is, a strange engineer (or their manager) dropping in from nowhere to tell you a mistake was made may even be seen as a bad thing.

vonmoltke · on March 28, 2019

In addition aerospace, like most other traditional engineering fields, is heavily biased towards seniority. Years of experience actually matter[1], and more junior engineers often have an uphill battle trying to convince more senior engineers of errors. In aerospace in particular it's a side effect of having systems so complex that first impressions of a subsystem can give you intuitions that are wrong in the context of the larger system, even though they may seem right in isolation.

[1] More than they should, in my opinion.

typeformer · on March 28, 2019

The Challenger disaster is a good example of an engineer who did warn the right people but they ignored him; I would bet bottom dollar something similar happened with these Boeing tragedies. The pressure to launch is great and who wants to be the person who stalls "progress".

JohnBooty · on March 28, 2019

Every single human on earth would agree that this is what should be done, but this is naive. Of course it should be done. How do we actually accomplish it?

    good engineers will step out of their box

Good engineers are often prevented or discouraged from stepping out of their boxes by organizational structures.

Engineers also typically have intense workloads.

Getting a solid week's worth of engineering done is hard enough as it is.

Heroically crossing boundaries and leaping outside of organizational boxes to solve problems carries, at the very least, a high risk of falling behind on other work assigned to you.

In defense of organizations (and those who create them), coordinating hundreds or thousands of people is really hard. How do you create a strong enough structure to avoid chaos, while simultaneously allowing people to step outside/across that structure when the need arises? It ain't easy.

Also, and I don't know anything about how Boeing operates, but I suspect they already are successful at this to a large degree. Their record is not spotless, including some cover-ups they're guilty of, but it is very good. They would not be a world leader in aviation with highly competitive planes if they weren't pretty good at this stuff.

tomxor · on March 28, 2019

This problem does not take heroic leaps outside of boundaries and risk to careers... it's so obviously wrong that simply pointing it out should suffice, this isn't another challenger problem, this is staring anyone in the face blinding fucking obvious.

My point is that this _would_ have happened, people will have pointed it out, because it takes so little effort and because it's such an obviously wrong flaw. But some exec will have overruled them multiple times because money > safety.

JohnBooty · on March 28, 2019

Engineering had a perfectly good solution - a warning system that lets the crew know when the AoA sensors were out of whack.

Unfortunately, somebody else decided to make it optional equipment. And some airlines decided not to pay for it.

So uh, tell us how this was an engineering problem, exactly?

tomxor · on March 28, 2019

... I'm saying it's NOT an engineering problem, it's a problem of people NOT listening to engineers. I don't think I could be much more clear.

JohnBooty · on March 29, 2019

My mistake! Sorry! I think I must have conflated a few different comments in my mind.

bravo22 · on March 28, 2019

That mistake of one team not talking to the other about AoA would have been caught at system level hazard analysis, which is a requirement for any half decent engineering organization churning out safety critical requirements. Heck it is part of required submission to FDA for medical devices which are considered even moderate safety risk, let alone an airplane.

phkahler · on March 28, 2019

>> It is possible that the engineers did an excellent work, but didn't question the specs they had.

The reason I find that scenario hard to believe is that MCAS very existence is to correct a design problem inherent in the plane. A safety problem - the plane is not safe without MCAS or at least without new training which they wanted to avoid.

tyingq · on March 28, 2019

Supposedly the history is that the single sensor input was decided when the MCAS could only exert a fairly small amount of change to the stabilizer. Later, the amount of control given to MCAS was drastically increased, apparently without considering the ramifications of a single sensor feed.

vsl · on March 29, 2019

The explanation I’ve read (sorry, don’t recall the source) is that this system repurposed a purely advisory one that existed before, for informing the pilot about AoA. Being advisory in nature, it didn’t need duplicate sensors.

Next up: you need to implement MCAS and everything is conveniently there in existing code.

That, plus toxic culture for internal people doing FAA (yes, really), plus what you write, plus business pressure on certification...

Edit: source https://syonyk.blogspot.com/2019/03/boeing-airbus-tesla-and-...

kulu2002 · on March 28, 2019

What about integration/ subsystem level testing? It seems MCAS wasn't thoroughly tested with all boundary values. Whats the use of systems like IAHM [1] if they can't predict what faults can occur how safety is ensured/ guaranteed in various scenarios.

[1] https://www.phmsociety.org/sites/phmsociety.org/files/Fielde...

snarf21 · on March 28, 2019

I agree with you that this was ALL about keeping type rating. I wish the government would offer a whistleblower award to anyone inside Boeing who could prove that this was indeed true especially since it seem that that is how the software originally operated. Companies will do whatever it takes to drive sales and revenue and stock price. Employees don't want to raise their hand and get fired as they have families to support. A true whistleblower program with WITSEC level provisions for protection and monetary support would help cut this down. Once it happens once or twice companies are very disincentivized to continue down this road.

karlkatzke · on March 28, 2019

The current generation of executives seem to be willing to gamble in this way. They know that as long as they have money, they won't see much (if any) prison and the chances of even getting to a trial will be minimal.

I don't think that good whistleblower protection would help any. The justice system needs an overhaul to be blind to someone's wealth and color.

ken · on March 28, 2019

I bet this would be virtually impossible to prove, and career suicide for anyone who tried it.

These aren't mustache-twirling villains who distribute memos that say "Let's ignore safety issues to get this approved faster". They really believe they're doing what's best for all involved. We'll save everyone time and money, and make it easier for pilots. How is that not a good thing? We have no reason to believe safety will be compromised.

Predicting the safety implications of design decisions, years in the future, is not an easy task. If the AOA sensors (I think?) were a tiny bit more reliable, we'd never have seen a problem, and the MAX program would be considered a great success in efficiency.

I'm sure we've all worked for managers who made decisions we disagreed with, but couldn't prove they were making the wrong one.

Timothycquinn · on March 28, 2019

However, they did actively choose to not put in redundancy and the status systems in the cockpits which should have been done out of the box following any common sense and failure mode analysis procedures. This decision itself is enough to bring this to court and as a result the internal communications on these decisions will be explored.

syntheticcdo · on March 28, 2019

I am not in aviation, but if they added redundant sensors and a new indicator, with an accompanying change to the flight manual for how to react to said indicator, wouldn't that have run contrary to the goal of "no new training/certification required"?

Timothycquinn · on March 28, 2019

I agree. That and the resulting sales impacts would seem to be the key motives to these decisions.

karmelapple · on March 28, 2019

There are likely document retention policies that limit how long the relevant emails and text messages may be retained. There could be Word files or other documentation that captures discussions still floating around though.

Hopefully whoever is investigating this is acting fast to acquire the emails before they automatically get wiped.

thisisbrians · on March 28, 2019

From what I've read, the assessed severity of the failure of aviation systems is rated. The rating for this system was not severe enough to require redundancy based on the assigned rating. I'll update my comment if I can find a reference.

davidwitt415 · on March 28, 2019

>These aren't mustache-twirling villains who distribute memos that say "Let's ignore safety issues to get this approved faster". They really believe they're doing what's best for all involved. We'll save everyone time and money, and make it easier for pilots. How is that not a good thing? We have no reason to believe safety will be compromised.

That is an incredibly naive reading of the situation. For starters, there is no such rationale as 'reason to believe,' this is a highly regulated process for good reason, which requires testing and verification.

Sure it could be true, but the far more likely motivation is along the lines of Dieselgate. And yes, it can be proven that managers make bad decisions that incur legal liability.

ken · on March 28, 2019

Naive? I worked at Boeing for a couple years, and was on a software team where I was regularly asked to do things which flew in the face of the known best practices of the industry. (My team is not to blame for this. It wasn't for the 737MAX, it wasn't avionics, and the project was cancelled long before it was at all usable.)

It's not as "highly regulated" as you might want to believe. They talk a good game about CMMI but if you try to improve something they remind you that CMMI is only about process, not quality. Hurry up and ship something (deadlines!), and if it's not perfect we'll find it in test.

Given that the company has this culture, I find it much more plausible that this is to blame for their product issues. I don't need to hypothesize a big evil conspiracy to explain bad software.

In fact, that's true of almost every software organization. The James Bond joke [1] fell flat precisely because you don't need a James Bond villain to get buggy software. It's what you get by default.

[1] https://www.youtube.com/watch?v=jm4Rll9axkQ

JohnBooty · on March 28, 2019

    career suicide for anyone who tried it.

I think that's what the parent poster meant by "a true whistleblower program with WITSEC level provisions for protection and monetary support would help cut this down."

A lot of people in a lot of industries might come forward if they didn't have to commit career suicide to do it.

ken · on March 28, 2019

Sure, but you can't guess who might be able to prove it. Are you going to offer WITSEC to every engineer who happens to disagree with their manager? Half the programmers I've ever worked with were annoyed by management and thought they were being asked to implement terrible decisions.

gamblor956 · on March 28, 2019

I agree with you that this was ALL about keeping type rating. I wish the government would offer a whistleblower award to anyone inside Boeing who could prove that this was indeed true especially since it seem that that is how the software originally operated.

Why would a whistleblower award be necessary? Boeing has been very open about this being the reason for the MCAS system; it's been discussed in a number of articles from NYT and WaPo.

snarf21 · on March 28, 2019

The thing they lied about was that they plane flew the same as the old 737s. So maybe I need to be more specific and say that someone inside Boeing that could prove they knew it flew differently and lied about it to keep the type rating. Keeping the type rating is fine as a goal as long as it is true. Two crashes and hundreds dead prove it is absolutely false.

tropo · on March 28, 2019

A whistleblower wouldn't change anything. The reasoning behind the choices is obvious, and it isn't illegal.

snarf21 · on March 28, 2019

It is just one idea. The hope would be that someone could blow the whistle before planes crashed and people died.

CodeMage · on March 28, 2019

Last time I looked up the meaning of the word, whistleblowing wasn't reserved only for illegal practices.

PinguTS · on March 28, 2019

As I understand a simple software fix is not possible according to regulation.

The problem is as follows, as you described it partly: 2 sensors are not enough. If the MCAS is an important part for the flight safety, a simple redundant safety system is not enough. Because an airplane is not about functional safety but mission critical safety. In functional safety, if there is an error the safety function is triggered and the system is transferred into a safe state. But there is no safe state here. If the system is mission critical, then it is not safe to assume to switch it off in case of an error. That means for mission critical system we need at least 3 readings and with a vote can decide on which reading is most likely the correct reading.

If the MCAS would not be part of the mission critical path, then we could ask why is there in the first place? There must be reason why it was introduced.

I assume, it is not done by a simple software update, if there are only 2 sensors. It will be partly redesigned to fit the requirements and regulations. But of course, this will not be publicly announced. Think about the share price. They will maintain a communication that assumes that this is an easy (and cheap) fix, a software update.

mannykannot · on March 28, 2019

This question -- if it is a safety feature, why is it acceptable that it can it be disabled -- comes up a lot, so I think we should recognize that, in general, there is an answer, though whether it was the right answer for MCAS goes a step beyond.

Whenever you have something that would usually improve safety, but which presents a risk if it fails, then the rational response is to ask whether it demonstrably improves safety overall. This calculus depends on how much it improves safety when working, how much harm it does when failing, and how likely it is to fail. These considerations can be modified by limitations on operations both when things are working and when they fail, as is the case of twin-engined aircraft use on trans-oceanic flights.

If, in addition, this thing can be disabled, the principle is the same, but there are more cases: what are the chances it be disabled even though working, or (as in this case) not be disabled when failing.

Disabling MCAS does not itself put an airplane in a dangerous situation, it merely increases the risk somewhat. The part of the analysis that seems to have been flawed is that covering the risk introduced when it fails but is left engaged.

FakeComments · on March 28, 2019

MCAS isn’t needed to fly the 737 MAX, you can perform a perfectly fine flight without it — the autopilot directly trims, pilots could fly via trimming themselves using the control on the stick, and there’s a manual wheel for emergencies when the jackscrew motor fails (or is disabled to override MCAS).

Rather, MCAS is needed to fly the 737 MAX like other 737 type aircraft.

MCAS has the capacity to override pilot control via continued trimming; MCAS is not a safety requirement, but can trigger a critical failure; MCAS was not built to critical failure specs; pilots were not trained on MCAS; MCAS is required to maintain the type rating, because otherwise the plane would handle differently during a stall. (This is the only purpose — tipping the nose to fake handling characteristics during a stall.)

It sounds like a reckless design, made out of political rather than safety considerations.

VBprogrammer · on March 28, 2019

That's a bit of a misunderstanding. MCAS is needed to satisfy the certification requirements for an aircraft under part 25. It only makes the 737 Max more like the 737NG in that it also makes it more like any other certified aircraft.

To put it another way, the alternative to MCAS was not more training for the pilots. It was the 737 Max not passing certification.

FakeComments · on March 28, 2019

Do you have a source?

My information suggests that while all modern aircrafts require trim control for stabilization, particularly near stall, the way that MCAS operates isn’t required. Rather, pilots would simply have to fly slightly differently to control the 737 MAX compared to the 737 NG. However, that difference in stall prevention would require the 737 MAX to be certified fresh, as opposed to the same type as the 737 NG.

In short, that MCAS trimming is only required if you’re not going to train pilots to trim correctly on the new airframe, because you’re trying to assert type compatibility.

cameldrv · on March 28, 2019

This is a good source: https://www.satcom.guru/2019/03/regulations-around-augmentat...

FAR 25 has a number of specific requirements around pitch stability, specifically, FAR 25.255(b)(1) and FAR 25.203(a), and some others. FAR 25.203(a) says "No abnormal nose-up pitching may occur. The longitudinal control force must be positive up to and throughout the stall"

My understanding is that on the MAX, without MCAS, once you've pitched up beyond some AoA beyond 12 degrees or so, you can let go of the yoke, and the plane will continue to pitch up further until it stalls. That does not comply with the regs, and so you have MCAS which dials in some nose down trim in this situation to counteract the aircraft's natural tendency to pitch up further.

mannykannot · on March 28, 2019

If the chart presented in the article below [1] is fairly accurate and not merely representational, I think that the 737 MAX, without MCAS, does not actually become statically unstable in pitch at least until after the stall, but the response was still unacceptable, and the reason MCAS was needed for certification.

There is a complication here in that the stick forces are generated by an elevator Feel and Centering Unit, which is fed by a dedicated pitot tube and the stabilizer position.

Pitch instability after the stall was part of the rear-engined jet deep stall problem, and the reason for stick pushers [2].

[1] https://leehamnews.com/2019/02/15/bjorns-corner-pitch-stabil...

[2] https://leehamnews.com/2018/12/07/bjorns-corner-pitch-stabil... Figure 3.

VBprogrammer · on March 28, 2019

I admit it's not as authoritative as I'd like but this is the best source I've found:

> MCAS is a longitudinal stability enhancement. It is not for stall prevention or to make the MAX handle like the NG; it was introduced to counteract the non-linear lift of the LEAP-1B engine nacelles and give a steady increase in stick force as AoA increases. The LEAP engines are both larger and relocated slightly up and forward from the previous NG CFM56-7 engines to accommodate their larger fan diameter. This new location and size of the nacelle cause the vortex flow off the nacelle body to produce lift at high AoA; as the nacelle is ahead of the CofG this lift causes a slight pitch-up effect (ie a reducing stick force) which could lead the pilot to further increase the back pressure on the yoke and send the aircraft closer towards the stall. This non-linear/reducing stick force is not allowable under FAR §25.173 "Static longitudinal stability". MCAS was therefore introduced to give an automatic nose down stabilizer input during steep turns with elevated load factors (high AoA) and during flaps up flight at airspeeds approaching stall.

http://www.b737.org.uk/mcas.htm

However, on Boeing's own website they give the 'makes it fly just like the NG' explanation. On balance of probabilities I think that's unlikely to be the engineering justification.

My reasoning being I don't believe that there is any formal requirement for an aircraft to exhibit the same handling behaviours to be counted on the same type rating. For example the 757 and the 767 shared a common type rating. The 757 was a pilots favourite precisely because it was sporty in comparison to the 767.

cameldrv · on March 28, 2019

This is still a bit unclear to me. It's possible that it was originally intended to provide equivalent handling characteristics to the NG, but after flight testing, the magnitude of the MCAS correction increased by over 4X. It's possible that based on the predicted performance, the MAX would have met FAR 25 but not had equivalent handling to the NG, but after flight testing, MCAS took on a more important role in making the MAX certifiable at all.

cameldrv · on March 28, 2019

The answer is FAR 25.672 (c)(2). "It must be shown that after any single failure of the stability augmentation system or any other automatic or power-operated system - ... The controllability and maneuverability requirements of this part are met within a practical operational flight envelope (for example, speed, altitude, normal acceleration, and airplane configurations) which is described in the Airplane Flight Manual"

MCAS is only required in high AoA, flaps up, manual flight. To get high AoA, you generally need some combination of slow speed, a steeply banked turn, and abrupt maneuvering. If you tell the pilots to keep the speed up, keep the bank angles down, and fly gently in the event of an MCAS failure, it meets the requirements.

From a practical standpoint, the aircraft should never see an AoA that would trip MCAS in normal flight. The only time a jetliner would typically see an AoA over ten degrees is on takeoff and landing, when the flaps are down (and MCAS is inhibited), or momentarily in turbulence.

From a risk control standpoint, you're multiplying probabilities together (of course assuming uncorrelated failures). Spitballing the failure probability chain, flights that would trigger MCAS are probably about 10^-4 to 10^-5 probability, and the lack of MCAS might cause an unrecoverable stall in perhaps 10^-1 to 10^-2 flights, so figure 10^-6 probability per flight of the airframe's poor stability leading to an accident. This is too high to be certified. If you have MCAS, but it's disabled when an AoA vane fails, figure the probability of an AoA vane failure is 10^-4, you get an accident probability of 10^-10, which is acceptable.

m_mueller · on March 28, 2019

I don't understand how your post squares with what happened in reality.

cameldrv · on March 28, 2019

Boeing has delivered 376 737MAXes since May 2017. About as many were delivered in 2019 as 2017, so figure their median age is 9 months. Figure a month from delivery to going into full service. That makes for 90,578 flight days of service. Figure 6 flights per day, you get about half a million flights. We know that the AoA vane had a failure in at least 3 flights, and we should probably assume that there were another three failures on the alternate side. That gives a failure rate of 6/500,000 = 1.2*10^-5. That's about 8x better than my spitball 10^-4.

The reason two planes have gone down is that the MCAS cure is worse than the instability disease. MCAS incorrectly activates at 10^-5, and empirically this failure is 66% fatal. If you can get MCAS to only activate when it's needed, the fact that it won't always be there to protect you is not so crucial.

Think about it like this: Would it bother you if the airbag in your car only went off 99.9% of the time when you got in an accident? It shouldn't, and it would hardly make any difference in auto fatalities. On the other hand, if the airbag in your car randomly went off on 0.1% of drives you took, you'd have that happen to you once a year or so, and there's a good chance that it would eventually cause you lose control of your car and crash.

ams6110 · on March 28, 2019

> If the MCAS would not be part of the mission critical path, then we could ask why is there in the first place? There must be reason why it was introduced.

It was added to meet certification requirements for handling at high angles of attack. Specifically, to counter an increased pitch up tendency due to the larger, more forward engine nacelles.

As to how mission-critical it is I don't know, but as I understand it the angles of attack where it would become active are not encountered in normal commercial flight. It's more of a system that is there to meet requirements for handling near the "edge of the envelope."

VBprogrammer · on March 28, 2019

I read this thread earlier and became frustrated by the number of times people repeated the same old misunderstandings. Thanks for taking the time to try to correct them.

Having a critical system, whether mechanical, electric or hydraulic is an unavoidable fact of aviation. For example on the airbus if there is a complete failure of the fly by wire system the plan is to fly the aircraft using only the trim for pitch control and the rudder for lateral control. I'm sure pilots routinely give this a go in the simulator, but I'd be very surprised if the expected outcome was much above a controlled crash. It's a case of managing the risk of a problem against the hazards.

An AoA disagree warning will probably be written up as a fault requiring written authorization to depart from a maintenance base. The pilot would be expected to be extra vigilant in avoiding flight close to the stall until the error was corrected. Perhaps settling for easier approaches, longer runways etc.

All pretty normal aviation practice.

bravo22 · on March 28, 2019

Yes! But even if you have 2 out of 3 vote you can still have two sensors that can fail, say due to a bird strike or something else and MCAS would have to be disabled which means pilots need to learn to fly the plane w/o MCAS.

The one part of the narrative that Boeing has tried to Jedi Mind Trick away is that even if the pilots knew to disable MCAS by pulling the fuse, they're still left with having to fly a plane which no longer flies as the previous generation that they were trained on -- which is what MCAS was designed to do. So you still have a safety issue.

ams6110 · on March 28, 2019

In normal flight, MCAS is not active. The plane handles pretty much like any other 737. MCAS becomes active at high angles of attack, near the stall, to meet handling requirements for certification.

salawat · on March 28, 2019

Doesn't matter. The flight characteristics are still drastically different toward the extremes, and you never assume away the extremes. You now have to guarantee that every crew when faced with an MCAS disable situation that required piloting in a way that the MCAS would kick in can successfully mimic the action of the MCAS with only the manual trim wheels.

I'm not saying that's impossible, but you'd want your aircrew to have sufficient training to be a stand in for the automation.

ams6110 · on March 28, 2019

Fully agree that Boeing should have disclosed MCAS, why it's there, and details of its operation to the folks flying the planes. If I were a pilot I would certainly want to know about such a system, even if its behavior is not normally something I have to deal with. Sort of like a stick shaker. It's there for safety near the extremes, but it's not something a pilot ever encounters during line flying.

VBprogrammer · on March 28, 2019

I think "drastically different" is probably overstating the truth. It certainly doesn't jibe with anything I've read.

It's worth bearing in mind that, from what we know, neither of the two crashes had anything to do with handling difficulties in the new aircraft near the stall.

salawat · on March 28, 2019

I say drastically different because no one pays the complexity tax of increased automation to deal with something that isn't important. In this case, to the certifiability of the airframe.

I understand your caution against overstating things, but I'm not trying to be hyperbolic. If you think about the consequences of doing basic maneuvers in adverse conditions (weather/load/engine power conditions), you rapidly begin to get into areas where what seems minor would get into uncharted territory without a working MCAS.

It is a different plane in terms of manual flying. The automation non-withstanding.

VBprogrammer · on March 28, 2019

I get where you are coming from, and it's true that "the FARs are written in blood" but I'm not convinced that a non-linear stick force would have caused any real world issues. For a start, all of the critical phases of flight (takeoff, landing and go-around) usually involve some amount of flaps, for which MCAS is apparently unnecessary.

salawat · on March 30, 2019

I'd like to point you here, https://news.ycombinator.com/item?id=19527543

This is an interview with D.P. Davies, a test pilot of the Aviation Review Board of the UK. He faced many of the same arguments back in the 1950's with regards to the certifiability of the 707-300 I believe, but nevertheless rejected it for non compliance.

In that case, even if the plane stalled itself, it would immediately pitch back down, making it a "benign" aberrant flight characteristic. However, as he successfully argued, certifying that plane as is would set a precedent Which would deteriorate the overall airworthyness of airframes over time as the goal posts were allowed to slip further and further away from what was required by law.

We call this Normalization of Deviance, and it's an Engineering firms favorite way to get people killed in high profile ways.

I am far from the best example of a person who doesn't exceed the boundaries of "rules" when they keep me from getting something done, but once you have the thing done in a non-compliant way, it is absolutely imperative to bring it back into compliance somehow or ensure every stakeholder (regulator/customer/user alike) is aware of the exceptionality of the non-compliant piece, and owning it in whatever capacity is required to get the job done safely.

Creators, designers, manufacturers, etc are given a great deal of leeway when it comes to doing things; part of that is the expectation that you know the rules you are breaking, and you are making the best effort to either communicate or remedy the non-compliance.

Don't know as it will change your mind, but I hope it makes where I'm coming from clearer.

Angostura · on March 28, 2019

> The problem Boeing face is that with MCAS disabled when this occurs, the plane no longer flies like an older 737.

The other problem Boeing faces is that with MCAS enabled the plane no longer necessarily flies like an older 737 - it can try to force its nose down unexpectedly.

mhandley · on March 28, 2019

The older 737 could do that too - it's known as a stabilizer runaway, and could happen if (for example) the stabilizer trim switch failed closed in the nose-down-trim position. It's something pilots train for, and there are trim-disable switches to handle this. Presumably Boeing believed that this was sufficient for pilots to cope with MCAS too.

The problem is that MCAS doesn't run continuously - instead it runs in 10 second bursts, so it looks different from a runaway trim event that pilots had trained for. And MCAS doesn't just result in runaway trim - the failed AoA sensor is also used to correct airspeed, so this gave the pilots an unreliable airspeed indication, and it also triggered the stick shaker on the captain's side, indicating an aerodynamic stall. The combination of all these things together seems to have confused the pilots, and resulted in them not disabling electrical trim.

Edit: here's the relevant data from the Lionair flight preliminary report: http://nrg.cs.ucl.ac.uk/mjh/lionair.png

ddalex · on March 28, 2019

> The problem is that MCAS doesn't run continuously - instead it runs in 10 second bursts, so it looks different from a runaway trim event that pilots had trained for

What's even worse is that a runway trim could be disabled by pulling hard on the yoke which would disable the autopilot driving the trim wheel motors. Then the wheels could manually be brought into position.

MCAS is not disabled by pulling on the yoke. Even if you set the trim wheels to the correct angle manually, MCAS would continue to move them, in increments that do not resemble a runway trim where you need to hold fast on the wheel.

So a sequence of actions commonly used by pilots to fix the runway trim is not disabling MCAS, making the pilots very confused about what's actually pointing the aircraft nose down.

userbinator · on March 28, 2019

What's even worse is that a runway trim could be disabled by pulling hard on the yoke which would disable the autopilot driving the trim wheel motors. Then the wheels could manually be brought into position.

Not if the runaway is caused by a stuck trim switch --- which is why there are the cutout switches and the procedure for runaway trim involves using them. Finally, if that doesn't work, manually grasp and hold the trim wheels:

https://www.youtube.com/watch?v=cQirIH_DuAs

(...and if that doesn't work, there's a big mechanical problem which is unlikely to be recoverable, since the trim wheels are directly linked to the stabilizer.)

ddalex · on March 29, 2019

> Finally, if that doesn't work, manually grasp and hold the trim wheels

You might think of grabbing and holding the wheels is there is a continuous movement there looking like an automated system going haywire. But with MCAS, it's not. It appears that the trim wheels move in short bursts in random movements, which they ALWAYS do. There is simply no reason to suspect that the wheels move in a single direction (pointing the nose down), unless you can visually see them. Which you can't because they're beneath and behind the pilot seat.

pas · on March 28, 2019

How come the pilots weren't aggressively keeping up with/against the MCAS? They simply gave up? Does that require some very serious effort (pushing many buttons, fiddling with the control), or do they have to simply keep "the joystick" in one direction?

mhandley · on March 28, 2019

Originally the captain was flying, and he was keeping up with MCAS using manual electrical trim. But near the end, he handed control to the first officer, so that he could read the quick-reference handbook and troubleshoot. And the first officer, not knowing how much trim the captain had been using, failed to keep up with MCAS. Very sad - if the captain had kept flying, they may have survived.

jki275 · on March 28, 2019

From what I've been reading on this, the yoke doesn't have enough control authority to override the MCAS system.

ams6110 · on March 28, 2019

There may not be enough elevator authority in a severely out of trim configuration. However the trim switches can be used to correct that, or failing that the electric trim system can be switched off and the trim wheels can be turned manually to bring the aircraft back into trim.

jki275 · on March 28, 2019

Of course, the system could be turned off. That's the issue -- they didn't flip the switches to turn it off.

Had they done that, they wouldn't have crashed.

sokoloff · on March 28, 2019

> the failed AoA sensor is also used to correct airspeed, so this gave the pilots an unreliable airspeed indication

Source? This is the first I’ve heard this claim.

roelschroeven · on March 28, 2019

From what I read it's a bit different. At high AoA the airspeed sensor doesn't work very good, therefore at high AoA the computer indicates that the airspeed is unreliable.

The failed AoA sensor indicated high AoA, so the computer indicated unreliable airspeed.

FabHK · on March 30, 2019

Just as GP said:

> this gave the pilots an unreliable airspeed indication

EDIT to clarify: Apparently this was misread as "unreliable (airspeed indication)" = "displaying an airspeed which was unreliable", rather than "(unreliable airspeed) indication" = "signal that airspeed should not be relied on".

mhandley · on March 28, 2019

Take a look at the graphs I linked to - they show this quite visibly. Both airspeed and altitude are taken from the pitot-static tubes, but these misread at extreme angles of attack, as the airflow isn't directly in the direction of the pitot tube. Thus the air data computer uses AoA information to correct the raw altitude and airspeed readings.

Edit: can't find the reference I was looking for, but try this thread: https://www.airliners.net/forum/viewtopic.php?t=738067#p1063...

sokoloff · on March 28, 2019

Those appear to be a few percent off each other. I concede that’s “not ideal”, but I doubt the airspeed indication skew contributed to the accident.

mhandley · on March 28, 2019

It doesn't really matter if the airspeed is slightly incorrect. What matters is that the aircraft is telling you the airspeed is unreliable, because that distracts from the actual problem you face.

https://aviationweek.com/awincommercial/safety-official-lion...

mannykannot · on March 28, 2019

We can see from the data that the left stick shaker is running almost continuously, while the right one is not. That is nominally saying that the airspeed is dangerously low. I assume that asymmetry is also a consequence of the faulty sensor, and I wonder if pilots are taught that this asymmetric stick-shaker response is a sign of equipment failure (and would it be clear to the pilots that this asymmetry is happening?)

Of course, if you don't know about MCAS, you won't know that it could also be messing with the trim. While trim runaway, at least when considered in isolation, seems straightforward to detect and stop, the pilots were trying to make sense of a set of not-clearly-related symptoms. I am curious as to how the pilot training and certification requirements are presented - for example, are they required to demonstrate handling a trim runaway, or to recognize and handle the various consequences of a given system component failure? It might be argued that there are vastly too many variations of the latter to be covered, but if so, that might be indicative of a broader issue, as real problems don't always present themselves like an exercise in responding to one specific symptom.

sokoloff · on March 28, 2019

The stick shaker is driven from the angle-of-attack system, not airspeed, because that's the least difference between physical world (the wing stalls at an angle of attack, not an airspeed). In this case, the differential stick shaker may have provided an additional clue to the crew. While I'm generally critical of the crew training and performance, particularly the Ethiopian Air crew, even I can't reasonably hang missing the differential shaker indication on the crew.

Your last paragraph raises an excellent point, and one which points to the need to put fully trained, but equally importantly fully seasoned, crews into large transport aircraft in passenger service.

(some) People in aviation cried about the implementation of the 1500 hour requirement to get an ATP (Airline Transport Pilot) certificate after the Colgan Air crash in Buffalo. To me, these two accidents [and others outside of US/Western European carriers] show the wisdom in that decision, especially the most recent one. I have right on either side of 1500 hours (all but 20 in piston airplanes, about 4 hours in 737 level D sim) over a 21 year span. If I went to and passed a type rating school now, I'd hold myself barely qualified to do the walkaround when it's raining, sit right seat, run the radios, carry the captain's bag, and eat whichever meal she didn't want.

350 hours is not an appropriate time (IMO) to have a front-seat position in a transport jet flying passengers.

sokoloff · on March 28, 2019

Ah. Thanks! I misread your previous as "unreliable (airspeed indication)" rather than as "(unreliable airspeed) indication".

avip · on March 28, 2019

God am I blessed to be building webapps and not be responsible for autopilot systems flying fellow humans 11Km above ground.

ams6110 · on March 28, 2019

Me too. And this is why it sort of rubs me the wrong way when people building chat apps call themselves engineers.

cblades · on March 28, 2019

Well this is the most casually elitist thing I've read in a long time.

Anyone that's gotten through a 200 level programming course can probably throw together some kind of chat program that will let a few people talk.

That's lightyears from a scalable, non-buggy, chat system with modern features deployed at scale.

ams6110 · on March 28, 2019

True, but there are still no lives on the line. And no personal liability (beyond maybe losing your job) for mistakes you make.

cblades · on March 28, 2019

What defines an "engineer", the work or the risks?

qyv · on March 28, 2019

This can actually vary depending on where you are in the world. There are places, like Canada, where the term engineer has a specific definition under the law, and it includes things such as professional ethics and liability.

avip · on March 28, 2019

Some mélangée of responsibility, liability, accountability, and legal authority to approve things for usage.

wafflesraccoon · on March 28, 2019

Is gatekeeping like this necessary?

Dirlewanger · on March 28, 2019

It's not so much that it's gatekeeping, but there's a big difference between writing bloated Electron applications with few, if any, constraints vs. writing Ada in an extremely constrained environment where there's memory constraints and other overhead that doesn't exist in the web development realm. If the former fails, a user gets slightly annoyed. If the latter fails, people die. One takes way more skill/training than the other. To treat the two on the same level is insincere at best.

salawat · on March 28, 2019

It doesn't take substantially different training beyond recognizing that one is applying a different tool in a different environment.

It's all programming. Different patterns, stacks, and pressure to test? Yes. Fundamentally the same work loop, however. I'd advise against raising one level of endeavor over another. It seldom produces substantive or useful conversation.

ravenstine · on March 28, 2019

Does anyone know what the typical salary of this kind of software developer is? I hope they are paid several times more than I make as a web developer.

vonmoltke · on March 28, 2019

More like a couple times less. I left aerospace partially because the pay sucks.

jessaustin · on March 28, 2019

Does this mean that the software developers aren't actually the people "responsible" for the behavior of the system?

ams6110 · on March 28, 2019

I don't work in that area but could image it's much more layered. In consumer/webapp type software a lot of developers are more or less "full stack." In aerospace I could see it being much more rigorous and more like waterfall so by the time the specification gets to the programmer it's already been designed, reviewed, and signed off by several other engineers and the actual programming is a rather rote task.

Or I could be totally wrong.

magnamerc · on March 28, 2019

Certain delegated systems engineers are 'responsible' for the system and they would work for Boeing or the FAA.

In a field like Aerospace, you don't have 'software developers' per se. You have systems specialists who write software. You don't move fast and break things, you run simulations of the software over and over again while varying inputs and fixing edge cases, and then you do real world testing.

vonmoltke · on March 28, 2019

Not sure how this relates to the issue of pay, but to answer your question: it's complicated, figuratively and literally.

There is a significant amount of technical coordination required to produce a system as complex as a commercial airliner. Systems engineers, at multiple levels, are responsible for wrangling this complexity. The top-level "spec"[1] is codified and broken down by subsystem to create a set of subsystem specs. Systems and lead engineers for each subsystem take those requirements and evaluate the feasibility. If they see a problem with the requirements, they push back to the higher level with requested changes and justifications. The systems and lead engineers at that higher level then evaluate the request in the context of the other subsystems, because a spec change to Subsystem A may require changes to Subsystem B as well, so they need to communicate with the team on Subsystem B (who may have made their own change requests as well) to determine if they can deal with the necessary changes. Subsystems that are themselves composed of subsystems repeat this process for their own subsystems, creating the possibility that a request could ripple up multiple levels and down a different branch in the system architecture. At some point changes may flow back to the top level and require negotiation on the broad top-level requirements, pulling them away from "wish list" territory and towards concrete, deliverable requirements.

This entire process is known as requirements gathering and flow-down. It is this process that defines the behavior of the system, through defining the behavior of each of its subsystems in turn down to the basic modules that are just collections of off-the-shelf or custom-built parts. Any of those basic modules that are entirely or primarily software should have a software engineer as the lead, helping to define what its requirements are and defining its interface with the rest of the system. Systems engineers tasked with managing those interfaces should also have significant familiarity with software development and architecture, perhaps even having started out as software engineers[2]. Ultimately, though, the lead and systems engineers[3] are responsible for making sure this process works effectively, and they have to do so as a team because a system like this is far too complicated for one or even a few engineers to fully understand.

[1] I use this term loosely, because it's more like a wish list of parameters and features than a real spec.

[2] "Systems engineer" is not a title you can drop in to straight out of school. It requires practical, cross-discipline knowledge of the design and architecture of the various subsystems that need to be combined to form the larger system. This is essential to be able to both understand how various subsystems affect each other and to effectively communicate with the engineers working on those subsystems.

[3] At the higher levels, where you run into a "subsystem of subsystems", the lead may be a systems engineer.

bumby · on March 28, 2019

They tend to be in the range of aerospace engineer salary. High end of engineer salary but not approaching silicon valley pay. Check out ONET for government data on salaries

ravenstine · on March 28, 2019

If that's true, I'm not exactly sure how to feel about it. It seems messed up.

JustSomeNobody · on March 28, 2019

Probably 1/4 what Google pays the people working on Gmail.

SilasX · on March 28, 2019

And blessed to get away with calling yourself an engineer without the rigor that these plane designers normally operate under.

erentz · on March 28, 2019

> Suppose they did originally do what the fixed software does now, and disable MCAS if the AoA sensors disagree. The problem Boeing face is that with MCAS disabled when this occurs, the plane no longer flies like an older 737....

But it’s been reported that this was an option you could buy when you bought the planes. And the crashed planes didn’t have this option.

So if that’s correct, then any plane shipped with this optional package would require the recertification. But it appears they don’t either.

If they did it would show up as very suspicious and I’m surprised nobody has reported on it:

Here buy this plane without this optional package and you don’t need new training.

Or buy it with the optional package and you need to learn about these new components we’ve added that may be disabled and undergo new training.

It seems too obvious.

mhandley · on March 28, 2019

An AoA disagree indicator was an option. But it was never an option for this to disable MCAS, as this would have resulted in a change of the flight characteristics that would have required training.

michaelcampbell · on March 28, 2019

Would a disagree indicator be an option without the second sensor? With what would it disagree?

mhandley · on March 28, 2019

All 737s have two AoA sensors. The left one feeds the left air data computer and the right one feeds the right air data computer. The AoA disagree indicator displays when the two disagree, but is optional and was not fitted on the planes that crashed. The old version of MCAS only uses one AoA sensor as input, even if the AoA disagree indicator was fitted. It does, however, alternate which of the two it uses with each flight.

lagadu · on March 28, 2019

I fail to understand how is the disagree indicator useful, considering the MCAS system will use the same AoA input regardless of disagreement. Is it useful for some other function?

ramraj07 · on March 28, 2019

If that's true (that it alternates between the sensors each flight) then it's extremely obvious that a business decision was made to only use one sensor at a time.

erentz · on March 28, 2019

Oh! thank you I didn’t make that connection.

erentz · on March 28, 2019

Edit: to be a bit clearer about what I’m saying:

There’s no indication that before these accidents an airline buying the optional AoA disagree package got extra training to tell them about handling MCAS. And therefore it’s not clear that if these planes had this package they would’ve been saved. The pilots would’ve had a new warning light but not known that it means they need to disable MCAS because there was no training to that effect. If anything I assume the pilots being unaware of MCAS would only think an AoA disagree light impacted autopilot, which is separate from MCAS and disabled at these times. They would still have needed to identify the trim and MCAS problem another way.

(Now it is a different matter since everyone knows it’s there.)

skywhopper · on March 28, 2019

Yes it all comes back to the requirement from on high to not require any retraining or recertification even though they were delivering essentially a different airplane. Trying to simulate the feel of a different plane via software is adding a huge new layer of complexity and failure risk.

acqq · on March 28, 2019

> If they'd done this, they'd have needed to provide additional training, and this must have concerned Boeing management that it might jeopardize the common type rating.

Yes. There were no simulators to train pilots (only 4 delivered up to now, vs. 376 planes delivered! -- by the way, the value of all MAX orders, including these still not delivered, is around 600 billion with a b dollars!) and if I'd guess the simulators can't simulate the plane behavior when MCAS is off. Because the selling point is "MAX behaves the same as the old one." Which is just not true.

jussij · on March 28, 2019

> The problem Boeing face is that with MCAS disabled when this occurs, the plane no longer flies like an older 737.

The bigger problem is the MCAS was only added to fix a major design fault, where by the aircraft would automatically pitch up when accelerating.

So with the MCAS disabled, the aircraft then runs the risk of stalling when accelerating.

I don't understand how design engineers would ever think a software workaround would be a suitable fix for what appears to be a major aerodynamic design flaw.

ams6110 · on March 28, 2019

All aircraft with underslung engines want to pitch up when accelerating. That's just physics. MCAS is there to counter pitch up at high angles of attack, caused by the aerodynamic characteristics of the larger, more forward engine nacelles.

jussij · on March 28, 2019

It's not the pitch that is the problem, but instead the degree of pitch.

This particular aircraft had a tendency to pitch so badly they had to install a software workaround to automatically catered for the behavior.

And presumably it was so extreme they thought an automatic option was needed, out of fear the pilots on their own would not be able to handle the situation.

CraigJPerry · on March 28, 2019

According to Blancolirio on YT (a wholehearted thumbs up for his journalism, e.g. the video on atlas prime air is worth a watch, he currently flies as FO on the 777 I believe), there exists an angle of attack disagree light already in the 737max options sheet. There's also an option to purchase an AoA indicator dial, and he said one of the major us carriers did buy that option on their aircraft.

mhandley · on March 28, 2019

Yes, that's correct. But the implications of AoA disagree are different for the older 737 and the Max with the new software installed. On the Max, this means that the plane will now fly differently at high AoAs than it did if the AoA disgree light was not lit. This is something pilots will need to be trained for.

FabHK · on March 30, 2019

> the plane will now fly differently at high AoAs than it did if the AoA disgree light was not lit.

... "at high indicated AoA", with 50% probability, no?

heisenbit · on March 28, 2019

There is a huge difference of displaying angle of attack as input to educated pilots to feeding it into a system which has the authority to override the pilot with little warning.

The problem with the 737MAX airframe is that the pitch up seems to be so strong that nothing else than immediate MCAS feedback can reign it in.

osrec · on March 28, 2019

When politics and egos come into play, even teams of very smart software engineers can end up making silly, seemingly incompetent decisions.

fixermark · on March 28, 2019

Agreed. I think what this may mean going forward is that the CAAs are going to have to consider demanding that the training specifications be designed around a scenario where some (as-yet-to-be-defined) subset of the smart systems are disabled, and if the airframe behaves differently in that configuration, it demands re-training.

I'm somewhat surprised acceptance criteria weren't already there. You don't plan for the common case when lives are on the line.

kelnos · on March 28, 2019

> If they did require a separate type rating, this would likely kill 737 sales

Would it, though? I'm genuinely asking because I don't know how much all this costs. Certainly certifying pilots for a new aircraft isn't free, and probably isn't cheap, but the MAX line promises significant savings in fuel cost. In the long run, would the latter outweigh the former?

gsnedders · on March 28, 2019

It likely would; if you have a fleet full of 737s and the 737 MAX turns into a new type rating, there's a much weaker argument as why they must get the 737 MAX rather than an A320neo family aircraft, and that gives Airbus a big in-road with potential customers.

That's not to say that Airbus would get all the sales (they won't!), but they then have a much easier time to selling to former 737 customers.

isostatic · on March 28, 2019

So Boeing may have to reduce the price a little. C'est la vie.

However ti's unlikely to cause major problems, presumably the tooling and spares for the non-max 737 are pretty similar to the 737. If you own a fleet of Boeing mid-range jets, why would you suddenly start to buy airbus?

gsnedders · on March 28, 2019

That depends if the certification of the tooling and spares allows them to be used interchangeably between the pre-MAX and the MAX.

robin_reala · on March 28, 2019

Out of interest, why wouldn’t Airbus get those sales?

adwww · on March 28, 2019

Airlines will all have their own long list of criteria to help decide which aircraft to buy.

Removing common type will mean a few less points in a given airliners scoring system - it might tip the score to favour the Airbus for some, and for other airlines it might not even be a consideration.

rootusrootus · on March 28, 2019

Why would they? It's not like Airbus planes haven't had their own fatal glitches. Heck, it hasn't even been all that long since they had their own version of an MCAS style failure which was only recoverable because it happened a lot farther from the ground.

icebraining · on March 28, 2019

all those sales

gsnedders · on March 28, 2019

And notably, all the build slots for the A320neo family are sold for the next few years; I believe an order today won't be delivered till 2022 (and if you have other aircraft going off-lease, especially if some other airline is taking them on, you might not be able to wait). And you won't order today, because sales take time.

If you have a delivery date prior to that for the 737 MAX, you may well just want to wait and hope everything gets rectified before your delivery.

Boeing also have an incentive to shift their product, and with their negotiating position weakened will likely offer to sell their product for a lower price than they currently would to customers who already have 737s.

magduf · on March 28, 2019

>And notably, all the build slots for the A320neo family are sold for the next few years; I believe an order today won't be delivered till 2022

There's a simple solution for this: increase manufacturing capacity.

How do you do this quickly? It's simple: there's a really huge airplane manufacturing plant in Washington, USA that may be up for sale soon...

adwww · on March 28, 2019

^^ I just made the exact same comment, but this puts it clearer!

adwww · on March 28, 2019

I expect it makes switching to an Airbus model more compelling - if you're going to have to retrain your crew, you might as well get the plane you want, not just the one you think will be easiest to train for.

I have no idea if the equivalent Airbus (a320neo maybe?) is any better or worse than the 737Max, but losing the common type rating would be one less vote for the 737 max.

dogra · on March 28, 2019

The wings of the A320 have more ground clearing than the 737 NG. Airbus could easily fit the larger and more efficient engines. Boeing had to move the engines slightly before the wings (or face an expensive complete redesign). The resulting different flight behavior would have required a new type rating (which is expensive). Instead software (cheap!) was used to correct for the different behavior.

This is not an indication on other aspects of the A320 vs. the 737 MAX.

gsnedders · on March 28, 2019

And remember that on the Boeing 737 Classic and NG that the engine nacelle has been non-circular and been running with smaller fans than other aircraft with similar engines: engine size isn't a new problem with the MAX, it's been a problem since the move to high-bypass engines.

ReptileMan · on March 28, 2019

The problem was not that the redisign was expensive - we are talking about trillion dollar market here but that it was slow - clean sheet = 10 years. Making an octopus by nailing four more legs to the old 737 5-6 years.

I am fairly certain that if boing could have solved the timing issue by throwing money at it they woyld have done so.

kelnos · on April 1, 2019

I guess for an airline that already has a mixed Boeing/Airbus fleet, or one that is only 737 and wants to completely switch to 737 MAX (where A320neo might be a decent consideration instead), this might make sense. But surely pilot and crew training is only one part of the equation. You also need new maintenance training, and you probably already have a logistics pipeline for sourcing Boeing-compatible parts that would need to be significantly changed, etc.

ams6110 · on March 28, 2019

I would suspect that training a 737 pilot on differences in the new MAX would still be far less involved than training them to understand an Airbus, which is a totally different philosophy in control systems and piloting.

8draco8 · on March 28, 2019

Disclaimer: I'm not a pilot, I am just plane enthusiast. Type training for 737 starts at roughly $13k, 767 rating starts at $18k so I guess 737 MAX having more systems to go trough than 737 should cost somewhere in between if it would be separate type. Currently 737 MAX need just type differences training if pilot have already 737 rating, so it cost somewhere in the ball park of $2k to do that. All of those are prices available to pilots, so if pilot want to get the rating and pay it for himself. I assume that airlines would have some kind of discount.

magduf · on March 28, 2019

>Certainly certifying pilots for a new aircraft isn't free, and probably isn't cheap, but the MAX line promises significant savings in fuel cost. In the long run, would the latter outweigh the former?

The savings in fuel cost isn't everything, because you're missing the fact that Boeing was competing directly against the Airbus 320neo, which likely offered the same fuel economy but without the extra training costs. Why bother buying a Boeing when you can get the Airbus for less?

It seems pretty obvious to me that Boeing has been milking the 737 airframe for far too long, instead of proactively developing a replacement when they had time. Instead, they just stuck with what they had, and then when Airbus came up with their plane, Boeing tried to come up with something competitive on short notice using old junk they already had. They should have developed a successor to the 737 many years ago and moved to that, spreading the development and certification costs out, but they were short-sighted and cheap, and now it's going to cost them very dearly.

kelnos · on April 1, 2019

Switching from Boeing to Airbus would be more than just pilot and crew retraining, though, right? Maintenance and part sourcing would be significantly different as well. Obviously there are differences between 737 and 737 MAX there as well, but I can't imagine they'd be as significant.

In addition, since the 737 and 737 MAX are very similar from a piloting standpoint, I would expect that getting type certified on a MAX when you already are certified on a vanilla 737 would be much easier and cheaper than type certifying on a completely different plane from another manufacturer.

tropo · on March 28, 2019

The engineers were busy with the 777 and 787.

askvictor · on March 28, 2019

I read recently the the 737max was specifically designed (and rushed out) as a response to American Airlines ordering Airbus. Boeing made the max a bit more efficient, and kept the type rating the same as selling points.

bumby · on March 28, 2019

I don't think it's just the personnel retraining. My understanding is that this decision helped keep the aircraft from being a "major" configuration change that would necessitate a more lengthy FAA certification process. Supposedly, a major air carrier gave Boeing a timeline deadline which may have influenced design decisions to avoid that longer certification.

Of course the FAA may be partly culpable for delegating some of their oversight decisions to Boeing

ams6110 · on March 28, 2019

I would guess at the end of the day it wouldn't end up being a totally separate type rating. It would be additional training on the differences. But I think Boeing wanted to avoid the requirement even for that.

ChuckMcM · on March 28, 2019

This is pretty much it in a nutshell as far as I can tell. If the sensors don't agree, and MCAS switches off, then the pilots have to be ready to deal with the plane trying to pitch up and stall on their own.

When would that happen? Take off and go-arounds.

Pilot is coming in for a landing, something goes wrong (too much cross wind, plane on the taxiway, Etc.) what they do is they pull back on the stick and push the throttles up to max to get into a climb. If MCAS is disabled and the pilot hasn't trained to fly the plane without it, there is a risk it will pitch up and stall onto its tail. Not a good place to be.

melling · on March 28, 2019

"Boeing's software fix, announced today, is to compare readings from both angle-of-attack sensors and disable MCAS if they disagree significantly. The obvious question is why they didn't do this in the first place?"

Because you had to pay for the second sensor and the disagree light.

https://www.nytimes.com/2019/03/21/business/boeing-safety-fe...

cjbprime · on March 28, 2019

No, the second sensor is on every plane. The option was just a light that wasn't hooked up to anything else. The light's a red herring here.

magduf · on March 28, 2019

The second sensor is on every plane, true, but the MCAS doesn't use it. The light just tells the pilot when the two sensors disagree.

The criminally negligent thing is having MCAS only use input from a single sensor, especially when there's a second sensor already there.

B1FF_PSUVM · on March 28, 2019

> this would likely kill 737 sales, regardless of whether the plane is now safe.

I suspect a 737 Max is now as saleable as a Samsung Note 7 phone.

tluyben2 · on March 28, 2019

If it's indeed solvable by software then it'll still sell fine, but this is one of the articles I have read where the claim is made that it is a fundamental design issue and cannot safely be fixed. I know nothing about aviation but I do spend a lot of time in planes and I sure as hell won't be boarding a MAX until the fix has been flying around without accident for a few years (I try to fly Airbus only anyway but that's not for safety reasons).

dmpk2k · on March 28, 2019

Perhaps it'll sell, but were I a buyer I'd be asking what else that Boeing might have skimped on and FAA rubber stamped.

We know the plane was built in a rush, and the single-sensor MCAS is an odd choice. It's unlikely MCAS is the only problem, although hopefully it's the only dangerous one.

Don't forget the perception of your ticket-buying customers.

mrhappyunhappy · on March 28, 2019

I might be the only idiot but I don’t plan on flying Boeing 737 Max, no matter what kind of software fixes they put in place. Irrational? Maybe.

ethbro · on March 28, 2019

> what else that Boeing might have skimped on and FAA rubber stamped

Are we pretending this doesn't happen with other manufacturers too?

I'd rather fly on a plane that's been thoroughly revetted than one which has only been through a flawed process once.

ReptileMan · on March 28, 2019

Yes but how many years will the revetting take? And will such audit be considered. We are probably talking 2-3 years time frame with no 737 max if they do full audit.

cf141q5325 · on March 28, 2019

With manufacturer not being US manufacturers the size of Boeing? Personally I would doubt that.

Is there any example where that happened?

rootusrootus · on March 28, 2019

A year from now, only a small fraction of the flying public will remember the 737MAX problem, and even fewer will consider it when they are booking flights.

siwatanejo · on March 28, 2019

IIRC the two planes that crashed only had a single AOA sensor (the 2nd redundant one being only present in a premium add-on that those airlines didn't purchase), so this software fix would have not changed anything.

EDIT: alright thanks for the replies.

dalai · on March 28, 2019

From what I've gathered, all MAX planes (including the ones that crashed) have two AoA sensors, but MCAS just uses one. As far as I know there was never an option to have a redundant configuration. The premium add-on was the indication that they disagree.

mhandley · on March 28, 2019

No, all 737s have two AoA sensors. The one on the captain's side feeds the flight control computer on that side, and the one on the first officer's side feeds the flight control computer on that side. At any one time, one computer is in charge. They can compare data, but Boeing decided not to for MCAS.

chopin · on March 28, 2019

All 737 MAX have two sensors. The Lion Air flight had sensors disagreeing by 20° right from the start. This made the stick shaker go off on one side right after departure (throughout the entire flight) and led to a multitude of other alarms (altitude disagree, unreliable airspeed).

acqq · on March 28, 2019

> the 2nd redundant one

No, it was never redundant, in the "premium variant" only the "they don't match" signal would be displayed / sounded, but only one used still, and the pilots would have to turn MCAS off and fly with the plane which behaves differently than the one for which they are trained.

tw04 · on March 28, 2019

I was under the impression the "base model" only came with a single AoA sensor. Adding a second sensor and the warning light if they disagreed was an expensive upgrade that neither of the planes that crashed were equipped with.

sixothree · on March 28, 2019

Are the angle-of-attack sensors so unreliable to have caused two crashes?

csours · on March 28, 2019

This reminds me of The Slow Winter by James Mickens [0]

> "John was terrified by the collapse of the parallelism bubble, and he quickly discarded his plans for a 743-core processor that was dubbed The Hydra of Destiny and whose abstract Platonic ideal was briefly the third-best chess player in Gary, Indiana. Clutching a bottle of whiskey in one hand and a shotgun in the other, John scoured the research literature for ideas that might save his dreams of infinite scaling. He discovered several papers that described software-assisted hardware recovery. The basic idea was simple: if hardware suffers more transient failures as it gets smaller, why not allow software to detect erroneous computations and re-execute them? This idea seemed promising until John realized THAT IT WAS THE WORST IDEA EVER. Modern software barely works when the hardware is correct, so relying on software to correct hardware errors is like asking Godzilla to prevent Mega-Godzilla from terrorizing Japan. THIS DOES NOT LEAD TO RISING PROPERTY VALUES IN TOKYO. It’s better to stop scaling your transistors and avoid playing with monsters in the first place, instead of devising an elaborate series of monster checks-and-balances and then hoping that the monsters don’t do what monsters are always going to do because if they didn’t do those things, they’d be called dandelions or puppy hugs."

0: http://scholar.harvard.edu/files/mickens/files/theslowwinter...

russdill · on March 28, 2019

Software ECC is already a thing. There's a lot of similar theoretical work on equivalent ideas for computation. Just remember that the idea is never about getting to zero. It's more like getting from one failure in 1e12 hours to one failure in 1e18 hours.

songeater · on March 28, 2019

this is awesome and probably needs a thread of its own

csours · on March 28, 2019

There have been a few, but it looks like none of them really sparked a lot of discussion:

https://hn.algolia.com/?query=Slow%20Winter&sort=byPopularit...

---

I've submitted it again: https://news.ycombinator.com/item?id=19514259

ixtli · on March 28, 2019

This is wonderful.

SeaSeaRider · on March 28, 2019

There is a debate to be had, but this is a naked propeganda piece. The crux of the article is based on:

“Among Boeing’s critics is Gregory Travis, a veteran software engineer and experienced, instrument-rated pilot who has flown aircraft simulators as large as the Boeing 757.”

... someone who uses flight simulators. This is not credible journalism.

ben7799 · on March 28, 2019

Exactly... it's not clear if the author of the article is even an engineer, and he's speaking authoritatively on something. good engineer doesn't even speak authoritatively outside their specific area of expertise.

And the author's source is someone who is not an aeronautical engineer and has never flown any airliners apparently.

As a software engineer, I would never consider myself qualified to declare an airframe good or bad.

mschuster91 · on March 28, 2019

> ... someone who uses flight simulators. This is not credible journalism.

When talking about "instrument-rated", this most likely means a real rating certification. As for the flight simulators, best class "Full Flight Simulators" actually allow for zero "real" flight time for type rating transfers, as well as being actually used for the required regular training of airline pilots (per https://en.wikipedia.org/wiki/Full_flight_simulator)

inferiorhuman · on March 28, 2019

When talking about "instrument-rated", this most likely means a real rating certification.

It is a real thing, but he's not type rated on a 737 and it shows. He and eetimes get a number of things plainly wrong. The article was mostly content-free clickbait and I'd encourage you to simply flag it.

ackfoo · on March 28, 2019

Hatchet. Job.

An article written by a non-pilot about an article by a GA pilot with no experience as an ATP.

The elephant in the room is not the type-rating issue so much as the speculative cause of the crashes: if your aircraft is out-of-trim and control pressure cannot restore it, you have a runaway trim condition and you need to disable the electric trim system immediately.

If, in these cases, it turns out that the AOA sensor was faulty, that is only one of many possible causes for a runaway trim condition.

The core problem is not the specific cause, but the failure of pilots to respond appropriately to a common and easily-remedied situation.

TheCondor · on March 28, 2019

We are armchair engineering the issue on hn, can there be “credible” journalism on this?

I feel for the engineers involved, I assume they all had good intentions and did their best. This seems like one of those things we’d read about in 2030 where some clever engineering and software allowed Boeing to fend off the rivals for a fraction of the cost, or likewise it took them to the brink of survival. As startup and entrepreneur people, we live this stuff daily with smaller stakes, I can’t help but sort of admire their attempt.

rootusrootus · on March 28, 2019

The amount of armchair engineering going on here on HN about this particular issue has been disappointing, but educational. I've seen a couple of very good technical write-ups about what's really going on with MCAS, and by my guesstimate probably 90% or more of what is said on HN about this issue is factually incorrect right from the start.

karmelapple · on March 28, 2019

Any links handy to help us learn more about what's really going on with MCAS?

tim333 · on March 28, 2019

I was thinking 'faulty airframe' was a bit over the top. OK the stall characteristics seem not ideal but I'm sure a lot of planes have aerodynamics that are not ideal in places. I'd be happy enough flying in one if they turn off the MCAS gizmo.

clawoo · on March 28, 2019

The article reeked of anything but credible journalism as soon as it opened with "The saga of Boeing’s 737 MAX serves as a case study in engineering incompetence, and in engineering ethics – or the lack thereof."

By this point's it's obvious to everyone that the engineering of the plane is pretty far down the line of causes which lead to this.

There was a Twitter thread[1] a few weeks ago which explained it very clearly:

Some people are calling the 737MAX tragedies a #software failure. Here's my response: It's not a software problem. It was an

* Economic problem that the 737 engines used too much fuel, so they decided to install more efficient engines with bigger fans and make the 737MAX.

This led to an

* Airframe problem. They wanted to use the 737 airframe for economic reasons, but needed more ground clearance with bigger engines.The 737 design can't be practically modified to have taller main landing gear. The solution was to mount them higher & more forward.

This led to an

* Aerodynamic problem. The airframe with the engines mounted differently did not have adequately stable handling at high AoA to be certifiable. Boeing decided to create the MCAS system to electronically correct for the aircraft's handling deficiencies. During the course of developing the MCAS, there was a

* Systems engineering problem. Boeing wanted the simplest possible fix that fit their existing systems architecture, so that it required minimal engineering rework, and minimal new training for pilots and maintenance crews.

The easiest way to do this was to add some features to the existing Elevator Feel Shift system. Like the #EFS system, the #MCAS relies on non-redundant sensors to decide how much trim to add. Unlike the EFS system, MCAS can make huge nose down trim changes.

On both ill-fated flights, there was a:

* Sensor problem. The AoA vane on the 737MAX appears to not be very reliable and gave wildly wrong readings. On #LionAir, this was compounded by a

* Maintenance practices problem. The previous crew had experienced the same problem and didn't record the problem in the maintenance logbook. This was compounded by a:

* Pilot training problem. On LionAir, pilots were never even told about the MCAS, and by the time of the Ethiopian flight, there was an emergency AD issued, but no one had done sim training on this failure. This was compounded by an:

* Economic problem. Boeing sells an option package that includes an extra AoA vane, and an AoA disagree light, which lets pilots know that this problem was happening. Both 737MAXes that crashed were delivered without this option. No 737MAX with this option has ever crashed.

All of this was compounded by a:

* Pilot expertise problem. If the pilots had correctly and quickly identified the problem and run the stab trim runaway checklist, they would not have crashed.

Nowhere in here is there a software problem. The computers & software performed their jobs according to spec without error. The specification was just shitty. Now the quickest way for Boeing to solve this mess is to call up the software guys to come up with another band-aid.

I'm a software engineer, and we're sometimes called on to fix the deficiencies of mechanical or aero or electrical engineering, because the metal has already been cut or the molds have already been made or the chip has already been fabed, and so that problem can't be solved.

But the software can always be pushed to the update server or reflashed. When the software band-aid comes off in a 500mph wind, it's tempting to just blame the band-aid.

[1] https://threadreaderapp.com/thread/1106934362531155974.html

rootusrootus · on March 28, 2019

This is an excellent analysis, thanks! Wish I could upvote more than once.

salawat · on March 28, 2019

Succinctly put. Ship it!

ReptileMan · on March 28, 2019

I am not sure it even uses the term airframe correctly. And it is not faulty. Probably case of we have not had a boeing bashing hot take in 48h find someone that speaks vaguely pilot lingo and interview him.

rootusrootus · on March 28, 2019

Even better, he's a software engineer, or in other words exactly the kind of person who likes to armchair quarterback real engineers.

manfredo · on March 28, 2019

I've read a variety of articles on this and they often said somewhat different things. What I've been able to gather about the timeline of events is:

1. The new engines on the MAX shifted the center of gravity forward (and I assume center of lift stayed the same).

2. Boeing was worried that #1 would cause the plane to nose up during high angles of attack (so, take off and landing?), and added software, MCAS, to pitch up to counteract this.

3. There's some confusion over when this software kicks in and how to cancel it (something about the trim controls not cancelling MCAS?)

4. Regardless of #3, this software seems to have confused pilots and the current belief is that MCAS was active when pilots didn't want it active.

5. ????

6. Planes crash.

Also, I've read about some concerns about the fact that the handling behavior changed so much but the plane wasn't reclassified as a different type. I'm still unclear about how classifications plays into this story.

My core point of confusion is, if MCAS is the culprit why isn't the solution to remove MCAS? Is tendency to pitch during high angles of attack unusual, and something pilots cannot be expected to counteract manually? I've only played sims like DCS and X-Plane (and not very much at that) but "nose goes up when I don't want it to, so I push stick forward" doesn't seem too complicated to me. Of course, I'm no pilot so I'm probably drastically oversimplifying the situation.

mhandley · on March 28, 2019

Your point #1 is incorrect. The problem with the larger engines is that their larger nacelles placed further forward produce extra lift at high angles of attack. This lift is further forward than the centre of mass. The certification requirement is that to produce steadily increasing angles of attack, you need to steadily increase back pressure on the yoke. The problem with the Max is that this is no longer true. Past a certain angle of attack, the back pressure needed to further increase angle of attack reduces. The plane is not actually unstable, but it's closer to being so than the certification requirements allow. And it's certainly behaviour that Boeing couldn't claim was similar enough to older 737s to allow a common type rating. Hence MCAS, which was supposed to detect this condition and make the aircraft fly like an older 737. This allowed a common type rating, and allowed the aircraft to be certified. But fundamentally, the airframe has an undesirable property, and you'd never have designed it this way unless the desire for a common type rating dominated other design decisions.

reacweb · on March 28, 2019

It is incredible: the airframe has been designed to reuse the certification of 737. A flaw that could be fixed in a proper way, instead has been worked around using unreliable subterfuges.

The aim of the certification process is to ensure the safety (and reliability) of aircraft. The required mindset shall be that the certification process helps to highlight defects in order to build a better aircraft. Here, the mindset was that the certification process is a burden with arbitrary constraints that have to be fulfilled even if this means a worse aircraft.

IMHO, the people (managers) with the wrong mindset shall be replaced and the faulty airframe of 737 shall be killed.

greeneggs · on March 28, 2019

This is unfair. If done appropriately, making a plane behave the way pilots expect should also make it safer. A "worse aircraft" with a better UI can actually be a safer aircraft.

rtkwe · on March 28, 2019

Only so far as we can assume the UI never breaks and the sensors are always correct. The issue is pilots need to be trained to understand where the UI hides the underlying performance of the plane because when it breaks it can go wrong incredibly quickly. MCAS altered the way to aircraft reacted and also would restart it's changes unless completely cut out. Pilots didn't receive enough training to recognize the issue as being the MCAS system adjusting the trim fast enough to prevent the crashes during takeoff and ultimately the whole point of the MCAS was to avoid having to retrain and maintain the common type rating.

skykooler · on March 28, 2019

Thank you; I've been following this since the second crash and this is the first time I've seen an explanation for why moving the engines forward made the plane unstable at high angles of attack.

incongruity · on March 28, 2019

So, I must be missing something but all that re-engineering was to put a larger engine on the airframe but still have ground clearance, while on the ground, correct? What was stopping them from increasing clearance by increasing landing gear height? (And thus not impacting flight characteristics)

mhandley · on March 28, 2019

They did increase the length of the nose landing gear, which helped a bit. But increasing the length of the main landing gear would have required moving the gear further outboard, as there was no space to extend the gear in an inboard direction when it's stowed. You can't move the gear a lot further out, or it ends up behind the engines. See this picture someone else on this thread posted: https://i.stack.imgur.com/GFzcj.jpg And you really don't want to move the engines further out, because that's going to affect the engine-out behaviour, and require a larger vertical stabilizer to handle asymmetric thrust.

tropo · on March 28, 2019

I think that image shows plenty of room. Remember that the tires don't count, because those would be farther away from the wing.

There are other height issues with ground support equipment for luggage, fuel, passengers, maid service, sewage, etc.

jacquesm · on March 28, 2019

> Boeing was worried that #1 would cause the plane to nose up during high angles of attack

Your #2 is flawed. Shifting the center of gravity forward would cause the plane to nose down relative to the older model. The problem is that the engines are more powerful, mounted higher and further forward shifting the point of thrust to a location where the plane would rotate further (nose 'up'), that is why MCAS attempts to push the nose down by trimming the tail plane if it senses the plane to be in a too high angle of attack.

concerned_user · on March 28, 2019

From what I read purpose of MCAS is to make stick give same feedback as on 737 NG, i.e. certain linear force increase would correspond to deflection amount on the elevator. With MAX it was no longer true due to air frame geometry and engine changes, and at certain high angles same amount of force applied to stick would result in more deflection. All this had to be maintained in order to not have re-certify all the pilots from NG but be allowed to only give minimal training I think.

jacquesm · on March 28, 2019

> From what I read purpose of MCAS is to make stick give same feedback as on 737 NG

No, the MCAS system is not in any way responsible for feedback to the control stick. It engages the stabilizer (tail plane) trim motors based on AOA sensor readings, the state of the autopilot and whether the flaps are engaged or not. It has a limit of 2.5 degrees change but this limit is reset every time the system is reset in effect allowing it to fully deflect the tailplane until the limit stops kick in.