Chris Hastings

Functional Safety Play Book

Activity

Mon

Wed

Fri

Sun

Apr

May

Jun

Jul

Aug

Sep

Oct

Nov

Dec

Jan

Feb

Mar

What is this?

Less

Memberships

Functional Safety Play Book

230 members • Free

5 contributions to Functional Safety Play Book

Richard Kelly

9d •

Functional Safety Question ❓

Maximum Out of Service Time (MOST)

Hi everyone, @Noah Tibasiima has raised the following question, but it was added to another post and may have been overlooked. I have been sleeping on this for a while. I would be interested in hearing how others approach the determination of Maximum Out of Service Time (MOST) when a safety function is bypassed. There is a document out there discussing this (I kinda forgot the title) but it is not mainstream FS if I am not mistaken. However it discusses using time at risk to set maximum time that an IPL can be bypassed. An explanation that stuck with me was this: When an IPL or SIF is bypassed, its PFD during that period is effectively 1.0, since it is guaranteed to fail on demand. Because of that, the time spent in bypass cannot be arbitrary. To keep the average PFD of the function within its tolerable target over the proof test interval, the duration of the bypass has to be limited. The way I saw it derived was by essentially equating the risk contribution accumulated during the bypass period with the allowed risk budget allocated to that IPL/SIF over the full interval. In simplified terms, the MOST becomes the maximum time the function can remain bypassed before the average PFD target is exceeded. My questions to those reading this: 1. How are you determining MOST in practice, do you derive it analytically from the SIF PFD target, or do you rely on more conservative procedural limits? 2. Do you treat the bypass state strictly as PFD = 1, or do you incorporate compensating measures (temporary IPLs, administrative controls, etc.) into the calculation? 3. Are there particular company or industry guidelines you have found useful for setting these limits? Curious to hear how others handle this in operating facilities because I can swear I have told someone before go look up the SRS😂, yet they were dealing with a legacy system

New comment 8d ago

Chris Hastings

3 likes • 8d

We relate this to MTTR which indicates the identified SIF out of service. When this becomes compromised we have always carried out an additional risk assessment to determine what other mitigations can be put in place. We lost a SIL 2 valve for maintenance reasons and increased inspections, testing and procedures to cover the risk gap. Doing a mathematical exercise to mask the shortfall is never good enough.

Richard Kelly

13d •

Announcements 📌

Live Decision Review — Monday 16 March

I’ll be running the first Playbook Decision Review next week. 📅 Monday 16 March 2026⏱️ ~40 minutes Scenario we’ll review A SIF designed to meet SIL 2 was verified assuming independence. During installation review it’s discovered that the installation didnt meet the design requirements — increasing potential common cause failure. The plant is nearly complete and correcting the installation would cost millions. So the question becomes: What would you recommend as the functional safety engineer? Options we’ll pressure-test: - Accept the installation and justify the assumptions - Recalculate the SIL verification with updated CCF assumptions - Require corrective changes despite the cost The aim of the session is simple:Walk through how experienced engineers frame and defend decisions when the standards don’t give a clear answer. 🔒 Attendance is available to premium members. Recording available to all members If you’d like to attend, check out the calendar, comment below or message me and I’ll send the meeting link.

New comment 12d ago

Chris Hastings

1 like • 13d

we need to get the owner involved in the decision process, FSE would advise correction but the PM would fight against cost. There needs to be a confirmed ALARP case conclude and accepted by the owner. plus... lesson learned so we can capture the mistake made in the next project

Richard Kelly

18d •

General Discussion

Proof test coverage

Something that always makes me pause when reviewing designs… Proof test coverage that somehow always ends up being 100% effective. On paper it looks great. The numbers work nicely. The SIL calculation passes comfortably. But in the real world I always find myself thinking: Can we really detecting every dangerous failure with that test? In my experience, this is a major cause of rework. If the design progresses to the point where commissioning documents are written and then a FSA or design review reveals overly optimistic proof test coverage it’s a lot of work to correct. Anyone else experiencing this?

New comment 15d ago

Chris Hastings

1 like • 17d

Yes agree, this is work in progress as the tendency is to assume 100%. Consideration should be made during the Proof test procedure development, "what failure mode does the proof test reveal and more importantly what is not revealed?". In respect of valve coverage "mission time" , "manufacturer recommendation" become relevant, (all though 'Midas' testing for letby can be used), these require that the valve be overhauled AND recertified (clock reset).

Chris Hastings

0 likes • 16d

@Robert Petchey all good comments and realisation that 100% also implies that you know 100% of the failure modes. This can bite hard when future failure catch you by surprise. I note from the dcs/automation world that the process designer is including proof testing within software taking the maintenance team and human error out of the loop?? The trap is that human error as moved from the maintenance team to the design team !!

Iyan Putra

Jan 8 •

Functional Safety Question ❓

Mission Time

Hi all, thanks for accepting. First of all, I am new in functional safety and sorry for my bad english😊. Actually I have some doubt about one of variable in PFDavg calculation namely mission time, couple of question to all: 1. What will happen in the end of mission time?should end user decommissioned the plant?or just replace everything and the mission time will get restarted? 2. If it depend on end user, than based on what consideration usually for them to determine the correct mission time?and what is the reason behind that? 3. Since by the time PFDavg will get derated, and SIL claimed may decreased over the time, shouldn't end user decide to set the mission time before the SIL/RRF drops beyond the rating it should be? Hope you guys can share your knowledge. Thanks,

New comment 6h ago

Chris Hastings

0 likes • 17d

@Noah Tibasiima agree, as Richard, my previous comments on proof test coverage are relevant. Your point on outage program is commercially important, combining Mission time requirements with major outages is critical to the business.

Chris Hastings

Jan 8 •

Introductions 👋

Fascinating F S

Hi everyone it will be great to share experiences, interpretations and understanding of functional safety.

1-5 of 5

Level 2

14points to level up

Chris Hastings

@chris-hastings-1821

Power generation digital control and functional safety expert

Active 8d ago

Joined Jan 8, 2026

Contributions

Followers

Following