| Literature DB >> 29324723 |
Luis Marques1,2, Verónica Vasconcelos3, Paulo Pedreiras4, Luís Almeida5.
Abstract
Data networks are naturally prone to interferences that can corrupt messages, leading to performance degradation or even to critical failure of the corresponding distributed system. To improve resilience of critical systems, time-triggered networks are frequently used, based on communication schedules defined at design-time. These networks offer prompt error detection, but slow error recovery that can only be compensated with bandwidth overprovisioning. On the contrary, the Flexible Time-Triggered (FTT) paradigm uses online traffic scheduling, which enables a compromise between error detection and recovery that can achieve timely recovery with a fraction of the needed bandwidth. This article presents a new method to recover transmission errors in a time-triggered Controller Area Network (CAN) network, based on the Flexible Time-Triggered paradigm, namely FTT-CAN. The method is based on using a server (traffic shaper) to regulate the retransmission of corrupted or omitted messages. We show how to design the server to simultaneously: (1) meet a predefined reliability goal, when considering worst case error recovery scenarios bounded probabilistically by a Poisson process that models the fault arrival rate; and, (2) limit the direct and indirect interference in the message set, preserving overall system schedulability. Extensive simulations with multiple scenarios, based on practical and randomly generated systems, show a reduction of two orders of magnitude in the average bandwidth taken by the proposed error recovery mechanism, when compared with traditional approaches available in the literature based on adding extra pre-defined transmission slots.Entities:
Keywords: CAN; fault-tolerance; flexible time-triggered; real-time systems; scheduling; server; temporal redundancy; time-triggered
Year: 2018 PMID: 29324723 PMCID: PMC5795361 DOI: 10.3390/s18010188
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Elementary Cycle (EC) and Trigger Message (TM) encoding in Flexible Time-Triggered -Controller Area Network (FTT-CAN).
Figure 2Insert idle time in the Synchronous Window (SW).
Bit Error Rate (BER) measurements in Controller Area Network (CAN) [18].
| Environment | BER |
|---|---|
| Benign | 3.0 × 10−11 |
| Normal | 3.1 × 10−9 |
| Aggressive | 2.6 × 10−7 |
Figure 3Error recovery process.
Figure 4Message and replica hit by errors.
Figure 5Scenarios with 1 error (a) and with 2 errors (b) and retransmission with 1 to 3 replicas.
Figure 6Plot of P (≥ n; 1/λ)
Number of replicas needed for a target reliability level in an Aggressive environment.
| Scenario | Replica Number (n msgs) | pfail | Overhead (Number msgs) | |
|---|---|---|---|---|
| 1 error, triple ret | 3 | 1.12 × 10−17 | OK | 3 |
| 2 errors, triple ret | 3 | 3.62 × 10−21 | OK | 6 |
| 3 errors, double ret | 2 | 1.81 × 10−21 | OK | 6 |
| 4 errors, simple ret | 1 | 6.04 × 10−21 | OK | 4 |
Maximum consecutive cycles (max_cycles) with single error and maximum number of errors in one cycle (max_1cycle), for various values of LSW and λ, using pε = 10−16.
| LSW(ms); λ | max_cycles | max_1cycle |
|---|---|---|
| 2.5; 0.026 | 3 | 3 |
| 2.5; 0.26 | 5 | 4 |
| 25; 0.026 | 5 | 4 |
| 25; 0.26 | 7 | 6 |
Figure 7Possible error sequences in consecutive cycles.
Figure 8Possible error and recovery scenarios for indirect interference.
Figure 9Response time for every message in the Updated_SAE set, using each possible interference pattern, when considering Indirect and Direct Interference.
Figure 10FTT-CAN simulator architecture.
Comparing analytical WCRT with that observed in simulation for the Updated_SAE message set with LSW = 55.1% of LEC, considering an Aggressive environment.
| ID | Design | Simul | ID | Design | Simul | ID | Design | Simul | ID | Design | Simul |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 2 | 10 | 3 | 2 | 19 | 3 | 3 | 28 | 4 | 4 |
| 2 | 2 | 2 | 11 | 3 | 2 | 20 | 4 | 3 | 29 | 4 | 4 |
| 3 | 2 | 2 | 12 | 3 | 2 | 21 | 4 | 4 | 30 | 5 | 4 |
| 4 | 2 | 2 | 13 | 3 | 3 | 22 | 4 | 3 | 31 | 5 | 4 |
| 5 | 2 | 2 | 14 | 3 | 3 | 23 | 4 | 3 | 32 | 5 | 4 |
| 6 | 2 | 2 | 15 | 3 | 3 | 24 | 4 | 3 | 33 | 5 | 4 |
| 7 | 2 | 2 | 16 | 3 | 3 | 25 | 4 | 3 | 34 | 5 | 4 |
| 8 | 2 | 2 | 17 | 3 | 3 | 26 | 4 | 4 | 35 | 5 | 4 |
| 9 | 3 | 2 | 18 | 3 | 3 | 27 | 4 | 3 | 36 | 5 | 4 |
Minimum LSW configuration value by design and simulation.
| Message Set | VEIL | PSA | Updated_SAE |
|---|---|---|---|
| LEC (ms) | 5 | 5 | 2.5 |
| RepLevel | 3-2-2-1 | 3-3-2-1 | 3-3-2-1 |
| Bandwidth utilization (@1Mbit/s) | 4.4% | 9.1% | 27.9% |
| Error server bandwidth(configuration) | 0.105% | 0.105% | 0.108% |
| LSW/LEC without errors (by design) | 7.1% | 11.9% | 37.9% |
| LSW/LEC with errors+server (by design) | 23.8% | 28.0% | 55.1% |
| LSW/LEC with errors+server (simulation) | 21.1% | 24.8% | 48.4% |
| Pessimism (design over simulation) | 12.8% | 12.7% | 13.9% |
Comparison of minimum LSW and bandwidth (BW) requirement with different design methods.
| Minimum LSW | Configuration BW | |||||
|---|---|---|---|---|---|---|
| VEIL | PSA | Updated_SAE | VEIL | PSA | Updated_SAE | |
| Controlled Retransmission | 23.8% | 28.0% | ||||
| Automatic Retransmission | 60.0% | 12.6% | 12.6% | 25.2% | ||
| Static TT | 22.3% | 41.4% | X | 15.1% | 29.5% | X |
Figure 11Average requirement for minimum LSW vs message set bandwidth utilization.
Figure 12Bandwidth required by each method (with LEC = 2.5 ms and λ = 0.26).
| AW | Asynchronous Window | |
| BER | Bit-Error Rate | |
| EC | Elementary Cycle | |
| Err_S | Array with error scenarios | |
| FTT | Flexible Time-Triggered | |
| GP | Global success Probability | |
| Interf_P | Array with interference pattern | |
| λ | Process intensity | fauts/second |
| LEC | Length of Elementary Cycle | seconds |
| LTM | Length of Trigger Message | seconds |
| LSW | Length of Synchronous Window | seconds |
| max_cycles | Maximum number of consecutive ECs with 1 error per EC | |
| max_1cycle | Maximum number of errors in one EC | |
| MT | Mission Time | hours/seconds |
| Acceptable failure probability in message | ||
| Probability of obtaining | ||
| RepLevel | Vector with replica number (function of number of errors) | |
| Ri | Worst case response time of message i | seconds or number of ECs |
| SW | Synchronous Window | |
| TM | Trigger Message | |
| Error server period | seconds | |
| WCRT | Worst Case Response Time | seconds |
Updated_SAE message set.
| ID | T/TLEC | DLC | ID | T/TLEC | DLC | ID | T/TLEC | DLC | ID | T/TLEC | DLC |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 50/20 | 1 | 10 | 7.5/3 | 1 | 19 | 10/4 | 6 | 28 | 12.5/5 | 5 |
| 2 | 5/2 | 2 | 11 | 7.5/3 | 1 | 20 | 10/4 | 2 | 29 | 12.5/5 | 3 |
| 3 | 5/2 | 1 | 12 | 7.5/3 | 1 | 21 | 10/4 | 3 | 30 | 50/25 | 1 |
| 4 | 5/2 | 2 | 13 | 7.5/3 | 1 | 22 | 10/4 | 2 | 31 | 100/40 | 4 |
| 5 | 5/2 | 1 | 14 | 7.5/3 | 4 | 23 | 12.5/5 | 2 | 32 | 100/40 | 1 |
| 6 | 5/2 | 2 | 15 | 7.5/3 | 4 | 24 | 12.5/5 | 2 | 33 | 100/40 | 1 |
| 7 | 5/2 | 1 | 16 | 7.5/3 | 4 | 25 | 12.5/5 | 2 | 34 | 1000/400 | 3 |
| 8 | 5/2 | 1 | 17 | 10/4 | 1 | 26 | 12.5/5 | 2 | 35 | 1000/400 | 1 |
| 9 | 7.5/3 | 1 | 18 | 10/4 | 2 | 27 | 12.5/5 | 4 | 36 | 1000/400 | 1 |
FTT-CAN configuration parameters—LEC = 2.5 ms; bit-rate = 1000 Kbps, all deadlines equal to periods except D1 = 5 ms, D30 = 20 ms.
PSA message set.
| ID | T/TLEC | DLC | ID | T/TLEC | DLC | ID | T/TLEC | DLC | ID | T/TLEC | DLC |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 10/2 | 3 | 7 | 20/4 | 3 | 13 | 50/10 | 8 | 19 | 100/20 | 7 |
| 2 | 10/2 | 5 | 8 | 20/4 | 4 | 14 | 50/10 | 8 | 20 | 100/20 | 7 |
| 3 | 10/2 | 5 | 9 | 20/4 | 5 | 15 | 50/10 | 8 | 21 | 150/30 | 2 |
| 4 | 10/2 | 8 | 10 | 40/8 | 5 | 16 | 100/20 | 1 | 22 | 150/30 | 4 |
| 5 | 15/3 | 2 | 11 | 50/10 | 5 | 17 | 100/20 | 6 | 23 | 200/40 | 4 |
| 6 | 15/3 | 4 | 12 | 50/10 | 5 | 18 | 100/20 | 7 |
FTT-CAN configuration parameters—LEC = 5 ms; bit-rate = 1000 kbps; all deadlines equal to periods.
VEIL message set.
| ID | T/TLEC | DLC | ID | T/TLEC | DLC | ID | T/TLEC | DLC | ID | T/TLEC | DLC |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 10/2 | 2 | 6 | 20/4 | 4 | 11 | 100/20 | 8 | 16 | 1000/200 | 1 |
| 2 | 10/2 | 2 | 7 | 50/10 | 2 | 12 | 250/50 | 1 | 17 | 1000/200 | 2 |
| 3 | 10/2 | 4 | 8 | 50/10 | 3 | 13 | 500/100 | 2 | 18 | 1000/200 | 8 |
| 4 | 20/4 | 1 | 9 | 100/20 | 4 | 14 | 500/100 | 2 | 19 | 1000/200 | 8 |
| 5 | 20/4 | 2 | 10 | 100/20 | 8 | 15 | 1000/200 | 1 |
FTT-CAN configuration parameters—LEC = 5 ms; bit-rate = 1000 kbps; all deadlines equal to periods.