PhD position: Time-guaranteed and reliable execution for safety-critical multicore systems
Keywords: Real-time systems, Safety, Fault-tolerance, WCET
Starting date: September/October 2021
- Apply As soon As Possible
- Send an email to firstname.lastname@example.org
The safety-critical embedded industries, such as avionics, automobile, robotics and health-care, require guarantees for hard real-time and correct application execution. As applications become more complex, their computational demands scale rapidly, requiring architectures with multiple processing elements. Although multicore architectures can effectively satisfy the needs of best-effort systems, the same cannot be stated for critical embedded systems due to hard-to-predict timing behaviour and increased fault susceptibility .
Hard-to-predict timing behaviour originates from the complex nature of modern systems. Not only application complexity, but also hardware complexity has been increased. Modern architectures are enhanced with dynamic hardware components with variable timing behaviour, while parallel execution of applications on the same platform leads to concurrent accesses to shared resources affecting applications’ timing behaviour. To provide hard real-time guarantees, safe, but pessimistic, Worst-Case Execution Time (WCET) estimations have to be employed during system design . Increased fault susceptibility stems from the nature of electronic systems. Reliability threats, such as manufacturing process variation, aging and soft errors, depend on transistors size and are expected to significantly increase with transistors shrinking . Due to this unreliable nature of electronic systems, the susceptibility of multicore architectures towards reliability threats is inevitable . However, the majority of existing WCET estimation approaches is fault-unaware; the hardware of the target platform is assumed to be fault-free . As reliability issues become imminent due to technology scaling, such fault-unaware approaches become unsafe. With the technology size reduction, faults in combinational logic and smaller sequential logic of cores cannot be considered negligible anymore .
The goal of this PhD thesis is to provide the means to analyse both functional and timing behaviour of applications, perform fault-aware WCET estimation and design cores with timing guarantees and reliable execution. This will be achieved through novel approaches considering both reliability and WCET aspects. More precisely, a realistic and accurate functional and timing architectural vulnerability analysis will be proposed. This framework will be extended with probabilistic/hybrid WCET estimation techniques to provide fault-aware WCET estimations. Low-level fault-tolerant mechanisms will be designed to mitigate the most important impact of faults. The thesis will be based on open-source cores, e.g., RISC-V .
- Good knowledge of computer architecture, embedded systems and real-time systems
- Mathematical methods and probabilities
- Programming experience in C/C++ and python
- Familiarity with FPGA design and/or High-Level Synthesis.
- Mostly importantly, we seek highly motivated people.
A master in Computer Science, Computer Engineering, or Electrical Engineering is required. An equivalent engineering degree (5th year) enabling to start doctoral studies is also accepted.
- Your CV along with your Bachelor/Master transcripts
- A motivational letter
- Reference letters
- Any additional documents/links that you think can show your experience (reports, notes, papers, github repositories…)
 S. Saidi, R. Ernst, S. Uhrig, H. Theiling, B. Dinechin, The Shift to Multicores in Real-Time and Safety-Critical Systems, in CODES + ISSS, p. 220-229, October 2015.
 C. Maiza. H. Rihani, J. Rivas, J. Goossens, S. Altmeyer, R. Davis, A Survey of Timing Verification Techniques for Multi-Core Real-Time Systems, ACM CS, 52(3):56:1-56:38, June 2019.
 P. Shivakumar, M. Kistler, S. Keckler, D. Burger, L. Alvisi, Modelling the effect of technology trends on soft error rate of combinational logic, in DSN, p. 389-398, June 2002
 S. Rehman, M. Shafique, J. Henkel, Reliable Software for Unreliable Hardware: A Cross Layer Perspective, Springer, 2016.
 A. Lofwenmark, S. Nadjm-Tehrani, Fault and timing analysis in critical multi-core systems: A survey with an avionics perspective, JSA, vol. 87, p. 1-11, 2018
 N. Mahatme S. Jagannathan, T. Loveless, L. Massengill, B. Bhuva, S. Wen, R. Wong, Comparison of Combinational and Sequential Error Rates for a Deep Submicron Process, IEEE NS, vol. 58, p. 2719–2725, Dec. 2011
 S. Rokicki, D. Pala, J. Paturel, and O. Sentieys. What You Simulate Is What You Synthesize: Designing a Processor Core from C++ Specifications, in International Conference on Computer-Aided Design (ICCAD), pages 1–8, Nov. 2019.
Your application has been successfully submitted.