Microsoft's Azure RTOS ThreadX for ARM11 Thumb & 32-bit Mode Using ARM Compiler 5 (AC5) 1. Building the ThreadX run-time Library First make sure you are in the "example_build" directory. Also, make sure that you have setup your path and other environment variables necessary for the ARM AC5 development environment. At this point you may run the build_threadx.bat batch file. This will build the ThreadX run-time environment in the "example_build" directory. You should observe assembly and compilation of a series of ThreadX source files. At the end of the batch file, they are all combined into the run-time library file: tx.a. This file must be linked with your application in order to use ThreadX. 2. Demonstration System Since there is no ARM11 FVP, there are no instructions here for running the demonstration; users are expected to run the demonstration on their platform of choice. Building the demonstration is easy; simply execute the build_threadx_sample.bat batch file while inside the "example_build" directory. You should observe the compilation of sample_threadx.c (which is the demonstration application) and linking with tx.a. The resulting file sample_threadx.axf is a binary file that can be downloaded and executed on the user's platform of choice. 3. System Initialization The entry point in ThreadX for the Cortex-A5 using AC5 tools is at label __main. This is defined within the AC5 compiler's startup code. In addition, this is where all static and global pre-set C variable initialization processing takes place. The ThreadX tx_initialize_low_level.s file is responsible for setting up various system data structures, the vector area, and a periodic timer interrupt source. By default, the vector area is defined to be located in the Init area, which is defined at the top of tx_initialize_low_level.s. This area is typically located at 0. In situations where this is impossible, the vectors at the beginning of the Init area should be copied to address 0. This is also where initialization of a periodic timer interrupt source should take place. In addition, _tx_initialize_low_level determines the first available address for use by the application, which is supplied as the sole input parameter to your application definition function, tx_application_define. 4. Assembler / Compiler Switches The following are compiler switches used in building the demonstration system: Compiler Switch Meaning -g Specifies debug information -c Specifies object code generation --cpu Cortex-A5 Specifies Cortex-A5 instruction set --apcs /interwork Specifies Thumb/32-bit compatibility Linker Switch Meaning -d Specifies to retain debug information in output file -o demo.axf Specifies demo output file name --elf Specifies elf output file format --ro Specifies that Read-Only memory starts at address 0 --first tx_initialize_low_level.o(Init) Specifies that the first area loaded is Init --remove Remove unused areas --list Specifies map file name --symbols Specifies symbols for map file --map Creates a map file Application Defines --PD "TX_ENABLE_FIQ_SUPPORT SETL {TRUE}" This assembler define enables FIQ interrupt handling support in the ThreadX assembly files. If used, it should be used on all assembly files and the generic C source of ThreadX should be compiled with TX_ENABLE_FIQ_SUPPORT defined as well. --PD "TX_ENABLE_IRQ_NESTING SETL {TRUE}" This assembler define enables IRQ nested support. If IRQ nested interrupt support is needed, this define should be applied to tx_initialize_low_level.s. --PD "TX_ENABLE_FIQ_NESTING SETL {TRUE}" This assembler define enables FIQ nested support. If FIQ nested interrupt support is needed, this define should be applied to tx_initialize_low_level.s. In addition, IRQ nesting should also be enabled. -DTX_ENABLE_FIQ_SUPPORT This compiler define enables FIQ interrupt handling in the ThreadX generic C source. This define should also be used in conjunction with the corresponding assembler define. -DTX_DISABLE_ERROR_CHECKING If defined before tx_api.h is included, this define causes basic ThreadX error checking to be disabled. Please see Chapter 2 in the "ThreadX User Guide" for more details. -DTX_MAX_PRIORITIES Defines the priority levels for ThreadX. Legal values range from 32 through 1024 (inclusive) and MUST be evenly divisible by 32. Increasing the number of priority levels supported increases the RAM usage by 128 bytes for every group of 32 priorities. However, there is only a negligible effect on performance. By default, this value is set to 32 priority levels. -DTX_MINIMUM_STACK Defines the minimum stack size (in bytes). It is used for error checking when threads are created. The default value is port-specific and is found in tx_port.h. -DTX_TIMER_THREAD_STACK_SIZE Defines the stack size (in bytes) of the internal ThreadX timer thread. This thread processes all thread sleep requests as well as all service call timeouts. In addition, all application timer callback routines are invoked from this context. The default value is port-specific and is found in tx_port.h. -DTX_TIMER_THREAD_PRIORITY Defines the priority of the internal ThreadX timer thread. The default value is priority 0 - the highest priority in ThreadX. The default value is defined in tx_port.h. -DTX_TIMER_PROCESS_IN_ISR Defined, this option eliminates the internal system timer thread for ThreadX. This results in improved performance on timer events and smaller RAM requirements because the timer stack and control block are no longer needed. However, using this option moves all the timer expiration processing to the timer ISR level. By default, this option is not defined. -DTX_REACTIVATE_INLINE Defined, this option performs reactivation of ThreadX timers in-line instead of using a function call. This improves performance but slightly increases code size. By default, this option is not defined. -DTX_DISABLE_STACK_FILLING Defined, placing the 0xEF value in each byte of each thread's stack is disabled. By default, this option is not defined. -DTX_ENABLE_STACK_CHECKING Defined, this option enables ThreadX run-time stack checking, which includes analysis of how much stack has been used and examination of data pattern "fences" before and after the stack area. If a stack error is detected, the registered application stack error handler is called. This option does result in slightly increased overhead and code size. Please review the tx_thread_stack_error_notify API for more information. By default, this option is not defined. -DTX_DISABLE_PREEMPTION_THRESHOLD Defined, this option disables the preemption-threshold feature and slightly reduces code size and improves performance. Of course, the preemption-threshold capabilities are no longer available. By default, this option is not defined. -DTX_DISABLE_REDUNDANT_CLEARING Defined, this option removes the logic for initializing ThreadX global C data structures to zero. This should only be used if the compiler's initialization code sets all un-initialized C global data to zero. Using this option slightly reduces code size and improves performance during initialization. By default, this option is not defined. -DTX_DISABLE_NOTIFY_CALLBACKS Defined, this option disables the notify callbacks for various ThreadX objects. Using this option slightly reduces code size and improves performance. -DTX_BLOCK_POOL_ENABLE_PERFORMANCE_INFO Defined, this option enables the gathering of performance information on block pools. By default, this option is not defined. -DTX_BYTE_POOL_ENABLE_PERFORMANCE_INFO Defined, this option enables the gathering of performance information on byte pools. By default, this option is not defined. -DTX_EVENT_FLAGS_ENABLE_PERFORMANCE_INFO Defined, this option enables the gathering of performance information on event flags groups. By default, this option is not defined. -DTX_MUTEX_ENABLE_PERFORMANCE_INFO Defined, this option enables the gathering of performance information on mutexes. By default, this option is not defined. -DTX_QUEUE_ENABLE_PERFORMANCE_INFO Defined, this option enables the gathering of performance information on queues. By default, this option is not defined. -DTX_SEMAPHORE_ENABLE_PERFORMANCE_INFO Defined, this option enables the gathering of performance information on semaphores. By default, this option is not defined. -DTX_THREAD_ENABLE_PERFORMANCE_INFO Defined, this option enables the gathering of performance information on threads. By default, this option is not defined. -DTX_TIMER_ENABLE_PERFORMANCE_INFO Defined, this option enables the gathering of performance information on timers. By default, this option is not defined. -DTX_ENABLE_EVENT_TRACE Defined, this option enables the internal ThreadX trace feature. The trace buffer is supplied at a later time via an application call to tx_trace_enable. -DTX_TRACE_TIME_SOURCE This defines the time-stamp source for event tracing. This define is only pertinent if the ThreadX library is built with TX_ENABLE_EVENT_TRACE defined. -DTX_TRACE_TIME_MASK This defines the number of valid bits in the event trace time-stamp source defined previously. If the time-stamp source is 16-bits, this value should be 0xFFFF. Alternatively, if the time-stamp source is 32-bits, this value should be 0xFFFFFFFF. This define is only pertinent if the ThreadX library is built with TX_ENABLE_EVENT_TRACE defined. 5. Register Usage and Stack Frames The AC5 compiler assumes that registers r0-r3 (a1-a4) and r12 (ip) are scratch registers for each function. All other registers used by a C function must be preserved by the function. ThreadX takes advantage of this in situations where a context switch happens as a result of making a ThreadX service call (which is itself a C function). In such cases, the saved context of a thread is only the non-scratch registers. The following defines the saved context stack frames for context switches that occur as a result of interrupt handling or from thread-level API calls. All suspended threads have one of these two types of stack frames. The top of the suspended thread's stack is pointed to by tx_thread_stack_ptr in the associated thread control block TX_THREAD. Offset Interrupted Stack Frame Non-Interrupt Stack Frame 0x00 1 0 0x04 CPSR CPSR 0x08 r0 (a1) r4 (v1) 0x0C r1 (a2) r5 (v2) 0x10 r2 (a3) r6 (v3) 0x14 r3 (a4) r7 (v4) 0x18 r4 (v1) r8 (v5) 0x1C r5 (v2) r9 (v6) 0x20 r6 (v3) r10 (v7) 0x24 r7 (v4) r11 (fp) 0x28 r8 (v5) r14 (lr) 0x2C r9 (v6) 0x30 r10 (v7) 0x34 r11 (fp) 0x38 r12 (ip) 0x3C r14 (lr) 0x40 PC 6. Improving Performance The distribution version of ThreadX is built without any compiler optimizations. This makes it easy to debug because you can trace or set breakpoints inside of ThreadX itself. Of course, this costs some performance. To make it run faster, you can change the build_threadx.bat file to remove the -g option and enable all compiler optimizations. In addition, you can eliminate the ThreadX basic API error checking by compiling your application code with the symbol TX_DISABLE_ERROR_CHECKING defined. 7. Interrupt Handling ThreadX provides complete and high-performance interrupt handling for Cortex-A5 targets. There are a certain set of requirements that are defined in the following sub-sections: 7.1 Vector Area The Cortex-A5 vectors start at address zero. The demonstration system startup Init area contains the vectors and is loaded at address zero. On actual hardware platforms, this area might have to be copied to address 0. 7.2 IRQ ISRs ThreadX fully manages standard and vectored IRQ interrupts. ThreadX also supports nested IRQ interrupts. The following sub-sections define the IRQ capabilities. 7.2.1 Standard IRQ ISRs The standard ARM IRQ mechanism has a single interrupt vector at address 0x18. This IRQ interrupt is managed by the __tx_irq_handler code in tx_initialize_low_level. The following is the default IRQ handler defined in tx_initialize_low_level.s: EXPORT __tx_irq_handler EXPORT __tx_irq_processing_return __tx_irq_handler ; ; /* Jump to context save to save system context. */ B _tx_thread_context_save ; Jump to the context save __tx_irq_processing_return ; ; /* At this point execution is still in the IRQ mode. The CPSR, point of ; interrupt, and all C scratch registers are available for use. Note ; that IRQ interrupts are still disabled upon return from the context ; save function. */ ; ; /* Application ISR call(s) go here! */ ; ; /* Jump to context restore to restore system context. */ B _tx_thread_context_restore 7.2.2 Vectored IRQ ISRs The vectored ARM IRQ mechanism has multiple interrupt vectors at addresses specified by the particular implementation. The following is an example IRQ handler defined in tx_initialize_low_level.s: EXPORT __tx_irq_example_handler __tx_irq_example_handler ; ; /* Call context save to save system context. */ STMDB sp!, {r0-r3} ; Save some scratch registers MRS r0, SPSR ; Pickup saved SPSR SUB lr, lr, #4 ; Adjust point of interrupt STMDB sp!, {r0, r10, r12, lr} ; Store other scratch registers BL _tx_thread_vectored_context_save ; Call the vectored IRQ context save ; ; /* At this point execution is still in the IRQ mode. The CPSR, point of ; interrupt, and all C scratch registers are available for use. Note ; that IRQ interrupts are still disabled upon return from the context ; save function. */ ; ; /* Application ISR call goes here! */ ; ; /* Jump to context restore to restore system context. */ B _tx_thread_context_restore 7.2.3 Nested IRQ Support By default, nested IRQ interrupt support is not enabled. To enable nested IRQ support, the entire library should be built with TX_ENABLE_IRQ_NESTING defined. With this defined, two new IRQ interrupt management services are available, namely _tx_thread_irq_nesting_start and _tx_thread_irq_nesting_end. These function should be called between the IRQ context save and restore calls. Execution between the calls to _tx_thread_irq_nesting_start and _tx_thread_irq_nesting_end is enabled for IRQ nesting. This is achieved by switching from IRQ mode to SYS mode and enabling IRQ interrupts. The SYS mode stack is used during the SYS mode operation, which was setup in tx_initialize_low_level.s. When nested IRQ interrupts are no longer required, calling the _tx_thread_irq_nesting_end service disables nesting by disabling IRQ interrupts and switching back to IRQ mode in preparation for the IRQ context restore service. The following is an example of enabling IRQ nested interrupts in a standard IRQ handler: EXPORT __tx_irq_handler EXPORT __tx_irq_processing_return __tx_irq_handler ; ; /* Jump to context save to save system context. */ B _tx_thread_context_save __tx_irq_processing_return ; ; /* Enable nested IRQ interrupts. NOTE: Since this service returns ; with IRQ interrupts enabled, all IRQ interrupt sources must be ; cleared prior to calling this service. */ BL _tx_thread_irq_nesting_start ; ; /* Application ISR call(s) go here! */ ; ; /* Disable nested IRQ interrupts. The mode is switched back to ; IRQ mode and IRQ interrupts are disable upon return. */ BL _tx_thread_irq_nesting_end ; ; /* Jump to context restore to restore system context. */ B _tx_thread_context_restore 7.3 FIQ Interrupts By default, Cortex-A5 FIQ interrupts are left alone by ThreadX. Of course, this means that the application is fully responsible for enabling the FIQ interrupt and saving/restoring any registers used in the FIQ ISR processing. To globally enable FIQ interrupts, the application should enable FIQ interrupts at the beginning of each thread or before any threads are created in tx_application_define. In addition, the application must ensure that no ThreadX service calls are made from default FIQ ISRs, which is located in tx_initialize_low_level.s. 7.3.1 Managed FIQ Interrupts Full ThreadX management of FIQ interrupts is provided if the ThreadX sources are built with the TX_ENABLE_FIQ_SUPPORT defined. If the library is built this way, the FIQ interrupt handlers are very similar to the IRQ interrupt handlers defined previously. The following is default FIQ handler defined in tx_initialize_low_level.s: EXPORT __tx_fiq_handler EXPORT __tx_fiq_processing_return __tx_fiq_handler ; ; /* Jump to fiq context save to save system context. */ B _tx_thread_fiq_context_save __tx_fiq_processing_return: ; ; /* At this point execution is still in the FIQ mode. The CPSR, point of ; interrupt, and all C scratch registers are available for use. */ ; ; /* Application FIQ handlers can be called here! */ ; ; /* Jump to fiq context restore to restore system context. */ B _tx_thread_fiq_context_restore 7.3.1.1 Nested FIQ Support By default, nested FIQ interrupt support is not enabled. To enable nested FIQ support, the entire library should be built with TX_ENABLE_FIQ_NESTING defined. With this defined, two new FIQ interrupt management services are available, namely _tx_thread_fiq_nesting_start and _tx_thread_fiq_nesting_end. These function should be called between the FIQ context save and restore calls. Execution between the calls to _tx_thread_fiq_nesting_start and _tx_thread_fiq_nesting_end is enabled for FIQ nesting. This is achieved by switching from FIQ mode to SYS mode and enabling FIQ interrupts. The SYS mode stack is used during the SYS mode operation, which was setup in tx_initialize_low_level.s. When nested FIQ interrupts are no longer required, calling the _tx_thread_fiq_nesting_end service disables nesting by disabling FIQ interrupts and switching back to FIQ mode in preparation for the FIQ context restore service. The following is an example of enabling FIQ nested interrupts in the typical FIQ handler: EXPORT __tx_fiq_handler EXPORT __tx_fiq_processing_return __tx_fiq_handler ; ; /* Jump to fiq context save to save system context. */ B _tx_thread_fiq_context_save __tx_fiq_processing_return ; ; /* At this point execution is still in the FIQ mode. The CPSR, point of ; interrupt, and all C scratch registers are available for use. */ ; ; /* Enable nested FIQ interrupts. NOTE: Since this service returns ; with FIQ interrupts enabled, all FIQ interrupt sources must be ; cleared prior to calling this service. */ BL _tx_thread_fiq_nesting_start ; ; /* Application FIQ handlers can be called here! */ ; ; /* Disable nested FIQ interrupts. The mode is switched back to ; FIQ mode and FIQ interrupts are disable upon return. */ BL _tx_thread_fiq_nesting_end ; ; /* Jump to fiq context restore to restore system context. */ B _tx_thread_fiq_context_restore 8. ThreadX Timer Interrupt ThreadX requires a periodic interrupt source to manage all time-slicing, thread sleeps, timeouts, and application timers. Without such a timer interrupt source, these services are not functional. However, all other ThreadX services are operational without a periodic timer source. To add the timer interrupt processing, simply make a call to _tx_timer_interrupt in the IRQ processing. An example of this can be found in the file tx_initialize_low_level.s in the Integrator sub-directories. 9. Thumb/Cortex-A5 Mixed Mode By default, ThreadX is setup for running in Cortex-A5 32-bit mode. This is also true for the demonstration system. It is possible to build any ThreadX file and/or the application in Thumb mode. If any Thumb code is used the entire ThreadX source- both C and assembly - should be built with the "-apcs /interwork" option. 11. VFP Support By default, VFP support is disabled for each thread. If saving the context of the VFP registers is needed, the following API call must be made from the context of the application thread - before the VFP usage: void tx_thread_vfp_enable(void); After this API is called in the application, VFP registers will be saved/restored for this thread if it is preempted via an interrupt. All other suspension of the this thread will not require the VFP registers to be saved/restored. To disable VFP register context saving, simply call the following API: void tx_thread_vfp_disable(void); 12. Revision History For generic code revision information, please refer to the readme_threadx_generic.txt file, which is included in your distribution. The following details the revision information associated with this specific port of ThreadX: 04-02-2021 Release 6.1.6 changes: tx_port.h Updated macro definition 09-30-2020 Initial ThreadX 6.1 version for Cortex-A5 using AC5 tools. Copyright(c) 1996-2020 Microsoft Corporation https://azure.com/rtos