__bit variables must be located after the 0x20 address This force the stack to start after 0x20, thus decrease the stack size by a considerable amount.