Where to place the stack and load the kernel

Tags:

I'm new to operating system development and I'm curious about a problem I've run into while developing my own bootloader. My operating system is going to be written in assembly and will run in 16-bit real mode.

I know what the stack is and I have the impression that it grows downwards into the memory. Correct me if I'm mistaken. I know how to load a basic kernel into the memory from a floppy disc and I don't believe that is the problem.

The problem I'm running into is that I'm unsure where to place the stack and load my kernel into memory. I've tried creating my stack like this and I'm running into problems:

mov ax, 0x0000
mov ss, ax
mov sp, 0xFFFF

I'm loading my kernel at 0x1000:0x0000. When I PUSH and later POP the volatile registers in my print function, my kernel just hangs the second time I do call print. This is my print function:

print:
    push ax
    push bx
    push cx

    mov al, [si]
    cmp al, 0
    je p_Done

    cmp al, 9
    je p_Tab

    mov ah, 0xE
    int 0x10

    cmp al, 10
    je p_NewLine
   p_Return:
    inc si
    jmp print
  p_Tab:
    xor cx, cx
   p_Tab_Repeat:
    cmp cx, 8
    je p_Return
    mov ah, 0xE
    mov al, " "
    int 0x10
    inc cx
    jmp p_Tab_Repeat
  p_NewLine:
    xor bx, bx
    mov ah, 0x3
    int 0x10

    mov dl, 0x00
    mov ah, 0x2
    int 0x10
    jmp p_Return
  p_Done:
    pop cx
    pop bx
    pop ax
    ret

These are the lines I want to display:

db "Kernel successfully loaded!", 10, 0
db 9, "Lmao, just a tab test!", 10, 0

This is the output I get when my kernel runs (the _ is the cursor):

Kernel successfully loaded!
_

It successfully prints the first line, but hangs while printing the second. If I remove the PUSH and POP statements it works just fine. Why does my kernel hang when I attempt save and restore registers in my print function? Where should I place my stack and where should I load my kernel?

468

asked Jul 05 '16 05:07

DrCereal

1 Answers

It doesn't help that this isn't a Minimal Complete Verifiable Example but your question suggests possible things to look for. Usually if code works by removing PUSHes and POPs in function prologue and epilogue, it usually means the stack was becoming unbalanced during the execution of the body of the function. An unbalanced stack will cause the RET instruction to return to whatever semi-random location is on the top of the stack. This will likely lead to apparent hangs and/or reboots. The behaviour will be undefined.

I haven't followed the logic in your code, but this stands out:

print:
    push ax
    push bx
    push cx

    ... snip out code for brevity  

    jmp print

At some point it is possible for your print function to be restarted at a point before all the pushes. This will cause more PUSHes onto the stack without corresponding POPs at the end. I think you might have been trying to get behavior like this:

print:
    push ax
    push bx
    push cx

.prloop:
    ... snip out code for brevity  

    jmp .prloop

The .prloop label appears at the top of the function but after the pushes. This prevents excess values being placed on the stack. .prloop can be any valid label of your choice.

The stack can be placed anywhere in memory that isn't being used by the system and doesn't interfere with your bootloader and/or kernel code. As @RossRidge points out, using an SP of 0xFFFF misaligns the stack because it is an odd address (0xFFFF=-1). The x86 won't complain (absent the Alignment Check flag) but it can hurt stack performance on some x86 architectures.

Note: setting SS:SP to 0x1000:0x0000 will cause the stack to run from 0x1000:0xFFFF down to 0x1000:0x0000. The first 16-bit value pushed will be at 0x1000:0xFFFE.

Your kernel and stack are generally safe anywhere between physical address 0x00520 and 0x90000 as long as they don't conflict with one another. On some systems the upper part of the memory region between 0x90000 and 0xA0000 may not be available. If you want to use this memory area I would avoid the area between 0x9C000 and 0xA0000. This area can be used by the BIOS as part of the Extended BIOS Data Area (EBDA).

The exact amount of usable Low Memory Area (LMA) space can be learned by calling the ROM-BIOS's interrupt 12h service, or directly reading the word at 0x00413. In either case, the result is the amount of KiB of usable memory. If there is less than 640 KiB of actual memory, and/or some of the memory at the LMA's top is used by the EBDA or other software, then the result will be lower than 640 (that is, 0x0280). Technically the result can be higher than 640 too. By multiplying or left-shifting the amount in KiB, the equivalent amount in paragraphs or bytes can be calculated.

The region between 0x00000 and 0x00520 should not be used as it contains the real mode interrupt vector table, the BIOS Data Area (BDA) and 32 bytes of memory that is considered to be reserved.

answered Nov 15 '22 09:11

Michael Petch

Related questions
                            
                                Can't clear entire screen in 16-bit real mode Assembly
                            
                                Adding two vector in assembly x86_64 with AVX2 plus technical clarifications
                            
                                In NASM labels next to each other in memory are printing both strings instead of first one
                            
                                Comma, colon, decorator or end of line expected after operand
                            
                                Unsigned int to unsigned long long well defined?
                            
                                Why do 32-bit applications work on 64-bit x86 CPUs?
                            
                                Error 13: Invalid or unsupported executable while booting simple kernel in grub with string literal
                            
                                Regarding cmp / jg, jle, etc in AT&T syntax assembly
                            
                                Why does ARM distinguish between SDIV and UDIV but not with ADD, SUB and MUL?
                            
                                How does assembler compute segment and offset for symbol addresses?
                            
                                Counting character frequencies in an array of characters - x86 Assembly
                            
                                NASM compiling x86_64 ASM label addresses off by 256 bytes in Mach-O when using multiple db declarations?
                            
                                masm error A2075: jump destination too far : by 30 bytes
                            
                                Can atomic instructions straddle cache lines?
                            
                                Shortest Intel x86-64 opcode for rax=1?
                            
                                Cannot move 8 bit address to 16 bit register
                            
                                How to truncate float values in XMM register
                            
                                Is the assembly language different from one architecture to another?
                            
                                How safe is this swap w/o pushing registers?
                            
                                CPU Registers and Multitasking

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Where to place the stack and load the kernel

Tags:

stack

assembly

x86-16

osdev

real-mode

DrCereal

People also ask

1 Answers

Michael Petch

Recent Activity

Donate For Us