Hello and welcome to Part 14 of my Beginning Logic Design series! In the last episode, I added my ALU operations. For this round, I want to add implement some operators for manipulating a stack and some handling for calling subroutines. Let’s jump to it!

My Stack System

The stack pointer of my cpu will keep track of the “top” of the stack. Most CPUs have a stack that grows “down”, but my CPU already has a lot of inefficiencies and I’m feeling rebellious so my stack will grow up! I current reset the stack to 0 on reset, so at the start of a program it should be ready to go.

I’ll use the first few available opcodes from my EXTRA operation family for my stack related functions.

F0: push A
F1: push B
F2: push C
F3: pop A
F4: pop B
F5: pop C

F0: push A
F1: push B
F2: push C
F3: pop A
F4: pop B
F5: pop C

F0: push A
F1: push B
F2: push C
F3: pop A
F4: pop B
F5: pop C

As before I’ll start by roughly mocking out this organization in my PERFORM state

EXTRA: begin
  case (instruction[3:0])
    // Push A
    0: begin
      
    end
    // Push B
    1: begin
      
    end
    // Push C
    2: begin
      
    end
    // Pop A
    3: begin
      
    end
    // Pop B
    4: begin
      
    end
    // Pop C
    5: begin
      
    end
  endcase
end

EXTRA: begin
case (instruction[3:0])
// Push A
0: begin
end
// Push B
1: begin
end
// Push C
2: begin
end
// Pop A
3: begin
end
// Pop B
4: begin
end
// Pop C
5: begin
end
endcase
end

EXTRA: begin
  case (instruction[3:0])
    // Push A
    0: begin
      
    end
    // Push B
    1: begin
      
    end
    // Push C
    2: begin
      
    end
    // Pop A
    3: begin
      
    end
    // Pop B
    4: begin
      
    end
    // Pop C
    5: begin
      
    end
  endcase
end

Now I’ll start on the PUSH A operations. I’ll need to write A to the memory address my stack pointer is currently set to, then increment the stack pointer. Since this involves some bus interactions it’ll take two cycles.

On the first I’ll put the A register value in the write_data register, set the address_bus to my stack pointer and enable write.

For the second cycle, I’ll clear my write signal, increment my stack and return to FETCH to continue my program, easy as that!

0: begin
  case (cycle)
    0: begin
      write_data <= a;
      address_bus <= stack;
      write <= 1;
    end
    1: begin
      write <= 0;
      stack++;
      state <= FETCH;
      program_counter++;
    end
  endcase
end

0: begin
case (cycle)
0: begin
write_data <= a;
address_bus <= stack;
write <= 1;
end
1: begin
write <= 0;
stack++;
state <= FETCH;
program_counter++;
end
endcase
end

0: begin
  case (cycle)
    0: begin
      write_data <= a;
      address_bus <= stack;
      write <= 1;
    end
    1: begin
      write <= 0;
      stack++;
      state <= FETCH;
      program_counter++;
    end
  endcase
end

And by the magic of copy-pasta, I extend this to my other two registers.

// Push B
1: begin
  case (cycle)
    0: begin
      write_data <= b;
      address_bus <= stack;
      write <= 1;
    end
    1: begin
      write <= 0;
      stack++;
      state <= FETCH;
      program_counter++;
    end
  endcase
end
// Push C
2: begin
  case (cycle)
    0: begin
      write_data <= c;
      address_bus <= stack;
      write <= 1;
    end
    1: begin
      write <= 0;
      stack++;
      state <= FETCH;
      program_counter++;
    end
  endcase
end

// Push B
1: begin
case (cycle)
0: begin
write_data <= b;
address_bus <= stack;
write <= 1;
end
1: begin
write <= 0;
stack++;
state <= FETCH;
program_counter++;
end
endcase
end
// Push C
2: begin
case (cycle)
0: begin
write_data <= c;
address_bus <= stack;
write <= 1;
end
1: begin
write <= 0;
stack++;
state <= FETCH;
program_counter++;
end
endcase
end

// Push B
1: begin
  case (cycle)
    0: begin
      write_data <= b;
      address_bus <= stack;
      write <= 1;
    end
    1: begin
      write <= 0;
      stack++;
      state <= FETCH;
      program_counter++;
    end
  endcase
end
// Push C
2: begin
  case (cycle)
    0: begin
      write_data <= c;
      address_bus <= stack;
      write <= 1;
    end
    1: begin
      write <= 0;
      stack++;
      state <= FETCH;
      program_counter++;
    end
  endcase
end

Now for the inverse operation POP. This means performing a read with the decremented stack pointer and storing that into the desired register, which will also be two cycles. On the first I’ll predecrement stack as I set the address_bus to it. On the second I’ll clear my read, store the returned value and go back into FETCH.

// Pop A
3: begin
  case (cycle)
    0: begin
      address_bus <= --stack;
      read <= 1;
    end
    1: begin
      read <= 0;
      a <= data_bus;
      state <= FETCH;
      program_counter++;
    end
  endcase
end

// Pop A
3: begin
case (cycle)
0: begin
address_bus <= --stack;
read <= 1;
end
1: begin
read <= 0;
a <= data_bus;
state <= FETCH;
program_counter++;
end
endcase
end

// Pop A
3: begin
  case (cycle)
    0: begin
      address_bus <= --stack;
      read <= 1;
    end
    1: begin
      read <= 0;
      a <= data_bus;
      state <= FETCH;
      program_counter++;
    end
  endcase
end

I honestly didn’t think implementing push and pop would be quite so easy, everything was working well on the first attempt. As before I’ll copy my way through to implement this for B and C.

// Pop B
4: begin
  case (cycle)
    0: begin
      address_bus <= --stack;
      read <= 1;
    end
    1: begin
      read <= 0;
      b <= data_bus;
      state <= FETCH;
      program_counter++;
    end
  endcase
end
// Pop C
5: begin
  case (cycle)
    0: begin
      address_bus <= --stack;
      read <= 1;
    end
    1: begin
      read <= 0;
      c <= data_bus;
      state <= FETCH;
      program_counter++;
    end
  endcase
end

// Pop B
4: begin
case (cycle)
0: begin
address_bus <= --stack;
read <= 1;
end
1: begin
read <= 0;
b <= data_bus;
state <= FETCH;
program_counter++;
end
endcase
end
// Pop C
5: begin
case (cycle)
0: begin
address_bus <= --stack;
read <= 1;
end
1: begin
read <= 0;
c <= data_bus;
state <= FETCH;
program_counter++;
end
endcase
end

// Pop B
4: begin
  case (cycle)
    0: begin
      address_bus <= --stack;
      read <= 1;
    end
    1: begin
      read <= 0;
      b <= data_bus;
      state <= FETCH;
      program_counter++;
    end
  endcase
end
// Pop C
5: begin
  case (cycle)
    0: begin
      address_bus <= --stack;
      read <= 1;
    end
    1: begin
      read <= 0;
      c <= data_bus;
      state <= FETCH;
      program_counter++;
    end
  endcase
end

Subroutines

The next two instructions I want to implement are an operation that jumps into a subroutine and a paired operator that returns from that subroutine. I’ll try to keep these operations pretty simple. I’ll first stub out my opcodes.

// Jump subroutine
6: begin
  case (cycle)
    
  endcase
end
// Return from subroutine
7: begin
  case (cycle)
    
  endcase
end

// Jump subroutine
6: begin
case (cycle)
endcase
end
// Return from subroutine
7: begin
case (cycle)
endcase
end

// Jump subroutine
6: begin
  case (cycle)
    
  endcase
end
// Return from subroutine
7: begin
  case (cycle)
    
  endcase
end

For my JSR operation (jump to subroutine), I’ll first push my next instruction address to the top of my stack, then jump the program to the next address. This will take 4 total bus interactions so my current 2-bit cycle variable will not allow for this, I’ll modify my cycle to 3-bits so it can count to 8 and start implementing.

Pretty quickly intro drafting my implementation of this, and right after gloating how easy push/pop was to implement, I noticed this one was going to be a bit trickier! The first thing I need to do is calculate the address of the next instruction and push the most significant byte to the stack.

0: begin
  write <= 1;
  address_bus <= stack;
  program_counter += 3;
  write_data <= program_counter[15:8];
end

0: begin
write <= 1;
address_bus <= stack;
program_counter += 3;
write_data <= program_counter[15:8];
end

0: begin
  write <= 1;
  address_bus <= stack;
  program_counter += 3;
  write_data <= program_counter[15:8];
end

On the next cycle, I complete the return address right by setting the next stack byte to the least significant byte.

1: begin
  address_bus <= stack + 1;
  write_data <= program_counter[7:0];
end

1: begin
address_bus <= stack + 1;
write_data <= program_counter[7:0];
end

1: begin
  address_bus <= stack + 1;
  write_data <= program_counter[7:0];
end

With the pointer written to the stack, I’ll begin reading the next pointer to jump to and increment my stack by the length of the pointer (2 bytes). Since my program counter is now ahead of the pointer to jump to, I need to look back 2 bytes for the most significant byte of the subroutine’s address.

2: begin
  write <= 0;
  read <= 1;
  address_bus <= program_counter - 2;
  stack += 2;
end

2: begin
write <= 0;
read <= 1;
address_bus <= program_counter - 2;
stack += 2;
end

2: begin
  write <= 0;
  read <= 1;
  address_bus <= program_counter - 2;
  stack += 2;
end

I’ll store the returned most signifcant byte for the subroutine in my x register and request the next byte.

3: begin
  x <= data_bus;
  address_bus <= program_counter - 1;
end

3: begin
x <= data_bus;
address_bus <= program_counter - 1;
end

3: begin
  x <= data_bus;
  address_bus <= program_counter - 1;
end

Then finally I’ll be done with the bus and can jump into the subroutine.

4: begin
  read <= 0;
  program_counter <= {x, data_bus};
  state <= FETCH;
end

4: begin
read <= 0;
program_counter <= {x, data_bus};
state <= FETCH;
end

4: begin
  read <= 0;
  program_counter <= {x, data_bus};
  state <= FETCH;
end

Phew! I had a few issues with implementing this at first, primarily from not managing my pointers properly. With time, patience and debugging in the simulator it did eventually work out.

The ReTurn from Subroutine (RTS) thankfully is a bit easier, and will only take three cycles. First I’ll begin the read for the least significant byte of where to jump back to.

0: begin
  read <= 1;
  address_bus <= --stack;
end

0: begin
read <= 1;
address_bus <= --stack;
end

0: begin
  read <= 1;
  address_bus <= --stack;
end

On the second cycle, I’ll store that byte in x and read the most significant byte of the return pointer.

1: begin
  address_bus <= --stack;
  x <= data_bus;
end

1: begin
address_bus <= --stack;
x <= data_bus;
end

1: begin
  address_bus <= --stack;
  x <= data_bus;
end

On the last cycle we can stop the read and jump to the return pointer!

2: begin
  read <= 0;
  program_counter <= {data_bus, x};
  state <= FETCH;
end

2: begin
read <= 0;
program_counter <= {data_bus, x};
state <= FETCH;
end

2: begin
  read <= 0;
  program_counter <= {data_bus, x};
  state <= FETCH;
end

That’ll do it! I’ll use this program to test it, annotated with addresses and comments for brevity:

8000: c0 de     ; Set A = 0xDE
8002: f0        ; Push A to stack
8003: f6 80 07  ; Jump into subroutine at 0x8007
8006: e0        ; Halt machine
8007: c1 20     ; Set B = 0x20
8009: c2 17     ; Set C = 0x12
800b: f7        ; Return

8000: c0 de ; Set A = 0xDE
8002: f0 ; Push A to stack
8003: f6 80 07 ; Jump into subroutine at 0x8007
8006: e0 ; Halt machine
8007: c1 20 ; Set B = 0x20
8009: c2 17 ; Set C = 0x12
800b: f7 ; Return

8000: c0 de     ; Set A = 0xDE
8002: f0        ; Push A to stack
8003: f6 80 07  ; Jump into subroutine at 0x8007
8006: e0        ; Halt machine
8007: c1 20     ; Set B = 0x20
8009: c2 17     ; Set C = 0x12
800b: f7        ; Return

In simulation it works like a charm!

With that working I am done with the initial set of goals I had for this CPU, and this series along with that! I hope some folks have found this series interesting and/or useful. If you have any improvements to suggest or would like me to cover the implementation of any of this in further detail please leave a note in the comments. Keep tinkering!!

This is a cached copy of a post I have not migrated to the new design. Some links may no longer work.

Beginning Logic Design – Part 14

My Stack System

Subroutines

Leave a Reply Cancel reply