I found this document somewhere on a gopher archive in an Apple2's programming directory and I think this could be very useful information for example for emulator writers :)
2-Nov-1994 Ivo van Poorten <ipoorten=www@cs.vu.nl>
With all the books on 6502 programming, and all the years that the 6502 has been around and popular, you'd think that any quirks would have been well documented by now. But nooooo! So... for those of you involved in assembler or machine-language programming, here are some 6502 booby-traps -- those marked with * are supposedly fixed in the CMOS parts such as 65C02 and 65C102 (but not the C-64's 6510).
- Return address pushed on the stack by JSR is one less than actual next instruction. RTS increments PC after popping. RTI doesn't.
- The status bits pushed on the stack by PHP have the breakpoint bit set.
- The D (decimal mode) flag is not defined after RESET.
- The D (decimal mode) flag is not cleared by interrupts.
- The ADC and SBC instructions don't set N,V, and Z status bits if decimal mode is on. C status is set correctly.
- An indirect JMP (xxFF) will fail because the MSB will be fetched from address xx00 instead of page xx+1.
- If an interrupt occurs on a BRK instruction, the breakpoint is ignored.
- The ROR instruction didn't exist in the very earliest (pre-'77) chips.
- Undefined op-codes do strange things, some lock up the CPU.
- When adding a carry to the MSB of an address, a fetch occurs at a garbage address. The CMOS chips refetch the last byte of the instruction.
- When doing a fetch-modify-store instruction (INC, DEC, ASL, LSR, ROL, ROR) garbage is stored into the location during the "modify" cycle... followed by the "real" store cycle which stores the correct data. The CMOS chips do a second fetch instead of a garbage store.
These aren't really "bugs", but there are some instructions that just seem like they ought to be there, but aren't:
- INC A and DEC A.
- BRA relative. Unconditional branch.
- BIT #immediate. The CMOS chips also have BIT absolute,X and BIT zeropage,X.
- STX absolute,Y and STY absolute,X. How come LDX and LDY can use these addressing modes, but the matching store instructions can't?
- SEV. Set overflow bit. Probably not really useful, but you can set and clear all of the other status bits, and there *is* a CLV.
- Personally, I'd also like to have "Clear A" and "Test A" instructions. These would be twice as fast and half as big as LDA #0 and CMP #0. Ironically, the CMOS chips have an STZ (Store Zero) instruction which can clear a memory location, but still can't clear the Accumulator.
- I'd also like to have BSR relative. A subroutine call that's relocatable.
- Fixed in the CMOS versions (65C02, 65C102, etc.)