Hi @marcos. With NOR you should be fine. Use a good stackup to minimise crosstalk between tracks and meet impedance. It may not be necessary, but you may want to do some kind of length matching, e.g. you may want to make data signals similar lengt, strobe signals a little bit longer then the longest data. Be sure, the data are correctly routed also on the board where this flash card is inserted. Be sure, the delay in address decoder will not influence your timing.
If you attached some screenshots from your simulations and what is wrong, that would help.
Impedance and track width will depend on your stackup. Be sure you get the track geometry from your PCB manufacturer or you can re-use some of our stackups:
Download PCB Stackups – Free for your ProjectsFor more chips used in parallel I would go for Balanced T-Branch topology:
...............................--------CHIP 1
.......................------<
......................| .......--------CHIP 2
Connector -----<
......................| .......--------CHIP 3
.......................------<
...............................--------CHIP 4
I normally do not simulate this kind of connections, but if you are not sure, you can try at least reflection and crosstalk. Just send a pulse from the source (output pin) and see how it looks in destination (input pin(s)). Do it for different scenarios - e.g. read, write - when in one situation a pin can be output and in different situation the pin can be input.
BTW: I do not simulate in Altium as I am not convinced about results from the Altium simulator, but maybe I will see something unusual.