2005-04-16 15:20:36 -07:00
/ * $ Id : VISsave. S ,v 1 . 6 2 0 0 2 / 0 2 / 0 9 1 9 : 4 9 : 3 0 d a v e m E x p $
* VISsave. S : C o d e f o r s a v i n g F P U r e g i s t e r s t a t e f o r
* VIS r o u t i n e s . O n e s h o u l d n o t c a l l t h i s d i r e c t l y ,
* but u s e m a c r o s p r o v i d e d i n < a s m / v i s a s m . h > .
*
* Copyright ( C ) 1 9 9 8 J a k u b J e l i n e k ( j j @ultra.linux.cz)
* /
# include < a s m / a s i . h >
# include < a s m / p a g e . h >
# include < a s m / p t r a c e . h >
# include < a s m / v i s a s m . h >
# include < a s m / t h r e a d _ i n f o . h >
.text
.globl VISenter, V I S e n t e r h a l f
/* On entry: %o5=current FPRS value, %g7 is callers address */
/* May clobber %o5, %g1, %g2, %g3, %g7, %icc, %xcc */
/ * Nothing s p e c i a l n e e d b e d o n e h e r e t o h a n d l e p r e - e m p t i o n , t h i s
* FPU s a v e / r e s t o r e m e c h a n i s m i s a l r e a d y p r e e m p t i o n s a f e .
* /
.align 32
VISenter :
ldub [ % g 6 + T I _ F P D E P T H ] , % g 1
brnz,a ,p n % g 1 , 1 f
cmp % g 1 , 1
stb % g 0 , [ % g 6 + T I _ F P S A V E D ]
stx % f s r , [ % g 6 + T I _ X F S R ]
9 : jmpl % g 7 + % g 0 , % g 0
nop
1 : bne,p n % i c c , 2 f
srl % g 1 , 1 , % g 1
vis1 : ldub [ % g 6 + T I _ F P S A V E D ] , % g 3
stx % f s r , [ % g 6 + T I _ X F S R ]
or % g 3 , % o 5 , % g 3
stb % g 3 , [ % g 6 + T I _ F P S A V E D ]
rd % g s r , % g 3
clr % g 1
ba,p t % x c c , 3 f
stx % g 3 , [ % g 6 + T I _ G S R ]
2 : add % g 6 , % g 1 , % g 3
cmp % o 5 , F P R S _ D U
be,p n % i c c , 6 f
sll % g 1 , 3 , % g 1
stb % o 5 , [ % g 3 + T I _ F P S A V E D ]
rd % g s r , % g 2
add % g 6 , % g 1 , % g 3
stx % g 2 , [ % g 3 + T I _ G S R ]
add % g 6 , % g 1 , % g 2
stx % f s r , [ % g 2 + T I _ X F S R ]
sll % g 1 , 5 , % g 1
3 : andcc % o 5 , F P R S _ D L | F P R S _ D U , % g 0
be,p n % i c c , 9 b
add % g 6 , T I _ F P R E G S , % g 2
andcc % o 5 , F P R S _ D L , % g 0
be,p n % i c c , 4 f
add % g 6 , T I _ F P R E G S + 0 x40 , % g 3
2005-10-07 13:30:49 -07:00
membar #S y n c
2005-04-16 15:20:36 -07:00
stda % f0 , [ % g 2 + % g 1 ] A S I _ B L K _ P
stda % f16 , [ % g 3 + % g 1 ] A S I _ B L K _ P
2005-10-07 13:30:49 -07:00
membar #S y n c
2005-04-16 15:20:36 -07:00
andcc % o 5 , F P R S _ D U , % g 0
be,p n % i c c , 5 f
4 : add % g 1 , 1 2 8 , % g 1
2005-10-07 13:30:49 -07:00
membar #S y n c
2005-04-16 15:20:36 -07:00
stda % f32 , [ % g 2 + % g 1 ] A S I _ B L K _ P
stda % f48 , [ % g 3 + % g 1 ] A S I _ B L K _ P
5 : membar #S y n c
[SPARC64]: Avoid membar instructions in delay slots.
In particular, avoid membar instructions in the delay
slot of a jmpl instruction.
UltraSPARC-I, II, IIi, and IIe have a bug, documented in
the UltraSPARC-IIi User's Manual, Appendix K, Erratum 51
The long and short of it is that if the IMU unit misses
on a branch or jmpl, and there is a store buffer synchronizing
membar in the delay slot, the chip can stop fetching instructions.
If interrupts are enabled or some other trap is enabled, the
chip will unwedge itself, but performance will suffer.
We already had a workaround for this bug in a few spots, but
it's better to have the entire tree sanitized for this rule.
Signed-off-by: David S. Miller <davem@davemloft.net>
2005-06-27 15:42:04 -07:00
ba,p t % x c c , 8 0 f
nop
.align 32
80 : jmpl % g 7 + % g 0 , % g 0
2005-04-16 15:20:36 -07:00
nop
6 : ldub [ % g 3 + T I _ F P S A V E D ] , % o 5
or % o 5 , F P R S _ D U , % o 5
add % g 6 , T I _ F P R E G S + 0 x80 , % g 2
stb % o 5 , [ % g 3 + T I _ F P S A V E D ]
sll % g 1 , 5 , % g 1
add % g 6 , T I _ F P R E G S + 0 x c0 , % g 3
wr % g 0 , F P R S _ F E F , % f p r s
2005-10-07 13:30:49 -07:00
membar #S y n c
2005-04-16 15:20:36 -07:00
stda % f32 , [ % g 2 + % g 1 ] A S I _ B L K _ P
stda % f48 , [ % g 3 + % g 1 ] A S I _ B L K _ P
membar #S y n c
[SPARC64]: Avoid membar instructions in delay slots.
In particular, avoid membar instructions in the delay
slot of a jmpl instruction.
UltraSPARC-I, II, IIi, and IIe have a bug, documented in
the UltraSPARC-IIi User's Manual, Appendix K, Erratum 51
The long and short of it is that if the IMU unit misses
on a branch or jmpl, and there is a store buffer synchronizing
membar in the delay slot, the chip can stop fetching instructions.
If interrupts are enabled or some other trap is enabled, the
chip will unwedge itself, but performance will suffer.
We already had a workaround for this bug in a few spots, but
it's better to have the entire tree sanitized for this rule.
Signed-off-by: David S. Miller <davem@davemloft.net>
2005-06-27 15:42:04 -07:00
ba,p t % x c c , 8 0 f
nop
2005-04-16 15:20:36 -07:00
[SPARC64]: Avoid membar instructions in delay slots.
In particular, avoid membar instructions in the delay
slot of a jmpl instruction.
UltraSPARC-I, II, IIi, and IIe have a bug, documented in
the UltraSPARC-IIi User's Manual, Appendix K, Erratum 51
The long and short of it is that if the IMU unit misses
on a branch or jmpl, and there is a store buffer synchronizing
membar in the delay slot, the chip can stop fetching instructions.
If interrupts are enabled or some other trap is enabled, the
chip will unwedge itself, but performance will suffer.
We already had a workaround for this bug in a few spots, but
it's better to have the entire tree sanitized for this rule.
Signed-off-by: David S. Miller <davem@davemloft.net>
2005-06-27 15:42:04 -07:00
.align 32
80 : jmpl % g 7 + % g 0 , % g 0
2005-04-16 15:20:36 -07:00
nop
.align 32
VISenterhalf :
ldub [ % g 6 + T I _ F P D E P T H ] , % g 1
brnz,a ,p n % g 1 , 1 f
cmp % g 1 , 1
stb % g 0 , [ % g 6 + T I _ F P S A V E D ]
stx % f s r , [ % g 6 + T I _ X F S R ]
clr % o 5
jmpl % g 7 + % g 0 , % g 0
wr % g 0 , F P R S _ F E F , % f p r s
1 : bne,p n % i c c , 2 f
srl % g 1 , 1 , % g 1
ba,p t % x c c , v i s1
sub % g 7 , 8 , % g 7
2 : addcc % g 6 , % g 1 , % g 3
sll % g 1 , 3 , % g 1
andn % o 5 , F P R S _ D U , % g 2
stb % g 2 , [ % g 3 + T I _ F P S A V E D ]
rd % g s r , % g 2
add % g 6 , % g 1 , % g 3
stx % g 2 , [ % g 3 + T I _ G S R ]
add % g 6 , % g 1 , % g 2
stx % f s r , [ % g 2 + T I _ X F S R ]
sll % g 1 , 5 , % g 1
3 : andcc % o 5 , F P R S _ D L , % g 0
be,p n % i c c , 4 f
add % g 6 , T I _ F P R E G S , % g 2
add % g 6 , T I _ F P R E G S + 0 x40 , % g 3
2005-10-07 13:30:49 -07:00
membar #S y n c
2005-04-16 15:20:36 -07:00
stda % f0 , [ % g 2 + % g 1 ] A S I _ B L K _ P
stda % f16 , [ % g 3 + % g 1 ] A S I _ B L K _ P
membar #S y n c
[SPARC64]: Avoid membar instructions in delay slots.
In particular, avoid membar instructions in the delay
slot of a jmpl instruction.
UltraSPARC-I, II, IIi, and IIe have a bug, documented in
the UltraSPARC-IIi User's Manual, Appendix K, Erratum 51
The long and short of it is that if the IMU unit misses
on a branch or jmpl, and there is a store buffer synchronizing
membar in the delay slot, the chip can stop fetching instructions.
If interrupts are enabled or some other trap is enabled, the
chip will unwedge itself, but performance will suffer.
We already had a workaround for this bug in a few spots, but
it's better to have the entire tree sanitized for this rule.
Signed-off-by: David S. Miller <davem@davemloft.net>
2005-06-27 15:42:04 -07:00
ba,p t % x c c , 4 f
nop
.align 32
2005-04-16 15:20:36 -07:00
4 : and % o 5 , F P R S _ D U , % o 5
jmpl % g 7 + % g 0 , % g 0
wr % o 5 , F P R S _ F E F , % f p r s