2005-09-26 16:04:21 +10:00
/ *
* Memory c o p y f u n c t i o n s f o r 3 2 - b i t P o w e r P C .
*
* Copyright ( C ) 1 9 9 6 - 2 0 0 5 P a u l M a c k e r r a s .
*
* This p r o g r a m i s f r e e s o f t w a r e ; you can redistribute it and/or
* modify i t u n d e r t h e t e r m s o f t h e G N U G e n e r a l P u b l i c L i c e n s e
* as p u b l i s h e d b y t h e F r e e S o f t w a r e F o u n d a t i o n ; either version
* 2 of t h e L i c e n s e , o r ( a t y o u r o p t i o n ) a n y l a t e r v e r s i o n .
* /
# include < a s m / p r o c e s s o r . h >
# include < a s m / c a c h e . h >
# include < a s m / e r r n o . h >
# include < a s m / p p c _ a s m . h >
2016-01-13 23:33:46 -05:00
# include < a s m / e x p o r t . h >
2018-08-09 08:14:41 +00:00
# include < a s m / c o d e - p a t c h i n g - a s m . h >
2019-04-26 16:23:26 +00:00
# include < a s m / k a s a n . h >
2005-09-26 16:04:21 +10:00
# define C O P Y _ 1 6 _ B Y T E S \
lwz r7 ,4 ( r4 ) ; \
lwz r8 ,8 ( r4 ) ; \
lwz r9 ,1 2 ( r4 ) ; \
lwzu r10 ,1 6 ( r4 ) ; \
stw r7 ,4 ( r6 ) ; \
stw r8 ,8 ( r6 ) ; \
stw r9 ,1 2 ( r6 ) ; \
stwu r10 ,1 6 ( r6 )
# define C O P Y _ 1 6 _ B Y T E S _ W I T H E X ( n ) \
8 # # n ## 0 : \
lwz r7 ,4 ( r4 ) ; \
8 # # n ## 1 : \
lwz r8 ,8 ( r4 ) ; \
8 # # n ## 2 : \
lwz r9 ,1 2 ( r4 ) ; \
8 # # n ## 3 : \
lwzu r10 ,1 6 ( r4 ) ; \
8 # # n ## 4 : \
stw r7 ,4 ( r6 ) ; \
8 # # n ## 5 : \
stw r8 ,8 ( r6 ) ; \
8 # # n ## 6 : \
stw r9 ,1 2 ( r6 ) ; \
8 # # n ## 7 : \
stwu r10 ,1 6 ( r6 )
# define C O P Y _ 1 6 _ B Y T E S _ E X C O D E ( n ) \
9 # # n ## 0 : \
addi r5 ,r5 ,- ( 1 6 * n ) ; \
b 1 0 4 f ; \
9 # # n ## 1 : \
addi r5 ,r5 ,- ( 1 6 * n ) ; \
b 1 0 5 f ; \
2016-10-13 16:42:53 +11:00
EX_ T A B L E ( 8 ## n # # 0 b ,9 ## n # # 0 b ) ; \
EX_ T A B L E ( 8 ## n # # 1 b ,9 ## n # # 0 b ) ; \
EX_ T A B L E ( 8 ## n # # 2 b ,9 ## n # # 0 b ) ; \
EX_ T A B L E ( 8 ## n # # 3 b ,9 ## n # # 0 b ) ; \
EX_ T A B L E ( 8 ## n # # 4 b ,9 ## n # # 1 b ) ; \
EX_ T A B L E ( 8 ## n # # 5 b ,9 ## n # # 1 b ) ; \
EX_ T A B L E ( 8 ## n # # 6 b ,9 ## n # # 1 b ) ; \
EX_ T A B L E ( 8 ## n # # 7 b ,9 ## n # # 1 b )
2005-09-26 16:04:21 +10:00
.text
.stabs " arch/ p o w e r p c / l i b / " ,N _ S O ,0 ,0 ,0 f
2010-09-01 07:21:21 +00:00
.stabs " copy_ 3 2 . S " ,N _ S O ,0 ,0 ,0 f
2005-09-26 16:04:21 +10:00
0 :
2005-10-17 11:50:32 +10:00
CACHELINE_ B Y T E S = L 1 _ C A C H E _ B Y T E S
LG_ C A C H E L I N E _ B Y T E S = L 1 _ C A C H E _ S H I F T
CACHELINE_ M A S K = ( L 1 _ C A C H E _ B Y T E S - 1 )
2005-09-26 16:04:21 +10:00
2019-04-26 16:23:26 +00:00
# ifndef C O N F I G _ K A S A N
powerpc/32: add memset16()
Commit 694fc88ce271f ("powerpc/string: Implement optimized
memset variants") added memset16(), memset32() and memset64()
for the 64 bits PPC.
On 32 bits, memset64() is not relevant, and as shown below,
the generic version of memset32() gives a good code, so only
memset16() is candidate for an optimised version.
000009c0 <memset32>:
9c0: 2c 05 00 00 cmpwi r5,0
9c4: 39 23 ff fc addi r9,r3,-4
9c8: 4d 82 00 20 beqlr
9cc: 7c a9 03 a6 mtctr r5
9d0: 94 89 00 04 stwu r4,4(r9)
9d4: 42 00 ff fc bdnz 9d0 <memset32+0x10>
9d8: 4e 80 00 20 blr
The last part of memset() handling the not 4-bytes multiples
operates on bytes, making it unsuitable for handling word without
modification. As it would increase memset() complexity, it is
better to implement memset16() from scratch. In addition it
has the advantage of allowing a more optimised memset16() than what
we would have by using the memset() function.
Signed-off-by: Christophe Leroy <christophe.leroy@c-s.fr>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2017-08-23 16:54:32 +02:00
_ GLOBAL( m e m s e t 1 6 )
rlwinm. r0 ,r5 , 3 1 , 1 , 3 1
addi r6 , r3 , - 4
beq- 2 f
rlwimi r4 ,r4 ,1 6 ,0 ,1 5
mtctr r0
1 : stwu r4 , 4 ( r6 )
bdnz 1 b
2 : andi. r0 , r5 , 1
beqlr
sth r4 , 4 ( r6 )
blr
EXPORT_ S Y M B O L ( m e m s e t 1 6 )
2019-04-26 16:23:26 +00:00
# endif
powerpc/32: add memset16()
Commit 694fc88ce271f ("powerpc/string: Implement optimized
memset variants") added memset16(), memset32() and memset64()
for the 64 bits PPC.
On 32 bits, memset64() is not relevant, and as shown below,
the generic version of memset32() gives a good code, so only
memset16() is candidate for an optimised version.
000009c0 <memset32>:
9c0: 2c 05 00 00 cmpwi r5,0
9c4: 39 23 ff fc addi r9,r3,-4
9c8: 4d 82 00 20 beqlr
9cc: 7c a9 03 a6 mtctr r5
9d0: 94 89 00 04 stwu r4,4(r9)
9d4: 42 00 ff fc bdnz 9d0 <memset32+0x10>
9d8: 4e 80 00 20 blr
The last part of memset() handling the not 4-bytes multiples
operates on bytes, making it unsuitable for handling word without
modification. As it would increase memset() complexity, it is
better to implement memset16() from scratch. In addition it
has the advantage of allowing a more optimised memset16() than what
we would have by using the memset() function.
Signed-off-by: Christophe Leroy <christophe.leroy@c-s.fr>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2017-08-23 16:54:32 +02:00
2015-05-19 12:07:48 +02:00
/ *
* Use d c b z o n t h e c o m p l e t e c a c h e l i n e s i n t h e d e s t i n a t i o n
* to s e t t h e m t o z e r o . T h i s r e q u i r e s t h a t t h e d e s t i n a t i o n
* area i s c a c h e a b l e . - - p a u l u s
2015-09-16 12:04:53 +02:00
*
* During e a r l y i n i t , c a c h e m i g h t n o t b e a c t i v e y e t , s o d c b z c a n n o t b e u s e d .
* We t h e r e f o r e s k i p t h e o p t i m i s e d b l o c t h a t u s e s d c b z . T h i s j u m p i s
* replaced b y a n o p o n c e c a c h e i s a c t i v e . T h i s i s d o n e i n m a c h i n e _ i n i t ( )
2015-05-19 12:07:48 +02:00
* /
2019-04-26 16:23:26 +00:00
_ GLOBAL_ K A S A N ( m e m s e t )
2017-08-23 16:54:36 +02:00
cmplwi 0 ,r5 ,4
blt 7 f
2015-05-19 12:07:52 +02:00
rlwimi r4 ,r4 ,8 ,1 6 ,2 3
rlwimi r4 ,r4 ,1 6 ,0 ,1 5
2017-08-23 16:54:36 +02:00
stw r4 ,0 ( r3 )
2015-05-19 12:07:48 +02:00
beqlr
2017-08-23 16:54:36 +02:00
andi. r0 ,r3 ,3
2015-05-19 12:07:48 +02:00
add r5 ,r0 ,r5
2017-08-23 16:54:36 +02:00
subf r6 ,r0 ,r3
2015-05-19 12:07:52 +02:00
cmplwi 0 ,r4 ,0
2017-08-23 16:54:38 +02:00
/ *
* Skip o p t i m i s e d b l o c u n t i l c a c h e i s e n a b l e d . W i l l b e r e p l a c e d
* by ' b n e ' d u r i n g b o o t t o u s e n o r m a l p r o c e d u r e i f r4 i s n o t z e r o
* /
2018-08-09 08:14:41 +00:00
5 : b 2 f
patch_ s i t e 5 b , p a t c h _ _ m e m s e t _ n o c a c h e
2015-05-19 12:07:52 +02:00
2015-05-19 12:07:48 +02:00
clrlwi r7 ,r6 ,3 2 - L G _ C A C H E L I N E _ B Y T E S
add r8 ,r7 ,r5
srwi r9 ,r8 ,L G _ C A C H E L I N E _ B Y T E S
addic. r9 ,r9 ,- 1 / * t o t a l n u m b e r o f c o m p l e t e c a c h e l i n e s * /
ble 2 f
xori r0 ,r7 ,C A C H E L I N E _ M A S K & ~ 3
srwi. r0 ,r0 ,2
beq 3 f
mtctr r0
4 : stwu r4 ,4 ( r6 )
bdnz 4 b
3 : mtctr r9
li r7 ,4
10 : dcbz r7 ,r6
addi r6 ,r6 ,C A C H E L I N E _ B Y T E S
bdnz 1 0 b
clrlwi r5 ,r8 ,3 2 - L G _ C A C H E L I N E _ B Y T E S
addi r5 ,r5 ,4
2015-05-19 12:07:52 +02:00
2 : srwi r0 ,r5 ,2
2005-09-26 16:04:21 +10:00
mtctr r0
bdz 6 f
1 : stwu r4 ,4 ( r6 )
bdnz 1 b
6 : andi. r5 ,r5 ,3
beqlr
mtctr r5
addi r6 ,r6 ,3
8 : stbu r4 ,1 ( r6 )
bdnz 8 b
blr
2017-08-23 16:54:36 +02:00
7 : cmpwi 0 ,r5 ,0
beqlr
mtctr r5
addi r6 ,r3 ,- 1
9 : stbu r4 ,1 ( r6 )
bdnz 9 b
blr
2017-08-23 16:54:34 +02:00
EXPORT_ S Y M B O L ( m e m s e t )
2019-04-26 16:23:26 +00:00
EXPORT_ S Y M B O L _ K A S A N ( m e m s e t )
2005-09-26 16:04:21 +10:00
2015-05-19 12:07:48 +02:00
/ *
* This v e r s i o n u s e s d c b z o n t h e c o m p l e t e c a c h e l i n e s i n t h e
* destination a r e a t o r e d u c e m e m o r y t r a f f i c . T h i s r e q u i r e s t h a t
* the d e s t i n a t i o n a r e a i s c a c h e a b l e .
* We o n l y u s e t h i s v e r s i o n i f t h e s o u r c e a n d d e s t d o n ' t o v e r l a p .
* - - paulus.
2015-09-16 12:04:51 +02:00
*
* During e a r l y i n i t , c a c h e m i g h t n o t b e a c t i v e y e t , s o d c b z c a n n o t b e u s e d .
* We t h e r e f o r e j u m p t o g e n e r i c _ m e m c p y w h i c h d o e s n ' t u s e d c b z . T h i s j u m p i s
* replaced b y a n o p o n c e c a c h e i s a c t i v e . T h i s i s d o n e i n m a c h i n e _ i n i t ( )
2015-05-19 12:07:48 +02:00
* /
2019-04-26 16:23:26 +00:00
_ GLOBAL_ K A S A N ( m e m m o v e )
2015-05-19 12:07:55 +02:00
cmplw 0 ,r3 ,r4
bgt b a c k w a r d s _ m e m c p y
/* fall through */
2019-04-26 16:23:26 +00:00
_ GLOBAL_ K A S A N ( m e m c p y )
2018-08-09 08:14:41 +00:00
1 : b g e n e r i c _ m e m c p y
patch_ s i t e 1 b , p a t c h _ _ m e m c p y _ n o c a c h e
2015-05-19 12:07:48 +02:00
add r7 ,r3 ,r5 / * t e s t i f t h e s r c & d s t o v e r l a p * /
add r8 ,r4 ,r5
cmplw 0 ,r4 ,r7
cmplw 1 ,r3 ,r8
crand 0 ,0 ,4 / * c r0 . l t & = c r1 . l t * /
2015-05-19 12:07:55 +02:00
blt g e n e r i c _ m e m c p y / * i f r e g i o n s o v e r l a p * /
2015-05-19 12:07:48 +02:00
addi r4 ,r4 ,- 4
addi r6 ,r3 ,- 4
neg r0 ,r3
andi. r0 ,r0 ,C A C H E L I N E _ M A S K / * # b y t e s t o s t a r t o f c a c h e l i n e * /
beq 5 8 f
cmplw 0 ,r5 ,r0 / * i s t h i s m o r e t h a n t o t a l t o d o ? * /
blt 6 3 f / * i f n o t m u c h t o d o * /
andi. r8 ,r0 ,3 / * g e t i t w o r d - a l i g n e d f i r s t * /
subf r5 ,r0 ,r5
mtctr r8
beq+ 6 1 f
70 : lbz r9 ,4 ( r4 ) / * d o s o m e b y t e s * /
addi r4 ,r4 ,1
addi r6 ,r6 ,1
2015-05-19 12:07:57 +02:00
stb r9 ,3 ( r6 )
2015-05-19 12:07:48 +02:00
bdnz 7 0 b
61 : srwi. r0 ,r0 ,2
mtctr r0
beq 5 8 f
72 : lwzu r9 ,4 ( r4 ) / * d o s o m e w o r d s * /
stwu r9 ,4 ( r6 )
bdnz 7 2 b
58 : srwi. r0 ,r5 ,L G _ C A C H E L I N E _ B Y T E S / * # c o m p l e t e c a c h e l i n e s * /
clrlwi r5 ,r5 ,3 2 - L G _ C A C H E L I N E _ B Y T E S
li r11 ,4
mtctr r0
beq 6 3 f
53 :
dcbz r11 ,r6
COPY_ 1 6 _ B Y T E S
# if L 1 _ C A C H E _ B Y T E S > = 3 2
COPY_ 1 6 _ B Y T E S
# if L 1 _ C A C H E _ B Y T E S > = 6 4
COPY_ 1 6 _ B Y T E S
COPY_ 1 6 _ B Y T E S
# if L 1 _ C A C H E _ B Y T E S > = 1 2 8
COPY_ 1 6 _ B Y T E S
COPY_ 1 6 _ B Y T E S
COPY_ 1 6 _ B Y T E S
COPY_ 1 6 _ B Y T E S
# endif
# endif
# endif
bdnz 5 3 b
63 : srwi. r0 ,r5 ,2
mtctr r0
beq 6 4 f
30 : lwzu r0 ,4 ( r4 )
stwu r0 ,4 ( r6 )
bdnz 3 0 b
64 : andi. r0 ,r5 ,3
mtctr r0
beq+ 6 5 f
2015-05-19 12:07:57 +02:00
addi r4 ,r4 ,3
addi r6 ,r6 ,3
40 : lbzu r0 ,1 ( r4 )
stbu r0 ,1 ( r6 )
2015-05-19 12:07:48 +02:00
bdnz 4 0 b
65 : blr
2016-01-13 23:33:46 -05:00
EXPORT_ S Y M B O L ( m e m c p y )
EXPORT_ S Y M B O L ( m e m m o v e )
2019-04-26 16:23:26 +00:00
EXPORT_ S Y M B O L _ K A S A N ( m e m c p y )
EXPORT_ S Y M B O L _ K A S A N ( m e m m o v e )
2015-05-19 12:07:48 +02:00
2016-03-16 21:36:06 +11:00
generic_memcpy :
2005-09-26 16:04:21 +10:00
srwi. r7 ,r5 ,3
addi r6 ,r3 ,- 4
addi r4 ,r4 ,- 4
beq 2 f / * i f l e s s t h a n 8 b y t e s t o d o * /
andi. r0 ,r6 ,3 / * g e t d e s t w o r d a l i g n e d * /
mtctr r7
bne 5 f
1 : lwz r7 ,4 ( r4 )
lwzu r8 ,8 ( r4 )
stw r7 ,4 ( r6 )
stwu r8 ,8 ( r6 )
bdnz 1 b
andi. r5 ,r5 ,7
2 : cmplwi 0 ,r5 ,4
blt 3 f
lwzu r0 ,4 ( r4 )
addi r5 ,r5 ,- 4
stwu r0 ,4 ( r6 )
3 : cmpwi 0 ,r5 ,0
beqlr
mtctr r5
addi r4 ,r4 ,3
addi r6 ,r6 ,3
4 : lbzu r0 ,1 ( r4 )
stbu r0 ,1 ( r6 )
bdnz 4 b
blr
5 : subfic r0 ,r0 ,4
mtctr r0
6 : lbz r7 ,4 ( r4 )
addi r4 ,r4 ,1
stb r7 ,4 ( r6 )
addi r6 ,r6 ,1
bdnz 6 b
subf r5 ,r0 ,r5
rlwinm. r7 ,r5 ,3 2 - 3 ,3 ,3 1
beq 2 b
mtctr r7
b 1 b
_ GLOBAL( b a c k w a r d s _ m e m c p y )
rlwinm. r7 ,r5 ,3 2 - 3 ,3 ,3 1 / * r0 = r5 > > 3 * /
add r6 ,r3 ,r5
add r4 ,r4 ,r5
beq 2 f
andi. r0 ,r6 ,3
mtctr r7
bne 5 f
1 : lwz r7 ,- 4 ( r4 )
lwzu r8 ,- 8 ( r4 )
stw r7 ,- 4 ( r6 )
stwu r8 ,- 8 ( r6 )
bdnz 1 b
andi. r5 ,r5 ,7
2 : cmplwi 0 ,r5 ,4
blt 3 f
lwzu r0 ,- 4 ( r4 )
subi r5 ,r5 ,4
stwu r0 ,- 4 ( r6 )
3 : cmpwi 0 ,r5 ,0
beqlr
mtctr r5
4 : lbzu r0 ,- 1 ( r4 )
stbu r0 ,- 1 ( r6 )
bdnz 4 b
blr
5 : mtctr r0
6 : lbzu r7 ,- 1 ( r4 )
stbu r7 ,- 1 ( r6 )
bdnz 6 b
subf r5 ,r0 ,r5
rlwinm. r7 ,r5 ,3 2 - 3 ,3 ,3 1
beq 2 b
mtctr r7
b 1 b
_ GLOBAL( _ _ c o p y _ t o f r o m _ u s e r )
addi r4 ,r4 ,- 4
addi r6 ,r3 ,- 4
neg r0 ,r3
andi. r0 ,r0 ,C A C H E L I N E _ M A S K / * # b y t e s t o s t a r t o f c a c h e l i n e * /
beq 5 8 f
cmplw 0 ,r5 ,r0 / * i s t h i s m o r e t h a n t o t a l t o d o ? * /
blt 6 3 f / * i f n o t m u c h t o d o * /
andi. r8 ,r0 ,3 / * g e t i t w o r d - a l i g n e d f i r s t * /
mtctr r8
beq+ 6 1 f
70 : lbz r9 ,4 ( r4 ) / * d o s o m e b y t e s * /
71 : stb r9 ,4 ( r6 )
addi r4 ,r4 ,1
addi r6 ,r6 ,1
bdnz 7 0 b
61 : subf r5 ,r0 ,r5
srwi. r0 ,r0 ,2
mtctr r0
beq 5 8 f
72 : lwzu r9 ,4 ( r4 ) / * d o s o m e w o r d s * /
73 : stwu r9 ,4 ( r6 )
bdnz 7 2 b
2016-10-13 16:42:53 +11:00
EX_ T A B L E ( 7 0 b ,1 0 0 f )
EX_ T A B L E ( 7 1 b ,1 0 1 f )
EX_ T A B L E ( 7 2 b ,1 0 2 f )
EX_ T A B L E ( 7 3 b ,1 0 3 f )
2005-09-26 16:04:21 +10:00
58 : srwi. r0 ,r5 ,L G _ C A C H E L I N E _ B Y T E S / * # c o m p l e t e c a c h e l i n e s * /
clrlwi r5 ,r5 ,3 2 - L G _ C A C H E L I N E _ B Y T E S
li r11 ,4
beq 6 3 f
/* Here we decide how far ahead to prefetch the source */
li r3 ,4
cmpwi r0 ,1
li r7 ,0
ble 1 1 4 f
li r7 ,1
# if M A X _ C O P Y _ P R E F E T C H > 1
/ * Heuristically, f o r l a r g e t r a n s f e r s w e p r e f e t c h
MAX_ C O P Y _ P R E F E T C H c a c h e l i n e s a h e a d . F o r s m a l l t r a n s f e r s
we p r e f e t c h 1 c a c h e l i n e a h e a d . * /
cmpwi r0 ,M A X _ C O P Y _ P R E F E T C H
ble 1 1 2 f
li r7 ,M A X _ C O P Y _ P R E F E T C H
112 : mtctr r7
111 : dcbt r3 ,r4
addi r3 ,r3 ,C A C H E L I N E _ B Y T E S
bdnz 1 1 1 b
# else
dcbt r3 ,r4
addi r3 ,r3 ,C A C H E L I N E _ B Y T E S
# endif / * M A X _ C O P Y _ P R E F E T C H > 1 * /
114 : subf r8 ,r7 ,r0
mr r0 ,r7
mtctr r8
53 : dcbt r3 ,r4
54 : dcbz r11 ,r6
2016-10-13 16:42:53 +11:00
EX_ T A B L E ( 5 4 b ,1 0 5 f )
2005-09-26 16:04:21 +10:00
/* the main body of the cacheline loop */
COPY_ 1 6 _ B Y T E S _ W I T H E X ( 0 )
2005-10-17 11:50:32 +10:00
# if L 1 _ C A C H E _ B Y T E S > = 3 2
2005-09-26 16:04:21 +10:00
COPY_ 1 6 _ B Y T E S _ W I T H E X ( 1 )
2005-10-17 11:50:32 +10:00
# if L 1 _ C A C H E _ B Y T E S > = 6 4
2005-09-26 16:04:21 +10:00
COPY_ 1 6 _ B Y T E S _ W I T H E X ( 2 )
COPY_ 1 6 _ B Y T E S _ W I T H E X ( 3 )
2005-10-17 11:50:32 +10:00
# if L 1 _ C A C H E _ B Y T E S > = 1 2 8
2005-09-26 16:04:21 +10:00
COPY_ 1 6 _ B Y T E S _ W I T H E X ( 4 )
COPY_ 1 6 _ B Y T E S _ W I T H E X ( 5 )
COPY_ 1 6 _ B Y T E S _ W I T H E X ( 6 )
COPY_ 1 6 _ B Y T E S _ W I T H E X ( 7 )
# endif
# endif
# endif
bdnz 5 3 b
cmpwi r0 ,0
li r3 ,4
li r7 ,0
bne 1 1 4 b
63 : srwi. r0 ,r5 ,2
mtctr r0
beq 6 4 f
30 : lwzu r0 ,4 ( r4 )
31 : stwu r0 ,4 ( r6 )
bdnz 3 0 b
64 : andi. r0 ,r5 ,3
mtctr r0
beq+ 6 5 f
40 : lbz r0 ,4 ( r4 )
41 : stb r0 ,4 ( r6 )
addi r4 ,r4 ,1
addi r6 ,r6 ,1
bdnz 4 0 b
65 : li r3 ,0
blr
/* read fault, initial single-byte copy */
100 : li r9 ,0
b 9 0 f
/* write fault, initial single-byte copy */
101 : li r9 ,1
90 : subf r5 ,r8 ,r5
li r3 ,0
b 9 9 f
/* read fault, initial word copy */
102 : li r9 ,0
b 9 1 f
/* write fault, initial word copy */
103 : li r9 ,1
91 : li r3 ,2
b 9 9 f
/ *
* this s t u f f h a n d l e s f a u l t s i n t h e c a c h e l i n e l o o p a n d b r a n c h e s t o e i t h e r
* 1 0 4 f ( i f i n r e a d p a r t ) o r 1 0 5 f ( i f i n w r i t e p a r t ) , a f t e r u p d a t i n g r5
* /
COPY_ 1 6 _ B Y T E S _ E X C O D E ( 0 )
2005-10-17 11:50:32 +10:00
# if L 1 _ C A C H E _ B Y T E S > = 3 2
2005-09-26 16:04:21 +10:00
COPY_ 1 6 _ B Y T E S _ E X C O D E ( 1 )
2005-10-17 11:50:32 +10:00
# if L 1 _ C A C H E _ B Y T E S > = 6 4
2005-09-26 16:04:21 +10:00
COPY_ 1 6 _ B Y T E S _ E X C O D E ( 2 )
COPY_ 1 6 _ B Y T E S _ E X C O D E ( 3 )
2005-10-17 11:50:32 +10:00
# if L 1 _ C A C H E _ B Y T E S > = 1 2 8
2005-09-26 16:04:21 +10:00
COPY_ 1 6 _ B Y T E S _ E X C O D E ( 4 )
COPY_ 1 6 _ B Y T E S _ E X C O D E ( 5 )
COPY_ 1 6 _ B Y T E S _ E X C O D E ( 6 )
COPY_ 1 6 _ B Y T E S _ E X C O D E ( 7 )
# endif
# endif
# endif
/* read fault in cacheline loop */
104 : li r9 ,0
b 9 2 f
/* fault on dcbz (effectively a write fault) */
/* or write fault in cacheline loop */
105 : li r9 ,1
92 : li r3 ,L G _ C A C H E L I N E _ B Y T E S
mfctr r8
add r0 ,r0 ,r8
b 1 0 6 f
/* read fault in final word loop */
108 : li r9 ,0
b 9 3 f
/* write fault in final word loop */
109 : li r9 ,1
93 : andi. r5 ,r5 ,3
li r3 ,2
b 9 9 f
/* read fault in final byte loop */
110 : li r9 ,0
b 9 4 f
/* write fault in final byte loop */
111 : li r9 ,1
94 : li r5 ,0
li r3 ,0
/ *
* At t h i s s t a g e t h e n u m b e r o f b y t e s n o t c o p i e d i s
* r5 + ( c t r < < r3 ) , a n d r9 i s 0 f o r r e a d o r 1 f o r w r i t e .
* /
99 : mfctr r0
106 : slw r3 ,r0 ,r3
add. r3 ,r3 ,r5
beq 1 2 0 f / * s h o u l d n ' t h a p p e n * /
cmpwi 0 ,r9 ,0
bne 1 2 0 f
/* for a read fault, first try to continue the copy one byte at a time */
mtctr r3
130 : lbz r0 ,4 ( r4 )
131 : stb r0 ,4 ( r6 )
addi r4 ,r4 ,1
addi r6 ,r6 ,1
bdnz 1 3 0 b
/* then clear out the destination: r3 bytes starting at 4(r6) */
132 : mfctr r3
120 : blr
2016-10-13 16:42:53 +11:00
EX_ T A B L E ( 3 0 b ,1 0 8 b )
EX_ T A B L E ( 3 1 b ,1 0 9 b )
EX_ T A B L E ( 4 0 b ,1 1 0 b )
EX_ T A B L E ( 4 1 b ,1 1 1 b )
EX_ T A B L E ( 1 3 0 b ,1 3 2 b )
EX_ T A B L E ( 1 3 1 b ,1 2 0 b )
2016-01-13 23:33:46 -05:00
EXPORT_ S Y M B O L ( _ _ c o p y _ t o f r o m _ u s e r )