Fix PReLU Broadcasting Bug for Multiple Parameters #565

hishambarakat16 · 2024-07-05T20:52:15Z

#################Summary#################
Fixed a bug in the PReLU function in jittor/nn.py where broadcasting the weight parameter caused errors when num_parameters was greater than 1. The previous implementation did not correctly broadcast the weights to match the input dimensions, leading to runtime errors.

#################Changes Made#################
Modified the execute method in PReLU class to correctly broadcast weight parameter for cases where num_parameters is greater than 1.

#################Original Code:#################

def init(self, num_parameters=1, init_=0.25):
self.num_parameters = num_parameters
self.weight = init.constant((num_parameters,), "float32", init_)

def execute(self, x):
if self.num_parameters != 1:
assert self.num_parameters == x.size(1), f"num_parameters does not match input channels in PReLU"
return jt.maximum(0, x) + self.weight.broadcast(x, [0,2,3]) * jt.minimum(0, x)
else:
return jt.maximum(0, x) + self.weight * jt.minimum(0, x)

############Updated Code:##############

def init(self, num_parameters=1, init_=0.25):
self.num_parameters = num_parameters
self.weight = init.constant((num_parameters,), "float32", init_)

def execute(self, x):
if self.num_parameters != 1:
assert self.num_parameters == x.shape[1], f"num_parameters does not match input channels in PReLU"
weight_broadcasted = self.weight.broadcast([x.shape[0], self.num_parameters, *([1] * (len(x.shape) - 2))])
return jt.maximum(0, x) + weight_broadcasted * jt.minimum(0, x)
else:
return jt.maximum(0, x) + self.weight * jt.minimum(0, x)

#################Testing#################
Tested the updated PReLU function with various configurations to ensure proper functionality:

import jittor as jt
from jittor import nn

Create input data with the specified shape

def create_input_data(shape):
num_elements = 1
for dim in shape:
num_elements *= dim
return jt.array(list(range(-num_elements // 2, num_elements // 2)), dtype=jt.float32).reshape(shape)

Test the PReLU activation function

def test_prelu(num_parameters, input_shape):
prelu_layer = nn.PReLU(num_parameters=num_parameters)
input_data = create_input_data(input_shape)
print(f"Testing PReLU with num_parameters={num_parameters} and input_shape={input_shape}")
print(f"Input Data:\n{input_data.numpy()}")
output_data = prelu_layer(input_data)
print(f"Output Data (PReLU):\n{output_data.numpy()}\n")

if name == "main":
test_configs = [
(1, (5,)), # Single parameter
(5, (5, 5)), # Five parameters matching the number of channels
(3, (3, 3)), # Three parameters matching the number of channels
]
for num_parameters, input_shape in test_configs:
test_prelu(num_parameters, input_shape)

#################Test Results:#################

Testing PReLU with num_parameters=1 and input_shape=(5,) Input Data:
[-3. -2. -1. 0. 1.]
Output Data (PReLU):
[-0.75 -0.5 -0.25 0. 1. ]

Testing PReLU with num_parameters=5 and input_shape=(5, 5) Input Data:
[[-13. -12. -11. -10. -9.]
[ -8. -7. -6. -5. -4.]
[ -3. -2. -1. 0. 1.]
[ 2. 3. 4. 5. 6.]
[ 7. 8. 9. 10. 11.]]
Output Data (PReLU):
[[-3.25 -3. -2.75 -2.5 -2.25]
[-2. -1.75 -1.5 -1.25 -1. ]
[-0.75 -0.5 -0.25 0. 1. ]
[ 2. 3. 4. 5. 6. ]
[ 7. 8. 9. 10. 11. ]]

Testing PReLU with num_parameters=3 and input_shape=(3, 3) Input Data:
[[-5. -4. -3.]
[-2. -1. 0.]
[ 1. 2. 3.]]
Output Data (PReLU):
[[-1.25 -1. -0.75]
[-0.5 -0.25 0. ]
[ 1. 2. 3. ]]

##################################
This fix ensures that the PReLU activation function can handle multiple parameters correctly by properly broadcasting the weight parameter to match the input tensor dimensions.

add complex matmul, inv, qr, eig, and svd

fix issue 531,530;update jt.nn.PixelShuffle/jt.histc

fix issue 525;update jt.nn.Reflection2d/Replication2d

fix issue 527,526;update jt.zeros/ones/full/randn/randint/random

fix issue 529;update contrib.argmax_pool()

fix issue 528;update conv_transpose

Update mnist.py

polish rocm support

fix issue 521;update jt.nn.MaxUnpool2d/MaxUnpool3d

fix issue 522,520,519,516; update jt.Pool/Pool3d

fix issue 523;update jt.nn.Conv1d/Conv3d/conv2d/conv3d

Update ACL library and fix bugs in ACL integration

fix: fix for issue Jittor#544

,Jittor#456,Jittor#457

Jittor#474,Jittor#475,Jittor#476,Jittor#477

…ty of concat of issue Jittor#459

…tor#463

…ittor#467

…tor#480,Jittor#481,Jittor#482,Jittor#483

fix: some function&class input illegal paramters

fix numpy version

check parameters' positive in jt.nn.fold

#################Summary################# Fixed a bug in the PReLU function in jittor/nn.py where broadcasting the weight parameter caused errors when num_parameters was greater than 1. The previous implementation did not correctly broadcast the weights to match the input dimensions, leading to runtime errors. #################Changes Made################# Modified the execute method in PReLU class to correctly broadcast weight parameter for cases where num_parameters is greater than 1. #################Code Changes################# #################Original Code:################# def __init__(self, num_parameters=1, init_=0.25): self.num_parameters = num_parameters self.weight = init.constant((num_parameters,), "float32", init_) def execute(self, x): if self.num_parameters != 1: assert self.num_parameters == x.size(1), f"num_parameters does not match input channels in PReLU" return jt.maximum(0, x) + self.weight.broadcast(x, [0,2,3]) * jt.minimum(0, x) else: return jt.maximum(0, x) + self.weight * jt.minimum(0, x) ############Updated Code:############## def __init__(self, num_parameters=1, init_=0.25): self.num_parameters = num_parameters self.weight = init.constant((num_parameters,), "float32", init_) def execute(self, x): if self.num_parameters != 1: assert self.num_parameters == x.shape[1], f"num_parameters does not match input channels in PReLU" weight_broadcasted = self.weight.broadcast([x.shape[0], self.num_parameters, *([1] * (len(x.shape) - 2))]) return jt.maximum(0, x) + weight_broadcasted * jt.minimum(0, x) else: return jt.maximum(0, x) + self.weight * jt.minimum(0, x) #################Testing################# Tested the updated PReLU function with various configurations to ensure proper functionality: import jittor as jt from jittor import nn # Create input data with the specified shape def create_input_data(shape): num_elements = 1 for dim in shape: num_elements *= dim return jt.array(list(range(-num_elements // 2, num_elements // 2)), dtype=jt.float32).reshape(shape) # Test the PReLU activation function def test_prelu(num_parameters, input_shape): prelu_layer = nn.PReLU(num_parameters=num_parameters) input_data = create_input_data(input_shape) print(f"Testing PReLU with num_parameters={num_parameters} and input_shape={input_shape}") print(f"Input Data:\n{input_data.numpy()}") output_data = prelu_layer(input_data) print(f"Output Data (PReLU):\n{output_data.numpy()}\n") if __name__ == "__main__": test_configs = [ (1, (5,)), # Single parameter (5, (5, 5)), # Five parameters matching the number of channels (3, (3, 3)), # Three parameters matching the number of channels ] for num_parameters, input_shape in test_configs: test_prelu(num_parameters, input_shape) #################Test Results:################# Testing PReLU with num_parameters=1 and input_shape=(5,) Input Data: [-3. -2. -1. 0. 1.] Output Data (PReLU): [-0.75 -0.5 -0.25 0. 1. ] Testing PReLU with num_parameters=5 and input_shape=(5, 5) Input Data: [[-13. -12. -11. -10. -9.] [ -8. -7. -6. -5. -4.] [ -3. -2. -1. 0. 1.] [ 2. 3. 4. 5. 6.] [ 7. 8. 9. 10. 11.]] Output Data (PReLU): [[-3.25 -3. -2.75 -2.5 -2.25] [-2. -1.75 -1.5 -1.25 -1. ] [-0.75 -0.5 -0.25 0. 1. ] [ 2. 3. 4. 5. 6. ] [ 7. 8. 9. 10. 11. ]] Testing PReLU with num_parameters=3 and input_shape=(3, 3) Input Data: [[-5. -4. -3.] [-2. -1. 0.] [ 1. 2. 3.]] Output Data (PReLU): [[-1.25 -1. -0.75] [-0.5 -0.25 0. ] [ 1. 2. 3. ]] ################################## This fix ensures that the PReLU activation function can handle multiple parameters correctly by properly broadcasting the weight parameter to match the input tensor dimensions.

co63oc and others added 30 commits May 22, 2023 13:18

Update mnist.py

edf6a94

Merge pull request Jittor#518 from 514flowey/complex

7abc8d9

add complex matmul, inv, qr, eig, and svd

Merge pull request Jittor#534 from fansunqi/master

e90fbac

fix issue 531,530;update jt.nn.PixelShuffle/jt.histc

polish PixelShuffle in nn.py

04a02e0

Merge pull request Jittor#538 from fansunqi/issue525_branch

233c0e7

fix issue 525;update jt.nn.Reflection2d/Replication2d

Merge pull request Jittor#537 from fansunqi/issue527_branch

a8f51c4

fix issue 527,526;update jt.zeros/ones/full/randn/randint/random

Merge pull request Jittor#535 from fansunqi/issue529_branch

a2316de

fix issue 529;update contrib.argmax_pool()

Merge pull request Jittor#536 from fansunqi/issue528_branch

7bb9ce2

fix issue 528;update conv_transpose

Merge pull request Jittor#443 from co63oc/patch-1

b506d63

Update mnist.py

polish rocm support

e001b4c

Merge pull request Jittor#543 from LDYang694/master

26b2cf0

polish rocm support

Merge pull request Jittor#541 from fansunqi/issue521_branch

f645a07

fix issue 521;update jt.nn.MaxUnpool2d/MaxUnpool3d

Merge pull request Jittor#540 from fansunqi/issue522_branch

f2644d4

fix issue 522,520,519,516; update jt.Pool/Pool3d

Merge pull request Jittor#539 from fansunqi/issue523_branch

640af86

fix issue 523;update jt.nn.Conv1d/Conv3d/conv2d/conv3d

Update version to 1.3.9.8

05f4cf3

Merge branch 'master' into master

e69e1f7

Merge pull request Jittor#533 from uyzhang/master

419bf3c

Update ACL library and fix bugs in ACL integration

fix: a minimal quick fix for issue Jittor#544

7714ce3

Merge pull request Jittor#545 from zhc7/patch-1

c334324

fix: fix for issue Jittor#544

fix: jt.Var.expand with valid index -1

5df1673

a IndexError fix of issue Jittor#448

14de5fa

a ValueError fix of issue Jittor#450

862bce9

fix illegal parameters of Pool and Pool3d of issue Jittor#451,Jittor#453

cd8b19a

,Jittor#456,Jittor#457

fix illegal parameters of Conv2d issue Jittor#471,Jittor#472,Jittor#473,

793d638

Jittor#474,Jittor#475,Jittor#476,Jittor#477

fix illegal parameters of PixelShuffle of issue Jittor#458,fix validi…

9e60eb6

…ty of concat of issue Jittor#459

check x.shape and kernel_size of Pool and Pool3d,issue Jittor#461,Jit…

9a23f5c

…tor#463

fix Pad2d with illegal padding,issue Jittor#464,Jittor#465,Jittor#466,J…

9d7e634

…ittor#467

fix illegal parameters of ConvTranspose and Pool,issue Jittor#478,Jit…

c79142d

…tor#480,Jittor#481,Jittor#482,Jittor#483

Update README.md

6967475

Merge pull request Jittor#546 from Hanyx2021/fix-expand

b1f18f0

fix: some function&class input illegal paramters

LDYang694 and others added 11 commits June 5, 2024 22:31

polish nn.Sequential attribute

8d26bb8

check input shape and scale factor's positiveness in jt.nn.Upsample

0dc433d

resume

ca63d37

Update setup.py

0ea0fd9

fix numpy version

update version

7416cfb

check parameters' positive in jt.nn.fold

21e7409

Merge branch 'Jittor:master' into fold

8454a7a

Merge pull request Jittor#561 from fansunqi/fold

5ff687d

check parameters' positive in jt.nn.fold

update version

b2f7f26

add isin

7852283

uyzhang force-pushed the master branch from 5455a28 to 330dec6 Compare June 11, 2025 16:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix PReLU Broadcasting Bug for Multiple Parameters #565

Fix PReLU Broadcasting Bug for Multiple Parameters #565

Uh oh!

hishambarakat16 commented Jul 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Fix PReLU Broadcasting Bug for Multiple Parameters #565

Are you sure you want to change the base?

Fix PReLU Broadcasting Bug for Multiple Parameters #565

Uh oh!

Conversation

hishambarakat16 commented Jul 5, 2024

Create input data with the specified shape

Test the PReLU activation function

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants