data source
training data
phase currents sampled from normal distribution
rotor phase angle sampled from uniform distribution
resulting static torque
1000 training, 500 validation
network architecture:
input: 3-phase currents & rotor phase angle
output: torque
currents[3]
->dense[3](activation)->dense[3](activation)->dense[5](activation)
->c1
angle[1]->sin_and_cos[2]
->dense[3](activation)->dense[3](activation)->dense[5](activation)
->c2
(c1[5] * c2[5]) -> dense[1] -> torque[1]
91 free parameters
objective
training
Adam optimizer
minimize mean-squared difference (MSE) of predicted angle and actual angle
motor torque driven by 3-phase sine wave, actual gt_torq
vs predicted pred_torq
left plot x-axis: time elapsed (seconds)
integral of sigmoid
what ReLU used to look like
bad for real-valued data (lacked minus part)
best for real-valued data
least overfit + best performance among all activations tested
care more about local details
turn underfit into overfit instantly
visual coolness of ReLU
smoothness of softplus
gradient-friendliness of tanh
works with both real-valued data and images