3.1. Experimental Platform and Evidence Pipeline
Experiments were conducted on a Niryo NED3 Pro manipulator using direct Dynamixel current-command control. The manuscript evidence is deliberately concentrated on a single actuator, namely Dynamixel ID 6 (Niryo J5) in the distal wrist chain, used here as the controlled, decoupled test bed of the study.
The Nussbaum adaptation mechanism interacts with several hardware-induced effects whose individual contributions are extremely difficult to separate when the controller is exercised on a full multi-degree-of-freedom arm. These effects include unmodeled Coulomb and viscous friction at the actuator, severe encoder quantization at the gearbox output, current-saturation and slew limits in the Mode 0 current loop, communication latency on the Dynamixel bus, and the dead-band introduced by the low-level current driver near zero command. On a multi-joint arm these effects are superimposed on the time-varying Coriolis, centrifugal, and gravity terms of (
1), so that a single tracking-error trace at any joint reflects a sum of (i) the Nussbaum adaptation transient, (ii) the hardware-oriented regularization studied here, (iii) the actuator and sensing non-idealities, and (iv) the multi-body coupling. Isolating the distal wrist joint reduces the dominant contribution of term (iv), because the upstream joints are held at a fixed configuration during every reported run, the wrist payload is fixed, and the gravity component at the operating centre is approximately constant. This design does not eliminate all residual joint-level effects: friction, backlash, compliance, cable and gearbox effects, gravity residuals, and load-dependent behaviour may still influence the same actuator. It does, however, reduce the dominant multi-body coupling confound sufficiently for the remaining residual to be interpreted primarily in terms of the Nussbaum mechanism and the actuator/sensing chain, which is exactly the deployment-layer behaviour this paper aims to characterize.
This single-joint study is the necessary first step of a multi-stage evaluation programme: characterize and improve the deployment-layer behaviour of an adaptive controller on a single decoupled joint, establish the safety envelope, the regularization architecture, and the hardware-aware Optuna scoring, and then carry the resulting controller forward to whole-arm trajectory tracking in subsequent work. The Dynamixel ID 6 / Niryo J5 actuator is well suited to this role: it sits at the distal end of the kinematic chain, so its dynamics are dominated by its own rotor inertia and friction rather than by the rest of the arm; it is the actuator most exposed to encoder-quantization and low-speed effects in the NED3 Pro platform; and it is the actuator for which sustained current-mode operation is most demanding because the available current ceiling is modest relative to the demanded torque at low speeds. The paper uses the dual identifier “Dynamixel ID 6 / Niryo J5” to avoid ambiguity between actuator IDs and whole-arm joint numbering conventions used in earlier working notes.
The controller was executed outside the higher-level ROS command path for the final evidence runs. This decision was made after early experiments showed that command-state conflicts and timing jitter in the middleware path could obscure the low-level behaviour of the adaptive law. The final evidence loop therefore logged the reference position, measured position, filtered velocity, tracking error, intermediate Nussbaum variables, commanded current, measured current, and saturation indicators in a single direct loop.
Figure 2 shows the annotated distal actuator map and the robot used to ground the experiments in the physical platform.
Table 1.
Main experimental protocol used in the main real-hardware Nussbaum runs.
Table 1.
Main experimental protocol used in the main real-hardware Nussbaum runs.
| Item |
Setting |
| Robot platform |
Niryo NED3 Pro |
| Low-level interface |
Direct Dynamixel current command (Mode 0/current-control path) |
| Primary tested actuator |
Dynamixel ID 6 / Niryo J5 |
| Dynamixel current scale used for conversion |
XM430-W350 / XM430 X-series current-register convention,
|
| Trajectory type |
Locked-center sinusoid |
| Reference amplitude |
|
| Reference frequency |
|
| Operating center |
start-centered operating region |
| Control rate |
24 Hz |
| Main validation duration |
300 s |
| Evidence artifacts |
CSV logs, JSON summaries, and publication-style PDF/PNG plots |
| Core safety envelope |
Current clamp, current slew limit, velocity filtering, anti-windup, q-guard, and adaptation freeze on satx. |
3.2. Baseline Nussbaum-PID Formulation
The manipulator model adopted in the source NPID work [
3] is
where
is the joint position vector,
is the symmetric positive-definite inertia matrix,
collects Coriolis and centrifugal terms,
is gravity,
is the applied joint torque,
u is the control input, and
is an unknown control-direction matrix. In the real-hardware deployment of
Section 4, unmodeled friction, encoder quantization, communication latency, and current-command nonlinearities additionally act on the joint; these are treated explicitly at the deployment layer rather than absorbed into (
1).
For tracking, the joint-level error signals are
and the source formulation defines the generalized intermediate variable
which couples proportional, integral, and derivative tracking error into a single signal. The role of
is established in [
3]: if
as
, then
e,
, and
remain bounded and converge to zero.
For dimensional consistency in Eq. (
3), the parameter
has units of
. Therefore,
,
, and
all have angular-rate units. In the implementation reported in this paper, the controller signals
q,
,
e,
, and
were represented in degrees, degree-seconds, and degrees per second, respectively, because these are the native units used in the logged actuator data and in the reported tracking-error metrics. The same mathematical structure can be implemented using radians if all signals and gains are rescaled consistently. However, the optimized parameters reported in this paper are degree-basis implementation parameters and should not be transferred directly to a radian-basis controller without the corresponding unit conversion.
To avoid ambiguity between the source theoretical formulation and the implemented single-joint controller, the RBF input used in this hardware implementation is defined explicitly as
The corresponding RBF approximation is
, with Gaussian basis functions
where
is the centre of the
i-th basis function and
is its covariance matrix. This implementation-level regressor differs from the broader source-paper notation
. The reduced regressor
was used because the present validation is a single-joint hardware deployment study and because
already combines the proportional, integral, and derivative tracking-error information used by the controller. All RBF centres, widths, and tuned gains reported in this paper therefore correspond to the implemented regressor
, not to the full source-paper vector
x. The adaptive gain correction is
and the RBF weight update is
with
,
, and
the
-modification leakage term (denoted
in [
3]; renamed here only to avoid collision with the RBF width symbol).
The Nussbaum function used throughout this work is
and the associated Nussbaum gain is
. Although this function satisfies
, this is a property of the Nussbaum function rather than an initial condition for the adaptation state. The hardware implementation therefore used the non-zero initial adaptation state
which was selected during the offline Optuna tuning process together with the remaining deployment parameters, rather than chosen manually after the validation run. The general source control law is
and the source then imposes the linking relations
which yield the characteristic polynomial
. For
, this polynomial is Hurwitz, and the control law reduces to the compact form
with adaptation-state dynamics
Equation (
13) is mathematically concise and was implemented directly in the initial hardware baseline. In the reported real-motor tests, this direct implementation showed a deployment-layer degradation mechanism. Since
,
can accumulate over long runs when
remains predominantly positive in the tested regime. As
approaches regions where
becomes small, the effective multiplicative action of
decreases; after crossing such regions, the effective sign of the Nussbaum modulation can change. On a current-limited actuator with friction and quantized sensing, this behaviour can reduce practical control authority. This observation concerns the direct hardware transfer of the baseline law and should not be read as a contradiction of the source Lyapunov analysis.
3.3. Enhanced NPID Formulation
The goal of the enhanced NPID formulation was to preserve the Nussbaum core while preventing the actuator from being driven into the most damaging parts of the baseline adaptation trajectory. Three additions were kept in the final manuscript configuration. First, the
-law was regularized using a soft leakage term:
This term is used here as an implementation-level adaptation governor. It does not remove the Nussbaum mechanism because remains in the control law. However, it is not claimed as a general replacement for the original Nussbaum adaptation law. Its purpose is to slow excessive -growth in the tested actuator setting once enters a range that was empirically associated with poor long-duration hardware behaviour.
Second, a low-speed velocity-reference feedforward term was added:
where
is the feedforward magnitude and
is the velocity scale used to smooth the sign transition. The term is motivated by Coulomb-friction-like low-speed hesitation, but it uses the desired velocity
, not the measured velocity
. It is therefore described here as velocity-reference feedforward rather than as a direct friction observer or measured-velocity friction compensator. This design was used to provide a smooth reversal bias without injecting measured-velocity noise into the current command.
Third, a tail-region damping term was activated only when the absolute tracking error became large enough to justify extra damping. In the implementation, the activation weight is a smoothstep map between an error threshold and a full-activation error level. In compact notation,
where
increases monotonically from 0 to 1 as
grows between a lower error threshold and a full-activation level through a smoothstep (cubic Hermite) interpolation, so that the additional damping stays inactive during normal small-error tracking and engages only on large excursions; the two activation thresholds are reported with the other deployment parameters in the released configuration files. The additional tail damping is
The implemented enhanced NPID control signal before current mapping is therefore
Here denotes symmetric saturation at the internal controller limit . The variable is a dimensionless controller-side command and should not be interpreted as a joint torque. These three additions define the enhanced NPID controller studied here. Other engineering safeguards, including anti-windup, velocity filtering, saturation-aware freezing of adaptation, current slew limiting, and a joint-angle guard, remained active as the fixed deployment envelope during all real-hardware trials. They are important for reproducible execution, but they were not treated as the main scientific novelty of the manuscript.
Figure 3 summarizes the full closed-loop architecture and the role of every equation introduced above. The signal flow runs from left to right along the main horizontal chain. The
Reference block emits the locked-centre sinusoidal reference
with the validated amplitudes
and frequencies
. The
Error shaping block applies Eqs. (
2)–(
3) to produce the generalized error
, with integral anti-windup acting on the integrator inside
. The
Adaptive Nussbaum mechanism expands its internal structure into six sub-blocks arranged in a
grid: the left column holds the
filtered error (
routed in from PID shaping), the
adaptation law of Eq. (
7) with hyperparameters
, and the
pure adaptive law of Eq. (
12); the right column holds the
RBF regressor with input
, the
ζ update law (
driven by
with the
tail-leak input from the regularization block), and the
Nussbaum gain that supplies unknown-sign handling. The block-level output
of Eq. (
6) is shown along the bottom of the mechanism panel, where the RBF regressor output
is combined with the adapted weights
to drive the pure adaptive law.
The three hardware-oriented additions are grouped in the
Regularization block at the bottom of the figure and are routed via dashed green arrows back into the controller: the soft
-leak
is injected directly into the
-update sub-block of the adaptive mechanism, while the velocity-reference feedforward
of Eq. (
15) and the tail damping
of Eq. (
17) are summed with
at the explicit
junction outside the mechanism panel. The summed pre-deployment command
then enters the
Current mapping + guards block, which applies the current mapping of Eq. (
19), the current and slew clamps, and the saturation-aware adaptation freeze before issuing the raw current command
to the
Actuator (Dynamixel ID 6 in Mode 0 current loop). The measured joint state
is fed back to the input summing junction to close the position loop.
A second, offline loop is shown in dashed purple. The
Optuna tuning and evidence block consumes per-trial telemetry from the actuator during the tuning phase only, and distributes the selected hyperparameters along the
selected parameter bus to every tunable block:
in error shaping, the core/RBF gains
, the regularization parameters
, and the current and slew limits in the current-mapping block. The amber
OFFLINE TUNING badge at the top-right of the figure, the
(offline, frozen at deploy) qualifier on the parameter bus, and the
(tuning phase only) qualifier on the telemetry arrow all reinforce that this Optuna loop is
not part of the deployed real-time controller: once the parameter set of
Table 6 has been selected, the Optuna bus and tuning telemetry are disconnected, the parameters are frozen, and the controller runs as a fixed-parameter real-time loop. The colour key at the bottom of the figure separates the four functional concerns: controller math (blue), hardware/safety (orange), regularization (green), and Optuna evidence (purple).
3.4. Current-Command Deployment Layer
The actuator does not accept a torque command directly. The dimensionless controller command
is therefore mapped into raw Dynamixel current-command counts as
where
maps the dimensionless controller command into raw current counts,
is a raw-count bias,
is the raw current clamp, and
is the experimentally identified command-sign convention. For the final manuscript configuration,
,
,
, and
. Since the raw Dynamixel current command is integer-valued, the small optimized bias
has no independent physical meaning except through its effect on rounding close to integer thresholds. It is retained here only for exact reproducibility of the released configuration selected by Optuna. In practice, this value is approximately equivalent to zero raw current count.
If a physical current value is required, the raw command can be converted using the current least-significant bit of the tested Dynamixel actuator,
where
is the model-specific current scale in
. The deployment configuration used the XM430-W350 current-register convention for the tested Dynamixel actuator, for which the project configuration records
. Therefore, the clamp
corresponds to approximately
. The quantitative controller claims in this paper are nevertheless reported in raw Dynamixel current units and internal command units because those are the quantities logged by the deployment layer. The sign must be validated experimentally; it cannot be copied blindly from a different controller family.
The command sent to the motor was also subject to current clamping and a first-order slew constraint. Although these safety layers are conceptually simple, they are crucial for real-hardware credibility because they define the part of the theoretical control law that the actuator can actually realize.
3.5. Theoretical Scope of the Implemented Controller
The baseline NPID controller follows the theoretical formulation in the source work, where boundedness is established under the assumptions of the considered manipulator model and the corresponding adaptive control law. The enhanced controller implemented in this paper includes hardware-facing components, namely soft adaptation-state leakage, low-speed velocity-reference feedforward, tail-region damping, current clamping, slew-rate limiting, anti-windup, and saturation-aware adaptation handling. These components are introduced for real-motor deployment and are not claimed here as a new Lyapunov-based theoretical extension of the source controller. Therefore, the claims of the present paper are restricted to experimentally observed bounded operation under the reported actuator, reference trajectory, operating region, and safety envelope. The objective is to characterize and improve practical deployability, not to replace the original theoretical analysis.
A clarification on the role of the Nussbaum mechanism in this single-joint study is warranted, because it pre-empts a natural objection. On the isolated Dynamixel ID 6 test bed the effective command sign is in fact identifiable, and indeed the deployment layer fixes it to a constant value (
in Eq. (
19)). One might therefore ask why a Nussbaum-gain controller, whose defining purpose is to resolve an
unknown control direction, is exercised at all on a joint whose direction is known. The answer is methodological. The unknown-sign capability of the Nussbaum law is not the property under test in this paper; it is the property the source controller will be relied upon to provide in the multi-joint and variable-transmission settings targeted in subsequent work, where effective control-direction and low-level command-mapping uncertainty genuinely arise once joints are coupled, reconfigured, or driven through gear trains of mixed convention. Before that sign-handling capability can be trusted on real hardware, the
deployment dynamics of the very same adaptation law—the growth of
, the behaviour of
near its critical region, and the interaction of both with current limits and friction—must first be characterized in the cleanest practical setting. A single decoupled joint with a known, fixed sign is precisely that setting: it reduces the influence of multi-body coupling and removes sign ambiguity, so that the
-drift degradation reported in
Section 4 can be interpreted as a deployment-layer effect rather than as a sign-resolution transient. In other words, the known-sign single joint is used here not because the Nussbaum gain is needed to find the sign, but because it is the controlled experiment in which the deployability of the Nussbaum adaptation mechanism can be established before its sign-handling property is exercised on harder, genuinely sign-uncertain configurations.
3.6. Optuna-Guided Real-Hardware Tuning
Optuna is used in this paper
only as an offline tuning framework. It runs on a host computer that is connected to the robot during the parameter-search phase, drives a sequence of finite-duration real-hardware trials, and stores trial telemetry, scores, and the selected parameter set. Once the parameter set of
Table 6 has been chosen, the Optuna loop is disconnected and the controller runs as a fixed-parameter real-time loop. No live optimization, online hyperparameter adaptation, or online Optuna optimization is used during deployment; the purple Optuna paths in
Figure 3 are exercised only during the tuning phase.The manual tuning was useful for initial safety checks, but it was insufficient for the final controller because several parameters interact nonlinearly:
,
,
,
, RBF width, current ceiling, slew limit,
-leakage, velocity-reference feedforward, and tail-region damping. Optuna was therefore used as the main search framework [
7]. The implementation uses Optuna’s Tree-structured Parzen Estimator (TPE) sampler with multivariate sampling enabled and a fixed random seed for reproducibility, together with a short startup phase of random trials before the TPE model is engaged. The complete tuning archive contains 79 physical closed-loop trials, of which 61 satisfied the score-validity criterion used in the archived logs. The final manuscript configuration is therefore not presented as a blind single-trial winner; instead, it is a long-run validated configuration selected from the parameter basin exposed by the cumulative search effort and then confirmed in 300 s operation. For full transparency, the 79-trial figure refers specifically to the headline
/
archive; the additional operating points reported in
Table 4 and
Table 5 required further finite-duration Optuna passes, so the total real-hardware search effort behind the paper amounts to several hundred trials (
Table 2). All passes shared the same admissibility filters and the same scoring function of Eq. (
21); the quantitative selection claims in this subsection are nonetheless reported only from the single 79-trial headline archive so that every headline number is traceable to one campaign rather than aggregated across heterogeneous search budgets.
The Optuna score was deliberately hardware-aware. Trials were rejected immediately if they violated core admissibility conditions such as insufficient motion span, excessive maximum error, excessive 95th-percentile error, or too much
-growth. The remaining trials were scored using a weighted sum of tracking accuracy, internal command saturation ratio, effort ratio, low control activity, and soft penalties on adaptation growth. To avoid ambiguity about mixed units, the objective is written below in its implemented scaled form: error quantities are in degrees, ratios are dimensionless, and the numerical weights therefore carry the effective reciprocal units needed to make
J a scalar score. Equivalently, the degree-valued terms can be interpreted as normalized by
, while the ratio terms are normalized by one.
where
is the internal command saturation ratio,
is the mean absolute raw-current demand divided by
,
is the achieved motion-span ratio,
penalizes insufficient controller activity,
collects soft exceedances of the configured
-magnitude/growth limits, and
is the internal command clamp. For the headline archive, the active weights were
,
,
,
,
,
,
,
,
for the active
-soft-limit term, and
. Tail-window and step-specific weights were available in the tuning code but set to zero for the headline sinusoidal archive. This structure is closer to real experimental design than to ordinary curve-fitting because it explicitly encodes what counts as a successful and defensible hardware run.