Bug Journal 2026-02-27

Completed storage analysis on the Tianhe HPC cluster, documented the Slurm GPU node request workflow, and confirmed M14 baseline evaluation results (Pi0/BC-RNN achieved near-zero success rates on error recovery scenarios)

February 27, 2026 · 6 min

Bug Journal 2026-02-26

Rewrote BC-RNN training configs to image mode on the HPC cluster and successfully launched 5-task parallel training. Extended the evaluation framework to support 5 MimicGen tasks, then identified and fixed the task distribution mismatch causing Pi0.5’s 0% success rate.

February 26, 2026 · 8 min

Bug Journal 2026-02-23

Systematically optimized MIHD spatial transcriptomics fusion training on the DCC node (3x CPU acceleration + architecture decoupling + full-slide benchmarking + Vision Refine ablation experiments). Concurrently on the Tianhe cluster, completed MimicGen data preparation, fixed M14 three-way evaluation environment fingerprint crashes, and resolved Pi0.5 full fine-tuning OOM issues. Successfully brought Pi0.5 LoRA training (Job 46553) to stable operation.

February 23, 2026 · 17 min

Bug Journal 2026-02-22

DCC completed MIHD project cleanup and Vision Refinement two-stage fusion implementation with batch experiments launched; tianhe advanced Error Recovery Benchmark Phase II (M14 evaluation pipeline validation, Pi0.5 OOM diagnosis) and completed the Phoenix pi0.5 reproduction full data pipeline (9 MimicGen task datasets ingested at 18.4GB, training config ready).

February 22, 2026 · 14 min

Bug Journal 2026-02-20

Ran targeted tests on STAIG fusion for the MIHD project, discovered a double-normalization bug in eval_scan_fusion.py, and introduced pipeline-level embedding caching into run_benchmark.py.

February 20, 2026 · 5 min

Bug Journal 2026-02-19

In the MIHD project, completed a systematic literature review of H&E Image-Only clustering methods (establishing ARI 0.11–0.16 baselines and five root causes for Foundation Model failure), built four core technical documents, and implemented and validated three self-supervised clustering enhancement approaches (STEGO/BYOL+GAT/SCAN). SCAN improved image-only ARI from 0.251 to 0.303 (+20.6%).

February 19, 2026 · 14 min

Bug Journal 2026-02-14

Implemented force injection enhancements for a robotic arm error-recovery benchmark, but 30N still produces no visible perturbation in video — root cause unresolved

February 14, 2026 · 7 min

Bug Journal 2026-02-13

Translated the MIHD project enhancement plan document to Chinese and wrote it to a new file

February 13, 2026 · 3 min

Bug Journal 2026-02-12

Implemented and debugged GLM billing support for ccusage on tianhe; organized the Chinese version of the MIHD enhancement plan on DCC

February 12, 2026 · 7 min

Bug Journal 2026-02-10

Progress across three projects: finalized contributor documentation for the robotics benchmark project, organized robobrain_pi history and prepared SAC reinforcement learning training, and kicked off documentation updates for the gadget research module

February 10, 2026 · 7 min