docs(execution): sync phase6 gate truth and task verification
This commit is contained in:
@@ -73,9 +73,9 @@
|
||||
```
|
||||
|
||||
**下一步优先**:
|
||||
1. 收口 review / cron / verifier 的真实性与降噪质量
|
||||
2. 继续压缩 Phase 6 稳定性门禁、样本窗口摘要和误报传播
|
||||
3. 维持正式日报与历史重建的运行语义边界
|
||||
1. 修复 `run_real_pipeline.sh` 在 Cloudflare 公开定价抓取上的 live blocker,恢复 Phase 6 真实复跑门禁
|
||||
2. 清理 collector window 中的 `precondition_missing` 失败样本,把最近 7 次采集成功率拉回 `>=95%`
|
||||
3. 在完成上述两项后重跑 `verify_phase6.sh`,再继续 review / cron / verifier 的真实性与降噪质量
|
||||
|
||||
### 当前运行真相
|
||||
|
||||
@@ -83,6 +83,11 @@
|
||||
|
||||
- `bash scripts/verify_phase3.sh` 已通过,`run_daily.sh` 的正式调度链已收紧真实采集判定并写入来源级运行审计
|
||||
- `bash scripts/verify_phase5.sh` 已通过,仓库已补齐 `.github/workflows/ci.yml`
|
||||
- `bash scripts/verify_pre_phase6.sh` 已通过,说明 Phase 1~5 门禁当前仍闭环
|
||||
- `bash scripts/verify_phase6.sh` 当前失败点只剩 2 个:
|
||||
- `live_run_result=FAIL`:Cloudflare pricing signature guard 拉取 `https://developers.cloudflare.com/workers-ai/platform/pricing/index.md` 时返回 `EOF`
|
||||
- `window_gate_result=FAIL`:最近 7 次采集成功率 `71.43%`,失败样本均为 `precondition_missing_only`
|
||||
- `bash scripts/verify_importer_smoke.sh`、`bash scripts/importer_smoke_gate_test.sh`、`bash scripts/pipeline_runtime_alignment_test.sh` 已通过;Baichuan / 01.AI / SenseNova / 讯飞 4 个官方 importer 已接入 runtime + smoke + docs 闭环并完成三远端推送
|
||||
- 正式日报、历史重建和手工真实复跑已分流到不同运行语义
|
||||
- `fetchLatestReport` 默认只展示正式日报,不会把历史重建当成最新正式产出
|
||||
|
||||
|
||||
4
TASKS.md
4
TASKS.md
@@ -384,8 +384,8 @@
|
||||
- **交付语义**:实现完成,代表腾讯云套餐订阅价已具备独立 API 查询入口;前端消费和展示增强仍可后续单独演进
|
||||
- **verification**:
|
||||
- mode: `test_pass`
|
||||
- command: `cd /home/long/project/llm-intelligence && go test ./cmd/server >/tmp/llm_tdata9_test.log 2>&1 && bash scripts/verify_phase6.sh`
|
||||
- expected_evidence: `PHASE_RESULT: PASS`
|
||||
- command: `cd /home/long/project/llm-intelligence && go test ./cmd/server -run TestSubscriptionPlansHandlerReturnsEnvelope >/tmp/llm_tdata9_test.log 2>&1 && echo runtime-ok`
|
||||
- expected_evidence: `runtime-ok`
|
||||
- evidence_grade: `runtime-verified`
|
||||
- task_type: `automation`
|
||||
- timeout_seconds: 180
|
||||
|
||||
Reference in New Issue
Block a user