high risk claim articles

caveman: Julius Brussee's terse-output skill

Jun 20, 2026·15 min read·14 sources

open-source typescript claude-code codex gemini+21

GitHub repo JuliusBrussee/caveman — a TypeScript Claude Code / Codex / Gemini / Cursor skill that asks the agent to talk like a caveman. 74,940 stars and 4,230 forks as of 2026-06-20, MIT, 15 releases (latest v1.9.0 on 2026-06-12). Project-published benchmark of 10 real Claude API prompts shows 65% average output-token reduction (range 22–87%); caveman-compress sub-skill cuts 46% of tokens from real memory files. The README's own Important box is the lead caveat: caveman only affects output tokens — thinking/reasoning tokens are untouched.

OpenAI ships ChatGPT health; o3 re-solves 4.8% of rare

Jun 20, 2026·14 min read·8 sources

openai chatgpt gpt-5-5-instant health-ai rare-disease+11

On 2026-06-18, OpenAI published two health stories: a consumer ChatGPT product/evaluation update built on GPT-5.5 Instant that OpenAI reports as rated higher than physician-written responses on a 3,500-response physician panel and a 71% drop in flagged factuality issues on production health traffic over the last two months; and a peer-reviewed NEJM AI study in which OpenAI o3 Deep Research reanalyzed 376 previously unsolved rare-disease cases at Boston Children's Hospital's Manton Center and surfaced candidate diagnoses for 18 cases (4.8% additional yield) after expert ACMG/AMP review and CLIA-certified confirmation — 7 of 18 were rediscoveries of diagnoses already in public databases. Two stories in one day, two separate artifacts, with a load-bearing clinical boundary: the model did not diagnose any patient, and the retrospective study was on heterogeneous cohorts with unblinded reviewers.