AI proves math — and the developer toolchain burns

OpenAI’s model overturned a decades-old conjecture while a trojan VS Code extension exposed thousands of repos — a day of capability leaps and infrastructure fragility.

Editorial: Two themes ran through today’s headlines: AI models pushing intellectual frontiers, and the brittle human systems that let those systems into production. One story shows what large models can discover; the other shows the simplest developer convenience can become the largest attack vector.

Top Signal

An OpenAI model disproved a decades‑old Erdős conjecture

Why this matters now: OpenAI’s general‑purpose reasoning model cracking a famous 80‑year‑old problem signals that AI systems can now produce verifiable, original contributions to core academic fields — a capability that could reshape research workflows and peer review.

OpenAI published a writeup describing a model that produced a counterexample and proof for a long‑standing discrete geometry question about unit distances in the plane; the company framed this as “the first time AI has autonomously solved a prominent open problem central to a field of mathematics” in their post.

"AI is helping us to more fully explore the cathedral of mathematics we have built over the centuries."

That quote captures both the exhilaration and the hard follow‑up work: independent verification. OpenAI released an abridged chain‑of‑thought and companion remarks and invited mathematicians to vet the result; several domain experts praised the work, but the community is rightly cautious about provenance, reproducibility, and attribution. If the proof holds, this is more than a flashy benchmark — it's evidence that models can hold long, structured chains of reasoning and combine ideas across subfields in ways humans missed.

Practically, this should accelerate R&D pipelines (faster conjecture testing, automated counterexample search) but also force journals, arXiv and universities to update policies for AI‑authored results: expect demands for machine‑readable provenance, reproducible runbooks and human sign‑offs. For product teams, the near‑term takeaway is straightforward: treat such models as potent research accelerants, but keep rigorous verification and versioned audit trails before publishing.

In Brief

GitHub confirms breach of ~3,800 internal repos via a malicious VS Code extension

Why this matters now: GitHub’s compromise via a trojanized Visual Studio Code extension exposes how easily developer productivity tools can become supply‑chain attack vectors, putting any org that trusts dev‑side plugins at risk.

GitHub says it "detected and contained a compromise of an employee device involving a poisoned VS Code extension" that allowed exfiltration of internal repositories; the company removed the extension and isolated the endpoint, according to the report. An attacker group, calling itself TeamPCP, claimed the haul and offered it for sale.

The practical risk is high because extensions often run with wide filesystem and network privileges. Short checklist for engineering teams: prefer remote editors or dev‑containers, limit extension installs on corporate devices, enforce extension allow‑lists, rotate secrets often, and gate sensitive CI actions behind isolated runners.

Mozilla is retiring asm.js optimizations (move to WebAssembly)

Why this matters now: Mozilla removing asm.js optimizations from Firefox forces teams still shipping asm.js builds to migrate to WebAssembly for performance and security benefits.

Mozilla announced it will turn off asm.js optimizations in Firefox 148 and remove the code later; asm.js will still run as plain JavaScript, but maintainers recommend compiling to WebAssembly for speed and smaller binaries (details). If your team ships compiled C/C++ to the browser, start planning the migration path now.

A tokens‑per‑second demo that makes latency tangible

Why this matters now: The token‑speed demo converts abstract tok/s numbers into an experience you can feel — useful for product decisions around responsiveness and model selection.

A practical interactive explainer lets you compare generation rates across modes and chips, illustrating how the same tok/s produces different user experiences for code versus prose (demo). Product and infra teams should play with this when sizing SLAs for interactive agents: perceived latency is as much about token density and prefill as it is raw decode speed.

Deep Dive

GitHub’s VS Code supply‑chain compromise — why this fracture matters

Why this matters now: The GitHub incident is a real‑world case study in how developer ergonomics (installing a helpful editor extension) becomes a single point of failure for code integrity across an org.

The attack vector is familiar and ugly: a malicious extension appeared as a seemingly benign plugin, an employee installed it, the extension ran with broad permissions, and attackers used that foothold to access internal repos. GitHub says the incident "involved exfiltration of GitHub‑internal repositories only" and has contained the device, but the attack underlines systemic weaknesses.

Operational lessons for engineering leaders:

Reduce trust surface: require extension allow‑lists, and restrict installs to curated sets on corporate machines.
Move sensitive work off local hosts: remote dev containers, ephemeral workspaces and cloud‑hosted IDEs (where the environment is centrally controlled) drastically reduce the value of local compromises.
Harden keys and secrets: never store long‑lived credentials on dev machines, enforce short TTLs, and require hardware or vault‑backed signing for deploys.
CI as a choke point: implement immutable, isolated runners for production deployments; treat repository access as high‑risk and log/alert on unusual patterns.
Vendor & supply‑chain governance: scan installed extensions and third‑party tools automatically and require code signing and provenance for any plugin that touches the filesystem.

Beyond technical fixes, this event accelerates a cultural shift: security teams can no longer treat developer tools as “user convenience” — they are critical infrastructure. Expect more platform‑level controls from Microsoft/GitHub (sandboxed extension models, stricter marketplace reviews) and a renewed market for “enterprise‑grade” plugin catalogs.

"Developers install community plugins because they solve real problems and speed work, but that convenience creates massive attack surface."

Closing Thought

AI capability and platform fragility are flipping together: models are now capable of original, verifiable discovery, and the tools we trust to build and ship that work are the same ones adversaries target. That dual reality argues for two concurrent investments — better validation and provenance for AI‑generated knowledge, and tougher supply‑chain hygiene for developer workflows. Teams that move on both fronts will get the upside of faster research without the downside of an avoidable breach.