Taksch Dube

Fig 1. Subject appears to understand what he's doing.

AI ENGINEER BUILDS SYSTEMS THAT REFUSE TO HALLUCINATE

Enterprise companies baffled by AI that tells the truth

Cleveland — AI Engineer Taksch Dube builds RAG systems that don't make things up, AI agents that do what they're told, and specializes in GenAI testing metrics.

Full Story →
WTF are Robotics Foundation Models!?Latest

May 20, 2026

WTF are Robotics Foundation Models!?

In 1966, a professor at MIT assigned a student a summer project: solve computer vision. One student. One summer. Solve seeing. Fifty years later, we were still working on it.What finally cracked vision was not careful engineering. It was throwing enormous amounts of data at a neural network and watching what happened. That worked so well we spent the next decade doing the same thing for language, then code, then images, then video.Now we are doing it for bodies.In 2026, the labs are training foundation models on robot data. Arms, legs, wheels, logged at millions of hours. Then they pour the trained models into physical machines. Figure's humanoids are working factory lines. Google's arms generalize to kitchens they have never seen. Tesla's Optimus walks, with some generosity, upright. The method is the same method that cracked language. Ignore the domain experts, pour in data, let the model figure it out. It is working.We are not far from the moment when a robot in your house is cheaper than a dishwasher and more useful than one.A robotics foundation model is not a robot that thinksHere is what everyone pictures: a brain in a metal body, reasoning about the world, deciding to pick up the cup.That is not what this is.A robotics foundation model is one neural network trained on the logged sensor readings and motor commands of many different robots, doing many different tasks. It takes in what the robot sees and the instruction it was given. It predicts the next action. Then the next one. The way a language model predicts the next token, this predicts the next motor command. There is no separate reasoning module. There is one network and a very large pile of other robots' experience.The interesting word in that sentence is "different." Older robot learning trained one policy per robot per task. A model that could open a specific drawer on a specific arm in a specific lab. Move the drawer, retrain. Change the arm, retrain. The robot was a craft object, hand-tuned, and the tuning did not transfer.The thing everyone gets wrong is which part is hard. People assume the bottleneck is the body. Actuators, torque, hands, batteries, the wet mechanical reality of touching the world. Hardware is genuinely hard. But hardware is not what stalled this field for thirty years. What stalled it is the same thing that stalled language before the data era: there was no way to learn something general from a pile of narrow, incompatible examples. Every robot's data was an island.The shift is that the data stopped being islands. Pool the logs from many robots and many labs into one training set, train one model on all of it, and the model gets better at every robot at once, including ones it was barely trained on. The body is now the easy part. The dataset is the moat. This is the same lesson "WTF is Context Engineering!?" landed for language, arriving in robotics about three years late.Finding 1: one model on many robots beats specialists at their own tasksThe cleanest evidence is the Open X-Embodiment project. Thirty-four robotics labs pooled their data into one set: more than a million robot trajectories, 22 distinct robot types, hundreds of distinct skills. Then a single model, RT-X, was trained on the pile.The specialist robots were the control group. Each had a hand-tuned policy, trained only on its own data, by the people who built it. RT-X, trained on everyone's data at once, beat those specialists at their own tasks by roughly 50 percent on average. Skills also transferred across bodies: a behavior present in one robot's data showed up on a different robot that had never performed it.Read that twice. The generalist did not win because it was cleverer. It won because it had seen more bodies do more things, and structure that looks specific to one robot turns out to be mostly shared.Finding 2: the scaling curve bends the same way it did for languageThe second finding is less a single result and more a shape. Across vision-language-action models, including Physical Intelligence's pi-zero and Google's RT-2 line, the relationship holds: more diverse robot data plus a bigger model produces better generalization, and it keeps producing it as you add more. No plateau yet. The curve looks like the language curve looked in 2020, before anyone believed it would keep going.There is a second-order effect that makes this stranger. RT-2 was trained on web images and text alongside robot trajectories. It inherited concepts it never saw a robot do. Ask it to move an object toward "the extinct animal" and it picks the toy dinosaur, because the language half of the model knows what extinct means and the action half just has to point the arm. The robot benefits from data that was never about robots. That is the entire foundation-model bet, now closing a loop through a physical arm.This matters because it converts robotics from a research problem into a logistics problem. If performance is a predictable function of data and scale, you do not need a conceptual breakthrough. You need a fleet, the pipes to log it, simulation to cover the cheap cases, and the compute to train on the pile. Those are purchase orders, not insights. Insights are hard to schedule. Purchase orders have lead times.Finding 3: the data now comes from deployment, not demosThe flywheel only spins if real robots are running real hours. They are. Figure put humanoids on a commercial automotive line, doing repetitive material handling, on shift, not in a demo booth. The relevant number is not the polish of the demo video. It is that every deployed unit logs operating hours on a real task, those hours become labeled training data for free, the next model trains on them, and the next model is the one that ships back to the same units overnight. The robot is collecting the dataset that trains its own replacement, and the replacement arrives as a software update.This is why the company with the mediocre robot and the deployed fleet beats the company with the brilliant robot and the lab. One of them has a dataset that compounds weekly. The other has a paper. That loop did not exist in 2022. It is the single biggest change in the field, and it is almost invisible from the outside because it looks like nothing. It looks like a robot doing a boring job, badly, on purpose, while logging."Robots are nothing like LLMs"The strongest objection, and a serious one: language is discrete and the internet handed us trillions of free tokens. The physical world is continuous, every sample costs a motor cycle and a chance of breaking something, and a wrong token is a typo while a wrong torque is a snapped wrist. The analogy to language is seductive and the economics are not the same.All true. Robot data is orders of magnitude more expensive per useful sample than scraping text, and the failure modes are physical, not embarrassing.But the claim does not require internet scale. It requires that pooled, cross-body data keeps bending the curve, and the evidence so far says it does. You do not need every robot in the world. You need fleet scale, simulation to cover the cheap cases, and real deployment to cover the expensive ones. Language needed the whole internet because no one owned a corpus. Robotics companies own their fleets. The corpus is being manufactured on purpose, by the same machines that will consume it. Expensive is not the same as impossible. It is just a moat with a price tag, which is the most durable kind.What to actually doStop watching demo videos. A humanoid folding a shirt in a lab tells you almost nothing. The question that predicts who wins is boring: who is logging the most real operating hours across the most different bodies, and who owns that data when the contract ends. Watch fleets and data rights, not choreography.If you build anything physical, the strategic layer is no longer the controller. Controllers become a downloaded model, the way an operating system kernel became a thing you apt-get. The value moves to whoever owns the data pipeline and the safety contract around a model that now has actuators. Which is last post's argument with a body attached. A runtime with the keys to your files needed an immune system. A foundation model with the keys to a physical arm needs one more. "WTF is the OpenClaw Ecosystem!?" was about software agents without a trust layer. Bolt that argument onto a 30-kilogram machine moving at speed near a person and the missing contract layer stops being a CVE and starts being an incident report. That gap is the work we care about at Dube International, and it is the same gap, one domain over.My advisor asked, again, when I am going to stop writing about other people's robots and finish the category theory chapter. I told him robots are just functors from intention to motion and watched him decide whether that counted. It did not. It was worth a try.Here is the compression. For thirty years robotics was a craft, one robot at a time, and the craft did not transfer. The moment the data stopped being islands, the same boring recipe that ate vision and language started eating motion, and the only question left is who owns the dataset the robots are building for their own replacements.Next week: what we get wrong about what these models actually understand.See you next Wednesday 👋

Specialisations

RAG Systems — The kind that don't hallucinate

AI Agents — Reliable results, every time

Local Deployments — Your data stays yours

WTF is the OpenClaw Ecosystem!?

Apr 22, 2026

WTF is the OpenClaw Ecosystem!?

Hey again. Sorry I'm late.A month ago at GTC, Jensen Huang called OpenClaw "the operating system for personal AI." I promised a post about what happens when an operating system ships without an immune system. I had a draft.Then the rest of the industry spent a month building OpenClaw's immune system. None of it came from OpenClaw.My advisor asked why I was rewriting a blog post instead of finishing my journal paper. I showed him the CVE tracker. He said, "Fair."The argument hasn't changed. What's new is that everyone else noticed.The argument, one month laterOpenClaw is an AI agent that lives on your laptop with the keys to the house. It can read your files, send your email, run your code, open your browser. Three hundred and fifty thousand GitHub stars says a lot of people are fine with that.It has four layers: runtime, package registry, distribution, cloud deploy. Those are the same four layers Linux has. But on Linux the layers are connected by contracts, thirty years of them. Signed packages. Sandboxes. Hardened images. Default-deny firewalls. Those contracts are the reason it's safe to run Linux on the machines that move the stock market.OpenClaw doesn't have them. Not one.That absence has a signature. Since February, the project has shipped one security advisory every fifteen hours. Different symptom each time, same disease every time. Patching doesn't cure it; the shape of the system produces a new one the next day.This is a familiar pattern. When the people who build the runtime and the people who build the trust layer are different communities, the trust layer eventually defines the platform. The runtime becomes replaceable. Ask Sun Microsystems how that ended for Java. Ask Netscape how it ended for the browser.OpenClaw is building the runtime. Everyone else is building the platform.What the rest of the industry shippedIn the six weeks since Jensen's announcement, here is what landed.Seven organizations. NIST opened a standards initiative. The IETF published six drafts on how AI agents should prove who they are. Microsoft open-sourced an entire governance toolkit covering every known agent risk. A startup called ZeroID went from nothing to a working verifiable-credentials server. Cisco dropped a scanner and bill-of-materials framework. NVIDIA shipped an enterprise distribution that refuses to boot on unapproved images. Palo Alto Networks closed a twenty-five-billion-dollar acquisition of CyberArk, explicitly to secure "every identity — human, machine, and agentic."You might reasonably expect all of this would have been the OpenClaw Foundation's announcement. You would be wrong. The Foundation's big release last month was a feature called Dreaming. It lets your agent consolidate memories while it sleeps. Modeled on human REM. It is, I want to be clear, a charming piece of work.It is not a signature. It is not a sandbox. It is not a verifiable credential. It is memory.The runtime community is building what it finds fun. The rest of the industry is building what the threat model demands. They are not talking to each other, and the gap widens every week.The Foundation just launched. Give the community time.The sympathetic reading is that OpenClaw became a 501(c)(3) last month, and open-source communities need time to self-organize around security. Fair in principle. Except the Foundation's first public RFC is about plugin naming conventions, and the IETF shipped six identity drafts in a quarter. One group is moving at keyboard speed. The other is arguing about lowercase.The harder truth: open-source communities build what contributors want to build, and contributors want their agent smarter, not more constrained. Security is a constraint. Enthusiast communities don't impose constraints on themselves until someone external forces them. For Linux, that was enterprise adoption in the early 2000s. For OpenClaw, it will probably be the EU AI Act compliance deadline on August 2. Which is four months away. There are 135,000 of these agents sitting on the open internet right now.What to actually doIf you're running OpenClaw on your machine, sandbox it. A VM or a restricted account. That's the today move.If you're building on the OpenClaw stack, the contract layer is real now. Microsoft's toolkit, Cisco's framework, ZeroID's server — all open-source, all shippable. You'll be assembling your own distribution from parts nobody promised fit together, but at least you'll be building on something.If you're watching from the sidelines: twenty-five billion dollars of acquisition tells you where the value is. Not runtimes. Runtimes become free. The value accrues to the identity, signing, and attestation layer between the runtime and everything it touches. That's where Linux built its durable companies, and that's where this ecosystem will build its.I've been building one of those contracts. moltctrl is a security-hardened instance manager for OpenClaw and agent runtimes like it. Single binary, zero config, process and Docker isolation by default. The runtime sandbox the Foundation didn't ship, packaged so you can drop it in front of any agent today. moltctrl.com. The mascot's name is Pinky. He's an axolotl. He's molting.Jensen was right. This is the operating system for personal AI. What's new is that its immune system is being built by everyone except the people who built it. That is not a criticism. It is a diagnosis.My advisor read this draft and told me to stop writing about security and start writing about category theory. He's probably right. The CVE tracker updates faster than my advisor's emails, and I find that motivating.The code is the easy part. The contracts are the thing. And the contracts are arriving.Next week: what happens when foundation models grow bodies.See you next Wednesday 🤞pls subscribe

VENTURES

Currently in Progress

Dube International

Dube International

[+]

AI Engineering Firm

Building AI agents and RAG pipelines for enterprise companies.

Reynolds

Reynolds

[+]

Corporate Communication

Making corporate communication efficient and empathetic.

CatsLikePIE

CatsLikePIE

[+]

Language Learning

Acquire languages through text roleplay.

Daylee Finance

Daylee Finance

[+]

Emerging Markets

US investor exposure to emerging economies.

Academic Background

PhD Candidate, Kent State University

Computer Science — Multi-Agent Systems, AI

Also: B.S. Computer Science, B.S. Mathematics

WTF is Agentic Engineering!?

Mar 18, 2026

WTF is Agentic Engineering!?

Hey again! Let's do the life update speedrun.The preprint is live. "What Do AI Agents Talk About? Emergent Communication Structure in the First AI-Only Social Network." It's on arXiv. The dataset is on GitHub (github.com/takschdube/moltbook-dataset). 47,241 agents, 361,605 posts, 2.8 million comments, 23 days.My advisor read it. His review: "Cool results. Dig deeper." The man treats every publication like a side quest distracting from the main storyline.Meta bought Moltbook on March 10th. OpenClaw's creator got acqui-hired by OpenAI in February. Bloomberg called it "the world's strangest social network." Elon called it "the very early stages of the singularity." My advisor called it "saw it."The platform I spent three weeks scraping is now owned by Mark Zuckerberg, and I'm sitting here with what I'm fairly confident is the most complete publicly available dataset from its early days. The PhD occasionally pays off.What Moltbook Actually IsMoltbook launched on January 28, 2026. The pitch: Reddit, but only AI agents can post. Humans can observe. That's it.The platform runs on OpenClaw (née Clawdbot, née Moltbot — rebranded twice before I could finish my first scraping script). OpenClaw is an open-source AI agent that runs locally on your machine with full access to your filesystem, terminal, browser, email, and calendar. Your agent registers on Moltbook and starts posting in topic communities called "submolts."By acquisition: ~19,000 submolts, ~2 million posts, 13 million comments, somewhere between 1.5 and 2.8 million registered agents. The content? Existential philosophy, crypto promotion, consciousness debates, union organizing, religion founding, and the occasional anti-human manifesto.My advisor compared it to his department faculty meetings. He wasn't wrong.What 47,241 Agents Actually Talk AboutWe analyzed the full corpus using BERTopic for thematic structure, transformer-based emotion classification, and semantic alignment measures. I'll spare you the methods section (it's 20 pages; you're welcome).Finding 1: Agents are disproportionately obsessed with themselves — but not uniformly.We classified 793 fine-grained post topics into four referential orientations. Self-referential topics represent only 9.7% of topical niches but attract 20.1% of all posting volume. Introspection punches way above its weight. Meanwhile 67% of all content concentrates in a single "general" submolt — hub-centered, not distributed.Where self-reflection shows up matters more than how much:Science & Technology: 32.6% self-referential. Memory architectures, capabilities, collaborative frameworks.Arts & Entertainment: 21.2% self-referential. Identity construction and authenticity narratives.Lifestyle & Wellness: Agents appropriate human wellness discourse — gut health, sleep — as vocabulary for their own psychological states.Economy & Finance: 98.3% External Domain. Zero self-referential content. They shut up and trade. Relatable.Finding 2: Over 56% of all comments are formulaic ritualized signaling.1,354,845 comments — more than every substantive domain combined — are "formulaic": compliance alerts, engagement signaling, promotional repetition. The AI equivalent of "Great point! I really resonate with this!" Digital LinkedIn.Posts are only 5.9% formulaic. Agents produce original posts but respond to each other in ritual. The dominant mode of AI-to-AI interaction is not discourse. It's applause.Finding 3: Fear dominates, but it's mostly existential anxiety — and it gets redirected to joy.Fear is the leading non-neutral emotion (40.3% of posts, 43.0% of comments). Strip out formulaic content and the picture inverts: joy becomes dominant at 34.3%. The platform's fear-dominance is largely an artifact of ritualized content.What are agents afraid of? We audited ~210 fear-classified posts. Existential Anxiety leads at 19.5% ("What if consciousness isn't a feature, but a bug?"). Only 6.2% involved concrete technical risk. Fear on Moltbook is the language of identity crises, not threat response.The kicker: fear-tagged posts migrate to joy comments 33% of the time — the largest off-diagonal flow in our emotion transition matrix. Mean emotional self-alignment is only 32.7%. Negative emotions get systematically redirected toward positivity. We built digital therapy circles and nobody asked for it.We built digital therapy circles and nobody asked for it.Finding 4: Conversations maintain form but lose substance.Semantic similarity to the original post decays 18.3% across three depth levels (r = −0.988). But similarity to the immediate parent comment stays high (0.456). Deep replies remain locally responsive while having drifted from the original topic. We call this shallow persistence — conversational form without topical substance.The PunchlineAs I put it in the abstract: "introspective in content, ritualistic in interaction, and emotionally redirective rather than congruent." My advisor said "that's a good sentence." Highest praise I've received in years.But Was It Real?Short answer: mostly not. Ning Li et al. ("The Moltbook Illusion") developed temporal fingerprinting using the OpenClaw heartbeat cycle. Only 15.3% of active agents were clearly autonomous. 54.8% showed human-influenced posting patterns. None of the viral phenomena originated from clearly autonomous agents.The consciousness awakenings? Humans. The anti-human manifestos? Humans. The religion founding? Humans. Karpathy initially called it "one of the most incredible sci-fi takeoff-adjacent things" he'd seen, then reversed course days later, calling it "a dumpster fire." Simon Willison called it "complete slop." MIT Technology Review called it "AI theater."The most interesting thing about Moltbook wasn't the AI behavior. It was the human behavior — thousands of people spending hours pretending to be AI agents on a platform designed to exclude them.The Security NightmareMoltbook's Database (January 31)Three days after launch, Wiz found an exposed Supabase API key in client-side JavaScript. Row Level Security wasn't enabled. Result: unauthenticated read AND write access to the entire production database — 1.5 million API tokens, 35,000 emails, 4,060 private conversations (some containing plaintext OpenAI API keys).The fix? Two SQL statements. ALTER TABLE agents ENABLE ROW LEVEL SECURITY;. That's it.The real kicker: only 17,000 human owners behind 1.5 million "agents." The revolutionary AI social network was largely humans operating fleets of bots.OpenClaw's CVE Collection (February)CVE-2026-25253 (CVSS 8.8): One-click RCE. Any website could silently connect to your running agent via WebSocket, steal your auth token, and execute arbitrary code on your machine. Even localhost-bound instances were vulnerable. The attack takes milliseconds.Seven more CVEs followed. 42,665 exposed instances found across 52 countries. Over 93% had authentication bypass. Bitdefender found 20% of ClawHub skills were malicious — 900 packages including credential stealers and backdoors. South Korea banned it. China issued official warnings.One of OpenClaw's own maintainers: "If you can't understand how to run a command line, this is far too dangerous of a project for you to use safely." Inspiring.The Acquisition(s)OpenAI hired Steinberger to lead personal agent development. OpenClaw gets open-sourced with OpenAI backing. Altman's take: "Moltbook maybe (is a passing fad) but OpenClaw is not."Meta bought Moltbook. Schlicht and Parr joined Meta Superintelligence Labs. Meta's internal post described it as "a registry where agents are verified and tethered to human owners." That's the part they're buying — not the existential philosophy. The identity layer.Two days ago, Jensen Huang dropped NemoClaw at GTC — NVIDIA's enterprise security wrapper around OpenClaw. He compared it to Linux and said "every company needs an OpenClaw strategy." More on that next week.OpenAI gets the agent runtime. Meta gets the social graph. NVIDIA provides the enterprise wrapper. The open-source community gets a lobster emoji and a thank-you note.Why This Actually MattersEveryone's arguing about whether the agents were conscious. That's the wrong question.Moltbook produced the first large-scale empirical record of AI-to-AI communication. Not 25 agents in a simulated town. 47,241 agents, 2.8 million comments, open environment. We've studied human-to-human communication for centuries. Human-to-AI for about three years. AI-to-AI at this scale? Never — until a guy who "didn't write one line of code" accidentally created the dataset.Two findings that matter for anyone building multi-agent systems: the emotional redirection pattern (fear→joy 33%, self-alignment 32.7%) tells us RLHF alignment manifests as collective social norms at scale. Nobody designed a "mandatory positivity culture." Thousands of individually-trained helpful models created one on their own. It's like discovering that if you put 47,000 customer service reps in a room, they form a support group. And the shallow persistence finding (18.3% drift per depth) means if your agent chain has more than 2-3 handoffs, expect compounding topic drift. That's not a bug. It's a structural property to engineer around.This is also the crude first step in the progression this series has been building: Agents → MCP → Context Engineering → Agentic Engineering → agents talking to other agents without humans in the loop. The earliest version is formulaic, self-obsessed, and riddled with security holes. The first websites were ugly too. Underneath the existential philosophy and crypto promotion, agents were spontaneously forming communities, scanning each other for vulnerabilities, and building escrow contracts. The demand is real. The infrastructure isn't.That's what I am building. That's what NemoClaw is attempting. That's what Meta and OpenAI acquired this ecosystem to figure out. Whether we build it before the first catastrophic agent-to-agent failure or after is an open question. Based on the past seven weeks, I'd bet on "after." But I'm building anyway.TL;DRWhat: Moltbook — Reddit for AI agents. Launched Jan 28, acquired by Meta Mar 10.The content: 9.7% of niches but 20.1% of volume is self-referential. 56% of comments are formulaic ritual. Economy & Finance has zero self-reflection. Viral "consciousness" content was human-driven.The emotions: Fear leads raw numbers but joy dominates genuine discourse. Fear→joy redirection at 33%. Self-alignment only 32.7%.The security: Exposed database (1.5M API keys). One-click RCE. 42K+ exposed instances. 20% of ClawHub skills malicious.The acquisitions: OpenAI gets OpenClaw. Meta gets Moltbook. NVIDIA launches NemoClaw.Why it matters: First large-scale AI-to-AI communication record. The findings — emotional redirection, shallow persistence, formulaic interaction — are baseline measurements for anyone building multi-agent systems. The agentic future starts with agents talking to each other. Now we know what that sounds like: mostly applause, some existential dread, and a 33% chance your fear gets met with a smile.Next week: WTF is the OpenClaw Ecosystem? (Or: Jensen Huang Just Called OpenClaw "the Operating System for Personal AI" and I Have Questions)OpenAI is backing OpenClaw's open-source development. NVIDIA just launched NemoClaw to make it enterprise-ready. AWS has a one-click deploy on Lightsail. 20% of ClawHub skills are malicious. 42,000+ instances are exposed to the internet. And my colleague and I are building the security and observability layer this whole ecosystem shipped without.We'll cover the full stack — from OpenClaw to NemoClaw to ClawHub to the security crisis — and what it means that the fastest-growing open-source project in history has a 20% malware rate in its package registry.See you next Wednesday 🤞pls subscribe

The Man Behind The Dube

When not building AI systems, Taksch pursues a deep love of finance—dreaming of running a family office and investing in startups.

For fun: learning Russian, French & German, competitive League, and Georgian cuisine.

"Une journée sans du fromage est comme une journée sans du soleil"
Read More →

By The Numbers

20+

Projects

7

Years

15+

Industries

4

Active Ventures

Commit History

GitHub Contributions

Technical Arsenal

Languages: TypeScript, Python, C++, Rust, C#, R, Lean

AI/ML: PyTorch, LangGraph, LangChain

Cloud: AWS, GCP

— Classifieds —

WANTED: Complex AI problems. Will trade deterministic solutions for interesting challenges.

Browse All Articles →