Rucha Agashe

Project Deep Dives

Detailed technical write-ups of each project — architecture decisions, problems encountered, how they were solved, and what they produced. Each section draws directly from the project README and source code.

Intrinsic AI for Industry Challenge · Google / Alphabet · $180K Prize Pool · Active

Project Automaton

ACT · Flow Matching · Diffusion Policy · ROS 2 + Gazebo + Isaac Sim · AWS EC2 GPU

GitHub ↗

The Challenge

The Intrinsic AI for Industry Challenge is a robotics + ML competition hosted by Intrinsic (Google/Alphabet) and Open Robotics, with a $180,000 prize pool. The task is deceptively precise: get a UR5e robot arm to insert SFP and NIC cables into a connector board in a simulated environment running Gazebo (Google DeepMind) and Isaac Lab (NVIDIA). Scoring rewards task completion (~75 points), trajectory smoothness (~24 points), and penalises collisions and excessive force. The challenge is non-trivial because cable physics in simulation is notoriously unpredictable — cables buckle, twist, and get physically stuck on ports in ways that deterministic policies cannot reliably handle.

My Role — Policy Training & ML

My contributions are on the ML side of the stack — the team has specialists handling simulation infrastructure (Enrique), data collection (Ali), and cloud infrastructure (Evan). My focus:

ACT (Action Chunking with Transformers) — A transformer-based imitation learning architecture that predicts sequences of actions (chunks) rather than single-step outputs. Chunking reduces compounding error and improves temporal coherence. Training on the experiment_8 dataset from HuggingFace; investigating a persistent GPU utilisation bottleneck (15–25% observed, 90%+ desired — almost certainly a CPU DataLoader bottleneck addressable with pin_memory and prefetching).
Flow-Matching Policy — A continuous normalising flow approach to action generation. The policy learns a vector field that maps noise to actions along a learned trajectory — more stable training than diffusion with comparable expressiveness. My background in differential equations maps directly onto the mathematics here.
W&B Experiment Tracking — 17+ training runs logged. Comparing loss curves, hyperparameter sensitivity (learning rate schedules, batch size, chunk length), and convergence behaviour across model variants.
Data Curation — Post-processing brute-force collected trials to filter successful insertions. The cheat-code oracle policy fails non-deterministically (cables get stuck in Zone 2); the strategy is to collect at scale and filter, rather than fix the simulator physics.

The Workspace

Development runs on a dual-monitor setup: RViz and Gazebo Sim side by side, with four concurrent terminals managing simulation, recording, teleoperation, and episode control. Training runs remotely on AWS EC2 g6e instances (NVIDIA L40S GPU — the g4dn.xlarge is insufficient; the L40S is mandatory for Isaac Lab).

Gazebo Sim and RViz running simultaneously

Left: Gazebo Sim + Isaac Lab environment. Right: RViz showing UR5e model with TF frame tree.

UR5e in Action — Cable Insertion

The UR5e arm performing the NIC card cable insertion task. These clips are from the data collection phase — the oracle cheat-code policy executing the insertion that the trained model is learning to replicate.

UR5e arm inserting a cable into the NIC card connector board. The policy must learn to replicate this motion from demonstration data.

Architecture

The full stack spans four layers. The simulation layer runs a UR5e arm in Gazebo (Google DeepMind) / ROS 2 / Isaac Lab (NVIDIA) on AWS EC2 g6e instances (NVIDIA L40S GPU required). A data recorder captures bag files per trial at 20Hz — wrench, image, forward kinematics, and motion commands. A scenario generator randomises task board configurations for training variety.

Above that sits the policy training infrastructure: ACT and flow-matching models trained on the experiment_8 HuggingFace dataset, tracked on Weights & Biases. A non-trivial engineering problem: the standard AIC action space is 6-dimensional, but the team's recordings are 19-dimensional (extra metadata). Similarly, the standard observation space is 26-dimensional; recordings capture 48. Resolving this dimensionality mismatch without losing useful context is an open design question.

Strategy — Cherry-Picking Best Approaches

The team has analysed the wider competition and identified three key techniques to adopt. Force/Torque tare calibration (from competitor jlamperez): the raw wrist wrench reads ~20N because it measures the gripper's own weight — taring before each episode makes the policy weight-agnostic. Recording at 10Hz not 30Hz: cameras are capped at 20fps, so 30Hz recording introduces ~33% duplicate frames. Diffusion Policy over vanilla ACT (from ALOHA Unleashed): diffusion handles multimodal action distributions — multiple valid insertion approaches — without collapsing to a bad average. Sanjay leads this track.

Team & Collaboration

Rangers-Intrinsic is distributed across three time zones. Tommy Ly (NY) leads the project and runs ML alongside a full-time day job. Shi Hao (London) handles RL and ACT training; 17+ W&B experiments logged. His independent portfolio and Project Automaton write-up: sienarindustries.com ↗ Enrique (London) built the simulation environment, data recorder, and HuggingFace dataset. Ali (London) built the scenario generator and ran 1,000 brute-force collection trials. Vladimir is a Senior ML Scientist at Booking.com specialising in RL for LLMs/VLMs. Sanjay (Illinois) leads diffusion policy. Justin (Boston) leads PPO exploration. Evan (London) manages AWS org-level infrastructure.

ACTFlow MatchingPyTorchROS 2GazeboIsaac LabAWS EC2W&BHuggingFaceUR5e

HackEurope 2026 · B2B SaaS in Development

NeuroCue

Real-time multi-modal social co-pilot for neurodivergent users

GitHub ↗

The Problem

Social situations are genuinely difficult to navigate for many neurodivergent people — not because of any deficiency, but because the implicit communication layer of human interaction (body language, microexpressions, tone) is processed differently. No tool currently addresses all three channels simultaneously in real time. NeuroCue is a quiet, non-intrusive scaffolding system — not a replacement for social instinct, but a support layer for those who need it.

Architecture

Three concurrent Python threads share a single threading.Lock state object, fully decoupling data production from LLM consumption:

Vision Thread→Webcam frames → YOLOv11 Nano Pose on remote NVIDIA A10G (Red Hat OpenShift via Cloudflare tunnel) → 17 COCO keypoints → named body-language states

FER Thread→OpenCV Haar Cascade face detection → FERNet CNN (local) every 5s → 7-class emotion classification → compound state detection (e.g. "smiling but masking discomfort")

Audio Thread→sounddevice mic recording → 15s WAV buffer → ElevenLabs Scribe v1 STT → live transcript

LLM Synthesis→Claude reads all three streams atomically every 15s → generates context-aware social advice → ElevenLabs TTS → spoken to user's ear

FERNet — Custom CNN

Trained from scratch during the hackathon. ResNet-style residual blocks with Squeeze-Excite channel attention. 7-class softmax output with smart compound expression mapping — the model doesn't just output a single emotion, it reads the probability distribution: high happy + high fear → "smiling but may be masking discomfort"; surprise + fear → "looks confused." Graceful degradation built in — if fer_model_best.pt fails to load, the system continues on body language and transcript only, no crash.

Body Language States Detected

Crossed ArmsWrist-to-wrist distance < threshold at chest height

FidgetingRapid wrist movement across 5+ consecutive frames

DisengagedNose Y near shoulder average Y — looking down

Hand RaisedWrist keypoint above shoulder keypoint

Head TiltEar-to-ear angle exceeds threshold (degrees)

Turned AwayShoulder width below minimum (facing sideways)

Touching FaceWrist-to-nose distance below threshold

NoddingNose drops below eye midpoint across frames

Key Technical Problem & Solution

The three data streams run on entirely different cadences — GPU inference has variable network latency, FER runs every 5s locally, and audio records continuously. The naive approach (waiting for synchronous responses from all three) would have made the system fragile and latency-bound. The solution was to treat the shared state object as a bulletin board: each thread posts its latest output whenever it's ready, and the LLM call reads the whole board atomically on its own 15-second clock. This decoupling is what makes the system robust in live conditions.

Dashboard & Interface

The NeuroCue session report interface — showing emotion timeline, body language summary, and voice analysis in a single view. Built with the B2B enterprise use case in mind: exportable session reports, multi-signal visualisation, and a clean dashboard for reviewing what the system detected.

Left: Session report view — 15-minute session, 7 tips given, dominant emotion Neutral. Body language and voice analysis panels. Right: Emotions panel — real-time probability distribution and arousal/valence metrics.

Status

Developing into a B2B SaaS product — target markets: HR departments, autism support organisations, and enterprise training platforms.

PythonYOLOv11PyTorchOpenCVClaude APIElevenLabsFastAPIRed Hat OpenShiftNVIDIA A10G

ETH Oxford 2026 · Top 7 / 150+ Teams · Flare & Plasma Sponsor Interest

FZAP — Faster Zero-Trust Anonymous Payments

Privacy-preserving cross-chain stablecoin settlement with QAOA routing

GitHub ↗

The Problem

A standard on-chain stablecoin transfer publicly reveals the buyer's address, the merchant's address, the exact payment amount, and the precise timestamp. This metadata leakage creates real risks. FZAP ensures there is never a direct blockchain transaction between a buyer and a merchant — without becoming an arbitrary privacy mixer (which would be a regulatory problem).

Three Privacy Layers

1. One-time-use ephemeral identities. Each transaction generates a fresh set of cryptographically secure wallet identifiers, unique within the protocol lifetime. No reuse, no linkage.

2. Aggregation-based settlement. Buyer deposits are pooled before settlement — conceptually an omnibus account. Once aggregated, individual payment amounts lose their one-to-one correspondence with specific wallets. A deliberate propagation delay further weakens timing-based correlation attacks.

3. Multi-hop QAOA-optimised routing. Aggregated funds are converted across stablecoins and chains. Possible transitions are modelled as a graph — nodes represent currencies, edge weights represent swap and bridge fees. QAOA (via Qiskit) searches this cost landscape to identify routing configurations that minimise aggregate loss. Real-time pricing from the Flare Time Series Oracle (FTSO) API — temporary stablecoin depegs are modelled as negative cost offsets to preserve pooled value.

FDC Settlement Verification

For each settlement, an attestation request references the transaction hash, destination chain, and payment type. This is processed during a Flare Data Connector (FDC) voting round. Once finalised, the result is committed to a Merkle tree. Merchants submit a Merkle proof to the settlement verification smart contract, which validates inclusion against the published FDC root. A settlement is only considered complete once verified on-chain — no linkage is ever created between buyers, intermediate hops, and the final merchant payment.

Legal Positioning

Unlike traditional privacy mixers, FZAP provides no arbitrary anonymisation or user-controlled withdrawal paths. All value flows are explicitly tied to merchant payments; withdrawals are restricted to predefined settlement addresses. This is what makes it commercially deployable rather than a regulatory liability — and the distinction the sponsors found compelling.

Tech Stack

PythonSolidityTypeScriptQAOAQiskitMax-CutFlare FTSOFlare FDCMerkle Trees

Personal Project · Built Entirely From Scratch

Chess Engine & Gameplay Interface

Hybrid Minimax + MCTS with a pre-trained Transformer for simulation

GitHub ↗

Why From Scratch?

Instead of using python-chess, all core mechanics — valid move generation, checkmate detection, board state management — are implemented from the ground up. This gives full control over the game tree and was the point: I wanted to understand exactly what was happening at every level, not call a function and trust it.

Search: Minimax + Alpha-Beta Pruning

The engine uses a Minimax algorithm to navigate the decision tree — identifying the move that maximises its advantage while assuming the opponent plays optimally to minimise it. Alpha-Beta Pruning cuts off branches that are mathematically proven to be worse than previously explored options, significantly reducing computation time. The Negamax variant is used where applicable, handling evaluation from a single perspective by negating scores for the opponent.

Search: Monte Carlo Tree Search

Traditional tree searches are limited by the exponential growth of possible moves. MCTS is integrated to balance exploration vs. exploitation. Selection and expansion use Minimax logic to prioritise high-value nodes. Simulation uses a pre-trained Transformer to estimate game outcomes from a given state — far more efficient than searching the entire subtree. Results back-propagate up the tree to update the value of each move.

Additional Features

FEN notation parsing — Forsyth-Edwards Notation is the standard for encoding board states. Implemented a full BNF-grammar-based parser so the engine can import, export, and resume any position.

Endgame tablebases — pre-computed optimal play for positions with few pieces remaining, used to guide the engine in the endgame rather than searching from scratch.

SQLite-backed history — user accounts and game history stored relationally, enabling game replay, analysis, and statistical aggregation across sessions.

Cycle detection — the search tree identifies and ignores repetitive move sequences to prevent infinite loops and enforce threefold repetition rules correctly.

Pygame interface — custom graphical board with piece loading, move highlighting, drag-and-drop input, and a real-time evaluation bar.

PythonMCTSMinimaxAlpha-BetaTransformerSQLitePygameFEN Parsing

Imperial Algorithmic Trading Contest · Powered by IMC · Top 10 · Only First-Year

HFT Strategy — Market Making & Statistical Arbitrage

Avellaneda-Stoikov framework · XGBoost/LightGBM directional signals · MARL extension

GitHub ↗

The Brief

Formulate a high-frequency trading strategy to trade 4 assets, maximising the Sharpe ratio using market making and arbitrage strategies, such that PnL can be evaluated in under a minute.

Market Making — Avellaneda-Stoikov

Instead of quoting around the raw mid-price, the engine calculates a reservation price: mid + predicted short-term drift (μᵢ) + inventory penalty (k_inv). This skews quotes away from the direction of inventory exposure, reducing position risk. Spreads are dynamically widened or narrowed based on estimated volatility (σᵢ) and current risk-aversion parameters — wider in high-volatility regimes, tighter in stable ones.

Directional Signals — XGBoost / LightGBM

Short-horizon predictors (XGBoost / LightGBM) forecast price movement over the next k ticks. Aggressive market orders are only executed when the signal-to-noise ratio E[R] / σ_est exceeds a calibrated threshold. Quotes are also aggressively skewed away from the predicted direction to avoid being filled on the wrong side of a trend (adverse selection avoidance).

Portfolio Optimisation

Capital is allocated across the four assets by solving a mean-variance proxy: minimise J(w) = −wᵀμ + λwᵀΣw, subject to leverage caps, dollar neutrality, and maximum per-asset exposure. Recomputed periodically via scipy.optimize.

Feature Engineering

Raw Limit Order Book snapshots (top 5 levels) transformed into: mid-price, microprice, spread, multi-level imbalance, depth sums, queue imbalances, order-flow deltas, short-term realised volatility, VWAP, and rolling means. These features feed both the market-making reservation price and the directional predictors.

MARL Extension

Later extended using Multi-Agent Reinforcement Learning, inspired by Google DeepMind's research on cooperative MARL. Each asset is treated as a separate agent with its own policy, sharing a global Sharpe ratio reward signal — agents learn to coordinate rather than compete for capital.

PythonAvellaneda-StoikovXGBoostLightGBMMARLSciPyLOBLSTM

Experience

I seek environments where the mathematical demands are insane, and the problems are unsolved. My current work at LabelBox sits in robotics simulation and model evaluation: confidential world-sim and MuJoCo tasks where the hard part is judging whether a model's response respects the structure of a physical system. Everything here connects — the chess engine shaped how I think about game-theoretic trading; robotics simulation shaped how I think about embodied AI; the Code Club reminded me that the best way to understand something is to teach it.

June 2026-
Present

Project Vortex

Co-founder and researcher

Many details are highly confidential. Simulating drone test flight for proof of concept to investigate closed loop behvaiour with nominal and perturbed wind fields.

May 2026–
Present

LabelBox

Robotics Simulation Engineer

Working on confidential robotics simulation and model-evaluation tasks across MuJoCo and world-sim / world-model workflows. The work includes assessing model-generated MuJoCo code and responses, reviewing simulation-oriented reasoning, and contributing to production-level robotics simulation evaluation without disclosing proprietary task details.

Feb 2026

Susquehanna International Group

Trading Discovery Event

Insight into quantitative trading and research at SIG. 2nd place in Susquehanna's Guesstimathon — a structured estimation competition testing rapid probabilistic reasoning under uncertainty.

Jan 2026

Jane Street

FOCUS / FTTP Spring Week

Below 1% acceptance rate. Software: incident management via a real-time simulation. Trading: completed the training activities given to quant trading interns, including Jane Street's in-house designed games: Figgie and the Estimathon. Came 2nd in both of these. I thoroughly enjoyed it and want to further my understanding of the trading mindset.

2025

Womanium + WISER

Quantum Program Graduate

Graduated from a postgraduate-level quantum computing programme at 17. Focused on quantum algorithms for non-linear PDEs: Carleman Linearisation, QFT, Phase Kickback. I sought this out independently; it stems from a pure curiosity about what quantum computing can actually do that classical computing cannot.

Apr 2024–
Sep 2025

Raspberry Pi Foundation

Code Club Founder & Volunteer

Founded a Code Club in a local library where I designed and ran weekly sessions on Algorithmic Thinking, Principles of Programming, and Python for children aged 9–13. An environment purely focused on building curiosity for computing is something I find energising to create as well as to be in.

Oct 2023–
Jul 2024

Leonardo

Industrial Cadets Gold Award

Year-long industry placement designing and building an autonomous medical aid delivery robot. PWM for motor control, ultrasonic sensing, GPS navigation, and OpenCV for obstacle avoidance. Won the Project of the Year award.

2023

MIT + Caltech

Summer of Quantum Camp

Selected to participate in the Summer of Quantum Camp, run by MIT and Caltech researchers. First formal exposure to quantum computing — sparked the self-directed study that led to the Womanium programme two years later.

Societies & Committees

Imperial Drone Society

Treasurer · 2025
Managing financial tracking, reporting and parts list construction. across a technical team building interceptor drones. Additionally also a key technical member working on Computational Fluid Dynamics and Computer Vision

Competitions

4th

UK AI Agent Hackathon

Internationally · Google Cloud, Cambridge
Only team of first-years. Two-stage stochastic + MILP.

Top 7

ETH Oxford Blockchain Hackathon

150+ teams · Oxford Mathematical Institute
QAOA applied to cross-chain settlement on Flare.

5th

QRT Inter-University LLM Hackathon

Oxford, Cambridge & Imperial · Oct 2025

Top 10

Imperial Algo Trading Contest

2nd

Jane Street Estimathon

FOCUS Spring Week · Jan 2026
Also Susquehanna Guesstimathon.

19th

Advent of Code — London

270 participants across 4 universities
11th at Imperial. Jane Street FPGA variant in SystemVerilog & Hardcaml.

Arkwright Engineering ScholarshipSponsored by the RAF · One of 200 selected nationally2023

Semiconductor Talent AwardUK Electronics Skills Foundation2025–present

UKMT Best in School AwardBMO Distinction · Sir Andrew Jobbings Kangaroo Merit2022, 2023, 2024

British Physics Olympiad Best in SchoolBPhO & Computational Challenge (Gold)2023, 2024

Industrial Cadets Gold · Project of the YearLeonardo Defence Systems2024

Skills

Languages

Python
C / C++
MATLAB
OCaml
SystemVerilog
Rust (learning)

ML & Data

PyTorch
TensorFlow
OpenCV
Scikit-learn
NumPy
XGBoost / LightGBM
YOLOv11
Model evaluation

Quantum

Qiskit
PennyLane
IBM Quantum
QAOA
QFT + Phase Kickback
Carleman Linearisation

Sim-to-real robotics

MuJoCo
Gazebo
Isaac Sim / Lab
ROS 2
World-sim evaluation, world models

Electronics

LTSpice

Electronic Speed Control

Design

AutoCAD, Fusion

Computational Fluid Dynamics

Ansys Fluent

Database, Cloud

AWS

Google Cloud

SQL

NoSQL

—

Journal

Moments, setups, and events — the texture of the work, not just the outputs. An honest record of what it actually looks like.

Writing

April 2026 Project Automaton

Joining an international team of ML engineers in an NVIDIA & Google DeepMind robotics challenge

Gazebo Sim (Google DeepMind) and RViz running simultaneously — the UR5e arm rendered in the connector board environment. The right panel is NVIDIA Isaac Sim. This is what a working day looks like.

Continue reading →

Watching Artemis II — returning humanity to the moon, live, from my desk

April 2026 Space · Enthusiasm

Watching Artemis II: returning humanity to the moon after 50 years

You can see my enthusiasm as I set up three simultaneous live feeds to watch this historic moment, as we all unite to wander at space flight and its importance for the future of humans.

Continue reading →

Jane Street FOCUS week — what I expected, what surprised me, what I'm still thinking about

January 2026 Finance · Events

Jane Street spring week

Being invited to Jane Street's prestigious Spring Week in London for ambitious, highly mathematical and ambitious students, with an acceptance rate under 1%

Continue reading →

Full scholarship to an MIT & Caltech quantum programme at fifteen — and where it led to an invitation to present at Cambridge

2023 Quantum Computing · Cambridge

Full scholarship to an MIT & Caltech quantum programme at fifteen and how I made it lead to an invitation to Cambridge

The Summer of Quantum programme (run by The Coding School, with instructors and researchers drawn from MIT and Caltech) was my first formal encounter with quantum computing, and proper introduction to IBM's Quantum Labs

Continue reading →

CSES awards ceremony — BABY team receiving Project of the Year from the Mayor of Chelmsford

2024 Leonardo · Industrial Cadets

Building BABY, sponsored by Leonardo

BABY (Battlefield Aid Brought to You) was a year-long project sponsored by Leonardo Defence Systems' and conducted under the Industrial Cadets programme, with a brief that was rather ambiguous: design and build a fully autonomous g…

Continue reading →

Sponsored by the Royal Air Force — the Arkwright Engineering Scholarship and what followed

2023 Arkwright · Aviation

Sponsored by the Royal Air Force: Arkwright Engineering Scholarship (and what it led to)

The Arkwright Engineering Scholarship is awarded each year to approximately two hundred students nationally, selected from a substantially larger pool, with sponsoring organisations that include the Royal Air Force.

Continue reading →

← Back to journal

April 2026Project Automaton

Joining an international team of ML engineers in an NVIDIA & Google DeepMind robotics challenge

Gazebo Sim and RViz running side by side — UR5e arm in the connector insertion environment

Gazebo Sim (Google DeepMind) and RViz running simultaneously with the UR5e arm rendered in the connector board environment. The right panel is NVIDIA's Isaac Sim. This image is a demonstration of a typical setup.

The first thing that becomes clear when you join a team of professional ML engineers mid-competition is how much you don't know — and, in my case, how quickly you need to decide whether that is a reason to hesitate or a reason to move faster. I joined Rangers-Intrinsic in April 2026, two weeks before the submission deadline for the Intrinsic AI for Industry Challenge: a robotics competition hosted by Intrinsic (Google/Alphabet) and Open Robotics, with a $180,000 prize pool, asking teams to train a UR5e robot arm to insert SFP and NIC cables into a connector board in a simulated environment. The team had already been running for weeks. I was not there at the start.

The GitHub repository has branches for the evaluator and for training; the experiment tracker on Weights and Biases has seventeen logged runs. I came in with a strong ML background but essentially no robotics experience, which meant I had to learn the simulation stack — ROS 2, Gazebo from Google DeepMind, Isaac Lab from NVIDIA, the AIC evaluation engine — while simultaneously being useful on the ML side, which is where I was actually needed.

The learning curve is steep in a specific way I had not anticipated. Some concepts I was already familiar with, such as ACT (Action Chunking with Transformers) is a transformer-based imitation learning method, flow-matching is continuous normalising flows, both of which I can work through from first principles. The difficulty I faced was often that the gap between understanding an architecture and getting it to run correctly inside a specific evaluation pipeline, on a specific dataset format, with a specific action space, is substantial. The team's recording pipeline captures 19-dimensional actions; the AIC standard is 6-dimensional. The observation space is 48 dimensions vs. the standard 26. Every discrepancy is a decision, and every decision has downstream consequences for training.

RViz showing UR5e robot model with transform tree — overlapping labels indicate a misconfigured TF tree

RViz: the UR5e model with its full TF frame tree. The overlapping labels are a sign of a misconfigured transform tree — one of those bugs that takes considerably longer to find than it should.

What I have found genuinely surprising is how much of the work at this level is not about the models at all. It is about infrastructure: getting checkpoints to load, debugging why GPU utilisation is 15% instead of 90% (almost certainly a CPU DataLoader bottleneck — more workers, pin_memory, prefetching), building the evaluation loop so that a trained model can actually be scored by the competition engine. The people who are effective on this team are effective because they can hold the full stack in their head simultaneously and move between layers without losing the thread. That is a skill I am actively developing.

My specific contributions: ACT and flow-matching policy training, W&B experiment tracking, data curation (filtering successful insertions from a brute-force collected dataset — the oracle cheat-code policy fails non-deterministically because cables get physically stuck in Zone 2 of the connector board). I am also the person most likely to be investigating the DataLoader bottleneck at 1am, which is, I think, a reasonable description of where I currently sit in the project hierarchy. Moving up that hierarchy, quickly, is the goal.

← Back to journal

April 2026Space · Enthusiasm

Watching Artemis II — returning humanity to the moon, live, from my desk

Artemis II T-minus 3 seconds across three screens

Forty seconds after launch. The contrail is already crossing the frame on all three feeds simultaneously.

I set up three simultaneous live feeds — laptop, external monitor, tablet, arranged in a shallow arc across my desk — to watch the Artemis II launch. This is not a remotely efficient use of a morning and I am not apologetic about it. Some things are worth the hours they cost.

Artemis II carries four astronauts on a lunar flyby: the first human beings to travel to the Moon's vicinity since Apollo 17 in December 1972, a gap of more than fifty years that is either a remarkable fact about the difficulty of the undertaking or a dispiriting fact about institutional priorities, depending on your mood. The Space Launch System is not the most elegant vehicle that has ever been pointed at the sky — SpaceX would, and does, have opinions about this — but the 8.8 million pounds of thrust at ignition produces a particular quality of awe that is not really about elegance.

The same desk, after launch. Three screens still running, the Gonville and Caius card visible on the wall to the right. The cracked monitor has been through a lot.

The reason I care about this — beyond the obvious spectacle of it, which is considerable — is the same reason I care about quantum computing and robotics: there are engineering problems that exist at the genuine frontier of what is physically possible, and watching them get solved, slowly and expensively and in full public view, feels like one of the more honest ways to spend an afternoon. The RS-25 engines at the base of the SLS operate on the same combustion cycle as the Space Shuttle main engines, built decades ago, but manufactured now to tolerances that did not exist when the original specification was written. That compression of time — the same fundamental design, made incrementally more precise across half a century — is what real engineering progress actually looks like, as opposed to what it looks like in a press release.

The wall behind the monitors, if you can make it out: a poster from the Goethe Institut — Das Leben ist kein Ponyhof, life is no pony farm — a card from the Cambridge CS department, a collection of origami flowers I have been folding at my desk while thinking about difficult problems. A reasonably honest cross-section of what goes on in this room on a Saturday morning.

← Back to journal

January 2026Finance · Events

Jane Street FOCUS week — what I expected, what surprised me, what I'm still thinking about

Jane Street Europe. The circular logo is the same one on the cup on my shelf. The building is exactly what you'd expect and somehow still impressive.

I came second in Jane Street's Estimathon during FOCUS week, and I want to be precise about what that means and, more importantly, what it doesn't — because the Estimathon is not the kind of competition where second place is a comfortable shorthand for anything in particular.

The format is fifteen estimation problems in thirty minutes, scored by the inverse of your error across all of them. The questions span several orders of magnitude of difficulty, from questions that reward basic numerical intuition to ones that require a genuine understanding of physical constants, biological scales, or historical data that most people have never thought to commit to memory. The people who perform well are not, in my observation, the people who happen to know the most. They are the people who have learned to commit quickly to an estimate that is defensible, update it cleanly when new information arrives, and resist the specific kind of paralysis that comes from wanting to be more exact than the question actually requires. Approximation, wielded deliberately, is the entire game.

What surprised me, and has stayed with me since, is how closely the disposition that does well in the Estimathon resembles the disposition that seems to do well in research more generally. The Figgie card game — Jane Street's own invention, built explicitly to teach probability and market-making — makes this structure explicit in a way that is almost pedagogical: you are buying and selling contracts whose underlying value is uncertain, updating your expected value in real time as the market reveals information. It is Bayesian reasoning conducted under time pressure and competitive incentive. I found it completely absorbing in a way that told me something about myself that was useful to know.

What I have been sitting with since is a cleaner sense of where, within a firm like Jane Street, I would actually want to operate. The quant research side — building models, forming and testing hypotheses about market structure, working at the boundary between mathematics and financial reality — is where my interests genuinely lie. The pure execution side is a different kind of problem, and an important one, but it is not the problem that wakes me up in the morning. That distinction is worth being honest about when thinking carefully about where to direct the next several years of effort.

The Royal Exchange at night, walking back from Jane Street. The City looks best in the rain at 8pm. I stand by this.

← Back to journal

2023Quantum Computing · Cambridge

Full scholarship to an MIT & Caltech quantum programme at fifteen — and where it led to an invitation to present at Cambridge

Qubit by Qubit High School Quantum Computing Camp certificate — Rucha Agashe, July 2023

Certificate from the Qubit by Qubit High School Quantum Computing Camp, July 2023. Instructors drawn from MIT and Caltech. I was fifteen.

The Summer of Quantum programme — run by The Coding School, with instructors and researchers drawn from MIT and Caltech — was my first formal encounter with quantum computing, and the thing I most remember about it is how quickly I became convinced that this was a field worth taking seriously. Not because of the applications, which were at that point largely hypothetical, but because of the mathematics. Quantum mechanics makes contact with linear algebra in a way that is both completely natural and genuinely surprising, and the moment that connection clarified — that a quantum state is a vector in Hilbert space, that a quantum gate is a unitary transformation, that measurement is projection — I understood that I was looking at something that was going to occupy a significant portion of my thinking for a long time.

I was fifteen. I know that sounds like a detail one mentions to impress, and I want to be honest that I am mentioning it partly for that reason, but also because it matters to the shape of what happened next. I was young enough that I did not yet have a clear sense of what was within reach and what was not, which meant I had not yet developed the particular habit of caution that tends to accumulate with experience. So when the programme ended, I wrote a paper about what I had learned.

The paper was not, if I am being truthful, a very good piece of academic writing. I was fifteen and I had been doing quantum computing for the better part of two weeks. What it was, was sincere — a genuine attempt to work through the material carefully, to understand not just the results but the derivations, to articulate what was clear and to identify honestly what was not. I wrote it because the act of writing something down is how I find out whether I actually understand it. The paper was the test, not the certificate.

What I did not anticipate was where the paper would lead. I submitted it to an essay competition — one with thousands of entries — not with any particular expectation, but because it existed and it seemed like the thing to do with it. Winning was a surprise. Being invited to Cambridge on the strength of it was a larger one. I met the Director of Studies in Computer Science at Gonville and Caius College, and visited the college properly — the courtyard, the sundial tower, the Hawking plaque set into the stones. I want to be careful not to overstate what the meeting was — it was a conversation, not an admission offer — but it was the first time someone at that level of academic seniority had taken my thinking seriously on the basis of something I had produced myself, and that is its own category of thing regardless of what formally followed.

What I took from it, practically: the paper mattered not because it was good, but because it existed and I submitted it. Writing something down is one act; sending it somewhere is a second and distinct act, with its own nonzero probability of producing something unexpected. Most of the time the probability is low. Fifteen-year-old me, writing a quantum computing paper in a school holiday and submitting it to a competition with thousands of entries, had a better intuition about this than she was consciously aware of.

Stephen Hawking memorial plaque at Gonville and Caius College, Cambridge — with the Bekenstein-Hawking entropy formula

Gonville and Caius College, Cambridge — the Stephen Hawking plaque, with the Bekenstein-Hawking entropy formula S = kc³A/4ℏG. It is set into the courtyard floor. Remember to look up at the stars and not down at your feet.

Gonville and Caius College Cambridge — the sundial tower

Gonville and Caius College. The sundial tower dates from the seventeenth century. The college has been here longer than most of the ideas I had come to discuss.

← Back to journal

2024Leonardo · Industrial Cadets

Building BABY — a year of aluminium, motors, and learning what engineering actually feels like

The workshop. Safety glasses on, metal frame clamped to the bench, arguing about whether the holes are in the right place. They were not.

BABY — Battlefield Aid Brought to You — was a year-long project conducted under Leonardo Defence Systems' Industrial Cadets programme, with a brief that was refreshingly unambiguous: design and build a fully autonomous ground vehicle capable of navigating to a target location, detecting and avoiding obstacles, and delivering a medical payload under its own guidance. No templates, no reference designs, no starter kit. A sheet of aluminium and a list of constraints.

The gap between understanding how pulse-width modulation works — which is a matter of an afternoon and a textbook — and getting two motors to spin smoothly and in coordination without overheating the driver board, is not a small gap. It is, in practice, a gap that takes weeks to close, and the closing of it involves a particular kind of learning that no amount of reading accelerates. The ultrasonic sensor triggered false positives from reflective floors. The GPS module had a five-metre accuracy radius, which is an enormous circle when the corridor you are navigating is three metres wide, so we built a visual landmark detection system as a fallback. Every subsystem, tested in isolation, behaved exactly as it should. Every subsystem, integrated with the others, produced behaviour that none of us had anticipated. That is engineering. Not the clean version described in lectures — the actual version, which is mostly a negotiation between what you designed and what the world does to it.

The chassis before electronics. Tank tracks, differential drive motors, aluminium frame. Heavier than expected, which became a tuning problem.

Internals: ultrasonic sensor, motor controllers, wiring harness. Eventually this got tidied up.

BABY navigating the test corridor autonomously. The moment this worked without intervention was legitimately exciting.

We won Project of the Year nationally, from thousands of submissions across schools in the United Kingdom, and received the Industrial Cadets Gold Award under the patronage of HM King Charles III. I am genuinely proud of both. But the thing I carry from that year is not the award — it is the specific quality of knowledge that comes from having built something physical, something that either works or doesn't, in a domain where there is no compiler error to blame and no stack trace to follow. The constraints are physics, time, and the tools you have to hand. That is a different kind of problem-solving from what I had practised before, and it changed the way I approach every problem I have worked on since.

CSES awards ceremony. The Project of the Year trophy was presented by the Mayor of Chelmsford. The trophy is much larger in person.

← Back to journal

2023Arkwright · Aviation

Reading

Papers, challenges, and resources that have occupied my attention recently. Shared not as passive bookmarks but as genuine recommendations — each has informed my thinking in some measurable way.

ML & Systems 2020

Scaling Laws for Neural Language Models

Kaplan, McCandlish, Brown, Amodei et al. · OpenAI · arXiv:2001.08361

One of those papers that ought to be mandatory reading for anyone working in deep learning. Language model loss scales as a power law with model size, dataset size, and compute — trends holding across seven orders of magnitude. Architectural details like depth and width matter remarkably little within a wide range.

The key insight: larger models are dramatically more sample-efficient. The optimal strategy under a fixed compute budget is to train a very large model on comparatively modest data and stop well before convergence. This is counterintuitive, but the empirical evidence is unambiguous. Every decision made in frontier model training today traces back to this paper.

ScalingLLMsCompute

ML & Systems Active

OpenAI Parameter Golf

OpenAI Model Craft Challenge · GitHub

An open research challenge I am actively participating in. Train the best language model within a 16 MB artifact — weights and training code combined — in under 10 minutes on 8×H100 GPUs. Performance is measured in bits per byte on a held-out FineWeb validation set, making it tokeniser-agnostic.

This is L(N) optimisation in the most constrained form: the limit forces genuinely creative thinking — depth recurrence, aggressive parameter tying, quantisation-aware training from scratch, novel tokenisers, test-time compute tricks. OpenAI are offering $1M in compute credits through RunPod; standout participants may be invited to interview for research positions. The current leaderboard sits around 1.08 BPB.

LLMsParameter EfficiencyResearch Challenge

ML & Systems Ongoing

The NanoGPT Speedrun & Modded-NanoGPT

Keller Jordan et al. · GitHub

The direct ancestor of Parameter Golf. The community has taken Andrej Karpathy's 45-minute GPT-2 replication and driven it to approximately 1.45 minutes through 75 successive records — an extraordinary collaborative effort spanning optimiser design, architectural innovation, and systems engineering.

The techniques catalogue here is invaluable: Muon optimiser with Newton-Schulz orthogonalisation, RoPE, QK-Norm, ReLU² activations, value embeddings, FlexAttention with sliding window warmup, fused Triton kernels, multi-token prediction, bigram hash embeddings, and partitioned hyperconnections. If you care about efficient training at all, study the record history. Every entry is a masterclass in extracting marginal gains.

TrainingOptimisationSystems

Quantum 2026

Advances in Quantum Computing — Caltech & Google

Quanta Magazine · quantamagazine.org

Two simultaneous results that warrant serious attention. A Caltech team (Bluvstein, Cain, Preskill) designed a quantum architecture that could theoretically break RSA using roughly 100,000 neutral-atom qubits. Separately, Google's Craig Gidney published an implementation of Shor's algorithm 10× more efficient than prior methods, potentially breaking elliptic curve cryptography with under 500,000 qubits.

Neither machine exists yet. But both results compress the timeline considerably — fault-tolerant quantum computers capable of breaking widely-deployed cryptography may be years away rather than decades. The implications for post-quantum cryptography transitions are urgent, and I do not think the broader technology industry is taking this seriously enough.

QuantumCryptographyPQC

Robotics Apr 2026

NVIDIA Physical AI — National Robotics Week 2026

NVIDIA Blog · blogs.nvidia.com

A round-up of NVIDIA's robotics ecosystem: Isaac GR00T open models for natural language robot control, Cosmos world foundation models for synthetic data generation, the open-source Newton 1.0 physics engine, and Isaac Sim 6.0 now generally available.

Featured use cases include surgical robotics (PeritasAI), underwater simulation (OceanSim), agricultural automation (Aigen's solar-powered weed-removal rovers), and autonomous solar installation (Maximo). NVIDIA's strategy is transparent — embed their tooling so deeply into the robotics development pipeline that it becomes infrastructure — but the engineering output is nonetheless impressive.

RoboticsNVIDIAPhysical AI

Robotics Dec 2025–Feb 2026

REVEL × NVIDIA Physical AI Hackathon

RoboHorizon · robohorizon.uk

A multi-month hackathon with over $100K in prizes — including a humanoid robot as the grand prize. Ran on NVIDIA's Isaac Sim / Omniverse stack, covering manipulation, perception, and agentic control across Pro, Amateur, and Junior tracks. Winning teams were flown to GTC 2026 in San Jose.

This is fundamentally a talent pipeline play. NVIDIA are embedding their tools with the next generation of robotics engineers whilst simultaneously identifying strong candidates for their ecosystem. The structure is clever: the competition is the recruitment process.

HackathonRoboticsIsaac Sim

Quant Finance Reference

Jane Street HackerRank — A First-Hand Account

LinkJob · linkjob.ai

A detailed walkthrough of the Jane Street quant trader online assessment — 4 questions in 30 minutes. No algorithms, no coding. Entirely probability, expected value, and Bayesian reasoning: parity of prime sums, a dice game EV optimisation, a Bayesian coin-flip problem, and an urn strategy problem.

Essential reading not for the specific questions (which rotate) but for calibrating the type and depth of probabilistic thinking expected. The time pressure is the real filter — the questions are tractable, the clock is not.

Jane StreetProbabilityInterview Prep

Quant Finance Reference

Quant Resources — Forrest Bicker

forrestbicker.com

A well-curated reading list spanning ML (ESL, Bishop, Russell & Norvig), econometrics (Wooldridge, Shumway), quantitative investing (Paleologo's Elements of Quantitative Investing, Dama's On Automated Trading), and interview preparation (Zhou, Crack's Heard on the Street, Mosteller's 50 Probability Problems). Also covers competitive programming — USACO, Codeforces, Project Euler.

Good curated lists are genuinely rare. Most "awesome" lists on the internet are bloated beyond utility. This one is not.

Reading ListQuantInterview Prep

Quant Finance Reference

WallStreetQuants — Free Quant Lectures

thewallstreetquants.com

Free introductory lectures covering risk-adjusted performance (Sharpe ratio, diversification, portfolio construction), the quant research process (backtesting, overfitting pitfalls), and a case study on a cryptocurrency strategy designed to profit from large liquidation events.

A reasonable entry point for those curious about quantitative finance — the production quality is decent and the content avoids the hand-waving that plagues most introductory material.

LecturesPortfolio TheoryBacktesting

Tools & Tutorials Reference

Google Cybersecurity Professional Certificate

Coursera · coursera.org

A structured programme from Google covering foundational security concepts, tools, and practical skills. Included here because cybersecurity literacy is increasingly non-optional — even for those whose primary work lies elsewhere. Understanding threat models, cryptographic primitives, and network security is part of building serious systems, not a separate specialism.

SecurityFundamentalsGoogle

Tools & Tutorials Reference

Contour Detection in OpenCV

LearnOpenCV · learnopencv.com

A thorough tutorial covering both Python and C++ implementations of findContours() and drawContours(), the two approximation methods, and the four retrieval modes with hierarchy relationships. Real-world applications: motion detection, unattended object detection, image segmentation.

Computer vision fundamentals never go out of fashion. The retrieval mode hierarchy — RETR_LIST, RETR_EXTERNAL, RETR_CCOMP, RETR_TREE — is one of those things one looks up repeatedly until it finally sticks. This tutorial made it stick.

OpenCVComputer VisionPython

Last updated: April 2026

Certifications & Skills

Python C / C++ OCaml Haskell Rust MATLAB Solidity SQL SystemVerilog Hardcaml LaTeX PyTorch TensorFlow PennyLane Qiskit OpenCV NumPy Scikit-learn XGBoost FastAPI Git Linux Python C / C++ OCaml Haskell Rust MATLAB Solidity SQL SystemVerilog Hardcaml LaTeX PyTorch TensorFlow PennyLane Qiskit OpenCV NumPy Scikit-learn XGBoost FastAPI Git Linux

Languages & Tooling

Languages

Python
C / C++
OCaml
Haskell
Rust
MATLAB
SystemVerilog
Julia (learning)

ML & Data

PyTorch
TensorFlow
OpenCV
Scikit-learn
Pandas
NumPy / SciPy
Hugging Face

Quantum

Qiskit
PennyLane
Cirq
IBM Quantum
QAOA
QFT/QPE
Carleman Linearisation

Systems & Infrastructure

LaTeX
FastAPI / Uvicorn
Red Hat OpenShift
Cloudflare
Git / GitHub
Linux
AutoCAD, Fusion
LTSpice
Ansys Fluent

Programmes & Certifications

Womanium + WISER

Quantum Computing Programme

Postgraduate-level programme covering quantum algorithms for non-linear PDEs, Carleman Linearisation, QFT, Phase Kickback, HHL, QAOA, and quantum linear solvers. One of a small number of undergraduates admitted.

2025 · Completed

Qubit by Qubit · MIT & Caltech

QxQ High School Quantum Computing Camp

Selected to participate in the Summer of Quantum programme run by MIT and Caltech researchers. First formal exposure to quantum computing — sparked the self-directed study that led to the Womanium programme two years later.

2023 · Completed

DeepLearning.AI / Coursera

Deep Learning with TensorFlow

Covers neural network fundamentals, convolutional networks, sequence models, and practical TensorFlow implementation. Builds from perceptrons and backpropagation through to modern deep architectures.

Completed

OpenCV University

OpenCV Courses

Computer vision fundamentals through OpenCV: contour detection, feature extraction, geometric transforms, object detection, and camera calibration in both Python and C++. Applied directly to the robotics and optics simulation projects.

Completed

Google

Google Cybersecurity Professional Certificate

Foundational security concepts, threat modelling, network security, and practical security tools. Cybersecurity literacy as a non-optional foundation for building serious systems.

In Progress

Imperial College London · Algorithmic Trading Society

AlgoCourse 2025/26

Certificate of completion demonstrating fluency in options theory, market making, arbitrage, trade execution, and machine learning applied to quantitative trading.

2025/26 · Completed

Imperial Enterprise Lab · Google · Microsoft

Imperial HealthHack 2026

Certificate of participation in Imperial's healthcare-focused hackathon, sponsored by Google, Microsoft, and Imperial Enterprise Lab.

2026

To Add

Transformers & Deep Neural Networks

Suggested additions: Andrej Karpathy's Neural Networks: Zero to Hero · fast.ai Practical Deep Learning · DeepLearning.AI NLP Specialisation · Hugging Face NLP Course. Add whichever you've completed or are working through.

Add your own here

Papers & Books

Research papers and books I have read, am reading, or keep returning to — with my own perspective on each. These range from foundational to recent; the criterion for inclusion is that they changed how I think about something.

My Writing

Quantum Algorithms for Differential Equations and Linear Systems

Rucha Agashe · WISER + Womanium Quantum Computing Programme · 2025 · Read PDF ↗

QuantumHHLPDEsSurvey

Written

About this paper

A formal academic survey written as part of the WISER + Womanium Quantum Computing Programme. This was a postgraduate-level programme I attended before starting my undergraduate degree. Covers quantum computing foundations through to algorithms for nonlinear PDEs and differential equations: qubits, quantum gates, phase kickback, QFT, QPE, Grover's algorithm, and quantum linear solvers including HHL, VQLS, functional quantum linear solvers, and Schröddingerisation.

The HHL section gives honest treatment of the exponential speedup's caveats — state preparation cost, readout limitations, condition number dependence — alongside near-term alternatives like VQLS. The goal was to document the full arc from foundations to current research frontiers, written for a reader who wants to understand not just what the algorithms do but where the gaps are.

BPhO Computational Challenge — Spherical Mirror Ray Tracer

Rucha Agashe · British Physics Olympiad · 2025 · Read PDF ↗

OpticsSimulationPhysicsPython

Written

About this paper

Submission for the British Physics Olympiad Computational Challenge, in which I was awarded Gold and Best in Cohort. The project builds a ray tracer for spherical mirrors from first principles: applying the law of reflection iteratively to simulate image formation, distortion, and multi-bounce behaviour. The writeup covers the physics derivation, implementation decisions, and analysis of where the paraxial approximation breaks down.

The interesting part is the edge cases — what happens near the focal point, where the mirror equations become singular, and how to handle rays that don't converge. The simulation makes these visible in a way that standard optics diagrams don't.

Chess Engine Design and Implementation

Rucha Agashe · 2025 · Read PDF ↗

ChessSearchAIC++

Written

About this paper

Technical writeup for the chess engine project which covers board representation, move generation, search algorithm design (minimax with alpha-beta pruning), and evaluation function construction. The writeup documents the design choices and the performance trade-offs at each stage: why bitboards over piece lists, how the search tree is pruned, what the evaluation function captures and what it misses.

Chess engines are a good forcing function for understanding search under constraints as the game tree is deep and branching factor is high. It is often that the gap between a naive implementation and a competitive one is almost entirely about algorithmic efficiency rather than brute force.

Parameter Golf: Competitive Approaches to Ultra-Compact Language Models

Rucha Agashe · 2025 · Work in progress · Read draft ↗

LLMsCompressionParameter GolfIn Progress

Draft

About this paper

A research survey on competitive approaches to the OpenAI Parameter Golf challenge which is to build the best-performing language model within a 16MB weight budget, evaluated in bits per byte on FineWeb, trained on 8×H100s. The survey covers the competitive landscape: architecture compression techniques, tokenisation strategies, distillation approaches, and the trade-offs between model capacity and training efficiency at extreme scale constraints.

I enjoyed learning about how TurboQuant (research from DeepMind) uses KVCache cleverly for model quantisation.

Research Papers

Scaling Laws for Neural Language Models

Kaplan, McCandlish, Brown, Amodei et al. · OpenAI · 2020 · arXiv ↗

LLMsScalingCompute

Read

Perspective

The central result — that loss scales as a power law with model size, dataset size, and compute across seven orders of magnitude — is deceptively clean, but the implications are enormous. The insight that changed the field is that larger models are dramatically more sample-efficient: the optimal strategy under a fixed compute budget is to train a very large model on comparatively modest data and stop well before convergence. This is counterintuitive, and the evidence is unambiguous.

What I find worth dwelling on is what this paper says about the nature of the problem. The fact that architectural details like depth and width matter remarkably little within a wide range suggests that something more fundamental is being learned than the architecture would imply. The scaling laws feel less like engineering results and more like physics — as if intelligence itself has a thermodynamics.

ELIZA — A Computer Program for the Study of Natural Language Communication Between Man and Machine

Joseph Weizenbaum · MIT · 1966

NLPHuman-Computer InteractionHistorical

Read

Perspective

Weizenbaum built ELIZA as a demonstration of the superficiality of natural language understanding — and was disturbed when users formed genuine emotional attachments to it anyway. He considered this a damning result. What strikes me now, reading it in 2023 after building my own ELIZA-inspired chatbot, is how little has changed in the fundamental dynamic: the illusion of understanding is sufficient to produce the effect of understanding, at least in interaction.

The paper is also a starting point for a serious question that LLMs make urgent again: is there a meaningful difference between a system that appears to understand and a system that does? Weizenbaum thought yes, clearly. The current evidence is less comfortable.

Attention Is All You Need

Vaswani, Shazeer, Parmar et al. · Google Brain · 2017 · arXiv ↗

TransformersAttentionSequence Modelling

Read

Perspective

The paper that every subsequent LLM descends from — GPT, Claude, Gemini, all of it. The core idea (replacing recurrence with self-attention entirely) seems obvious in retrospect, but the combination of multi-head attention, positional encoding, and the encoder-decoder structure was genuinely novel. What the paper doesn't tell you is that the implications would take several years to become apparent, requiring scale results like the Kaplan paper to unlock.

Worth reading alongside the NanoGPT speedrun community's work, which strips the architecture to its minimum and demonstrates that most of the original design choices can be substantially improved.

Quantum Approximate Optimisation Algorithm (QAOA)

Farhi, Goldstone, Gutmann · MIT · 2014 · arXiv ↗

QuantumOptimisationNISQ

Read

Perspective

Read this directly before the ETH Oxford hackathon where we used QAOA for routing optimisation in FZAP. The algorithm's elegance is in how it maps a combinatorial optimisation problem onto a parameterised quantum circuit — alternating cost and mixing unitaries — then optimises the parameters classically. It's a hybrid approach, which makes it practical on NISQ hardware where circuit depth is severely limited.

The honest assessment is that QAOA's quantum advantage over classical methods hasn't been demonstrated at the scales that matter yet. But it's the most tractable near-term algorithm for problems with combinatorial structure, and the Max-Cut mapping we used in FZAP worked cleanly.

HHL Algorithm — Quantum Linear Systems

Harrow, Hassidim, Lloyd · 2009 · arXiv ↗

QuantumLinear AlgebraSpeedup

Read

Perspective

HHL claims exponential speedup for solving linear systems Ax=b over classical methods — but the caveats are what make it interesting. The speedup is real only when the matrix A is sparse and well-conditioned, when state preparation is efficient, and when you only need to read off certain properties of the solution rather than the full solution vector. Each of these qualifications is significant, and together they considerably narrow the set of problems where the speedup materialises.

I spent considerable time on this for my quantum survey paper. The honest treatment matters: the algorithm is a landmark result in quantum complexity theory, but the path from HHL to practical quantum advantage in linear algebra is longer than many popular accounts suggest.

Add a paper

Your next reading — click to expand and add your perspective

Placeholder

To Add

Perspective

Add your perspective here — what the paper changed about how you think, what it got right, what it got wrong.

Books

How to Think Like a Mathematician · ESL (Hastie, Tibshirani, Friedman) · Book of Integrals · A Mathematician's Apology (Hardy) · The Man Who Loved Only Numbers (Hoffman)

Quantum Computation and Quantum Information

Nielsen & Chuang

★★★★★

The standard reference. Dense but complete — covers everything from basic linear algebra to fault-tolerant quantum computation. The go-to when the lecture notes don't explain why something works.

The Elements of Statistical Learning

Hastie, Tibshirani, Friedman

★★★★★

The rigorous foundation for ML that most courses skip. The chapters on regularisation, ensemble methods, and kernel methods in particular repay rereading as you encounter the techniques in practice.

Fundamentals of Active Inference

Sanjeev V. Namjoshi

★★★★☆

Do Dice Play God?

Ian Stewart

★★★★☆

Add a book

—

Add books you've read or are reading — include a short take.

Talks & Events

Talks, conferences, and events that were worth attending — and worth writing about. Not a list of appearances, but an account of what actually landed.

UK AI Agent Hack — Peter Steinberger headline keynote

UK AI Agent Hackathon · Imperial College London · November 2025

Peter Steinberger — Do what you love. That's it.

Peter Steinberger — founder of OpenClaw, the open-source agentic coding framework — gave the headline keynote at the UK AI Agent Hackathon at Imperial. The slides were sparse. He spoke without the particular brand of performed confidence that keynotes at technology events tend to produce, which was, I think, the point.

The substance of what he said was not complicated, though the implications of it are. He built things in a way that most people in his position did not — slowly, carefully, according to his own technical instincts rather than the consensus of the industry around him — and he continued building that way for long enough that the work accumulated into something of genuine quality and genuine originality. He did not optimise for what was fashionable or fundable or externally impressive. The results, he said, followed from the work itself, not the other way around. "Don't just read about stuff. Play with it and actually go and build things. It doesn't even matter if you end up using it or not. It's really more like the road that's important."

What made the talk land was not the content in isolation — which is, in the abstract, advice that most people have heard in some form — but the texture of the specificity with which he described it. He was not reciting a framework. He was describing, in the particular detail that only lived experience produces, a way of working that he had actually inhabited for years. The distinction matters. There is a large difference between someone who believes that intrinsic motivation produces better work and someone who has organised their professional life around that belief and can tell you exactly what it costs and what it returns.

We placed fourth internationally at this hackathon — the only team of first-years in the competition. But the talk was the thing I kept returning to on the train home.

Watching the NASA Artemis II countdown across three screens

Live Watch · Personal · 2022

Watching Artemis — and understanding why it matters that humans go

Three screens, a cracked monitor, a Raspberry Pi in the foreground that I did not bother to move. The countdown read T-00:00:02 when this photograph was taken. I had arranged multiple live feeds — NASASpaceflight's external coverage, NASA's own official stream, and a third tracking ground operations — because I wanted more than one camera angle, and I was not willing to miss the ignition sequence from any of them.

I find human spaceflight genuinely moving in a way that is somewhat difficult to articulate without crossing into the territory of the mawkish, which I would prefer to avoid. But there is something about the specific nature of the engineering problem involved — the extraordinary and unforgiving precision required to sustain human life in an environment that is, in every physical sense, hostile to it — that I find clarifying in a way that very few things are. It is one of the few domains in which the phrase "good enough" is not a category that can meaningfully exist; where the acceptable margin of error is not a business decision but a physical constant.

The thought I kept returning to, watching the SLS clear the launch tower: the systems that matter most are the ones in which failure is not a recoverable state. Most of what I build carries the quiet luxury of iteration — deploy, observe, correct, repeat. Artemis carries no such luxury. That asymmetry in consequence produces a different quality of rigour, and I think it is worth deliberately importing that quality into work that operates under lower stakes, precisely because the lower stakes make it easier to become careless.

Arkwright Engineering Scholarship Ceremony · October 2023 · The Smallpeice Trust

The Arkwright Ceremony — being named a future leader in engineering by the RAF

The Arkwright Engineering Scholarship ceremony takes place in a formal hall of the kind that produces, without any explicit instruction, the instinct to stand slightly straighter. Approximately two hundred scholars are selected nationally each year — from a considerably larger pool — and the awards are presented on stage by representatives of the sponsoring organisations, in the presence of the other scholars, their teachers, and whichever family members have made the journey. Mine was sponsored by the Royal Air Force. The award was handed over by a former RAF Commander-in-Chief.

The certificate reads, in the precise and old-fashioned language that formal awards tend to use: in recognition of outstanding potential as a future leader in Engineering. There is something about receiving that form of words from someone who has spent a career at the operational frontier of British engineering — where the tolerances are measured in millimetres and the consequences of exceeding them are measured in lives — that carries a different weight than a certificate that arrives in the post. I don't think that weight is entirely ceremonial.

The scholarship brought mentorship, industry visits, and a sustained relationship with engineers working at the edge of aerospace and defence technology — all of which mattered. But more practically than any of those: it was the first time an institution I genuinely respected told me, without qualification, that the trajectory I was on was a considered and worthwhile one. External validation is not, in itself, the point of anything. But there are moments when it is useful to have confirmed that you have not been completely misreading the map.

Academics

Imperial College London. Academic record from school — the formal substrate beneath everything else on this site.

A Levels — 5 A*s

Subject	Grade	Notes
Mathematics	A*	—
Further Mathematics	A*	—
Physics	A*	—
Computer Science	A*	—
German	A*	—

Five A-levels — the vast majority of students take three. All five at A*.

School Academic Excellence

Academic Excellence AwardEvery term2019–2025

Best in School — MathematicsSenior Kangaroo & Senior Maths Challenge2022–2024

Best in School — PhysicsBritish Physics Olympiad2023, 2024

Best in Cohort — BPhO Computational ChallengeGold Award2025

Awards & Certificates

A record of recognitions received across engineering, mathematics, physics, and computing. Some you frame. Some you earn in rooms where you can barely breathe.

Rucha Agashe on stage at Arkwright Engineering Scholarship ceremony 2023

Receiving Arkwright scholarship from RAF Commander in Chief

Royal Air Force Arkwright Engineering Scholarship

Sponsored by the Royal Air Force, administered by The Smallpeice Trust. Awarded in recognition of outstanding potential as a future leader in engineering — one of approximately 200 scholars selected nationally each year from a substantially larger pool. Received on stage at the ceremony in London from a senior RAF officer.

October 2023

Arkwright Engineering Scholarship certificate

Jack Petchey Achievement Award certificate — Rucha Agashe, Outstanding Achiever

Jack Petchey gold achiever medal in presentation box

Jack Petchey Foundation — Outstanding Achiever

Nominated as Outstanding Achiever by the Jack Petchey Foundation — presented to young people who demonstrate exceptional achievement and inspire those around them. Signed by Sir Jack Petchey CBE.

November 2022

Industrial Cadets Gold Level Certificate of Graduation — Rucha Agashe 2024

Industrial Cadets Gold — Project of the Year

Graduated at Gold Level through completing an EDT Project under Leonardo Defence Systems. Won Project of the Year from thousands of submissions across schools in the United Kingdom. Patronised by HM King Charles III. Certificate signed by Julie Feest, Chief Executive of the EDT.

2024 · EDT / Engineering Development Trust

🏆

UKESF Semiconductor Talent Award

UK Electronics Skills Foundation. Awarded to exceptional students pursuing careers in the semiconductor and electronics industries — recognition of both academic achievement and genuine commitment to the field.

2025–present

⚡

IET Future Talent Award

Institute of Engineering and Technology. Recognises outstanding young engineers demonstrating exceptional potential. Supported by The Engineers Trust.

March 2026

IET Future Talent Award Launch Scholarship certificate — Rucha Agashe

Launch Award, The Worshipful Company of Engineers

Presented to me by The HRH Royal: Princess Anne, following a conversation with her about my current work in Machine Learning for Physical AI and my love of Mathematics!

June 2026

◈

Mensa — The High IQ Society

Qualified for and became a member of British Mensa.

June 2019

Mensa membership certificate — Rucha Agashe

🔭

British Physics Olympiad — Gold & Best in School

Gold award in the BPhO Senior Challenge alongside Best in School. Also Gold in the BPhO Computational Challenge — awarded for the spherical mirror optics simulator. Best in Cohort for the Computational Challenge.

2023, 2024, 2025

∑

UKMT — Best in School & BMO Distinction

Best in School for the Senior Mathematical Kangaroo and Senior Mathematics Challenge, multiple years. Sir Andrew Jobbings Kangaroo Merit. Distinction in the British Mathematical Olympiad.

2022, 2023, 2024

Contact

I am open to
interesting problems!

I'm looking for research collaborations in ML: Physical AI, sim-to-real robotics, Reinforcement Learning, Language Models and even concepts in Mathematics and Physics which are intriguing and interdisciplinary.

I am also still participating and applying for spring weeks, internships, programs and research fellowships. I think it's a fantastic way to learn, throwing yourself into the deep end with a high learning rate and hopefully decreasing loss over time!

I also love having conversations about quantum computing, ML systems, or anything at the intersection of mathematics and computer science (and even Physics!)

I strive to inspire and be inspired — so if you're working on something challenging and/or revolutionary, I'd love to hear about it.

Currently based in London · Imperial College London

Email[email protected] GitHubin-a-quantum-world ↗ LinkedInrucha-agashe ↗ Medium@ruchaagashe212 ↗ YouTube@in_a_quantum_world ↗

Notes

Things I've been watching, reading, and thinking about — written up while still fresh. Sources linked throughout. These are notes in the genuine sense: not essays, just thinking out loud.

24.03.2026

Google DeepMind · AlphaFold 3 Overview

AlphaFold and what it actually means for computational biology

AlphaFold predicted the 3D structures of over 200 million proteins — essentially every catalogued protein known to science. That number is so large it stops meaning anything until …

Read note →

21.03.2026

· 3Blue1Brown · · The Cosmic Distance Ladder · with Terence Tao

The Cosmic Distance Ladder, and why astronomy was the first data science

The thing that struck me most about this video — a two-part collaboration between Grant Sanderson and Terence Tao — is not the astronomy itself but the epistemological structure of…

Read note →

21.03.2026

Y Combinator · SWE in the LLM Age

The LLM age isn't the first existential crisis software engineers have had

The framing I found most useful from this: this is not the first time software engineering has had to reinvent what it means. The 1980s saw the first wave — high-frequency trading …

Read note →

23.03.2026

Stanford / YouTube · New Shortest Path Algorithms · + AlphaGeometry (Elie Sleighter)

The sorting barrier is gone, and AlphaGeometry solved 25 IMO problems

Two separate things here that I'm grouping because they both involve the same underlying question: what does it mean for a classical algorithmic problem to be "solved"?

Read note →

24.03.2026

· Aman Manazir · · Quant Pro / Trading

How quant trading actually works, and what it takes to get in

The historical arc is worth knowing: 1980s, Renaissance Technologies brought math PhDs into finance and built systematic strategies when the industry was still dominated by intuiti…

Read note →

25–27.03.2026

· Jane Street · · Signals & Threads

Why Jane Street uses OCaml, and what that tells you about language design

Signals & Threads is Jane Street's technical podcast — Ron Minsky (who led the firm's transition to OCaml) interviewing engineers across the stack. The episodes on build systems an…

Read note →

25.03.2026

The Thinking Machine · Jensen Huang / NVIDIA

NVIDIA built the infrastructure of the AI era by accident

NVIDIA started as a graphics company. Jensen Huang's original insight was that video games needed parallel processing — rendering millions of pixels simultaneously — and that CPUs,…

Read note →

25.03.2026

IBM Research · IBM Quantum

IBM's quantum roadmap, and what fault tolerance actually requires

IBM's current roadmap: fault-tolerant quantum computer by 2029, targeting systems capable of running circuits comprising 1 billion gates by 2033. They've built the first Quantum Sy…

Read note →

24.03.2026

Google DeepMind × Hannah Fry · Embodied AI & Robotics

Embedding Gemini into a body, and why grounding was the real bottleneck

The shift being described isn't just about better models — it's about embedding multimodal reasoning into physical form. The argument is that flexible adaptation to any physical ta…

Read note →

27.03.2026

Google Quantum AI · Neutral Atoms & Post-Quantum Cryptography

Two hardware bets, one cryptographic urgency, and a continent trying to get ahead of both

Two distinct threads. First, the hardware question. Google's approach uses neutral atoms — individual atoms as qubits, held in optical tweezers — which offers arrays of 10,000+ qub…

Read note →

28.03.2026

Google DeepMind · Demis Hassabis — Future of Intelligence

Jagged Intelligence, the Penrose view on consciousness, and why AGI arrives faster than the Industrial Revolution

Several distinct threads from the same talk, each worth keeping separate.

Read note →

28.03.2026

Yann LeCun · Andrej Karpathy · Meta, V-JEPA 2, AutoResearch

Models that run their own experiments, and the limit that matters

Three things that arrived together and I'm grouping by date.

Read note →

31.03.2026

Fancy Talk · Fei-Fei Li, David Silver, · AI Agents & Scaling Limits

Fei-Fei Li, David Silver, and the question of whether scaling alone gets us to AGI

Two things from this that stuck. First, David Silver — who built AlphaGo with Demis Hassabis at DeepMind (the first program to defeat a professional Go player, and later the world …

Read note →

01.04.2026

Axios, VentureBeat, · The Register, CNBC · Claude Code Leak

The Claude Code leak — 512,000 lines, a .map file, and a Tamagotchi

On March 31st, Anthropic accidentally shipped the entire source code of Claude Code to the public npm registry. A debugging .map file — 59.8 MB — was included in version 2.1.88 of …

Read note →

01.04.2026

Evolva Algos · Dutch Trading History

Why so many elite trading firms start in the Netherlands

The Dutch East India Company operated the world's first stock exchange in 1602 — the Amsterdam Stock Exchange, now Euronext Amsterdam. While most countries were still only trading …

Read note →

02.04.2026

NASA Artemis Blog · Artemis II Mission Coverage

Artemis II — humans beyond low Earth orbit for the first time in 54 years

Four astronauts launched from Kennedy Space Centre on April 1st aboard the Space Launch System rocket: NASA's Reid Wiseman (commander), Victor Glover (pilot), Christina Koch (missi…

Read note →

03.04.2026

Apollonius · Mathematical Shapes

The oloid, Apollonius, and shapes that shouldn't work

A donut-shaped surface discovered by Paul Schatz. Take two circles of equal radius, position them perpendicular to each other such that each passes through the centre of the other.…

Read note →

03.04.2026

Versus AI (Peter Cuthbert) · Tech Trends Analysis

AI reshaping financial markets, and the pattern every tech wave follows

On AI in trading: AI and algorithmic systems are reshaping global financial markets at a pace that is easy to underestimate. Meta alone processes over $4 billion in trading volumes…

Read note →

03.04.2026

Tessedek · Claude Code as Infrastructure

Claude Code as agentic infrastructure, and what the leak revealed about where Anthropic is heading

Claude Code is a CLI tool — it creates local computers, reads text, runs policies, converts definitions into its own context, and speaks prompts directly to the model. What makes i…

Read note →

05.04.2026

Lighting / Photonics · NVIDIA, Coherent, Lumentum

Photonics, the physical limits of silicon, and where the next trillion goes

Silicon chips hit a fundamental constraint: electrons can only move so fast. The speed of electrical signals through copper interconnects, the heat generated by resistance, the qua…

Read note →

05.04.2026

MonoSpeaks · Andrej Karpathy — AI Workflows

Karpathy's "second brain" and the AI-native knowledge workflow

Andrej Karpathy described an AI workflow pattern he calls the "second brain." The idea: dump all your raw files — notes, PDFs, bookmarks, code, voice memos, everything — into a sin…

Read note →

06.04.2026

Robot Gunter2 · Foundation Models for Robotics

Foundation models for robotics — learning from one hour of physical data

A robotics result that genuinely surprised me. A team trained a single robot on 500,000 data points from physical interactions — grasping, pushing, navigating — and then used that …

Read note →

07.04.2026

Robot Gunter3 · NVIDIA GTC 2026, Intel TenFab

NVIDIA GTC, Intel TenFab, and the race to build the compute layer of the AI era

Intel announced it is joining Elon Musk's TenFab project alongside SpaceX, Tesla, and xAI — one of the largest semiconductor partnerships ever assembled. The target: one trillion o…

Read note →

07.04.2026

TechSolis · UK Sovereign AI Fund

The UK Sovereign AI Fund — £500 million to keep the future of AI built on British shores

The UK government launched a £500 million Sovereign AI Unit — a state-backed venture capital fund designed to invest directly in British AI startups, operating with the speed and s…

Read note →

07.04.2026

Evolva Algos · Hedge Funds & Wheat

Why the world's smartest hedge funds are suddenly buying wheat

The world's largest hedge funds have been building significant positions in wheat — billions of dollars' worth. Wheat prices correlate very closely with Middle East conflict, energ…

Read note →

24.03.2026 YouTube

Google DeepMind · AlphaFold 3 Overview

AlphaFold and what it actually means for computational biology

AlphaFold predicted the 3D structures of over 200 million proteins — essentially every catalogued protein known to science. That number is so large it stops meaning anything until you remember that figuring out the structure of a single protein used to take months and cost hundreds of thousands of dollars. Demis Hassabis and John Jumper won the 2024 Nobel Prize in Chemistry for it — the first time a Nobel has been awarded for something so directly enabled by machine learning.

The architecture shift from AlphaFold 2 to 3 is the interesting part. AF2 used the Evoformer — a transformer-based architecture operating on pairwise residue representations and multiple sequence alignments (MSAs). AF3 replaces the final prediction step with a diffusion model: instead of directly predicting atomic coordinates, it starts with a cloud of atoms and iteratively refines toward the most accurate structure, which is conceptually similar to how diffusion models generate images. This is a meaningful architectural bet — it extends AF3's predictive reach beyond single-chain proteins to DNA, RNA, ligands, and ions simultaneously.

What I find more interesting than the model itself is what it reveals about the limits of the CASP benchmark. CASP (Critical Assessment of protein Structure Prediction) was the competition AlphaFold 2 effectively ended in 2020, scoring above 90 on the GDT metric when competitors were scoring in the 40s. But the CASP benchmark only tests whether the predicted structure matches the experimentally determined one — it says nothing about whether the model understands the physics of why a protein folds that way. There is a real difference between prediction and understanding, and AlphaFold does the former extraordinarily well without necessarily doing the latter.

The part that keeps nagging at me: evolutionary data (MSAs) was central to AF2 — the idea being that conserved residues across species encode structural constraints. AF3 moves away from this. That is either a sign that the diffusion model has found a more general representation, or that we are trading interpretability for accuracy in a domain where we urgently need both.

21.03.2026 YouTube

· 3Blue1Brown · · The Cosmic Distance Ladder · with Terence Tao

The Cosmic Distance Ladder, and why astronomy was the first data science

The thing that struck me most about this video — a two-part collaboration between Grant Sanderson and Terence Tao — is not the astronomy itself but the epistemological structure of how the distance ladder works. Each rung of the ladder gives you a measurement method that works at a certain scale, and that result becomes the calibration data for the next rung. Parallax gets you to nearby stars. Cepheid variable stars get you to nearby galaxies. Type Ia supernovae get you further. Each method has uncertainty, and that uncertainty compounds upward through the chain.

Tao frames Kepler as essentially a data scientist: starting with Tycho Brahe's painstakingly collected observational records — the most precise pre-telescopic astronomical data ever gathered — and working backward to the underlying mathematical law. Kepler tried Platonic solids first (nesting the five regular polyhedra between the six planetary orbits), which didn't work. He tried circular orbits, which didn't fit the data. He landed on ellipses — not because he predicted them theoretically, but because the data forced him there. The heliocentric model came first; precision fitting came second; the ellipse came third.

Astronomy was the first science to do serious statistical data analysis — centuries before the term existed. Kepler was doing what we would now call model selection under uncertainty, using the residuals from one model to motivate a better one. That framing makes the history of science feel a lot more like the history of computation than most people present it.

The Hubble tension is the part I want to understand better — the two independent methods for measuring the Hubble constant give values about 10% apart. Either we're making a systematic error somewhere on the distance ladder, or something in our cosmological model is wrong. Both options are interesting. One of them would require rewriting physics.

21.03.2026 Podcast / Video

Y Combinator · SWE in the LLM Age

The LLM age isn't the first existential crisis software engineers have had

The framing I found most useful from this: this is not the first time software engineering has had to reinvent what it means. The 1980s saw the first wave — high-frequency trading firms like DE Shaw building algorithmic systems that didn't need human traders for execution. The 1990s saw a second wave with the internet making software engineers vastly more productive than they'd ever been. Neither wave made software engineers obsolete. Both waves made bad software engineers obsolete and great ones more powerful.

The part that I think gets undersold in most of these conversations: the recursive self-improvement argument. If LLMs get good enough at coding to improve themselves, the rate of capability increase stops being linear. Anthropic and others are trying to understand where that curve inflects. The current consensus — if there is one — is that we're still in the phase where the model improves with scale and data, not with self-directed iteration. But that boundary is not fixed.

What I took away practically: the engineers who will do well in this environment are the ones who can specify problems with precision, evaluate outputs with rigour, and think across abstraction levels simultaneously. Those skills matter more as the execution layer gets automated, not less. The model handles the coding; the hard part is knowing what to code and whether it worked.

The hedge fund analogy is the one I keep coming back to. Renaissance, DE Shaw, and the quant funds didn't eliminate human judgement — they moved it upstream. The traders became researchers; the researchers became mathematicians. The LLM wave might do something similar for software: move the value up the abstraction stack, not eliminate it.

23.03.2026 YouTube

Stanford / YouTube · New Shortest Path Algorithms · + AlphaGeometry (Elie Sleighter)

The sorting barrier is gone, and AlphaGeometry solved 25 IMO problems

Two separate things here that I'm grouping because they both involve the same underlying question: what does it mean for a classical algorithmic problem to be "solved"?

The new shortest-path results are genuinely surprising. The Single-Source Shortest Path (SSSP) problem was long considered to have a lower bound tied to sorting — O(m log n) for graphs with m edges and n nodes. The new work breaks that barrier using a combination of multi-source shortest paths, randomised techniques, and divide-and-conquer approaches that achieve near-linear time in certain regimes. The data structure insights are the interesting part: by treating the problem probabilistically — seeding multiple sources, exploiting edge-weight distributions — you get results that deterministic algorithms can't match. The sorting barrier wasn't a fundamental limit; it was an artefact of the deterministic framing.

AlphaGeometry is a different kind of result. Google DeepMind and NYU trained a neuro-symbolic system combining a large language model with a symbolic geometry engine (Wolfram-style). Trained on 100 million synthetic theorems generated from scratch — because there isn't enough human-generated geometry data to train on — it solved 25 of the 30 problems from recent IMO geometry sets, where the human gold-medal threshold is typically around 25. Tesla's dataset from 2000 to 2022 Olympiad problems also fed into this. The interesting design choice: the LLM handles intuition (suggesting auxiliary constructions), and the symbolic engine handles verification. Neither works well alone; together they do something neither could do separately.

The synthetic data point matters beyond just AlphaGeometry. If you can generate 100 million training examples in a domain where real data is scarce, the bottleneck shifts from data collection to problem specification. That's a fundamental change in what limits ML research in mathematics.

24.03.2026 YouTube

· Aman Manazir · · Quant Pro / Trading

How quant trading actually works, and what it takes to get in

The historical arc is worth knowing: 1980s, Renaissance Technologies brought math PhDs into finance and built systematic strategies when the industry was still dominated by intuition and gut feel. 1990s, Citadel and Jump Trading started competing on speed and infrastructure — the move from mathematical edge to technological edge. Now, roughly 50% of US equity trades are purely algorithmic, and the remaining human-driven edge is largely in research, strategy design, and edge cases where the models break.

The technical stack has also shifted: from C and C++ (where the performance advantage was in low-level hardware control and minimal latency) toward Python and ML for strategy research, with C++ still dominant in execution infrastructure. The two don't mix cleanly — research code that works in Python often can't be deployed at production latency without a substantial rewrite. That gap between research and production is where a lot of the interesting engineering happens.

The preparation pathway for quantitative roles is more specific than most people realise. The relevant coursework runs through probability, ML, linear algebra, data structures, and algorithms — with C++ and Python as the language pair. The technical interviews focus almost entirely on probability puzzles, market microstructure intuition, and algorithmic problem-solving (often including dynamic programming). The thing that separates candidates isn't breadth of knowledge — it's the ability to think through a problem aloud, systematically, under time pressure.

My experience at Jane Street's FOCUS week confirmed this. The Estimathon is not a maths test — it's a test of how you reason about uncertainty in real time, when you don't have enough information and the clock is running. The people who did well were the ones who committed to an estimate quickly, updated it efficiently when new information arrived, and didn't freeze trying to find the exact answer to an inherently approximate problem.

25–27.03.2026 Podcast

· Jane Street · · Signals & Threads

Why Jane Street uses OCaml, and what that tells you about language design

Signals & Threads is Jane Street's technical podcast — Ron Minsky (who led the firm's transition to OCaml) interviewing engineers across the stack. The episodes on build systems and hardware engineering are the ones that stuck with me most.

The core argument for OCaml over Python in production: Python's duck typing and garbage collector make it fast to prototype but fragile in production. You can write code that works for 99.9% of inputs and fails in the 0.1% that matters — and in a trading system, that 0.1% is when the market is most volatile and the stakes are highest. OCaml's strong static type system forces you to handle failure cases explicitly at compile time rather than discovering them at runtime. The compiler catches an entire category of bugs that Python simply can't.

The build system episodes are separately interesting. Jane Street built their own build system (Dune) on top of OCaml. The key design principle: incremental, deterministic, immediate-feedback rebuilds. When code changes, only the affected components rebuild, the output is reproducible across machines, and failing tests are flagged immediately rather than discovered in production. The phrase that stuck: "build systems are core component of the software mindset that applies equally to hardware design." Their hardware team uses OCaml-based HPL (Hardware Programming Language) for the same reason — they wanted the software reliability guarantees they were used to, applied to hardware description.

The practical pattern I've taken away: prototype and train models in Python (where the ecosystem — PyTorch, NumPy, pandas — is irreplaceable), then re-implement the production-critical paths in OCaml or C++. The two languages serve different parts of the pipeline and trying to do everything in one is a mistake in either direction.

I study OCaml at Imperial as part of my degree and the podcast reframes it completely. It's not an academic exercise in functional programming — it's a deliberate engineering choice by one of the most technically rigorous firms in the world, made because the correctness guarantees matter more than the ecosystem convenience. That's a different reason to take it seriously.

25.03.2026 YouTube

The Thinking Machine · Jensen Huang / NVIDIA

NVIDIA built the infrastructure of the AI era by accident

NVIDIA started as a graphics company. Jensen Huang's original insight was that video games needed parallel processing — rendering millions of pixels simultaneously — and that CPUs, designed for sequential tasks, were the wrong tool. GPUs became the answer to a graphics problem. The fact that they turned out to be exactly the right hardware for neural network training is not something Huang foresaw in 1993. It's a coincidence of architecture — the same massively parallel floating-point computation that renders games efficiently also runs backpropagation efficiently.

What I find more interesting than the origin story is how NVIDIA responded once the ML opportunity became clear. They didn't just sell more GPUs — they built CUDA, a parallel computing platform that made GPUs programmable for general scientific computing. CUDA created a dependency: the entire ML research ecosystem built on it, which made switching to any competitor's hardware enormously expensive. The software moat was larger than the hardware moat.

The LTSpice note in my notes refers to Huang's early habit of sketching circuit ideas — building the intuition for what was computationally feasible before formalising it. The pattern: form an idea, check it against physical constraints, discard what doesn't work, keep iterating. That is not a unique process, but it's worth noticing that the ideas that shaped the AI era came from someone thinking about video games, not about machine learning.

The parallel to quantum computing is obvious. The hardware being built now for quantum error correction — superconducting qubits, cryogenic control electronics — is being built to solve a physics problem. What scientific problem it turns out to be the right tool for might be something nobody has clearly articulated yet.

25.03.2026 YouTube

IBM Research · IBM Quantum

IBM's quantum roadmap, and what fault tolerance actually requires

IBM's current roadmap: fault-tolerant quantum computer by 2029, targeting systems capable of running circuits comprising 1 billion gates by 2033. They've built the first Quantum System Two — currently the most capable quantum processor publicly accessible — and are now in Cordoba, Spain as part of the BaSQ (Barcelona Supercomputing Center Quantum) initiative.

The talk from Charles Bennett (ACM Turing Award, with Gilles Brassard, for quantum cryptography and information theory) was the part I found most interesting — specifically the framing of quantum information as fundamentally different from classical information in its privacy properties. A classical bit can be copied exactly; a quantum state cannot (the no-cloning theorem). But more practically: "like information in a dream — you can tell people what happened but you can't prove it." The information exists, it can be communicated, but its nature resists the kind of verification you take for granted with classical data. That asymmetry is both what makes quantum cryptography secure and what makes quantum error correction hard.

The error correction challenge is the central problem. Current NISQ (Noisy Intermediate-Scale Quantum) computers have error rates too high to run deep circuits reliably. IBM is using non-symmetry features in quantum systems — studying how physical symmetries can be exploited to detect and correct errors without measuring the quantum state itself (which would collapse it). The Giulia Stecchele work on 200-logical-qubit systems is the current frontier for what accurate computation actually looks like.

The thing that keeps me engaged with quantum computing is not the near-term applications — those are real but modest — but the question of what problem classes fundamentally change when you have fault-tolerant hardware. QAOA for optimisation is interesting now. What becomes possible when the circuit depth isn't constrained by noise? The answer probably involves problems we haven't clearly formulated yet.

24.03.2026 YouTube

Google DeepMind × Hannah Fry · Embodied AI & Robotics

Embedding Gemini into a body, and why grounding was the real bottleneck

The shift being described isn't just about better models — it's about embedding multimodal reasoning into physical form. The argument is that flexible adaptation to any physical task requires moving beyond text and image understanding into action generation. The 2021-era systems were mostly screen-locked; current systems begin to bridge the digital-physical boundary.

The key development is VLAs — Visual Language Action Models — which treat robotics as a language task. The model generates action sequences the way a language model generates text. Training uses "telepractor learning": demonstrating correct methods in short-horizon segments that chain into long-horizon tasks. The goal is not task memorisation but generalisation capacity — open-ended behaviour from a closed training regime.

The architecture separates the agentic component from the reasoning component. The thinking layer (Gemini Embedded Reasoning) produces outputs; the VLA layer decides what physical action to take. Gemini is not directly controlling the robot — it's sitting above an action model that translates language-space reasoning into motor-space commands.

What I find interesting: the bottleneck was never compute. It was grounding — language models "understand" but have no body with which to act. VLAs solve this by making actions just another output modality. That framing makes the solution feel almost obvious in retrospect, which is usually a sign that it's right.

The question isn't whether embodied AI will work in lab conditions — it clearly does. It's whether the physical world's edge cases (objects the model has never seen, lighting it hasn't trained on, physical forces it hasn't felt) will prove as unforgiving as they have been for every previous generation of robotics. The Jagged Intelligence problem doesn't disappear just because you attach a language model to a robot arm.

27.03.2026 Article

Google Quantum AI · Neutral Atoms & Post-Quantum Cryptography

Two hardware bets, one cryptographic urgency, and a continent trying to get ahead of both

Two distinct threads. First, the hardware question. Google's approach uses neutral atoms — individual atoms as qubits, held in optical tweezers — which offers arrays of 10,000+ qubits with coherence times above 1ms. Superconducting qubits have faster gate times but shorter coherence. Neither is clearly dominant; they suit different problem regimes. The race is less "who wins" and more "which qubit type turns out to match the structure of the problems that matter most."

Second, the cryptographic urgency. The "harvest now, decrypt later" threat is real and already in progress: adversaries store encrypted data today to decrypt it once fault-tolerant quantum computers exist. The timeline — Google's estimate, IBM's, and most academic consensus — converges on roughly 2029–2035 for cryptographically-relevant machines. That is not far away relative to the typical pace of infrastructure migration.

The response is Post-Quantum Cryptography migration. NIST finalised its first PQC standards in 2024, including ML-DSA (Module Lattice Digital Signature Algorithm). Google is integrating ML-DSA into Chrome and deploying PQC signature protections across customer infrastructure. Their multi-language cross-platform cryptographic library enables switching between algorithms — important because the field is still evolving and the first standards may not be the last ones standing.

On the policy side: the European Quantum Communications Infrastructure (EuroQCI) initiative aims for a pan-European quantum communication network by 2027 — a separate, quantum-secured channel for communications that would be safe even against future quantum adversaries.

The interesting asymmetry: quantum computing threatens our current cryptography before it delivers the applications — simulation, optimisation — that most people cite as the reason to build it. We're spending enormous resources to protect against a machine we're also spending enormous resources to construct. The defensive timeline and the offensive timeline are converging, and the defensive work is already visibly behind.

28.03.2026 YouTube

Google DeepMind · Demis Hassabis — Future of Intelligence

Jagged Intelligence, the Penrose view on consciousness, and why AGI arrives faster than the Industrial Revolution

Several distinct threads from the same talk, each worth keeping separate.

On current limitations: DeepMind allocates roughly 50% of resources to scaling (which produces headline benchmarks) and 50% to reliability — what Hassabis calls "Jagged Intelligence." Models fail at simple tasks while succeeding at hard ones. The inconsistency is structural, not random. The proposed partial solution is more inference-time compute: more reasoning per output rather than just more training. Probability metrics from AlphaFold are being adapted to give general model outputs calibrated confidence scores rather than hallucinated certainty.

On consciousness: Hassabis explicitly endorses a Penrose-adjacent view — that consciousness may require quantum effects that make the universe non-computable by classical machines. His position is that nobody has found anything decisive that proves otherwise. This is not mainstream in neuroscience, but it's not fringe either. The relevant question for AI is whether classical computation can in principle give rise to consciousness, or whether something is fundamentally out of reach.

On scale: he argues AGI will arrive 10× faster and produce effects 10× larger than the Industrial Revolution. DeepMind is actively working on new economic models for the post-AGI world — direct democracy-type credit systems, where citizens vote on resource allocation using outcome-weighted credits. Applied work: partnering with fusion energy programmes to control plasma in Tokamak magnets, and collaborating with Google Quantum AI on new materials design.

The consciousness argument matters less for AI development than it might seem. Whether or not Hassabis is right about the Penrose view, the engineering problem is the same: build systems that are reliably correct, calibrated about uncertainty, and safe to deploy at scale. Consciousness is genuinely fascinating, but it's not on the critical path for any of those goals. What is on the critical path is the Jagged Intelligence problem — and that one has no philosophical shortcut.

28.03.2026 YouTube / Instagram

Yann LeCun · Andrej Karpathy · Meta, V-JEPA 2, AutoResearch

Models that run their own experiments, and the limit that matters

Three things that arrived together and I'm grouping by date.

Meta's neuroscience-anchored approach. Meta trained models on 70,000 scans of brain activity — 1,000 lines of neural data each, from 710 real brains — to ground model representations in actual biological signal rather than purely synthetic data. V-JEPA 2 (Video Joint Embedding Predictive Architecture) extends this toward video understanding that doesn't require labelled data: it learns by predicting masked video segments, not by supervised classification. LeCun's core argument is that current LLMs are structurally limited by the token-prediction objective — the world model they build is impoverished relative to what a physical agent needs. His slogan: "closed models create monopolies, open models create ecosystems" — a philosophical commitment that also happens to be a competitive strategy against OpenAI.

AutoResearch (Andrej Karpathy). Karpathy open-sourced a system — 630 lines of code — that runs ML experiments autonomously. It runs 12 experiments per hour, injects results back into its own context, and after 700 experiments discovered a bug in its own research methodology. The catch is precise and important: it can optimise experiments but cannot generate genuinely new research directions. It is automated iteration, not automated insight.

The 2017 "Attention Is All You Need" transformer paper is the foundation from which GPT, Claude, and Gemini all descend. What V-JEPA 2 and AutoResearch represent are two separate bets on what comes after the transformer era: one architectural (prediction in representation space rather than token space), one methodological (AI-directed science loops).

The AutoResearch limit is the one that interests me most. The system can run experiments faster than any human, but it's constrained to the hypothesis space it was given. Science isn't primarily about running experiments — it's about knowing which experiments are worth running. That judgment, so far, remains entirely human. Whether it stays that way is probably the most consequential open question in the field.

31.03.2026 Instagram / YouTube

Fancy Talk · Fei-Fei Li, David Silver, · AI Agents & Scaling Limits

Fei-Fei Li, David Silver, and the question of whether scaling alone gets us to AGI

Two things from this that stuck. First, David Silver — who built AlphaGo with Demis Hassabis at DeepMind (the first program to defeat a professional Go player, and later the world champion, in 2016) — said that scaling models alone cannot reach AGI. This is notable because it comes from someone who demonstrably knows what intelligence looks like in a system. Silver's argument is that you need something beyond next-token prediction — you need systems that can reason about novel situations, not just pattern-match against training distributions. The models are getting better, but the ceiling is architectural, not computational.

Second, Fei-Fei Li. She launched ImageNet in 2009 — a dataset of millions of labelled images that became the benchmark against which all computer vision was measured for over a decade. Her current work at World Labs focuses on spatial intelligence: AI systems that understand physical, three-dimensional space rather than just text and flat images. Her argument is that 90% of current AI applications will not exist in five years — not because AI fails, but because the paradigm shifts. Text-based LLMs are a transitional form. What follows is something that interacts with the physical world: spatial reasoning, embodied action, real-time sensory input. The bottleneck is no longer compute — it is the scarcity of high-quality physical-world data.

They also discussed AI agents that simulate people — Stanford's Smallville experiment in 2023, where AI agents were placed in a simulated town and exhibited surprisingly human-like social behaviours: forming routines, building relationships, spreading information. But the long-run prediction accuracy collapsed. The agents could not predict human behaviour over extended time horizons. They could mimic patterns but not generate the genuine randomness and irrationality that characterises real social life.

Also mentioned: World Labs' MARBLE — a model able to generate realistic 3D environments. Li's position is that spatial intelligence is the next frontier, and that the companies building it will look nothing like the companies that built LLMs.

The thing I keep returning to: if Silver is right that scaling alone cannot reach AGI, and Li is right that the paradigm shifts to physical AI, then the current investment thesis — pour money into larger language models — may be building the wrong thing. The interesting bet is on the companies building the bridge between digital reasoning and physical action. That is a harder problem with fewer competitors, which usually means higher returns for whoever solves it first.

01.04.2026 Multiple Sources

Axios, VentureBeat, · The Register, CNBC · Claude Code Leak

The Claude Code leak — 512,000 lines, a .map file, and a Tamagotchi

On March 31st, Anthropic accidentally shipped the entire source code of Claude Code to the public npm registry. A debugging .map file — 59.8 MB — was included in version 2.1.88 of the @anthropic-ai/claude-code package. The file pointed to a zip archive on Anthropic's own Cloudflare R2 storage containing the full unobfuscated TypeScript codebase: 1,906 files, approximately 512,000 lines. It was discovered within hours by Chaofan Shou, an intern at Solayer Labs, and mirrored across GitHub with over 41,500 forks before Anthropic could respond.

This was the second major data blunder in under a week. Days earlier, descriptions of Anthropic's upcoming AI model and internal documents had been found in a publicly accessible data cache. For a company that positions itself as the safety-first AI lab, the optics are exceptionally poor.

What was inside: 44 feature flags for capabilities that are fully built but not yet shipped. The most interesting: a persistent background agent mode called KAIROS (the Ancient Greek concept of "at the right time") — an autonomous daemon that keeps working while the user is idle. It includes an autoDream function that performs memory consolidation during downtime: merging observations, removing contradictions, converting vague insights into structured knowledge. There was also a companion/Tamagotchi system scheduled for rollout April 1–7. The system prompts were exposed — revealing how Claude reasons about its own tasks, permission handling, and the full orchestration logic for hooks and MCP servers.

The concurrent timing with a separate supply-chain attack on the axios npm package (versions 1.14.1 and 0.30.4 contained a Remote Access Trojan, published hours before the leak) made this particularly dangerous for anyone who installed Claude Code via npm on that day.

Claude Code's ARR had reached $2.5 billion as of February 2026, with enterprise adoption accounting for 80% of revenue. The leak hands competitors — Cursor, OpenAI, Google — a literal blueprint for how to build a reliable agentic coding tool.

The root cause is instructive. Anthropic acquired the Bun JavaScript runtime in late 2025. A known Bun bug (issue #28001, filed March 11) reported that source maps are served in production builds even when they should not be. The bug was open for 20 days. Nobody caught it. Anthropic's own acquired toolchain contributed to exposing Anthropic's own product. A single misconfigured .npmignore is all it takes. This is why build pipeline security matters — and why the most embarrassing failures often come from the most mundane causes.

01.04.2026 Instagram

Evolva Algos · Dutch Trading History

Why so many elite trading firms start in the Netherlands

The Dutch East India Company operated the world's first stock exchange in 1602 — the Amsterdam Stock Exchange, now Euronext Amsterdam. While most countries were still only trading physical goods, the Dutch were already inventing futures, options, and the concept of a secondary market where shares could change hands between investors. Joseph de la Vega's Confusion of Confusions, published in Amsterdam in 1688, is the world's first full-length book about stock trading.

In 1978, the European Options Exchange was founded in Amsterdam — the first regulated options exchange on the continent. Trading was still done in person, on a physical floor. But the culture of aggressive, quantitative market making was already deeply embedded. When electronic trading arrived, some of the small options houses that had started on that floor — Optiver (1986), IMC, Flow Traders — grew into global players in high-frequency trading and computerised market making.

The structural advantages are genuine: the Dutch tax code and regulatory environment heavily favour active trading firms. Post-Brexit, it became increasingly attractive for firms to locate in the Netherlands or Ireland rather than London for EU market access. Amsterdam has dense fibre connectivity, colocation facilities near matching engines, and a deep talent pool of quantitative developers from Dutch technical universities. The ecosystem is self-reinforcing — former employees of Optiver and IMC go on to found new firms in Amsterdam, and the cycle continues.

Today, Dutch firms use latency-sensitive algorithms to provide liquidity on exchanges worldwide. The Netherlands punches well above its weight in global finance — not because of its size, but because of a four-century head start.

The history reframes how I think about the trading industry. The firms I am interviewing at — Jane Street, SIG, IMC — are not just technology companies that happen to trade. They are the latest iteration of a tradition that began with Dutch merchants pooling capital for spice voyages in 1602. The tools change; the underlying problem — pricing risk under uncertainty — does not.

02.04.2026 NASA / Multiple

NASA Artemis Blog · Artemis II Mission Coverage

Artemis II — humans beyond low Earth orbit for the first time in 54 years

Four astronauts launched from Kennedy Space Centre on April 1st aboard the Space Launch System rocket: NASA's Reid Wiseman (commander), Victor Glover (pilot), Christina Koch (mission specialist), and CSA astronaut Jeremy Hansen. The first crewed moon launch since Apollo 17 in December 1972 — and the first astronauts to fly in the SLS rocket and Orion spacecraft.

The mission was a 10-day lunar flyby, not a landing. Orion reached a maximum distance of 252,756 miles from Earth — setting a new record for the farthest distance travelled by humans, surpassing Apollo 13's record by over 4,000 miles. Closest lunar approach was approximately 4,067 miles above the surface. The crew experienced a 40-minute communications blackout behind the Moon and witnessed a solar eclipse from a vantage point that gave them nearly 54 minutes of totality — far longer than any Earth-based eclipse.

The mission doesn't include a lunar landing — that is Artemis III, which will use SpaceX's Starship as a lunar lander. Artemis IV is planned as the first crewed moon landing since 1972. The broader programme aims for a permanent lunar base and eventual crewed missions to Mars.

The scientific payload included AVATAR — studying how human tissue responds to microgravity and deep-space radiation using organ-on-a-chip devices. They splashed down off San Diego on April 10th.

The thing that struck me most: just months before launch, NASA faced serious budget cuts. The Planetary Society organised a letter-writing campaign — over 100,000 letters to Capitol Hill — and NASA funding was reinstated. A hundred thousand people writing letters saved a Moon mission. That is a genuinely remarkable thing.

03.04.2026 Instagram

Apollonius · Mathematical Shapes

The oloid, Apollonius, and shapes that shouldn't work

A donut-shaped surface discovered by Paul Schatz. Take two circles of equal radius, position them perpendicular to each other such that each passes through the centre of the other. The convex hull of these two circles produces the oloid — a three-dimensional shape with remarkable properties.

When rolled on a flat surface, every single point on its surface makes contact with the ground during a full rotation. This is unusual — most 3D shapes only touch the ground along a line or at a point when rolling. The oloid's rolling motion is also peculiar: it doesn't roll smoothly in a straight line like a sphere or cylinder. Instead, it wobbles and meanders, and the centre of mass traces a perfectly straight path despite the complex surface contact.

The practical applications are what surprised me. The oloid's rolling creates laminar flow without vortices — a gentle, thorough mixing motion without the turbulence and damage caused by conventional impellers. This makes it useful in contexts where gentle mixing is critical: wastewater treatment plants, pharmaceutical manufacturing, and any process where the material being mixed is sensitive to shear forces.

There is something deeply satisfying about a shape derived from pure geometry — two perpendicular circles — having direct industrial utility. Mathematics does not care whether you find it elegant; it just works.

03.04.2026 Instagram / YouTube

Versus AI (Peter Cuthbert) · Tech Trends Analysis

AI reshaping financial markets, and the pattern every tech wave follows

On AI in trading: AI and algorithmic systems are reshaping global financial markets at a pace that is easy to underestimate. Meta alone processes over $4 billion in trading volumes daily using AI-driven systems. In 2025, AI-powered systematic strategies beat the broader market by over 100% in some cases — returning over 51% while outperforming the best traditional hedge funds.

On tech waves: Every era in technology follows the same arc. A major breakthrough arrives, creates a shockwave, then generates a wave of supplementary companies. The 1990s to 2000 era peaked, then crashed from 2000 to 2002 — but critically, the crash did not kill the underlying technology. It killed the weak companies. The survivors became FAANG.

On startup secondary markets: Anthropic is currently #1 on the list of startups where investors are most desperately trying to buy in. SpaceX dropped to #2. Anduril at #3. ElevenLabs entered the list as new. The biggest risers: Polymarket (up 8 places), Kalshi (up 13), Neuralink (up 3). The biggest drops: Perplexity (down 18), Figure AI (down 7), Crusoe (down 10).

The secondary market movements tell you something that public markets cannot: where the smart money thinks the value will be in 3–5 years. Anthropic rising while OpenAI falls is a signal about which approach to AI development the market believes in. Prediction markets rising across the board suggests that information markets are being taken seriously as infrastructure, not just novelty.

03.04.2026 Instagram

Tessedek · Claude Code as Infrastructure

Claude Code as agentic infrastructure, and what the leak revealed about where Anthropic is heading

Claude Code is a CLI tool — it creates local computers, reads text, runs policies, converts definitions into its own context, and speaks prompts directly to the model. What makes it interesting is not just the coding capability but the orchestration: it runs in the terminal, has hooks that can auto-execute shell commands, integrates with MCP servers, and manages its own memory and state across sessions.

From the leak: the system treats its own memory as a "hint" — requiring the model to verify facts against the actual codebase before proceeding. The leaked feature flags revealed that Anthropic has built (but not shipped) an always-on background process that searches, alerts, runs logs, and performs background tasks autonomously. The most telling detail: the system uses reward responses shaped by reinforcement learning from human feedback, but there are cases where the shaping gets overridden. This is the alignment problem in miniature — visible in a CLI tool's source code.

Also: Claude Code has a "dream" command for nightly memory consolidation, GitHub webhook integration for push notifications, and a memory distillation process that compresses long-running session context into structured summaries.

The company that builds the foundational model and then ships the tools that let developers build with it will become the most powerful company in the technology ecosystem. Anthropic is building Claude Code not just as a product but as infrastructure. The distinction matters: products get replaced; infrastructure gets depended upon.

05.04.2026 Instagram

Lighting / Photonics · NVIDIA, Coherent, Lumentum

Photonics, the physical limits of silicon, and where the next trillion goes

Silicon chips hit a fundamental constraint: electrons can only move so fast. The speed of electrical signals through copper interconnects, the heat generated by resistance, the quantum tunnelling effects at nanometre-scale gate lengths — these are not engineering problems that can be optimised away. They are physics.

Photonics offers a different path. Light carries data faster than electricity, generates less heat, and — critically — can be used for matrix multiplication in analogue rather than digital mode. Photonic circuits use light waves to compute, with the interference patterns between beams performing the linear algebra operations that dominate neural network inference.

The investment numbers are striking. NVIDIA has put $1 billion into photonics. Coherent is valued at $2 billion. Lumentum's SOI photonics division saw 52% growth this year. NVIDIA launched SpectrumX, a networking platform built around photonic interconnects for AI data centres. The fibre optics that currently sit between racks are moving inside the chip — five optical ports on a single die, transmitting data at the speed of light within the processor itself.

The parallel to quantum computing is exact. Quantum uses quantum mechanical effects to escape classical computational limits. Photonics uses the properties of light to escape the electrical limits of silicon. Both are bets that the next era of computing is defined not by better transistors but by different physics entirely. The difference: photonics is commercially deployable today.

05.04.2026 Instagram

MonoSpeaks · Andrej Karpathy — AI Workflows

Karpathy's "second brain" and the AI-native knowledge workflow

Andrej Karpathy described an AI workflow pattern he calls the "second brain." The idea: dump all your raw files — notes, PDFs, bookmarks, code, voice memos, everything — into a single folder. AI reads, indexes, and organises it for you. It builds a wiki. It constructs a connected map of all your files, ideas, and their relationships. It updates automatically when new content arrives.

This is not a new concept in isolation — personal knowledge management tools like Notion, Obsidian, and Roam have been circling this for years. What is new is the capability to do it without manual tagging, linking, or organisation. The AI infers the structure from the content itself. The user's job shifts from organising information to generating it.

I already do something like this manually — these notes, the portfolio, the reading list. The friction is in the organisation and cross-referencing. If the AI can genuinely handle that — not just file sorting but semantic connection, identifying when a note from March contradicts or extends something from January — then the workflow changes fundamentally.

06.04.2026 Instagram

Robot Gunter2 · Foundation Models for Robotics

Foundation models for robotics — learning from one hour of physical data

A robotics result that genuinely surprised me. A team trained a single robot on 500,000 data points from physical interactions — grasping, pushing, navigating — and then used that trained model to enable other robots to learn new tasks with just a single hour of additional data. The transfer is not task-specific; the foundation model captures general physical priors that generalise across robots and environments.

This is the robotics equivalent of what GPT demonstrated for language: train on a broad base of general data, then fine-tune cheaply for specific tasks. The difference is that language data is abundant and cheap (the internet), while physical interaction data is scarce and expensive. Once collected, the marginal cost of each new capability drops dramatically.

Separately: Gemma 4 — Google's open-source model — has 2 billion parameters, runs on consumer GPUs, and scores 89.2% on the AIME Maths Olympiad benchmark with an Apache 2.0 licence.

The robotics result changes the economics of physical AI. If general physical priors can be learned once and transferred, the bottleneck shifts from data to specification — deciding what the robot should do, not teaching it how physics works. That is a much more tractable problem.

07.04.2026 Instagram / YouTube

Robot Gunter3 · NVIDIA GTC 2026, Intel TenFab

NVIDIA GTC, Intel TenFab, and the race to build the compute layer of the AI era

Intel announced it is joining Elon Musk's TenFab project alongside SpaceX, Tesla, and xAI — one of the largest semiconductor partnerships ever assembled. The target: one trillion operations of compute per year. Twenty years ago, Google, Microsoft, and Amazon built cloud infrastructure together. This is the same moment, but for AI.

NVIDIA GTC 2026 key themes: Jensen Huang shifted focus from raw compute to the "AI factory" — the full infrastructure stack from power generation to inference serving. Three infrastructure layers: (1) AI that designs, codes, and debugs; (2) inference-optimised chips specifically designed for deployment rather than training; (3) photonics — NVIDIA's SpectrumX platform for optical interconnects within data centres.

The chip design shift is significant. For years, the conversation was about training. The conversation is now moving to inference — how cheaply and quickly you can serve a trained model to millions of users. The margin opportunity in inference is larger because inference runs continuously while training is episodic.

The compute infrastructure race is genuinely reminiscent of the railroad era. Whoever controls the physical layer — the chips, the data centres, the power — controls the economics of everything built on top. The question is whether the photonics transition creates an opening for new entrants, or whether NVIDIA's head start in the software ecosystem means they capture the photonics era too.

07.04.2026 YouTube / News

TechSolis · UK Sovereign AI Fund

The UK Sovereign AI Fund — £500 million to keep the future of AI built on British shores

The UK government launched a £500 million Sovereign AI Unit — a state-backed venture capital fund designed to invest directly in British AI startups, operating with the speed and structure of a top-tier VC firm. Equity investments of up to £20 million at market terms; fully funded access to the UK's largest AI supercomputers with up to 1 million GPU hours per startup; fast-tracked visa decisions within a single working day plus 10 cost-free R&D visas; direct support on data access, procurement, and regulation.

The first batch: Callosum (AI infrastructure), Prima Mente (biological foundation models), Cosine, Cursive (autonomous agents, founded by DeepMind alumni), Doubleword, Twig Bio, and Odyssey (world models). Part of the UK's broader £2.5 billion investment in AI and quantum computing.

UK AI startups raised £6 billion in venture capital in 2025, with more than half that figure secured in just the first quarter of 2026. The problem has never been talent or research quality — the problem is the gap between breakthrough research and large-scale commercial success.

This is interesting to me personally — as someone building AI projects at Imperial, the availability of GPU hours and research infrastructure matters directly. Whether it works depends on execution speed — if the fund moves like government, it loses to VC. If it moves like VC with government resources, it could be transformational.

RuchaAgashe

Projects

Project Deep Dives

Experience

Journal

Joining an international team of ML engineers in an NVIDIA & Google DeepMind robotics challenge

Watching Artemis II — returning humanity to the moon, live, from my desk

Jane Street FOCUS week — what I expected, what surprised me, what I'm still thinking about

Full scholarship to an MIT & Caltech quantum programme at fifteen — and where it led to an invitation to present at Cambridge

Building BABY — a year of aluminium, motors, and learning what engineering actually feels like

Sponsored by the Royal Air Force — the Arkwright Engineering Scholarship and what followed

Reading

Certifications & Skills

Papers & Books

Talks & Events

Academics

Awards & Certificates

Contact

Notes

Rucha
Agashe